56 datasets found

Amazon employees 2007-2024
statista.com
gruabehub.com
Updated Jun 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Amazon employees 2007-2024 [Dataset]. https://www.statista.com/statistics/234488/number-of-amazon-employees/
Explore at:
Dataset updated
Jun 25, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
United States, Worldwide
Description
The combined number of full- and part-time employees of Amazon.com has increased significantly since 2017. Amazon’s headcount peaked in 2021 when the American multinational e-commerce company employed ********* full- and part-time employees, not counting external contractors. However, in 2024, the number dropped to *********. E-commerce crunch The workforce reduction of Amazon follows the mass layoffs hitting the entire e-commerce sector. With the full reopening of physical stores after the COVID-19 pandemic, online shopping demand decreased, leading online retailers to restructure their businesses, including personnel costs. Diversifying business With online retail sales growing slower due to recession and inflation, Amazon can still leverage other profitable revenue segments — from media subscriptions to server hosting and cloud services. On top of that, in 2023 Amazon monitored small enterprises operating in different fields and strategically invested in them, as disclosed startup acquisitions indicate.

Amazon Employee Access Challenge

kaggle.com

Updated Aug 11, 2021

Facebook

Twitter

Click to copy link

Link copied

Cite

Luca Massaron (2021). Amazon Employee Access Challenge [Dataset]. https://www.kaggle.com/lucamassaron/amazon-employee-access-challenge/code

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Aug 11, 2021

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Luca Massaron

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

When an employee at any company starts work, they first need to obtain the computer access necessary to fulfill their role. This access may allow an employee to read/manipulate resources through various applications or web portals. It is assumed that employees fulfilling the functions of a given role will access the same or similar resources. It is often the case that employees figure out the access they need as they encounter roadblocks during their daily work (e.g. not able to log into a reporting portal). A knowledgeable supervisor then takes time to manually grant the needed access in order to overcome access obstacles. As employees move throughout a company, this access discovery/recovery cycle wastes a nontrivial amount of time and money.

There is a considerable amount of data regarding an employee’s role within an organization and the resources to which they have access. Given the data related to current employees and their provisioned access, models can be built that automatically determine access privileges as employees enter and leave roles within a company. These auto-access models seek to minimize the human involvement required to grant or revoke employee access.

Part of the competition "Amazon.com - Employee Access Challenge" (https://www.kaggle.com/c/amazon-employee-access-challenge), the data consists of real historical data collected from 2010 & 2011. Employees are manually allowed or denied access to resources over time. Your task is to create an algorithm capable of learning from this historical data to predict approval/denial for an unseen set of employees.

Content

The data comes from Amazon Inc. collected from 2010-2011 (published on Kaggle platform). The training set consists of 32769 samples and the testing one of 58922 samples. The training set has one label attribute named “ACTION”, whose value “1” indicates an application is approved whereas “0” indicates rejection. As predictors of this state, there are eight features, indicating characteristics of the required resource anf the role and work group of the employee at Amazon requesting access.

train.csv - The training set. Each row has the ACTION (ground truth), RESOURCE, and information about the employee's role at the time of approval

test.csv - The test set for which predictions should be made. Each row asks whether an employee having the listed characteristics should have access to the listed resource.

Column Name	Description
ACTION	ACTION is 1 if the resource was approved, 0 if the resource was not
RESOURCE	An ID for each resource
MGR_ID	The EMPLOYEE ID of the manager of the current EMPLOYEE ID record; an employee may have only one manager at a time
ROLE_ROLLUP_1	Company role grouping category id 1 (e.g. US Engineering)
ROLE_ROLLUP_2	Company role grouping category id 2 (e.g. US Retail)
ROLE_DEPTNAME	Company role department description (e.g. Retail)
ROLE_TITLE	Company role business title description (e.g. Senior Engineering Retail Manager)
ROLE_FAMILY_DESC	Company role family extended description (e.g. Retail Manager, Software Engineering)
ROLE_FAMILY	Company role family description (e.g. Retail Manager)
ROLE_CODE	Company role code; this code is unique to each role (e.g. Manager)

Models are judged on area under the ROC curve (https://en.wikipedia.org/wiki/Receiver_operating_characteristic)

Acknowledgements

The data has been donated by Amazon and the original competition has been hosted in collaboration with the IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2013)

w
Dataset of business metrics of companies called Amazon
workwithdata.com
Updated May 6, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2025). Dataset of business metrics of companies called Amazon [Dataset]. https://www.workwithdata.com/datasets/companies?col=ceo%2Cceo_approval%2Cceo_gender%2Ccity%2Cemployees&f=1&fcol0=company&fop0=%3D&fval0=Amazon
Explore at:
Dataset updated
May 6, 2025
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is about companies. It has 19 rows and is filtered where the company is Amazon. It features 5 columns: employees, CEO, CEO gender, and CEO approval.
m
Amazon.com Inc - Total-Cash-From-Operating-Activities
macro-rankings.com
csv, excel
Updated Mar 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
macro-rankings (2025). Amazon.com Inc - Total-Cash-From-Operating-Activities [Dataset]. https://www.macro-rankings.com/Markets/Stocks?Entity=AMZN.US&Item=Total-Cash-From-Operating-Activities
Explore at:
csv, excelAvailable download formats
Dataset updated
Mar 18, 2025
Dataset authored and provided by
macro-rankings
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
united states
Description
Total-Cash-From-Operating-Activities Time Series for Amazon.com Inc. Amazon.com, Inc. engages in the retail sale of consumer products, advertising, and subscriptions service through online and physical stores in North America and internationally. The company operates through three segments: North America, International, and Amazon Web Services (AWS). It also manufactures and sells electronic devices, including Kindle, fire tablets, fire TVs, echo, ring, blink, and eero; and develops and produces media content. In addition, the company offers programs that enable sellers to sell their products in its stores; and programs that allow authors, independent publishers, musicians, filmmakers, Twitch streamers, skill and app developers, and others to publish and sell content. Further, it provides compute, storage, database, analytics, machine learning, and other services, as well as advertising services through programs, such as sponsored ads, display, and video advertising. Additionally, the company offers Amazon Prime, a membership program. The company's products offered through its stores include merchandise and content purchased for resale and products offered by third-party sellers. It serves consumers, sellers, developers, enterprises, content creators, advertisers, and employees. Amazon.com, Inc. was incorporated in 1994 and is headquartered in Seattle, Washington.
g
Webpage capture on the news article of Amazon workers are rising up around...
gimi9.com
Updated Mar 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Webpage capture on the news article of Amazon workers are rising up around the world to say: enough | gimi9.com [Dataset]. https://gimi9.com/dataset/mekong_649f22c56c6202c630cf37e32a4ce3b94fba2647
Explore at:
Dataset updated
Mar 23, 2025
Description
This webpage capture is the reference for Labor incidents dataset. It contains news articles from international newspapers.
m
Amazon.com Inc - Operating-Expenses
macro-rankings.com
csv, excel
Updated Jul 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
macro-rankings (2025). Amazon.com Inc - Operating-Expenses [Dataset]. https://www.macro-rankings.com/markets/stocks/amzn-nasdaq/income-statement/operating-expenses
Explore at:
csv, excelAvailable download formats
Dataset updated
Jul 24, 2025
Dataset authored and provided by
macro-rankings
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
united states
Description
Operating-Expenses Time Series for Amazon.com Inc. Amazon.com, Inc. engages in the retail sale of consumer products, advertising, and subscriptions service through online and physical stores in North America and internationally. The company operates through three segments: North America, International, and Amazon Web Services (AWS). It also manufactures and sells electronic devices, including Kindle, fire tablets, fire TVs, echo, ring, blink, and eero; and develops and produces media content. In addition, the company offers programs that enable sellers to sell their products in its stores; and programs that allow authors, independent publishers, musicians, filmmakers, Twitch streamers, skill and app developers, and others to publish and sell content. Further, it provides compute, storage, database, analytics, machine learning, and other services, as well as advertising services through programs, such as sponsored ads, display, and video advertising. Additionally, the company offers Amazon Prime, a membership program. The company's products offered through its stores include merchandise and content purchased for resale and products offered by third-party sellers. It serves consumers, sellers, developers, enterprises, content creators, advertisers, and employees. Amazon.com, Inc. was incorporated in 1994 and is headquartered in Seattle, Washington.
s
Sentiment popularity - Amazon Mechanical Turk dataset
eprints.soton.ac.uk
Updated Nov 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
VENANZI, MATTEO; Teacy, William; Rogers, Alexander; Jennings, Nicholas (2023). Sentiment popularity - Amazon Mechanical Turk dataset [Dataset]. http://doi.org/10.5258/SOTON/376544
Explore at:
Unique identifier
https://doi.org/10.5258/SOTON/376544
Dataset updated
Nov 4, 2023
Dataset provided by
University of Southampton
Authors
VENANZI, MATTEO; Teacy, William; Rogers, Alexander; Jennings, Nicholas
Description
Dataset re-collected from an original dataset collected by Pang, B., and Lee, L. 2004. "A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts". In Proceedings of the 42nd annual meeting on Association for Computational Linguistics. The dataset presents a binary classification problem, with workers asked to select either positive (1) or negative (0) for a 500 sentences extracted from movie reviews, with gold labels assigned by the website. It contains 10,000 sentiment judgements collected from 143 using the Amazon Mechanical Turk platform. Each row is in the format WorkerID, TaskID, Worker label, Gold label, time spent on the judgement by the worker
d
More than 1,070,574 Verified Contacts of companies that use Amazon AWS
datarade.ai
Updated Aug 20, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DataCaptive (2021). More than 1,070,574 Verified Contacts of companies that use Amazon AWS [Dataset]. https://datarade.ai/data-providers/datacaptive/data-products/more-than-1-070-574-verified-contacts-of-companies-that-use-a-datacaptive
Explore at:
.json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset updated
Aug 20, 2021
Dataset authored and provided by
DataCaptive
Area covered
Rwanda, Kyrgyzstan, Niger, Tunisia, Singapore, Virgin Islands (British), British Indian Ocean Territory, Saint Helena, Iceland, Tonga
Description
Amazon AWS - Cloud Platforms & Services

Companies using Amazon AWS

We have data on 1,070,574 companies that use Amazon AWS. The companies using Amazon AWS are most often found in United States and in the Computer Software industry. Amazon AWS is most often used by companies with 10-50 employees and 1M-10M dollars in revenue. Our data for Amazon AWS usage goes back as far as 2 years and 1 months.

What is Amazon AWS?

Amazon Web Services (AWS) is a collection of remote computing services, also called web services that make up a cloud computing platform offered by Amazon.com.

Top Industries that use Amazon AWS

Looking at Amazon AWS customers by industry, we find that Computer Software (6%) is the largest segment.

Distribution of companies using Amazon AWS by Industry

 Computer software - 67, 537 companies  Hospitals & Healthcare - 54, 293 companies  Retail - 39, 543 companies  Information Technology and Services - 35, 382 companies  Real Estate - 31, 676 companies  Restaurants - 30, 302 companies  Construction - 29, 207 companies  Automotive - 28, 469 companies  Financial Services - 23, 680 companies  Education Management - 21, 548 companies

Top Countries that use Amazon AWS

49% of Amazon AWS customers are in United States and 7% are in United Kingdom.

Distribution of companies using Amazon AWS by country

 United Sates – 616 2275 companies  United Kingdom – 68 219 companies  Australia – 44 601 companies  Canada – 42 770 companies  Germany – 31 541 companies  India – 30 949 companies  Netherlands – 19 543 companies  Brazil – 17 165 companies  Italy – 14 876 companies  Spain – 14 675 companies

Contact Information of Fields Include:-

• Company Name • Business contact number • Title
• Name • Email Address • Country, State, City, Zip Code • Phone, Mobile and Fax • Website • Industry • SIC & NAICS Code • Employees Size
• Revenue Size
• And more…

Why Buy AWS Users List from DataCaptive?

• More than 1,070,574 companies
• Responsive database • Customizable as per your requirements • Email and Tele-verified list • Team of 100+ market researchers • Authentic data sources

What’s in for you?

Over choosing us, here are a few advantages we authenticate-

• Locate, target, and prospect leads from 170+ countries • Design and execute ABM and multi-channel campaigns • Seamless and smooth pre-and post-sale customer service • Connect with old leads and build a fruitful customer relationship • Analyze the market for product development and sales campaigns • Boost sales and ROI with increased customer acquisition and retention

Our security compliance

We use of globally recognized data laws like –

GDPR, CCPA, ACMA, EDPS, CAN-SPAM and ANTI CAN-SPAM to ensure the privacy and security of our database. We engage certified auditors to validate our security and privacy by providing us with certificates to represent our security compliance.

Our USPs- what makes us your ideal choice?

At DataCaptive™, we strive consistently to improve our services and cater to the needs of businesses around the world while keeping up with industry trends.

• Elaborate data mining from credible sources • 7-tier verification, including manual quality check • Strict adherence to global and local data policies • Guaranteed 95% accuracy or cash-back • Free sample database available on request

Guaranteed benefits of our Amazon AWS users email database!

85% email deliverability and 95% accuracy on other data fields

We understand the importance of data accuracy and employ every avenue to keep our database fresh and updated. We execute a multi-step QC process backed by our Patented AI and Machine learning tools to prevent anomalies in consistency and data precision. This cycle repeats every 45 days. Although maintaining 100% accuracy is quite impractical, since data such as email, physical addresses, and phone numbers are subjected to change, we guarantee 85% email deliverability and 95% accuracy on other data points.

100% replacement in case of hard bounces

Every data point is meticulously verified and then re-verified to ensure you get the best. Data Accuracy is paramount in successfully penetrating a new market or working within a familiar one. We are committed to precision. However, in an unlikely event where hard bounces or inaccuracies exceed the guaranteed percentage, we offer replacement with immediate effect. If need be, we even offer credits and/or refunds for inaccurate contacts.

Other promised benefits

• Contacts are for the perpetual usage • The database comprises consent-based opt-in contacts only • The list is free of duplicate contacts and generic emails • Round-the-clock customer service assistance • 360-degree database solutions
e
The average laboratory samples a population of 7,300 Amazon Mechanical Turk...
b2find.eudat.eu
Updated Apr 8, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2015). The average laboratory samples a population of 7,300 Amazon Mechanical Turk workers 2012-2017 - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/76cb46bd-03f6-56b0-9302-f4a3e57462a5
Explore at:
Dataset updated
Apr 8, 2015
Description
Using capture-recapture analysis we estimate the effective size of the active Amazon Mechanical Turk (MTurk) population that a typical laboratory can access to be about 7,300 workers. We also estimate that the time taken for half of the workers to leave the MTurk pool and be replaced is about 7 months. Each laboratory has its own population pool which overlaps, often extensively, with the hundreds of other laboratories using MTurk. Our estimate is based on a sample of 114,460 completed sessions from 33,408 unique participants and 689 sessions across seven laboratories in the US, Europe, and Australia from January 2012 to March 2015.This network project brings together economists, psychologists, computer and complexity scientists from three leading centres for behavioural social science at Nottingham, Warwick and UEA. This group will lead a research programme with two broad objectives: to develop and test cross-disciplinary models of human behaviour and behaviour change; to draw out their implications for the formulation and evaluation of public policy. Foundational research will focus on three inter-related themes: understanding individual behaviour and behaviour change; understanding social and interactive behaviour; rethinking the foundations of policy analysis. The project will explore implications of the basic science for policy via a series of applied projects connecting naturally with the three themes. These will include: the determinants of consumer credit behaviour; the formation of social values; strategies for evaluation of policies affecting health and safety. The research will integrate theoretical perspectives from multiple disciplines and utilise a wide range of complementary methodologies including: theoretical modeling of individuals, groups and complex systems; conceptual analysis; lab and field experiments; analysis of large data sets. The Network will promote high quality cross-disciplinary research and serve as a policy forum for understanding behaviour and behaviour change. Experimental data. We used an open-population capture-recapture analysis (Cormack, 1989), which allows for MTurk workers to enter and leave the population. As we found moderate turnover rates, these open-population models are more appropriate than the closed-population models (Otis, Burnham, White, & Anderson, 1978).
Z
Data from: SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural...
data.niaid.nih.gov
zenodo.org
Updated Mar 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tsiakoulis, Pirros (2025). SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7119399
Explore at:
Dataset updated
Mar 7, 2025
Dataset provided by
Ellinas, Nikolaos
Maniati, Georgia
Chalamandaris, Aimilios
Tsiakoulis, Pirros
Sung, June Sig
Vioni, Alexandra
Klapsas, Konstantinos
Nikitaras, Karolos
Jho, Gunu
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
This is the public release of the Samsung Open Mean Opinion Scores (SOMOS) dataset for the evaluation of neural text-to-speech (TTS) synthesis, which consists of audio files generated with a public domain voice from trained TTS models based on bibliography, and numbers assigned to each audio as quality (naturalness) evaluations by several crowdsourced listeners.DescriptionThe SOMOS dataset contains 20,000 synthetic utterances (wavs), 100 natural utterances and 374,955 naturalness evaluations (human-assigned scores in the range 1-5). The synthetic utterances are single-speaker, generated by training several Tacotron-like acoustic models and an LPCNet vocoder on the LJ Speech voice public dataset. 2,000 text sentences were synthesized, selected from Blizzard Challenge texts of years 2007-2016, the LJ Speech corpus as well as Wikipedia and general domain data from the Internet.Naturalness evaluations were collected via crowdsourcing a listening test on Amazon Mechanical Turk in the US, GB and CA locales. The records of listening test participants (workers) are fully anonymized. Statistics on the reliability of the scores assigned by the workers are also included, generated through processing the scores and validation controls per submission page.

To listen to audio samples of the dataset, please see our Github page.

The dataset release comes with a carefully designed train-validation-test split (70%-15%-15%) with unseen systems, listeners and texts, which can be used for experimentation on MOS prediction.

This version also contains the necessary resources to obtain the transcripts corresponding to all dataset audios.

Terms of use

The dataset may be used for research purposes only, for non-commercial purposes only, and may be distributed with the same terms.

Every time you produce research that has used this dataset, please cite the dataset appropriately.

Cite as:

@inproceedings{maniati22_interspeech, author={Georgia Maniati and Alexandra Vioni and Nikolaos Ellinas and Karolos Nikitaras and Konstantinos Klapsas and June Sig Sung and Gunu Jho and Aimilios Chalamandaris and Pirros Tsiakoulis}, title={{SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis}}, year=2022, booktitle={Proc. Interspeech 2022}, pages={2388--2392}, doi={10.21437/Interspeech.2022-10922} }

References of resources & models used

Voice & synthesized texts:K. Ito and L. Johnson, “The LJ Speech Dataset,” https://keithito.com/LJ-Speech-Dataset/, 2017.

Vocoder:J.-M. Valin and J. Skoglund, “LPCNet: Improving neural speech synthesis through linear prediction,” in Proc. ICASSP, 2019.R. Vipperla, S. Park, K. Choo, S. Ishtiaq, K. Min, S. Bhattacharya, A. Mehrotra, A. G. C. P. Ramos, and N. D. Lane, “Bunched lpcnet: Vocoder for low-cost neural text-to-speech systems,” in Proc. Interspeech, 2020.

Acoustic models:N. Ellinas, G. Vamvoukakis, K. Markopoulos, A. Chalamandaris, G. Maniati, P. Kakoulidis, S. Raptis, J. S. Sung, H. Park, and P. Tsiakoulis, “High quality streaming speech synthesis with low, sentence-length-independent latency,” in Proc. Interspeech, 2020.Y. Wang, R. Skerry-Ryan, D. Stanton, Y. Wu, R. J. Weiss, N. Jaitly, Z. Yang, Y. Xiao, Z. Chen, S. Bengio et al., “Tacotron: Towards End-to-End Speech Synthesis,” in Proc. Interspeech, 2017.J. Shen, R. Pang, R. J. Weiss, M. Schuster, N. Jaitly, Z. Yang, Z. Chen, Y. Zhang, Y. Wang, R. Skerrv-Ryan et al., “Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions,” in Proc. ICASSP, 2018.J. Shen, Y. Jia, M. Chrzanowski, Y. Zhang, I. Elias, H. Zen, and Y. Wu, “Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling,” arXiv preprint arXiv:2010.04301, 2020.M. Honnibal and M. Johnson, “An Improved Non-monotonic Transition System for Dependency Parsing,” in Proc. EMNLP, 2015.M. Dominguez, P. L. Rohrer, and J. Soler-Company, “PyToBI: A Toolkit for ToBI Labeling Under Python,” in Proc. Interspeech, 2019.Y. Zou, S. Liu, X. Yin, H. Lin, C. Wang, H. Zhang, and Z. Ma, “Fine-grained prosody modeling in neural speech synthesis using ToBI representation,” in Proc. Interspeech, 2021.K. Klapsas, N. Ellinas, J. S. Sung, H. Park, and S. Raptis, “WordLevel Style Control for Expressive, Non-attentive Speech Synthesis,” in Proc. SPECOM, 2021.T. Raitio, R. Rasipuram, and D. Castellani, “Controllable neural text-to-speech synthesis using intuitive prosodic features,” in Proc. Interspeech, 2020.

Synthesized texts from the Blizzard Challenges 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2016:M. Fraser and S. King, "The Blizzard Challenge 2007," in Proc. SSW6, 2007.V. Karaiskos, S. King, R. A. Clark, and C. Mayo, "The Blizzard Challenge 2008," in Proc. Blizzard Challenge Workshop, 2008.A. W. Black, S. King, and K. Tokuda, "The Blizzard Challenge 2009," in Proc. Blizzard Challenge, 2009.S. King and V. Karaiskos, "The Blizzard Challenge 2010," 2010.S. King and V. Karaiskos, "The Blizzard Challenge 2011," 2011.S. King and V. Karaiskos, "The Blizzard Challenge 2012," 2012.S. King and V. Karaiskos, "The Blizzard Challenge 2013," 2013.S. King and V. Karaiskos, "The Blizzard Challenge 2016," 2016.

Contact

Alexandra Vioni - a.vioni@samsung.com

If you have any questions or comments about the dataset, please feel free to write to us.

We are interested in knowing if you find our dataset useful! If you use our dataset, please email us and tell us about your research.
i
Human Annotated GAN Images
ieee-dataport.org
Updated Apr 25, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Juanyong Duan (2019). Human Annotated GAN Images [Dataset]. https://ieee-dataport.org/documents/human-annotated-gan-images
Explore at:
Dataset updated
Apr 25, 2019
Authors
Juanyong Duan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
we have collected an annotated dataset that contains 600 fake images and 400 real images. All the fake images are generated by a generative adversarial net and all the real images are downsampled images from the ImageNet dataset. These images are evaluated by 10 workers from the Amazon Mechanical Turk (AMT) based on eight carefully defined attributes.
f
Crowdsourced datasets to study the generation and impact of text...
figshare.com
bin
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jorge Ramírez; Marcos Baez; Fabio Casati; Boualem Benatallah (2023). Crowdsourced datasets to study the generation and impact of text highlighting in classification tasks [Dataset]. http://doi.org/10.6084/m9.figshare.9917162.v4
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.9917162.v4
Dataset updated
May 30, 2023
Dataset provided by
figshare
Authors
Jorge Ramírez; Marcos Baez; Fabio Casati; Boualem Benatallah
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Here we present the datasets derived from our experiments on using crowdsourcing for document classification tasks. These experiments resemble a two-step process that first highlights excerpts from the text and then leverage these to workers for classification. Thus our experiments groups into highlighting generation and classification. For generating highlights, we leverage crowdsourcing and automatic approaches such us extractive summarization and question answering models. For our classification experiments, we consider documents from two different domains: systematic literature reviews and amazon product reviews. Specifically, we study how highlighting text passages could aid workers in judging the relevance of a document given an input question. We spec these datasets to benefit not only to study these particular problem domains but a broader set of classification problems where individual judgments from workers are scarce.In a nutshell, the datasets represent two kinds of tasks:- classification tasks with highlighting support.- highlighting tasks, where the workers highlight evidence.Classification tasksIn this task, workers classified documents based on a given predicate. classification tasks using crowdsourced highlightsFiles:- classification_amazon-crowd-highlights.csv- classification_oa-crowd-highlights.csv- classification_tech-crowd-highlights.csv- classification_tech-3x12-crowd-highlights.csv- classification_tech-6x6-crowd-highlights.csvclassification tasks using ML-generated highlightsFiles:- classification_amazon-ML-highlights.csv- classification_oa-ML-highlights.csv- classification_tech-ML-highlights.csvHighlighting taskscrowdsourced highlightsIn this task, workers highlighted excerpts from documents that are relevant to a given predicate, to support future classification tasks.File: crowdsourced_highlights.csv.The file contains one line per highlight (generated by one worker); the column that holds the highlighted fragment(s) is highlighted_text. The highlighted_text is a "list of lists" (Python syntax), so iterating over this list will give you the text fragment generated by one worker. Also, the experiment column indicates domain + task design. So, to get the highlights used in the classification experiments, use the rows that end with "-highlight".ML-generated highlightsWe also consider automatic approaches to generate text highlights — specifically, extractive summarization and question-answering models.File: ml_highlights.csv.
Operator Financial Figures - Revenue, Profit/Loss, Employees for commercial...
datarade.ai
.csv
Updated Dec 31, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ch-aviation (2011). Operator Financial Figures - Revenue, Profit/Loss, Employees for commercial airlines worldwide [Dataset]. https://datarade.ai/data-providers/ch-aviation/data-products/operator-financial-figures-revenue-profit-loss-employees-ch-aviation
Explore at:
.csvAvailable download formats
Dataset updated
Dec 31, 2011
Dataset provided by
ch-aviation GmbHhttp://www.ch-aviation.com/
Authors
ch-aviation
Area covered
Haiti, Holy See, Estonia, Congo (Democratic Republic of the), Tonga, Namibia, Benin, Finland, Afghanistan, Iceland
Description
Operator Financial Figures provided by ch-aviation include annual revenue figures (broken down to passenger, cargo, other, and total revenue), operating and net profit as well as employee numbers for each financial year including the financial year end date.

The data set is updated monthly.

The sample dataset shows financial figures for Swiss, Alaska Airlines, and Horizon Air from 2011.

Contact us to get access to ch-aviation's AWS S3 sample data bucket as well allowing you to build proof of concepts with all of our sample data.

The direct bucket URL for this data set is: https://eu-central-1.console.aws.amazon.com/s3/buckets/dataservices-standardised-samples?region=eu-central-1&bucketType=general&prefix=operator_financial_figures/&showversions=false

Full Technical Data Dictionary: https://about.ch-aviation.com/operator-financial-figures/
Tech layoffs worldwide 2020-2024, by quarter
statista.com
Updated Jul 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Tech layoffs worldwide 2020-2024, by quarter [Dataset]. https://www.statista.com/statistics/199999/worldwide-tech-layoffs-covid-19/
Explore at:
Dataset updated
Jul 1, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
The tech industry had a rough start to 2024. Technology companies worldwide saw a significant reduction in their workforce in the first quarter of 2024, with over ** thousand employees being laid off. By the second quarter, layoffs impacted more than ** thousand tech employees. In the final quarter of the year around ** thousand employees were laid off. Layoffs impacting all global tech giants Layoffs in the global market escalated dramatically in the first quarter of 2023, when the sector saw a staggering record high of ***** thousand employees losing their jobs. Major tech giants such as Google, Microsoft, Meta, and IBM all contributed to this figure during this quarter. Amazon, in particular, conducted the most rounds of layoffs with the highest number of employees laid off among global tech giants. Industries most affected include the consumer, hardware, food, and healthcare sectors. Notable companies that have laid off a significant number of staff include Flink, Booking.com, Uber, PayPal, LinkedIn, and Peloton, among others. Overhiring led the trend, but will AI keep it going? Layoffs in the technology sector started following an overhiring spree during the COVID-19 pandemic. Initially, companies expanded their workforce to meet increased demand for digital services during lockdowns. However, as lockdowns ended, economic uncertainties persisted and companies reevaluated their strategies, layoffs became inevitable, resulting in a record number of *** thousand laid off employees in the global tech sector by the end of 2022. Moreover, it is still unclear how advancements in artificial intelligence (AI) will impact layoff trends in the tech sector. AI-driven automation can replace manual tasks leading to workforce redundancies. Whether through chatbots handling customer inquiries or predictive algorithms optimizing supply chains, the pursuit of efficiency and cost savings may result in more tech industry layoffs in the future.
m
Amazon.com Inc - Cogs-Excluding-Depreciation-and-Amortization
macro-rankings.com
csv, excel
Updated Aug 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
macro-rankings (2025). Amazon.com Inc - Cogs-Excluding-Depreciation-and-Amortization [Dataset]. https://www.macro-rankings.com/markets/stocks/amzn-nasdaq/income-statement/cogs-excluding-depreciation-and-amortization
Explore at:
csv, excelAvailable download formats
Dataset updated
Aug 11, 2025
Dataset authored and provided by
macro-rankings
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
united states
Description
Cogs-Excluding-Depreciation-and-Amortization Time Series for Amazon.com Inc. Amazon.com, Inc. engages in the retail sale of consumer products, advertising, and subscriptions service through online and physical stores in North America and internationally. The company operates through three segments: North America, International, and Amazon Web Services (AWS). It also manufactures and sells electronic devices, including Kindle, fire tablets, fire TVs, echo, ring, blink, and eero; and develops and produces media content. In addition, the company offers programs that enable sellers to sell their products in its stores; and programs that allow authors, independent publishers, musicians, filmmakers, Twitch streamers, skill and app developers, and others to publish and sell content. Further, it provides compute, storage, database, analytics, machine learning, and other services, as well as advertising services through programs, such as sponsored ads, display, and video advertising. Additionally, the company offers Amazon Prime, a membership program. The company's products offered through its stores include merchandise and content purchased for resale and products offered by third-party sellers. It also provides AgentCore services, such as AgentCore Runtime, AgentCore Memory, AgentCore Observability, AgentCore Identity, AgentCore Gateway, AgentCore Browser, and AgentCore Code Interpreter. It serves consumers, sellers, developers, enterprises, content creators, advertisers, and employees. Amazon.com, Inc. was incorporated in 1994 and is headquartered in Seattle, Washington.
H
Large Dataset of Generalization Patterns in the Number Game
dataverse.harvard.edu
search.dataone.org
Updated Aug 10, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eric J. Bigelow; Steven T. Piantadosi (2018). Large Dataset of Generalization Patterns in the Number Game [Dataset]. http://doi.org/10.7910/DVN/A8ZWLF
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/A8ZWLF
Dataset updated
Aug 10, 2018
Dataset provided by
Harvard Dataverse
Authors
Eric J. Bigelow; Steven T. Piantadosi
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
272,700 two-alternative forced choice responses in a simple numerical task modeled after Tenenbaum (1999, 2000), collected from 606 Amazon Mechanical Turk workers. Subjects were shown sets of numbers length 1 to 4 from the range 1 to 100 (e.g. {12, 16}), and asked what other numbers were likely to belong to that set (e.g. 1, 5, 2, 98). Their generalization patterns reflect both rule-like (e.g. “even numbers,” “powers of two”) and distance-based (e.g. numbers near 50) generalization. This data set is available for further analysis of these simple and intuitive inferences, developing of hands-on modeling instruction, and attempts to understand how probability and rules interact in human cognition.
d
Data for Anxiety Undermined Job Satisfaction Among Essential Workers in the...
search.dataone.org
dataverse.harvard.edu
Updated Nov 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CHOI, HYERI (2023). Data for Anxiety Undermined Job Satisfaction Among Essential Workers in the U.S. during the COVID-19 Pandemic [Dataset]. http://doi.org/10.7910/DVN/RMUJNX
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/RMUJNX
Dataset updated
Nov 14, 2023
Dataset provided by
Harvard Dataverse
Authors
CHOI, HYERI
Description
This is a dataset collected by the author through Amazon Mechanical Turk for a manuscript - Anxiety Undermined Job Satisfaction Among Essential Workers in the U.S. during the COVID-19 Pandemic. Codebook or other information can be provided upon request.
h
OffensiveLang
huggingface.co
Updated Jun 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amit Das (2024). OffensiveLang [Dataset]. https://huggingface.co/datasets/AmitDasRup123/OffensiveLang
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 23, 2024
Authors
Amit Das
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
OffensiveLang

Overview

OffensiveLang is a community based implicit offensive language dataset generated by ChatGPT 3.5 containing data for 38 different target groups. It has been meticulously annotated by Amazon MTurk workers, ensuring high-quality labeling of hate speech. Additionally, a prompt-based zero-shot method was employed with ChatGPT and the detection results were compared between human annotation and ChatGPT annotation. This dataset is invaluable for… See the full description on the dataset page: https://huggingface.co/datasets/AmitDasRup123/OffensiveLang.
Data from: DiPCo -- Dinner Party Corpus
zenodo.org
application/gzip, pdf
Updated Jul 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maarten Van Segbroeck; Ahmed Zaid; Ksenia Kutsenko; Cirenia Huerta; Tinh Nguyen; Xuewen Luo; Björn Hoffmeister; Jan Trmal; Maurizio Omologo; Roland Maas; Maarten Van Segbroeck; Ahmed Zaid; Ksenia Kutsenko; Cirenia Huerta; Tinh Nguyen; Xuewen Luo; Björn Hoffmeister; Jan Trmal; Maurizio Omologo; Roland Maas (2024). DiPCo -- Dinner Party Corpus [Dataset]. http://doi.org/10.21437/interspeech.2020-2800
Explore at:
application/gzip, pdfAvailable download formats
Unique identifier
https://doi.org/10.21437/interspeech.2020-2800
Dataset updated
Jul 11, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Maarten Van Segbroeck; Ahmed Zaid; Ksenia Kutsenko; Cirenia Huerta; Tinh Nguyen; Xuewen Luo; Björn Hoffmeister; Jan Trmal; Maurizio Omologo; Roland Maas; Maarten Van Segbroeck; Ahmed Zaid; Ksenia Kutsenko; Cirenia Huerta; Tinh Nguyen; Xuewen Luo; Björn Hoffmeister; Jan Trmal; Maurizio Omologo; Roland Maas
License
https://cdla.io/permissive-1-0https://cdla.io/permissive-1-0
Description
We present a speech data corpus that simulates a "dinner party" scenario taking place in an everyday home environment. The corpus was created by recording multiple groups of four Amazon employee volunteers having a natural conversation in English around a dining table. The participants were recorded by a single-channel close-talk microphone and by five far-field 7-microphone array devices positioned at different locations in the recording room. The dataset contains the audio recordings and human labeled transcripts of a total of 10 sessions with a duration between 15 and 45 minutes. The corpus was created to advance in the field of noise robust and distant speech processing and is intended to serve as a public research and benchmarking data set.
h
clcifar10
huggingface.co
Updated Jul 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NTU Computational Learning Lab (2025). clcifar10 [Dataset]. https://huggingface.co/datasets/ntucllab/clcifar10
Explore at:
Dataset updated
Jul 4, 2025
Dataset authored and provided by
NTU Computational Learning Lab
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for CLCIFAR10

This Complementary labeled CIFAR10 dataset contains 3 human-annotated complementary labels for all 50000 images in the training split of CIFAR10. The workers are from Amazon Mechanical Turk. We randomly sampled 4 different labels for 3 different annotators, so each image would have 3 (probably repeated) complementary labels. For more details, please visit our github or paper.

Dataset Structure Data Instances

A sample from the… See the full description on the dataset page: https://huggingface.co/datasets/ntucllab/clcifar10.

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista (2025). Amazon employees 2007-2024 [Dataset]. https://www.statista.com/statistics/234488/number-of-amazon-employees/

Amazon employees 2007-2024

Explore at:

43 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Jun 25, 2025

Dataset authored and provided by

Statistahttp://statista.com/

Area covered

United States, Worldwide

Description

The combined number of full- and part-time employees of Amazon.com has increased significantly since 2017. Amazon’s headcount peaked in 2021 when the American multinational e-commerce company employed ********* full- and part-time employees, not counting external contractors. However, in 2024, the number dropped to *********. E-commerce crunch The workforce reduction of Amazon follows the mass layoffs hitting the entire e-commerce sector. With the full reopening of physical stores after the COVID-19 pandemic, online shopping demand decreased, leading online retailers to restructure their businesses, including personnel costs. Diversifying business With online retail sales growing slower due to recession and inflation, Amazon can still leverage other profitable revenue segments — from media subscriptions to server hosting and cloud services. On top of that, in 2023 Amazon monitored small enterprises operating in different fields and strategically invested in them, as disclosed startup acquisitions indicate.

Clear search

Close search

Google apps

Main menu

Amazon employees 2007-2024

Amazon Employee Access Challenge

Context

Content

Acknowledgements

Dataset of business metrics of companies called Amazon

Amazon.com Inc - Total-Cash-From-Operating-Activities

Webpage capture on the news article of Amazon workers are rising up around...

Amazon.com Inc - Operating-Expenses

Sentiment popularity - Amazon Mechanical Turk dataset

More than 1,070,574 Verified Contacts of companies that use Amazon AWS

The average laboratory samples a population of 7,300 Amazon Mechanical Turk...

Data from: SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural...

Human Annotated GAN Images

Crowdsourced datasets to study the generation and impact of text...

Operator Financial Figures - Revenue, Profit/Loss, Employees for commercial...

Tech layoffs worldwide 2020-2024, by quarter

Amazon.com Inc - Cogs-Excluding-Depreciation-and-Amortization

Large Dataset of Generalization Patterns in the Number Game

Data for Anxiety Undermined Job Satisfaction Among Essential Workers in the...

OffensiveLang

Data from: DiPCo -- Dinner Party Corpus

clcifar10

Amazon employees 2007-2024