56 datasets found
  1. Amazon employees 2007-2024

    • statista.com
    • gruabehub.com
    Updated Jun 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Amazon employees 2007-2024 [Dataset]. https://www.statista.com/statistics/234488/number-of-amazon-employees/
    Explore at:
    Dataset updated
    Jun 25, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States, Worldwide
    Description

    The combined number of full- and part-time employees of Amazon.com has increased significantly since 2017. Amazon’s headcount peaked in 2021 when the American multinational e-commerce company employed ********* full- and part-time employees, not counting external contractors. However, in 2024, the number dropped to *********. E-commerce crunch The workforce reduction of Amazon follows the mass layoffs hitting the entire e-commerce sector. With the full reopening of physical stores after the COVID-19 pandemic, online shopping demand decreased, leading online retailers to restructure their businesses, including personnel costs. Diversifying business With online retail sales growing slower due to recession and inflation, Amazon can still leverage other profitable revenue segments — from media subscriptions to server hosting and cloud services. On top of that, in 2023 Amazon monitored small enterprises operating in different fields and strategically invested in them, as disclosed startup acquisitions indicate.

  2. Amazon Employee Access Challenge

    • kaggle.com
    Updated Aug 11, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luca Massaron (2021). Amazon Employee Access Challenge [Dataset]. https://www.kaggle.com/lucamassaron/amazon-employee-access-challenge/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 11, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Luca Massaron
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    When an employee at any company starts work, they first need to obtain the computer access necessary to fulfill their role. This access may allow an employee to read/manipulate resources through various applications or web portals. It is assumed that employees fulfilling the functions of a given role will access the same or similar resources. It is often the case that employees figure out the access they need as they encounter roadblocks during their daily work (e.g. not able to log into a reporting portal). A knowledgeable supervisor then takes time to manually grant the needed access in order to overcome access obstacles. As employees move throughout a company, this access discovery/recovery cycle wastes a nontrivial amount of time and money.

    There is a considerable amount of data regarding an employee’s role within an organization and the resources to which they have access. Given the data related to current employees and their provisioned access, models can be built that automatically determine access privileges as employees enter and leave roles within a company. These auto-access models seek to minimize the human involvement required to grant or revoke employee access.

    Part of the competition "Amazon.com - Employee Access Challenge" (https://www.kaggle.com/c/amazon-employee-access-challenge), the data consists of real historical data collected from 2010 & 2011. Employees are manually allowed or denied access to resources over time. Your task is to create an algorithm capable of learning from this historical data to predict approval/denial for an unseen set of employees.

    Content

    The data comes from Amazon Inc. collected from 2010-2011 (published on Kaggle platform). The training set consists of 32769 samples and the testing one of 58922 samples. The training set has one label attribute named “ACTION”, whose value “1” indicates an application is approved whereas “0” indicates rejection. As predictors of this state, there are eight features, indicating characteristics of the required resource anf the role and work group of the employee at Amazon requesting access.

    train.csv - The training set. Each row has the ACTION (ground truth), RESOURCE, and information about the employee's role at the time of approval

    test.csv - The test set for which predictions should be made. Each row asks whether an employee having the listed characteristics should have access to the listed resource.

    Column NameDescription
    ACTIONACTION is 1 if the resource was approved, 0 if the resource was not
    RESOURCEAn ID for each resource
    MGR_IDThe EMPLOYEE ID of the manager of the current EMPLOYEE ID record; an employee may have only one manager at a time
    ROLE_ROLLUP_1Company role grouping category id 1 (e.g. US Engineering)
    ROLE_ROLLUP_2Company role grouping category id 2 (e.g. US Retail)
    ROLE_DEPTNAMECompany role department description (e.g. Retail)
    ROLE_TITLECompany role business title description (e.g. Senior Engineering Retail Manager)
    ROLE_FAMILY_DESCCompany role family extended description (e.g. Retail Manager, Software Engineering)
    ROLE_FAMILYCompany role family description (e.g. Retail Manager)
    ROLE_CODECompany role code; this code is unique to each role (e.g. Manager)

    Models are judged on area under the ROC curve (https://en.wikipedia.org/wiki/Receiver_operating_characteristic)

    Acknowledgements

    The data has been donated by Amazon and the original competition has been hosted in collaboration with the IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2013)

  3. w

    Dataset of business metrics of companies called Amazon

    • workwithdata.com
    Updated May 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of business metrics of companies called Amazon [Dataset]. https://www.workwithdata.com/datasets/companies?col=ceo%2Cceo_approval%2Cceo_gender%2Ccity%2Cemployees&f=1&fcol0=company&fop0=%3D&fval0=Amazon
    Explore at:
    Dataset updated
    May 6, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about companies. It has 19 rows and is filtered where the company is Amazon. It features 5 columns: employees, CEO, CEO gender, and CEO approval.

  4. m

    Amazon.com Inc - Total-Cash-From-Operating-Activities

    • macro-rankings.com
    csv, excel
    Updated Mar 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (2025). Amazon.com Inc - Total-Cash-From-Operating-Activities [Dataset]. https://www.macro-rankings.com/Markets/Stocks?Entity=AMZN.US&Item=Total-Cash-From-Operating-Activities
    Explore at:
    csv, excelAvailable download formats
    Dataset updated
    Mar 18, 2025
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    united states
    Description

    Total-Cash-From-Operating-Activities Time Series for Amazon.com Inc. Amazon.com, Inc. engages in the retail sale of consumer products, advertising, and subscriptions service through online and physical stores in North America and internationally. The company operates through three segments: North America, International, and Amazon Web Services (AWS). It also manufactures and sells electronic devices, including Kindle, fire tablets, fire TVs, echo, ring, blink, and eero; and develops and produces media content. In addition, the company offers programs that enable sellers to sell their products in its stores; and programs that allow authors, independent publishers, musicians, filmmakers, Twitch streamers, skill and app developers, and others to publish and sell content. Further, it provides compute, storage, database, analytics, machine learning, and other services, as well as advertising services through programs, such as sponsored ads, display, and video advertising. Additionally, the company offers Amazon Prime, a membership program. The company's products offered through its stores include merchandise and content purchased for resale and products offered by third-party sellers. It serves consumers, sellers, developers, enterprises, content creators, advertisers, and employees. Amazon.com, Inc. was incorporated in 1994 and is headquartered in Seattle, Washington.

  5. g

    Webpage capture on the news article of Amazon workers are rising up around...

    • gimi9.com
    Updated Mar 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Webpage capture on the news article of Amazon workers are rising up around the world to say: enough | gimi9.com [Dataset]. https://gimi9.com/dataset/mekong_649f22c56c6202c630cf37e32a4ce3b94fba2647
    Explore at:
    Dataset updated
    Mar 23, 2025
    Description

    This webpage capture is the reference for Labor incidents dataset. It contains news articles from international newspapers.

  6. m

    Amazon.com Inc - Operating-Expenses

    • macro-rankings.com
    csv, excel
    Updated Jul 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (2025). Amazon.com Inc - Operating-Expenses [Dataset]. https://www.macro-rankings.com/markets/stocks/amzn-nasdaq/income-statement/operating-expenses
    Explore at:
    csv, excelAvailable download formats
    Dataset updated
    Jul 24, 2025
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    united states
    Description

    Operating-Expenses Time Series for Amazon.com Inc. Amazon.com, Inc. engages in the retail sale of consumer products, advertising, and subscriptions service through online and physical stores in North America and internationally. The company operates through three segments: North America, International, and Amazon Web Services (AWS). It also manufactures and sells electronic devices, including Kindle, fire tablets, fire TVs, echo, ring, blink, and eero; and develops and produces media content. In addition, the company offers programs that enable sellers to sell their products in its stores; and programs that allow authors, independent publishers, musicians, filmmakers, Twitch streamers, skill and app developers, and others to publish and sell content. Further, it provides compute, storage, database, analytics, machine learning, and other services, as well as advertising services through programs, such as sponsored ads, display, and video advertising. Additionally, the company offers Amazon Prime, a membership program. The company's products offered through its stores include merchandise and content purchased for resale and products offered by third-party sellers. It serves consumers, sellers, developers, enterprises, content creators, advertisers, and employees. Amazon.com, Inc. was incorporated in 1994 and is headquartered in Seattle, Washington.

  7. s

    Sentiment popularity - Amazon Mechanical Turk dataset

    • eprints.soton.ac.uk
    Updated Nov 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VENANZI, MATTEO; Teacy, William; Rogers, Alexander; Jennings, Nicholas (2023). Sentiment popularity - Amazon Mechanical Turk dataset [Dataset]. http://doi.org/10.5258/SOTON/376544
    Explore at:
    Dataset updated
    Nov 4, 2023
    Dataset provided by
    University of Southampton
    Authors
    VENANZI, MATTEO; Teacy, William; Rogers, Alexander; Jennings, Nicholas
    Description

    Dataset re-collected from an original dataset collected by Pang, B., and Lee, L. 2004. "A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts". In Proceedings of the 42nd annual meeting on Association for Computational Linguistics. The dataset presents a binary classification problem, with workers asked to select either positive (1) or negative (0) for a 500 sentences extracted from movie reviews, with gold labels assigned by the website. It contains 10,000 sentiment judgements collected from 143 using the Amazon Mechanical Turk platform. Each row is in the format WorkerID, TaskID, Worker label, Gold label, time spent on the judgement by the worker

  8. d

    More than 1,070,574 Verified Contacts of companies that use Amazon AWS

    • datarade.ai
    Updated Aug 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DataCaptive (2021). More than 1,070,574 Verified Contacts of companies that use Amazon AWS [Dataset]. https://datarade.ai/data-providers/datacaptive/data-products/more-than-1-070-574-verified-contacts-of-companies-that-use-a-datacaptive
    Explore at:
    .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Aug 20, 2021
    Dataset authored and provided by
    DataCaptive
    Area covered
    Rwanda, Kyrgyzstan, Niger, Tunisia, Singapore, Virgin Islands (British), British Indian Ocean Territory, Saint Helena, Iceland, Tonga
    Description

    Amazon AWS - Cloud Platforms & Services

    Companies using Amazon AWS

    We have data on 1,070,574 companies that use Amazon AWS. The companies using Amazon AWS are most often found in United States and in the Computer Software industry. Amazon AWS is most often used by companies with 10-50 employees and 1M-10M dollars in revenue. Our data for Amazon AWS usage goes back as far as 2 years and 1 months.

    What is Amazon AWS?

    Amazon Web Services (AWS) is a collection of remote computing services, also called web services that make up a cloud computing platform offered by Amazon.com.

    Top Industries that use Amazon AWS

    Looking at Amazon AWS customers by industry, we find that Computer Software (6%) is the largest segment.

    Distribution of companies using Amazon AWS by Industry

     Computer software - 67, 537 companies  Hospitals & Healthcare - 54, 293 companies  Retail - 39, 543 companies  Information Technology and Services - 35, 382 companies  Real Estate - 31, 676 companies  Restaurants - 30, 302 companies  Construction - 29, 207 companies  Automotive - 28, 469 companies  Financial Services - 23, 680 companies  Education Management - 21, 548 companies

    Top Countries that use Amazon AWS

    49% of Amazon AWS customers are in United States and 7% are in United Kingdom.

    Distribution of companies using Amazon AWS by country

     United Sates – 616 2275 companies  United Kingdom – 68 219 companies  Australia – 44 601 companies  Canada – 42 770 companies  Germany – 31 541 companies  India – 30 949 companies  Netherlands – 19 543 companies  Brazil – 17 165 companies  Italy – 14 876 companies  Spain – 14 675 companies

    Contact Information of Fields Include:-

    • Company Name • Business contact number • Title
    • Name • Email Address • Country, State, City, Zip Code • Phone, Mobile and Fax • Website • Industry • SIC & NAICS Code • Employees Size
    • Revenue Size
    • And more…

    Why Buy AWS Users List from DataCaptive?

    • More than 1,070,574 companies
    • Responsive database • Customizable as per your requirements • Email and Tele-verified list • Team of 100+ market researchers • Authentic data sources

    What’s in for you?

    Over choosing us, here are a few advantages we authenticate-

    • Locate, target, and prospect leads from 170+ countries • Design and execute ABM and multi-channel campaigns • Seamless and smooth pre-and post-sale customer service • Connect with old leads and build a fruitful customer relationship • Analyze the market for product development and sales campaigns • Boost sales and ROI with increased customer acquisition and retention

    Our security compliance

    We use of globally recognized data laws like –

    GDPR, CCPA, ACMA, EDPS, CAN-SPAM and ANTI CAN-SPAM to ensure the privacy and security of our database. We engage certified auditors to validate our security and privacy by providing us with certificates to represent our security compliance.

    Our USPs- what makes us your ideal choice?

    At DataCaptive™, we strive consistently to improve our services and cater to the needs of businesses around the world while keeping up with industry trends.

    • Elaborate data mining from credible sources • 7-tier verification, including manual quality check • Strict adherence to global and local data policies • Guaranteed 95% accuracy or cash-back • Free sample database available on request

    Guaranteed benefits of our Amazon AWS users email database!

    85% email deliverability and 95% accuracy on other data fields

    We understand the importance of data accuracy and employ every avenue to keep our database fresh and updated. We execute a multi-step QC process backed by our Patented AI and Machine learning tools to prevent anomalies in consistency and data precision. This cycle repeats every 45 days. Although maintaining 100% accuracy is quite impractical, since data such as email, physical addresses, and phone numbers are subjected to change, we guarantee 85% email deliverability and 95% accuracy on other data points.

    100% replacement in case of hard bounces

    Every data point is meticulously verified and then re-verified to ensure you get the best. Data Accuracy is paramount in successfully penetrating a new market or working within a familiar one. We are committed to precision. However, in an unlikely event where hard bounces or inaccuracies exceed the guaranteed percentage, we offer replacement with immediate effect. If need be, we even offer credits and/or refunds for inaccurate contacts.

    Other promised benefits

    • Contacts are for the perpetual usage • The database comprises consent-based opt-in contacts only • The list is free of duplicate contacts and generic emails • Round-the-clock customer service assistance • 360-degree database solutions

  9. e

    The average laboratory samples a population of 7,300 Amazon Mechanical Turk...

    • b2find.eudat.eu
    Updated Apr 8, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2015). The average laboratory samples a population of 7,300 Amazon Mechanical Turk workers 2012-2017 - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/76cb46bd-03f6-56b0-9302-f4a3e57462a5
    Explore at:
    Dataset updated
    Apr 8, 2015
    Description

    Using capture-recapture analysis we estimate the effective size of the active Amazon Mechanical Turk (MTurk) population that a typical laboratory can access to be about 7,300 workers. We also estimate that the time taken for half of the workers to leave the MTurk pool and be replaced is about 7 months. Each laboratory has its own population pool which overlaps, often extensively, with the hundreds of other laboratories using MTurk. Our estimate is based on a sample of 114,460 completed sessions from 33,408 unique participants and 689 sessions across seven laboratories in the US, Europe, and Australia from January 2012 to March 2015.This network project brings together economists, psychologists, computer and complexity scientists from three leading centres for behavioural social science at Nottingham, Warwick and UEA. This group will lead a research programme with two broad objectives: to develop and test cross-disciplinary models of human behaviour and behaviour change; to draw out their implications for the formulation and evaluation of public policy. Foundational research will focus on three inter-related themes: understanding individual behaviour and behaviour change; understanding social and interactive behaviour; rethinking the foundations of policy analysis. The project will explore implications of the basic science for policy via a series of applied projects connecting naturally with the three themes. These will include: the determinants of consumer credit behaviour; the formation of social values; strategies for evaluation of policies affecting health and safety. The research will integrate theoretical perspectives from multiple disciplines and utilise a wide range of complementary methodologies including: theoretical modeling of individuals, groups and complex systems; conceptual analysis; lab and field experiments; analysis of large data sets. The Network will promote high quality cross-disciplinary research and serve as a policy forum for understanding behaviour and behaviour change. Experimental data. We used an open-population capture-recapture analysis (Cormack, 1989), which allows for MTurk workers to enter and leave the population. As we found moderate turnover rates, these open-population models are more appropriate than the closed-population models (Otis, Burnham, White, & Anderson, 1978).

  10. Z

    Data from: SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Mar 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tsiakoulis, Pirros (2025). SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7119399
    Explore at:
    Dataset updated
    Mar 7, 2025
    Dataset provided by
    Ellinas, Nikolaos
    Maniati, Georgia
    Chalamandaris, Aimilios
    Tsiakoulis, Pirros
    Sung, June Sig
    Vioni, Alexandra
    Klapsas, Konstantinos
    Nikitaras, Karolos
    Jho, Gunu
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    This is the public release of the Samsung Open Mean Opinion Scores (SOMOS) dataset for the evaluation of neural text-to-speech (TTS) synthesis, which consists of audio files generated with a public domain voice from trained TTS models based on bibliography, and numbers assigned to each audio as quality (naturalness) evaluations by several crowdsourced listeners.DescriptionThe SOMOS dataset contains 20,000 synthetic utterances (wavs), 100 natural utterances and 374,955 naturalness evaluations (human-assigned scores in the range 1-5). The synthetic utterances are single-speaker, generated by training several Tacotron-like acoustic models and an LPCNet vocoder on the LJ Speech voice public dataset. 2,000 text sentences were synthesized, selected from Blizzard Challenge texts of years 2007-2016, the LJ Speech corpus as well as Wikipedia and general domain data from the Internet.Naturalness evaluations were collected via crowdsourcing a listening test on Amazon Mechanical Turk in the US, GB and CA locales. The records of listening test participants (workers) are fully anonymized. Statistics on the reliability of the scores assigned by the workers are also included, generated through processing the scores and validation controls per submission page.

    To listen to audio samples of the dataset, please see our Github page.

    The dataset release comes with a carefully designed train-validation-test split (70%-15%-15%) with unseen systems, listeners and texts, which can be used for experimentation on MOS prediction.

    This version also contains the necessary resources to obtain the transcripts corresponding to all dataset audios.

    Terms of use

    The dataset may be used for research purposes only, for non-commercial purposes only, and may be distributed with the same terms.

    Every time you produce research that has used this dataset, please cite the dataset appropriately.

    Cite as:

    @inproceedings{maniati22_interspeech, author={Georgia Maniati and Alexandra Vioni and Nikolaos Ellinas and Karolos Nikitaras and Konstantinos Klapsas and June Sig Sung and Gunu Jho and Aimilios Chalamandaris and Pirros Tsiakoulis}, title={{SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis}}, year=2022, booktitle={Proc. Interspeech 2022}, pages={2388--2392}, doi={10.21437/Interspeech.2022-10922} }

    References of resources & models used

    Voice & synthesized texts:K. Ito and L. Johnson, “The LJ Speech Dataset,” https://keithito.com/LJ-Speech-Dataset/, 2017.

    Vocoder:J.-M. Valin and J. Skoglund, “LPCNet: Improving neural speech synthesis through linear prediction,” in Proc. ICASSP, 2019.R. Vipperla, S. Park, K. Choo, S. Ishtiaq, K. Min, S. Bhattacharya, A. Mehrotra, A. G. C. P. Ramos, and N. D. Lane, “Bunched lpcnet: Vocoder for low-cost neural text-to-speech systems,” in Proc. Interspeech, 2020.

    Acoustic models:N. Ellinas, G. Vamvoukakis, K. Markopoulos, A. Chalamandaris, G. Maniati, P. Kakoulidis, S. Raptis, J. S. Sung, H. Park, and P. Tsiakoulis, “High quality streaming speech synthesis with low, sentence-length-independent latency,” in Proc. Interspeech, 2020.Y. Wang, R. Skerry-Ryan, D. Stanton, Y. Wu, R. J. Weiss, N. Jaitly, Z. Yang, Y. Xiao, Z. Chen, S. Bengio et al., “Tacotron: Towards End-to-End Speech Synthesis,” in Proc. Interspeech, 2017.J. Shen, R. Pang, R. J. Weiss, M. Schuster, N. Jaitly, Z. Yang, Z. Chen, Y. Zhang, Y. Wang, R. Skerrv-Ryan et al., “Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions,” in Proc. ICASSP, 2018.J. Shen, Y. Jia, M. Chrzanowski, Y. Zhang, I. Elias, H. Zen, and Y. Wu, “Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling,” arXiv preprint arXiv:2010.04301, 2020.M. Honnibal and M. Johnson, “An Improved Non-monotonic Transition System for Dependency Parsing,” in Proc. EMNLP, 2015.M. Dominguez, P. L. Rohrer, and J. Soler-Company, “PyToBI: A Toolkit for ToBI Labeling Under Python,” in Proc. Interspeech, 2019.Y. Zou, S. Liu, X. Yin, H. Lin, C. Wang, H. Zhang, and Z. Ma, “Fine-grained prosody modeling in neural speech synthesis using ToBI representation,” in Proc. Interspeech, 2021.K. Klapsas, N. Ellinas, J. S. Sung, H. Park, and S. Raptis, “WordLevel Style Control for Expressive, Non-attentive Speech Synthesis,” in Proc. SPECOM, 2021.T. Raitio, R. Rasipuram, and D. Castellani, “Controllable neural text-to-speech synthesis using intuitive prosodic features,” in Proc. Interspeech, 2020.

    Synthesized texts from the Blizzard Challenges 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2016:M. Fraser and S. King, "The Blizzard Challenge 2007," in Proc. SSW6, 2007.V. Karaiskos, S. King, R. A. Clark, and C. Mayo, "The Blizzard Challenge 2008," in Proc. Blizzard Challenge Workshop, 2008.A. W. Black, S. King, and K. Tokuda, "The Blizzard Challenge 2009," in Proc. Blizzard Challenge, 2009.S. King and V. Karaiskos, "The Blizzard Challenge 2010," 2010.S. King and V. Karaiskos, "The Blizzard Challenge 2011," 2011.S. King and V. Karaiskos, "The Blizzard Challenge 2012," 2012.S. King and V. Karaiskos, "The Blizzard Challenge 2013," 2013.S. King and V. Karaiskos, "The Blizzard Challenge 2016," 2016.

    Contact

    Alexandra Vioni - a.vioni@samsung.com

    If you have any questions or comments about the dataset, please feel free to write to us.

    We are interested in knowing if you find our dataset useful! If you use our dataset, please email us and tell us about your research.

  11. i

    Human Annotated GAN Images

    • ieee-dataport.org
    Updated Apr 25, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Juanyong Duan (2019). Human Annotated GAN Images [Dataset]. https://ieee-dataport.org/documents/human-annotated-gan-images
    Explore at:
    Dataset updated
    Apr 25, 2019
    Authors
    Juanyong Duan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    we have collected an annotated dataset that contains 600 fake images and 400 real images. All the fake images are generated by a generative adversarial net and all the real images are downsampled images from the ImageNet dataset. These images are evaluated by 10 workers from the Amazon Mechanical Turk (AMT) based on eight carefully defined attributes.

  12. f

    Crowdsourced datasets to study the generation and impact of text...

    • figshare.com
    bin
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jorge Ramírez; Marcos Baez; Fabio Casati; Boualem Benatallah (2023). Crowdsourced datasets to study the generation and impact of text highlighting in classification tasks [Dataset]. http://doi.org/10.6084/m9.figshare.9917162.v4
    Explore at:
    binAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Authors
    Jorge Ramírez; Marcos Baez; Fabio Casati; Boualem Benatallah
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Here we present the datasets derived from our experiments on using crowdsourcing for document classification tasks. These experiments resemble a two-step process that first highlights excerpts from the text and then leverage these to workers for classification. Thus our experiments groups into highlighting generation and classification. For generating highlights, we leverage crowdsourcing and automatic approaches such us extractive summarization and question answering models. For our classification experiments, we consider documents from two different domains: systematic literature reviews and amazon product reviews. Specifically, we study how highlighting text passages could aid workers in judging the relevance of a document given an input question. We spec these datasets to benefit not only to study these particular problem domains but a broader set of classification problems where individual judgments from workers are scarce.In a nutshell, the datasets represent two kinds of tasks:- classification tasks with highlighting support.- highlighting tasks, where the workers highlight evidence.Classification tasksIn this task, workers classified documents based on a given predicate. classification tasks using crowdsourced highlightsFiles:- classification_amazon-crowd-highlights.csv- classification_oa-crowd-highlights.csv- classification_tech-crowd-highlights.csv- classification_tech-3x12-crowd-highlights.csv- classification_tech-6x6-crowd-highlights.csvclassification tasks using ML-generated highlightsFiles:- classification_amazon-ML-highlights.csv- classification_oa-ML-highlights.csv- classification_tech-ML-highlights.csvHighlighting taskscrowdsourced highlightsIn this task, workers highlighted excerpts from documents that are relevant to a given predicate, to support future classification tasks.File: crowdsourced_highlights.csv.The file contains one line per highlight (generated by one worker); the column that holds the highlighted fragment(s) is highlighted_text. The highlighted_text is a "list of lists" (Python syntax), so iterating over this list will give you the text fragment generated by one worker. Also, the experiment column indicates domain + task design. So, to get the highlights used in the classification experiments, use the rows that end with "-highlight".ML-generated highlightsWe also consider automatic approaches to generate text highlights — specifically, extractive summarization and question-answering models.File: ml_highlights.csv.

  13. Operator Financial Figures - Revenue, Profit/Loss, Employees for commercial...

    • datarade.ai
    .csv
    Updated Dec 31, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ch-aviation (2011). Operator Financial Figures - Revenue, Profit/Loss, Employees for commercial airlines worldwide [Dataset]. https://datarade.ai/data-providers/ch-aviation/data-products/operator-financial-figures-revenue-profit-loss-employees-ch-aviation
    Explore at:
    .csvAvailable download formats
    Dataset updated
    Dec 31, 2011
    Dataset provided by
    ch-aviation GmbHhttp://www.ch-aviation.com/
    Authors
    ch-aviation
    Area covered
    Haiti, Holy See, Estonia, Congo (Democratic Republic of the), Tonga, Namibia, Benin, Finland, Afghanistan, Iceland
    Description

    Operator Financial Figures provided by ch-aviation include annual revenue figures (broken down to passenger, cargo, other, and total revenue), operating and net profit as well as employee numbers for each financial year including the financial year end date.

    The data set is updated monthly.

    The sample dataset shows financial figures for Swiss, Alaska Airlines, and Horizon Air from 2011.

    Contact us to get access to ch-aviation's AWS S3 sample data bucket as well allowing you to build proof of concepts with all of our sample data.

    The direct bucket URL for this data set is: https://eu-central-1.console.aws.amazon.com/s3/buckets/dataservices-standardised-samples?region=eu-central-1&bucketType=general&prefix=operator_financial_figures/&showversions=false

    Full Technical Data Dictionary: https://about.ch-aviation.com/operator-financial-figures/

  14. Tech layoffs worldwide 2020-2024, by quarter

    • statista.com
    Updated Jul 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Tech layoffs worldwide 2020-2024, by quarter [Dataset]. https://www.statista.com/statistics/199999/worldwide-tech-layoffs-covid-19/
    Explore at:
    Dataset updated
    Jul 1, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    The tech industry had a rough start to 2024. Technology companies worldwide saw a significant reduction in their workforce in the first quarter of 2024, with over ** thousand employees being laid off. By the second quarter, layoffs impacted more than ** thousand tech employees. In the final quarter of the year around ** thousand employees were laid off. Layoffs impacting all global tech giants Layoffs in the global market escalated dramatically in the first quarter of 2023, when the sector saw a staggering record high of ***** thousand employees losing their jobs. Major tech giants such as Google, Microsoft, Meta, and IBM all contributed to this figure during this quarter. Amazon, in particular, conducted the most rounds of layoffs with the highest number of employees laid off among global tech giants. Industries most affected include the consumer, hardware, food, and healthcare sectors. Notable companies that have laid off a significant number of staff include Flink, Booking.com, Uber, PayPal, LinkedIn, and Peloton, among others. Overhiring led the trend, but will AI keep it going? Layoffs in the technology sector started following an overhiring spree during the COVID-19 pandemic. Initially, companies expanded their workforce to meet increased demand for digital services during lockdowns. However, as lockdowns ended, economic uncertainties persisted and companies reevaluated their strategies, layoffs became inevitable, resulting in a record number of *** thousand laid off employees in the global tech sector by the end of 2022. Moreover, it is still unclear how advancements in artificial intelligence (AI) will impact layoff trends in the tech sector. AI-driven automation can replace manual tasks leading to workforce redundancies. Whether through chatbots handling customer inquiries or predictive algorithms optimizing supply chains, the pursuit of efficiency and cost savings may result in more tech industry layoffs in the future.

  15. m

    Amazon.com Inc - Cogs-Excluding-Depreciation-and-Amortization

    • macro-rankings.com
    csv, excel
    Updated Aug 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (2025). Amazon.com Inc - Cogs-Excluding-Depreciation-and-Amortization [Dataset]. https://www.macro-rankings.com/markets/stocks/amzn-nasdaq/income-statement/cogs-excluding-depreciation-and-amortization
    Explore at:
    csv, excelAvailable download formats
    Dataset updated
    Aug 11, 2025
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    united states
    Description

    Cogs-Excluding-Depreciation-and-Amortization Time Series for Amazon.com Inc. Amazon.com, Inc. engages in the retail sale of consumer products, advertising, and subscriptions service through online and physical stores in North America and internationally. The company operates through three segments: North America, International, and Amazon Web Services (AWS). It also manufactures and sells electronic devices, including Kindle, fire tablets, fire TVs, echo, ring, blink, and eero; and develops and produces media content. In addition, the company offers programs that enable sellers to sell their products in its stores; and programs that allow authors, independent publishers, musicians, filmmakers, Twitch streamers, skill and app developers, and others to publish and sell content. Further, it provides compute, storage, database, analytics, machine learning, and other services, as well as advertising services through programs, such as sponsored ads, display, and video advertising. Additionally, the company offers Amazon Prime, a membership program. The company's products offered through its stores include merchandise and content purchased for resale and products offered by third-party sellers. It also provides AgentCore services, such as AgentCore Runtime, AgentCore Memory, AgentCore Observability, AgentCore Identity, AgentCore Gateway, AgentCore Browser, and AgentCore Code Interpreter. It serves consumers, sellers, developers, enterprises, content creators, advertisers, and employees. Amazon.com, Inc. was incorporated in 1994 and is headquartered in Seattle, Washington.

  16. H

    Large Dataset of Generalization Patterns in the Number Game

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Aug 10, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eric J. Bigelow; Steven T. Piantadosi (2018). Large Dataset of Generalization Patterns in the Number Game [Dataset]. http://doi.org/10.7910/DVN/A8ZWLF
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 10, 2018
    Dataset provided by
    Harvard Dataverse
    Authors
    Eric J. Bigelow; Steven T. Piantadosi
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    272,700 two-alternative forced choice responses in a simple numerical task modeled after Tenenbaum (1999, 2000), collected from 606 Amazon Mechanical Turk workers. Subjects were shown sets of numbers length 1 to 4 from the range 1 to 100 (e.g. {12, 16}), and asked what other numbers were likely to belong to that set (e.g. 1, 5, 2, 98). Their generalization patterns reflect both rule-like (e.g. “even numbers,” “powers of two”) and distance-based (e.g. numbers near 50) generalization. This data set is available for further analysis of these simple and intuitive inferences, developing of hands-on modeling instruction, and attempts to understand how probability and rules interact in human cognition.

  17. d

    Data for Anxiety Undermined Job Satisfaction Among Essential Workers in the...

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CHOI, HYERI (2023). Data for Anxiety Undermined Job Satisfaction Among Essential Workers in the U.S. during the COVID-19 Pandemic [Dataset]. http://doi.org/10.7910/DVN/RMUJNX
    Explore at:
    Dataset updated
    Nov 14, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    CHOI, HYERI
    Description

    This is a dataset collected by the author through Amazon Mechanical Turk for a manuscript - Anxiety Undermined Job Satisfaction Among Essential Workers in the U.S. during the COVID-19 Pandemic. Codebook or other information can be provided upon request.

  18. h

    OffensiveLang

    • huggingface.co
    Updated Jun 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amit Das (2024). OffensiveLang [Dataset]. https://huggingface.co/datasets/AmitDasRup123/OffensiveLang
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 23, 2024
    Authors
    Amit Das
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    OffensiveLang

      Overview
    

    OffensiveLang is a community based implicit offensive language dataset generated by ChatGPT 3.5 containing data for 38 different target groups. It has been meticulously annotated by Amazon MTurk workers, ensuring high-quality labeling of hate speech. Additionally, a prompt-based zero-shot method was employed with ChatGPT and the detection results were compared between human annotation and ChatGPT annotation. This dataset is invaluable for… See the full description on the dataset page: https://huggingface.co/datasets/AmitDasRup123/OffensiveLang.

  19. Data from: DiPCo -- Dinner Party Corpus

    • zenodo.org
    application/gzip, pdf
    Updated Jul 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maarten Van Segbroeck; Ahmed Zaid; Ksenia Kutsenko; Cirenia Huerta; Tinh Nguyen; Xuewen Luo; Björn Hoffmeister; Jan Trmal; Maurizio Omologo; Roland Maas; Maarten Van Segbroeck; Ahmed Zaid; Ksenia Kutsenko; Cirenia Huerta; Tinh Nguyen; Xuewen Luo; Björn Hoffmeister; Jan Trmal; Maurizio Omologo; Roland Maas (2024). DiPCo -- Dinner Party Corpus [Dataset]. http://doi.org/10.21437/interspeech.2020-2800
    Explore at:
    application/gzip, pdfAvailable download formats
    Dataset updated
    Jul 11, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Maarten Van Segbroeck; Ahmed Zaid; Ksenia Kutsenko; Cirenia Huerta; Tinh Nguyen; Xuewen Luo; Björn Hoffmeister; Jan Trmal; Maurizio Omologo; Roland Maas; Maarten Van Segbroeck; Ahmed Zaid; Ksenia Kutsenko; Cirenia Huerta; Tinh Nguyen; Xuewen Luo; Björn Hoffmeister; Jan Trmal; Maurizio Omologo; Roland Maas
    License

    https://cdla.io/permissive-1-0https://cdla.io/permissive-1-0

    Description

    We present a speech data corpus that simulates a "dinner party" scenario taking place in an everyday home environment. The corpus was created by recording multiple groups of four Amazon employee volunteers having a natural conversation in English around a dining table. The participants were recorded by a single-channel close-talk microphone and by five far-field 7-microphone array devices positioned at different locations in the recording room. The dataset contains the audio recordings and human labeled transcripts of a total of 10 sessions with a duration between 15 and 45 minutes. The corpus was created to advance in the field of noise robust and distant speech processing and is intended to serve as a public research and benchmarking data set.

  20. h

    clcifar10

    • huggingface.co
    Updated Jul 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NTU Computational Learning Lab (2025). clcifar10 [Dataset]. https://huggingface.co/datasets/ntucllab/clcifar10
    Explore at:
    Dataset updated
    Jul 4, 2025
    Dataset authored and provided by
    NTU Computational Learning Lab
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for CLCIFAR10

    This Complementary labeled CIFAR10 dataset contains 3 human-annotated complementary labels for all 50000 images in the training split of CIFAR10. The workers are from Amazon Mechanical Turk. We randomly sampled 4 different labels for 3 different annotators, so each image would have 3 (probably repeated) complementary labels. For more details, please visit our github or paper.

      Dataset Structure
    
    
    
    
    
    
    
      Data Instances
    

    A sample from the… See the full description on the dataset page: https://huggingface.co/datasets/ntucllab/clcifar10.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Statista (2025). Amazon employees 2007-2024 [Dataset]. https://www.statista.com/statistics/234488/number-of-amazon-employees/
Organization logo

Amazon employees 2007-2024

Explore at:
43 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jun 25, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
United States, Worldwide
Description

The combined number of full- and part-time employees of Amazon.com has increased significantly since 2017. Amazon’s headcount peaked in 2021 when the American multinational e-commerce company employed ********* full- and part-time employees, not counting external contractors. However, in 2024, the number dropped to *********. E-commerce crunch The workforce reduction of Amazon follows the mass layoffs hitting the entire e-commerce sector. With the full reopening of physical stores after the COVID-19 pandemic, online shopping demand decreased, leading online retailers to restructure their businesses, including personnel costs. Diversifying business With online retail sales growing slower due to recession and inflation, Amazon can still leverage other profitable revenue segments — from media subscriptions to server hosting and cloud services. On top of that, in 2023 Amazon monitored small enterprises operating in different fields and strategically invested in them, as disclosed startup acquisitions indicate.

Search
Clear search
Close search
Google apps
Main menu