100+ datasets found
  1. Top Companies Dataset

    • kaggle.com
    Updated Jun 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    informrohit1 (2024). Top Companies Dataset [Dataset]. https://www.kaggle.com/datasets/informrohit1/top-companies-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 10, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    informrohit1
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This is Top 1000 companies dataset and it is web scraped form a website . This dataset includes 4 columns and 1000 unique rows . 4 columns includes Name of the company , Rating out of 5 , Reviews given by employees and Overall data. also Rows includes different 1000 Companies name.

    Dataset is good for beginner level Data analysis project and Company Recommendation projects etc.

  2. Top Global Companies Innovators & Giants 🌍🏢

    • kaggle.com
    Updated Jun 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sheikh Muhammad Abdullah (2024). Top Global Companies Innovators & Giants 🌍🏢 [Dataset]. https://www.kaggle.com/datasets/abdmental01/top-companies
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 7, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sheikh Muhammad Abdullah
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Data Description

    The dataset provided includes information about various companies, their stock symbols, financial metrics such as price-to-book ratio and share price, as well as details about their origin countries. Additionally, the dataset contains frequency distribution information for certain ranges of price-to-book ratios and share prices.

    About Data

    The dataset appears to be a compilation of financial data for different companies, likely for investment analysis or comparison purposes. It includes the following key components:

    • Rank: Rank of the company based on some criteria (not explicitly mentioned).
    • Company: Name of the company.
    • Stock Symbol: Symbol used to identify the company's stock in trading.
    • Price to Book Ratio: Financial metric indicating the relationship between a company's market value and its book value.
    • Share Price (USD): Price of a single share of the company's stock in US dollars.
    • Company Origin: Country where the company is based.
    • Label Count: Frequency distribution information for certain ranges of price-to-book ratios and share prices.

    This dataset can be utilized for various financial analyses such as company valuation, comparison of financial metrics across companies, and investment decision-making.

  3. h

    twt-kaggle-data

    • huggingface.co
    Updated Dec 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    megha manoj (2023). twt-kaggle-data [Dataset]. https://huggingface.co/datasets/mochi-skz/twt-kaggle-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 8, 2023
    Authors
    megha manoj
    Description

    mochi-skz/twt-kaggle-data dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. Company Purchasing Dataset

    • kaggle.com
    Updated Apr 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shahriar Kabir (2025). Company Purchasing Dataset [Dataset]. https://www.kaggle.com/datasets/shahriarkabir/company-purchasing-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 16, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Shahriar Kabir
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description
    • Dataset Title: Company Purchasing Dataset
    • Rows: 500 | Columns: 9
    • Domain: Supply Chain, Procurement, Finance
    • Use Case: Spend Analytics, Supplier Performance, Cost Optimization

    This synthetic dataset simulates procurement transactions for a mid-sized organization over 2024. It includes purchases across multiple categories (electronics, furniture, stationery, etc.) from various suppliers and buyers. Ideal for practicing descriptive analytics, spend analysis, and supplier performance evaluation.

  5. h

    test-dataset-kaggle

    • huggingface.co
    Updated Feb 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gholamreza Dar (2024). test-dataset-kaggle [Dataset]. https://huggingface.co/datasets/Gholamreza/test-dataset-kaggle
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 15, 2024
    Authors
    Gholamreza Dar
    Description

    Gholamreza/test-dataset-kaggle dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    kaggle-entity-annotated-corpus-ner-dataset

    • huggingface.co
    Updated Jul 10, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rafael Arias Calles (2022). kaggle-entity-annotated-corpus-ner-dataset [Dataset]. https://huggingface.co/datasets/rjac/kaggle-entity-annotated-corpus-ner-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 10, 2022
    Authors
    Rafael Arias Calles
    License

    https://choosealicense.com/licenses/odbl/https://choosealicense.com/licenses/odbl/

    Description

    Date: 2022-07-10 Files: ner_dataset.csv Source: Kaggle entity annotated corpus notes: The dataset only contains the tokens and ner tag labels. Labels are uppercase.

      About Dataset
    

    from Kaggle Datasets

      Context
    

    Annotated Corpus for Named Entity Recognition using GMB(Groningen Meaning Bank) corpus for entity classification with enhanced and popular features by Natural Language Processing applied to the data set. Tip: Use Pandas Dataframe to load dataset if using Python for… See the full description on the dataset page: https://huggingface.co/datasets/rjac/kaggle-entity-annotated-corpus-ner-dataset.

  7. A

    ‘Indian Company Dataset’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Dec 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Indian Company Dataset’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-indian-company-dataset-67c3/5da84e4e/?iid=003-487&v=presentation
    Explore at:
    Dataset updated
    Dec 24, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    India
    Description

    Analysis of ‘Indian Company Dataset’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yogeshrampariya/indian-company-dataset on 28 January 2022.

    --- Dataset description provided by original source is as follows ---

    This dataset is about Indian recruitment company, who hire best of brains in India in software, banking, manufacturing, analysis and Education background

    Columns:

    Name: Name of Company Rating : Overall Rating given by Rating agency Reviews: Overall Google Reviews Company: Company type like Public, Private Head_Quarters : Location where headquarter of company is Company_age : How old the company is No_of_Employee : Number of employee company has

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

    This dataset will test your coding skills where you will be handling categorical variables, its a classification problem

    --- Original source retains full ownership of the source dataset ---

  8. 50 US Startups: Financial and Geographical Insight

    • kaggle.com
    Updated Dec 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Irfan Ahmad (2023). 50 US Startups: Financial and Geographical Insight [Dataset]. https://www.kaggle.com/datasets/irfanahmad1/50-us-startups-financial-and-geographical-insight
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 20, 2023
    Dataset provided by
    Kaggle
    Authors
    Irfan Ahmad
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    Overview:

    This dataset provides a comprehensive look into the financial expenditures and profits of 50 startups based in the United States. It is an invaluable resource for analysts, economists, and business strategists seeking to understand the correlation between different types of spending and profitability in startup ventures.

    Attributes: 1. R&D Spend: - Description: The amount of money each company has invested in Research and Development activities. - Data Type: Numeric (US dollars) - Importance: Indicates the company's commitment to innovation and technological advancement. 2. Administration: - Description: Expenditure on administrative functions and operations. - Data Type: Numeric (US dollars) - Relevance: Reflects the overhead costs associated with managing the company. 3. Marketing Spend: - Description: Investment in marketing and promotional activities. - Data Type: Numeric (US dollars) - Significance: A key factor in revenue generation and market penetration. 4. State: - Description: The U.S. state where the company is operating. - Data Type: Categorical (California, New York, or Florida) - Purpose: Provides geographical context and allows for regional analysis. 5. Profit: - Description: The net profit earned by the company. - Data Type: Numeric (US dollars) - Utility: A direct measure of the company’s financial success.

    Potential Uses: - Business Analysis: Understanding how different types of spending (R&D, administration, marketing) affect profitability. - Regional Studies: Examining the impact of geographical location on business success. - Startup Growth: Insights into the financial practices of successful startups. - Economic Research: Data-driven study of the startup ecosystem in the U.S.

    Target Audience: - Business Analysts and Economists - Marketing Strategists - Startup Consultants - Data Science Enthusiasts - Academic Researchers

    Conclusion: This dataset is a rich resource for anyone looking to delve into the financial dynamics of startups in the U.S. It offers a unique perspective on how different types of investments correlate with company success across various states.

    Please note that the data is anonymized and does not include any confidential information about the companies listed. The dataset is intended for educational and research purposes.

  9. Sales Dataset of USA [Updated]

    • kaggle.com
    Updated Jun 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sulaiman ahmed (2023). Sales Dataset of USA [Updated] [Dataset]. https://www.kaggle.com/datasets/sulaimanahmed/sales-dataset-of-usa-updated
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 20, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sulaiman ahmed
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Area covered
    United States
    Description

    The given dataset appears to be a sales dataset containing information about different orders. Here is a description of the data:

    1. Row ID: An identifier for each row in the dataset.
    2. Order ID: Unique identifier for each order.
    3. Order Date: The date when the order was placed.
    4. Ship Date: The date when the order was shipped.
    5. Ship Mode: The mode of shipping chosen for the order.
    6. Customer ID: Unique identifier for each customer.
    7. Customer Name: Name of the customer who placed the order.
    8. Segment: The segment to which the customer belongs (e.g., consumer, corporate).
    9. Country: The country where the order was placed (in this case, United States).
    10. City: The city where the order was placed.
    11. State: The state where the order was placed.
    12. Postal Code: The postal code associated with the order's location.
    13. Region: The region of the country where the order was placed.
    14. Product ID: Unique identifier for each product.
    15. Category: The category to which the product belongs (e.g., furniture, office supplies).
    16. Sub-Category: The sub-category to which the product belongs (e.g., bookcases, chairs).
    17. Product Name: The name of the product.
    18. Cost: The cost of the product.
    19. Price: The price at which the product was sold.
    20. Profit: The profit made from the sale of the product.
    21. Quantity: The quantity of the product ordered.
    22. Sales: The total sales generated from the order (quantity multiplied by price).

    The dataset provides detailed information about each order, including customer details, product details, sales information, and shipping information. It can be used to analyze various aspects of the sales data, such as profitability, customer segments, product categories, and regional sales performance.

  10. h

    kaggle

    • huggingface.co
    Updated Feb 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmad Khan (2024). kaggle [Dataset]. https://huggingface.co/datasets/ahmadkhan1022/kaggle
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 2, 2024
    Dataset authored and provided by
    Ahmad Khan
    License

    https://choosealicense.com/licenses/pddl/https://choosealicense.com/licenses/pddl/

    Description

    Dataset Card for MergedDataset

      Dataset Summary
    
    
    
    
    
      Supported Tasks and Leaderboards
    

    [More Information Needed]

      Languages
    

    [More Information Needed]

      Dataset Structure
    
    
    
    
    
      Data Instances
    

    [More Information Needed]

      Data Fields
    

    [More Information Needed]

      Data Splits
    

    [More Information Needed]

      Dataset Creation
    
    
    
    
    
      Curation Rationale
    

    [More Information Needed]

      Source Data
    
    
    
    
    
      Initial Data… See the full description on the dataset page: https://huggingface.co/datasets/ahmadkhan1022/kaggle.
    
  11. h

    AppleStockData2025

    • huggingface.co
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TableGPT (2025). AppleStockData2025 [Dataset]. https://huggingface.co/datasets/tablegpt/AppleStockData2025
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 7, 2025
    Dataset authored and provided by
    TableGPT
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Apple Stock Data 2025

    This is a dataset copied from Kaggle. You can see the original dataset here: https://www.kaggle.com/datasets/umerhaddii/apple-stock-data-2025

    The following is the original readme of this dataset:

      About Dataset
    
    
    
    
    
      Context
    

    Apple Inc. is an American hardware and software developer and technology company that develops and sells computers, smartphones and consumer electronics as well as operating systems and application software. Apple also… See the full description on the dataset page: https://huggingface.co/datasets/tablegpt/AppleStockData2025.

  12. h

    kaggle-recipe-categorized-chunk-8

    • huggingface.co
    Updated Sep 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeff Schmitz (2024). kaggle-recipe-categorized-chunk-8 [Dataset]. https://huggingface.co/datasets/Schmitz005/kaggle-recipe-categorized-chunk-8
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 11, 2024
    Authors
    Jeff Schmitz
    Description

    Schmitz005/kaggle-recipe-categorized-chunk-8 dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. A

    ‘Top 100 Biggest Restaurant Chains 2021’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Jan 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ‘Top 100 Biggest Restaurant Chains 2021’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-top-100-biggest-restaurant-chains-2021-94e5/52e35c93/?iid=003-302&v=presentation
    Explore at:
    Dataset updated
    Jan 28, 2022
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Top 100 Biggest Restaurant Chains 2021’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/johnharshith/top-100-biggest-restaurant-chains-2021 on 28 January 2022.

    --- Dataset description provided by original source is as follows ---

    https://i.insider.com/5db704d7045a311ad239369b?width=1300&format=jpeg&auto=webp" alt="Popular Restaurant Chains">

    Context

    This Dataset contains the data compiled by Technomic and reported by Restaurant Business magazine, the top 100 most popular restaurant chains in the United States in terms of the latest 2020 sales which were responsible for three-fourths of the total industry sales growth last year.

    Content

    The data was obtained from the Restaurant Business magazine website. The columns contain stats such as position of restaurant chains, 2020 U.S. sales, YOY sales change, 2020 U.S. units, YOY unit change, segment and menu types. This data can be found from the website https://www.restaurantbusinessonline.com/top-500-chains with detailed analysis.

    Inspiration

    While 2016 was a rough year for chain restaurants, more than half of the industry wealth of $521.9 billion still comes from the Top 500 chains and nearly 94% of those dollars and 93% of those units are represented in the Top 250. These stats have made me curious to find out interesting profit patterns from this dataset.

    Dataset Usage

    This Dataset can be used to study interesting patterns using various classification techniques and arrive at some exciting conclusions. One can create amazing visualisations using the different columns of the dataset. We can also find out and design an effective business model from the given dataset and take one step closer to your most successful restaurant chain startup ever!

    --- Original source retains full ownership of the source dataset ---

  14. E-commerce Business Transaction

    • kaggle.com
    Updated May 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel Ramos (2022). E-commerce Business Transaction [Dataset]. https://www.kaggle.com/datasets/gabrielramos87/an-online-shop-business
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 14, 2022
    Dataset provided by
    Kaggle
    Authors
    Gabriel Ramos
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    E-commerce has become a new channel to support businesses development. Through e-commerce, businesses can get access and establish a wider market presence by providing cheaper and more efficient distribution channels for their products or services. E-commerce has also changed the way people shop and consume products and services. Many people are turning to their computers or smart devices to order goods, which can easily be delivered to their homes.

    Content

    This is a sales transaction data set of UK-based e-commerce (online retail) for one year. This London-based shop has been selling gifts and homewares for adults and children through the website since 2007. Their customers come from all over the world and usually make direct purchases for themselves. There are also small businesses that buy in bulk and sell to other customers through retail outlet channels.

    The data set contains 500K rows and 8 columns. The following is the description of each column. 1. TransactionNo (categorical): a six-digit unique number that defines each transaction. The letter “C” in the code indicates a cancellation. 2. Date (numeric): the date when each transaction was generated. 3. ProductNo (categorical): a five or six-digit unique character used to identify a specific product. 4. Product (categorical): product/item name. 5. Price (numeric): the price of each product per unit in pound sterling (£). 6. Quantity (numeric): the quantity of each product per transaction. Negative values related to cancelled transactions. 7. CustomerNo (categorical): a five-digit unique number that defines each customer. 8. Country (categorical): name of the country where the customer resides.

    There is a small percentage of order cancellation in the data set. Most of these cancellations were due to out-of-stock conditions on some products. Under this situation, customers tend to cancel an order as they want all products delivered all at once.

    Inspiration

    Information is a main asset of businesses nowadays. The success of a business in a competitive environment depends on its ability to acquire, store, and utilize information. Data is one of the main sources of information. Therefore, data analysis is an important activity for acquiring new and useful information. Analyze this dataset and try to answer the following questions. 1. How was the sales trend over the months? 2. What are the most frequently purchased products? 3. How many products does the customer purchase in each transaction? 4. What are the most profitable segment customers? 5. Based on your findings, what strategy could you recommend to the business to gain more profit?

    Photo by CardMapr on Unsplash

  15. Dataset : Business Intelligence Research Trends

    • kaggle.com
    Updated Nov 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MUHAMMAD AKMAL HAKIM (2024). Dataset : Business Intelligence Research Trends [Dataset]. http://doi.org/10.34740/kaggle/ds/5643554
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 14, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    MUHAMMAD AKMAL HAKIM
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset Overview

    This dataset presents a meticulously compiled collection of 387 academic publications that explore various aspects of social media and business intelligence. The dataset includes detailed metadata about each publication, such as titles, authorship, abstracts, publication years, article types, and the journals or conferences where they were published. Citations and research areas are also included, making this dataset a valuable resource for bibliometric analysis, trend detection, and literature reviews in the fields of social media analytics, sentiment analysis, business intelligence, and related disciplines.

    Content

    The dataset comprises 15 columns, each capturing specific attributes of the research papers. Below is a description of each column:

    • ID: A unique identifier for each record in the dataset.
    • Title: The title of the academic paper.
    • DOI: The Digital Object Identifier, which provides a permanent link to the publication.
    • Author: List of authors who contributed to the paper.
    • Abstract: A summary of the research paper, providing insights into the study's objectives, methods, and findings.
    • Year: The year the paper was published.
    • Article Type: Indicates the type of publication (e.g., Proceedings Paper, Article, Book Chapter).
    • Publication Name: The name of the journal or conference where the paper was published.
    • Number-of-Cited-References: The number of references cited in the paper.
    • Times Cited: The number of times the paper has been cited by other works.
    • Research Areas: The general research area(s) the paper pertains to (e.g., Computer Science, Engineering).
    • WOS Category: Specific categories or subfields relevant to Web of Science classification.
    • WOS Index: The index within Web of Science where the paper is listed.
    • Keywords: Keywords provided by the authors to describe the main topics of the paper.
    • Keyword Plus: Additional keywords derived from the titles of the paper’s cited references.

    Applications

    This dataset can be utilized for a variety of purposes, including but not limited to:

    • Trend Analysis: Identify emerging trends and popular topics in social media and business intelligence research.
    • Citation Analysis: Analyze citation patterns to determine the impact and relevance of specific publications.
    • Collaborative Networks: Map out authorship and institutional collaboration trends.
    • Text Mining: Perform text mining on abstracts and keywords to uncover latent themes and topics.
    • Research Evaluation: Conduct bibliometric evaluations to assess the productivity and impact of researchers and institutions in the field.

    Data Collection and Preprocessing

    The dataset was curated by extracting bibliometric data from Web of Science (WOS), ensuring the inclusion of comprehensive and high-quality metadata. All records have been standardized for consistency and completeness to facilitate easier analysis.

  16. Complete Company DataSet for EDA,data cleaning

    • kaggle.com
    Updated Jul 22, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shital Gaikwad (2021). Complete Company DataSet for EDA,data cleaning [Dataset]. https://www.kaggle.com/datasets/shitalgaikwad123/complete-company-dataset-for-edadata-cleaning/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 22, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Shital Gaikwad
    Description

    Dataset

    This dataset was created by Shital Gaikwad

    Contents

  17. Machine Learning Job Postings in the US

    • kaggle.com
    • opendatabay.com
    Updated Apr 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ivan Kumeyko (2025). Machine Learning Job Postings in the US [Dataset]. https://www.kaggle.com/datasets/ivankmk/thousand-ml-jobs-in-usa
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 20, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ivan Kumeyko
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    This dataset contains 1,000 job postings for Machine Learning-related roles across the United States, scraped between late 2024 and early 2025. The data was collected directly from company career pages and job boards, focusing on full job descriptions and associated company information.

    Column Descriptions

    ColumnDescription
    job_posted_dateThe date the job was posted (format: YYYY-MM-DD).
    company_address_localityThe city or locality of the job or company.
    company_address_regionThe U.S. state or region where the job is located.
    company_nameThe name of the company posting the job.
    company_websiteThe official website of the company.
    company_descriptionA short description or mission statement of the company.
    job_description_textThe full job description text as listed in the original posting.
    seniority_levelThe required seniority level (e.g., Internship, Entry level, Mid-Senior).
    job_titleThe full job title listed in the posting.
  18. T-Mart

    • kaggle.com
    Updated Aug 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gowtham G (2023). T-Mart [Dataset]. https://www.kaggle.com/datasets/imgowthamg/t-mart
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 10, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Gowtham G
    Description

    The provided dataset appears to be a sales dataset from a company called "**T-Mart.**" The dataset contains various columns with information about the sales transactions, including the date of the transaction, product details, quantity, sales type, location, payment mode, product category, unit of measurement (UOM), purchase price, and some additional labels and counts.

    Based on the given information, here's a brief description of the dataset:

    The "T-Mart" sales dataset captures sales transactions with details such as the transaction date, unique product identifier (PRODUCT ID), quantity sold, sales type (Direct Sales, Online, etc.), sales location (e.g., California, Alabama), payment mode (Cash, Online), product details (PRODUCT, CATEGORY, UOM), purchase price, and some additional label-based information.

    This dataset provides insights into various aspects of the company's sales operations, including the distribution of sales across different categories, products, and locations, as well as information about the payment modes used for transactions.

    Analyzing this dataset can help identify trends, popular products, sales performance by location, and preferred payment methods. It's essential for understanding the company's sales dynamics and making informed business decisions.

    This dataset appears to be rich in information, and with the right data visualization techniques, we can uncover valuable insights that can be used for strategic planning and optimizing sales strategies.

  19. Fundamental stock data

    • kaggle.com
    Updated Dec 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artem Burenok (2022). Fundamental stock data [Dataset]. https://www.kaggle.com/datasets/artemburenok/fundamental-stock-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 8, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Artem Burenok
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    It is not so often that one can find fundamental data of companies on which it would be possible to accurately assess the value of a company.

    So I decided to use yahoo_fin api to collect some fundamentals of 48 companies from the S&P 500 index.

    The content of indicators in each table: - total assets. - cash. - stockholder equity. - profit. - revenue. - return on equity, return on assets, profit margin. - trailing P/E, P/S, P/B, PEG, forward P/E.

    In addition, the dataset has prices for all stocks for four years.

  20. Global Startup Success Dataset

    • kaggle.com
    Updated Mar 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hamna Kaleem (2025). Global Startup Success Dataset [Dataset]. https://www.kaggle.com/datasets/hamnakaleemds/global-startup-success-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 1, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Hamna Kaleem
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    📊 Dataset Features This dataset includes 5,000 startups from 10 countries and contains 15 key features: Startup Name: Name of the startup Founded Year: Year the startup was founded Country: Country where the startup is based Industry: Industry category (Tech, FinTech, AI, etc.) Funding Stage: Stage of investment (Seed, Series A, etc.) Total Funding ($M): Total funding received (in million $) Number of Employees: Number of employees in the startup Annual Revenue ($M): Annual revenue in million dollars Valuation ($B): Startup's valuation in billion dollars Success Score: Score from 1 to 10 based on growth Acquired?: Whether the startup was acquired (Yes/No) IPO?: Did the startup go public? (Yes/No) Customer Base (Millions): Number of active customers Tech Stack: Technologies used by the startup Social Media Followers: Total followers on social platforms Analysis Ideas 📈 What Can You Do with This Dataset? Here are some exciting analyses you can perform:

    Predict Startup Success: Train a machine learning model to predict the success score. Industry Trends: Analyze which industries get the most funding. **Valuation vs. Funding: **Explore the correlation between funding and valuation. Acquisition Analysis: Investigate the factors that contribute to startups being acquired.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
informrohit1 (2024). Top Companies Dataset [Dataset]. https://www.kaggle.com/datasets/informrohit1/top-companies-dataset
Organization logo

Top Companies Dataset

Web scraped Dataset of Top 1000 Companies.

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 10, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
informrohit1
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

This is Top 1000 companies dataset and it is web scraped form a website . This dataset includes 4 columns and 1000 unique rows . 4 columns includes Name of the company , Rating out of 5 , Reviews given by employees and Overall data. also Rows includes different 1000 Companies name.

Dataset is good for beginner level Data analysis project and Company Recommendation projects etc.

Search
Clear search
Close search
Google apps
Main menu