100+ datasets found

Top Companies Dataset
kaggle.com
Updated Jun 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
informrohit1 (2024). Top Companies Dataset [Dataset]. https://www.kaggle.com/datasets/informrohit1/top-companies-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 10, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
informrohit1
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This is Top 1000 companies dataset and it is web scraped form a website . This dataset includes 4 columns and 1000 unique rows . 4 columns includes Name of the company , Rating out of 5 , Reviews given by employees and Overall data. also Rows includes different 1000 Companies name.

Dataset is good for beginner level Data analysis project and Company Recommendation projects etc.
Top Global Companies Innovators & Giants 🌍🏢
kaggle.com
Updated Jun 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sheikh Muhammad Abdullah (2024). Top Global Companies Innovators & Giants 🌍🏢 [Dataset]. https://www.kaggle.com/datasets/abdmental01/top-companies
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 7, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sheikh Muhammad Abdullah
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Data Description

The dataset provided includes information about various companies, their stock symbols, financial metrics such as price-to-book ratio and share price, as well as details about their origin countries. Additionally, the dataset contains frequency distribution information for certain ranges of price-to-book ratios and share prices.

About Data

The dataset appears to be a compilation of financial data for different companies, likely for investment analysis or comparison purposes. It includes the following key components:

Rank: Rank of the company based on some criteria (not explicitly mentioned).

Company: Name of the company.

Stock Symbol: Symbol used to identify the company's stock in trading.

Price to Book Ratio: Financial metric indicating the relationship between a company's market value and its book value.

Share Price (USD): Price of a single share of the company's stock in US dollars.

Company Origin: Country where the company is based.

Label Count: Frequency distribution information for certain ranges of price-to-book ratios and share prices.

This dataset can be utilized for various financial analyses such as company valuation, comparison of financial metrics across companies, and investment decision-making.
h
twt-kaggle-data
huggingface.co
Updated Dec 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
megha manoj (2023). twt-kaggle-data [Dataset]. https://huggingface.co/datasets/mochi-skz/twt-kaggle-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 8, 2023
Authors
megha manoj
Description
mochi-skz/twt-kaggle-data dataset hosted on Hugging Face and contributed by the HF Datasets community
Company Purchasing Dataset
kaggle.com
Updated Apr 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shahriar Kabir (2025). Company Purchasing Dataset [Dataset]. https://www.kaggle.com/datasets/shahriarkabir/company-purchasing-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 16, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Shahriar Kabir
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset Title: Company Purchasing Dataset

Rows: 500 | Columns: 9

Domain: Supply Chain, Procurement, Finance

Use Case: Spend Analytics, Supplier Performance, Cost Optimization

This synthetic dataset simulates procurement transactions for a mid-sized organization over 2024. It includes purchases across multiple categories (electronics, furniture, stationery, etc.) from various suppliers and buyers. Ideal for practicing descriptive analytics, spend analysis, and supplier performance evaluation.
h
test-dataset-kaggle
huggingface.co
Updated Feb 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gholamreza Dar (2024). test-dataset-kaggle [Dataset]. https://huggingface.co/datasets/Gholamreza/test-dataset-kaggle
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 15, 2024
Authors
Gholamreza Dar
Description
Gholamreza/test-dataset-kaggle dataset hosted on Hugging Face and contributed by the HF Datasets community
h
kaggle-entity-annotated-corpus-ner-dataset
huggingface.co
Updated Jul 10, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rafael Arias Calles (2022). kaggle-entity-annotated-corpus-ner-dataset [Dataset]. https://huggingface.co/datasets/rjac/kaggle-entity-annotated-corpus-ner-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 10, 2022
Authors
Rafael Arias Calles
License
https://choosealicense.com/licenses/odbl/https://choosealicense.com/licenses/odbl/
Description
Date: 2022-07-10 Files: ner_dataset.csv Source: Kaggle entity annotated corpus notes: The dataset only contains the tokens and ner tag labels. Labels are uppercase.

About Dataset

from Kaggle Datasets

Context

Annotated Corpus for Named Entity Recognition using GMB(Groningen Meaning Bank) corpus for entity classification with enhanced and popular features by Natural Language Processing applied to the data set. Tip: Use Pandas Dataframe to load dataset if using Python for… See the full description on the dataset page: https://huggingface.co/datasets/rjac/kaggle-entity-annotated-corpus-ner-dataset.
A
‘Indian Company Dataset’ analyzed by Analyst-2
analyst-2.ai
Updated Dec 24, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Indian Company Dataset’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-indian-company-dataset-67c3/5da84e4e/?iid=003-487&v=presentation
Explore at:
Dataset updated
Dec 24, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
India
Description
Analysis of ‘Indian Company Dataset’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yogeshrampariya/indian-company-dataset on 28 January 2022.

--- Dataset description provided by original source is as follows ---

This dataset is about Indian recruitment company, who hire best of brains in India in software, banking, manufacturing, analysis and Education background

Columns:

Name: Name of Company Rating : Overall Rating given by Rating agency Reviews: Overall Google Reviews Company: Company type like Public, Private Head_Quarters : Location where headquarter of company is Company_age : How old the company is No_of_Employee : Number of employee company has

Your data will be in front of the world's largest data science community. What questions do you want to see answered?

This dataset will test your coding skills where you will be handling categorical variables, its a classification problem

--- Original source retains full ownership of the source dataset ---
50 US Startups: Financial and Geographical Insight
kaggle.com
Updated Dec 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Irfan Ahmad (2023). 50 US Startups: Financial and Geographical Insight [Dataset]. https://www.kaggle.com/datasets/irfanahmad1/50-us-startups-financial-and-geographical-insight
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 20, 2023
Dataset provided by
Kaggle
Authors
Irfan Ahmad
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Description
Overview:

This dataset provides a comprehensive look into the financial expenditures and profits of 50 startups based in the United States. It is an invaluable resource for analysts, economists, and business strategists seeking to understand the correlation between different types of spending and profitability in startup ventures.

Attributes: 1. R&D Spend: - Description: The amount of money each company has invested in Research and Development activities. - Data Type: Numeric (US dollars) - Importance: Indicates the company's commitment to innovation and technological advancement. 2. Administration: - Description: Expenditure on administrative functions and operations. - Data Type: Numeric (US dollars) - Relevance: Reflects the overhead costs associated with managing the company. 3. Marketing Spend: - Description: Investment in marketing and promotional activities. - Data Type: Numeric (US dollars) - Significance: A key factor in revenue generation and market penetration. 4. State: - Description: The U.S. state where the company is operating. - Data Type: Categorical (California, New York, or Florida) - Purpose: Provides geographical context and allows for regional analysis. 5. Profit: - Description: The net profit earned by the company. - Data Type: Numeric (US dollars) - Utility: A direct measure of the company’s financial success.

Potential Uses: - Business Analysis: Understanding how different types of spending (R&D, administration, marketing) affect profitability. - Regional Studies: Examining the impact of geographical location on business success. - Startup Growth: Insights into the financial practices of successful startups. - Economic Research: Data-driven study of the startup ecosystem in the U.S.

Target Audience: - Business Analysts and Economists - Marketing Strategists - Startup Consultants - Data Science Enthusiasts - Academic Researchers

Conclusion: This dataset is a rich resource for anyone looking to delve into the financial dynamics of startups in the U.S. It offers a unique perspective on how different types of investments correlate with company success across various states.

Please note that the data is anonymized and does not include any confidential information about the companies listed. The dataset is intended for educational and research purposes.
Sales Dataset of USA [Updated]
kaggle.com
Updated Jun 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sulaiman ahmed (2023). Sales Dataset of USA [Updated] [Dataset]. https://www.kaggle.com/datasets/sulaimanahmed/sales-dataset-of-usa-updated
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 20, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sulaiman ahmed
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Area covered
United States
Description
The given dataset appears to be a sales dataset containing information about different orders. Here is a description of the data:

Row ID: An identifier for each row in the dataset.

Order ID: Unique identifier for each order.

Order Date: The date when the order was placed.

Ship Date: The date when the order was shipped.

Ship Mode: The mode of shipping chosen for the order.

Customer ID: Unique identifier for each customer.

Customer Name: Name of the customer who placed the order.

Segment: The segment to which the customer belongs (e.g., consumer, corporate).

Country: The country where the order was placed (in this case, United States).

City: The city where the order was placed.

State: The state where the order was placed.

Postal Code: The postal code associated with the order's location.

Region: The region of the country where the order was placed.

Product ID: Unique identifier for each product.

Category: The category to which the product belongs (e.g., furniture, office supplies).

Sub-Category: The sub-category to which the product belongs (e.g., bookcases, chairs).

Product Name: The name of the product.

Cost: The cost of the product.

Price: The price at which the product was sold.

Profit: The profit made from the sale of the product.

Quantity: The quantity of the product ordered.

Sales: The total sales generated from the order (quantity multiplied by price).

The dataset provides detailed information about each order, including customer details, product details, sales information, and shipping information. It can be used to analyze various aspects of the sales data, such as profitability, customer segments, product categories, and regional sales performance.
h
kaggle
huggingface.co
Updated Feb 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ahmad Khan (2024). kaggle [Dataset]. https://huggingface.co/datasets/ahmadkhan1022/kaggle
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 2, 2024
Dataset authored and provided by
Ahmad Khan
License
https://choosealicense.com/licenses/pddl/https://choosealicense.com/licenses/pddl/
Description
Dataset Card for MergedDataset

Dataset Summary Supported Tasks and Leaderboards

[More Information Needed]

Languages

[More Information Needed]

Dataset Structure Data Instances

[More Information Needed]

Data Fields

[More Information Needed]

Data Splits

[More Information Needed]

Dataset Creation Curation Rationale

[More Information Needed]

Source Data Initial Data… See the full description on the dataset page: https://huggingface.co/datasets/ahmadkhan1022/kaggle.
h
AppleStockData2025
huggingface.co
Updated Jan 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TableGPT (2025). AppleStockData2025 [Dataset]. https://huggingface.co/datasets/tablegpt/AppleStockData2025
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 7, 2025
Dataset authored and provided by
TableGPT
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
Apple Stock Data 2025

This is a dataset copied from Kaggle. You can see the original dataset here: https://www.kaggle.com/datasets/umerhaddii/apple-stock-data-2025

The following is the original readme of this dataset:

About Dataset Context

Apple Inc. is an American hardware and software developer and technology company that develops and sells computers, smartphones and consumer electronics as well as operating systems and application software. Apple also… See the full description on the dataset page: https://huggingface.co/datasets/tablegpt/AppleStockData2025.
h
kaggle-recipe-categorized-chunk-8
huggingface.co
Updated Sep 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jeff Schmitz (2024). kaggle-recipe-categorized-chunk-8 [Dataset]. https://huggingface.co/datasets/Schmitz005/kaggle-recipe-categorized-chunk-8
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 11, 2024
Authors
Jeff Schmitz
Description
Schmitz005/kaggle-recipe-categorized-chunk-8 dataset hosted on Hugging Face and contributed by the HF Datasets community
A
‘Top 100 Biggest Restaurant Chains 2021’ analyzed by Analyst-2
analyst-2.ai
Updated Jan 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ‘Top 100 Biggest Restaurant Chains 2021’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-top-100-biggest-restaurant-chains-2021-94e5/52e35c93/?iid=003-302&v=presentation
Explore at:
Dataset updated
Jan 28, 2022
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Top 100 Biggest Restaurant Chains 2021’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/johnharshith/top-100-biggest-restaurant-chains-2021 on 28 January 2022.

--- Dataset description provided by original source is as follows ---

https://i.insider.com/5db704d7045a311ad239369b?width=1300&format=jpeg&auto=webp" alt="Popular Restaurant Chains">

Context

This Dataset contains the data compiled by Technomic and reported by Restaurant Business magazine, the top 100 most popular restaurant chains in the United States in terms of the latest 2020 sales which were responsible for three-fourths of the total industry sales growth last year.

Content

The data was obtained from the Restaurant Business magazine website. The columns contain stats such as position of restaurant chains, 2020 U.S. sales, YOY sales change, 2020 U.S. units, YOY unit change, segment and menu types. This data can be found from the website https://www.restaurantbusinessonline.com/top-500-chains with detailed analysis.

Inspiration

While 2016 was a rough year for chain restaurants, more than half of the industry wealth of $521.9 billion still comes from the Top 500 chains and nearly 94% of those dollars and 93% of those units are represented in the Top 250. These stats have made me curious to find out interesting profit patterns from this dataset.

Dataset Usage

This Dataset can be used to study interesting patterns using various classification techniques and arrive at some exciting conclusions. One can create amazing visualisations using the different columns of the dataset. We can also find out and design an effective business model from the given dataset and take one step closer to your most successful restaurant chain startup ever!

--- Original source retains full ownership of the source dataset ---
E-commerce Business Transaction
kaggle.com
Updated May 14, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gabriel Ramos (2022). E-commerce Business Transaction [Dataset]. https://www.kaggle.com/datasets/gabrielramos87/an-online-shop-business
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 14, 2022
Dataset provided by
Kaggle
Authors
Gabriel Ramos
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

E-commerce has become a new channel to support businesses development. Through e-commerce, businesses can get access and establish a wider market presence by providing cheaper and more efficient distribution channels for their products or services. E-commerce has also changed the way people shop and consume products and services. Many people are turning to their computers or smart devices to order goods, which can easily be delivered to their homes.

Content

This is a sales transaction data set of UK-based e-commerce (online retail) for one year. This London-based shop has been selling gifts and homewares for adults and children through the website since 2007. Their customers come from all over the world and usually make direct purchases for themselves. There are also small businesses that buy in bulk and sell to other customers through retail outlet channels.

The data set contains 500K rows and 8 columns. The following is the description of each column. 1. TransactionNo (categorical): a six-digit unique number that defines each transaction. The letter “C” in the code indicates a cancellation. 2. Date (numeric): the date when each transaction was generated. 3. ProductNo (categorical): a five or six-digit unique character used to identify a specific product. 4. Product (categorical): product/item name. 5. Price (numeric): the price of each product per unit in pound sterling (£). 6. Quantity (numeric): the quantity of each product per transaction. Negative values related to cancelled transactions. 7. CustomerNo (categorical): a five-digit unique number that defines each customer. 8. Country (categorical): name of the country where the customer resides.

There is a small percentage of order cancellation in the data set. Most of these cancellations were due to out-of-stock conditions on some products. Under this situation, customers tend to cancel an order as they want all products delivered all at once.

Inspiration

Information is a main asset of businesses nowadays. The success of a business in a competitive environment depends on its ability to acquire, store, and utilize information. Data is one of the main sources of information. Therefore, data analysis is an important activity for acquiring new and useful information. Analyze this dataset and try to answer the following questions. 1. How was the sales trend over the months? 2. What are the most frequently purchased products? 3. How many products does the customer purchase in each transaction? 4. What are the most profitable segment customers? 5. Based on your findings, what strategy could you recommend to the business to gain more profit?

Photo by CardMapr on Unsplash
Dataset : Business Intelligence Research Trends
kaggle.com
Updated Nov 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MUHAMMAD AKMAL HAKIM (2024). Dataset : Business Intelligence Research Trends [Dataset]. http://doi.org/10.34740/kaggle/ds/5643554
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/ds/5643554
Dataset updated
Nov 14, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
MUHAMMAD AKMAL HAKIM
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset Overview

This dataset presents a meticulously compiled collection of 387 academic publications that explore various aspects of social media and business intelligence. The dataset includes detailed metadata about each publication, such as titles, authorship, abstracts, publication years, article types, and the journals or conferences where they were published. Citations and research areas are also included, making this dataset a valuable resource for bibliometric analysis, trend detection, and literature reviews in the fields of social media analytics, sentiment analysis, business intelligence, and related disciplines.

Content

The dataset comprises 15 columns, each capturing specific attributes of the research papers. Below is a description of each column:

ID: A unique identifier for each record in the dataset.

Title: The title of the academic paper.

DOI: The Digital Object Identifier, which provides a permanent link to the publication.

Author: List of authors who contributed to the paper.

Abstract: A summary of the research paper, providing insights into the study's objectives, methods, and findings.

Year: The year the paper was published.

Article Type: Indicates the type of publication (e.g., Proceedings Paper, Article, Book Chapter).

Publication Name: The name of the journal or conference where the paper was published.

Number-of-Cited-References: The number of references cited in the paper.

Times Cited: The number of times the paper has been cited by other works.

Research Areas: The general research area(s) the paper pertains to (e.g., Computer Science, Engineering).

WOS Category: Specific categories or subfields relevant to Web of Science classification.

WOS Index: The index within Web of Science where the paper is listed.

Keywords: Keywords provided by the authors to describe the main topics of the paper.

Keyword Plus: Additional keywords derived from the titles of the paper’s cited references.

Applications

This dataset can be utilized for a variety of purposes, including but not limited to:

Trend Analysis: Identify emerging trends and popular topics in social media and business intelligence research.

Citation Analysis: Analyze citation patterns to determine the impact and relevance of specific publications.

Collaborative Networks: Map out authorship and institutional collaboration trends.

Text Mining: Perform text mining on abstracts and keywords to uncover latent themes and topics.

Research Evaluation: Conduct bibliometric evaluations to assess the productivity and impact of researchers and institutions in the field.

Data Collection and Preprocessing

The dataset was curated by extracting bibliometric data from Web of Science (WOS), ensuring the inclusion of comprehensive and high-quality metadata. All records have been standardized for consistency and completeness to facilitate easier analysis.
Complete Company DataSet for EDA,data cleaning
kaggle.com
Updated Jul 22, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shital Gaikwad (2021). Complete Company DataSet for EDA,data cleaning [Dataset]. https://www.kaggle.com/datasets/shitalgaikwad123/complete-company-dataset-for-edadata-cleaning/data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 22, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Shital Gaikwad
Description
Dataset

This dataset was created by Shital Gaikwad

Contents

Machine Learning Job Postings in the US

kaggle.com
opendatabay.com

Updated Apr 20, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Ivan Kumeyko (2025). Machine Learning Job Postings in the US [Dataset]. https://www.kaggle.com/datasets/ivankmk/thousand-ml-jobs-in-usa

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Apr 20, 2025

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Ivan Kumeyko

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered

United States

Description

This dataset contains 1,000 job postings for Machine Learning-related roles across the United States, scraped between late 2024 and early 2025. The data was collected directly from company career pages and job boards, focusing on full job descriptions and associated company information.

Column Descriptions

Column	Description
`job_posted_date`	The date the job was posted (format: YYYY-MM-DD).
`company_address_locality`	The city or locality of the job or company.
`company_address_region`	The U.S. state or region where the job is located.
`company_name`	The name of the company posting the job.
`company_website`	The official website of the company.
`company_description`	A short description or mission statement of the company.
`job_description_text`	The full job description text as listed in the original posting.
`seniority_level`	The required seniority level (e.g., Internship, Entry level, Mid-Senior).
`job_title`	The full job title listed in the posting.

T-Mart
kaggle.com
Updated Aug 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gowtham G (2023). T-Mart [Dataset]. https://www.kaggle.com/datasets/imgowthamg/t-mart
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 10, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Gowtham G
Description
The provided dataset appears to be a sales dataset from a company called "**T-Mart.**" The dataset contains various columns with information about the sales transactions, including the date of the transaction, product details, quantity, sales type, location, payment mode, product category, unit of measurement (UOM), purchase price, and some additional labels and counts.

Based on the given information, here's a brief description of the dataset:

The "T-Mart" sales dataset captures sales transactions with details such as the transaction date, unique product identifier (PRODUCT ID), quantity sold, sales type (Direct Sales, Online, etc.), sales location (e.g., California, Alabama), payment mode (Cash, Online), product details (PRODUCT, CATEGORY, UOM), purchase price, and some additional label-based information.

This dataset provides insights into various aspects of the company's sales operations, including the distribution of sales across different categories, products, and locations, as well as information about the payment modes used for transactions.

Analyzing this dataset can help identify trends, popular products, sales performance by location, and preferred payment methods. It's essential for understanding the company's sales dynamics and making informed business decisions.

This dataset appears to be rich in information, and with the right data visualization techniques, we can uncover valuable insights that can be used for strategic planning and optimizing sales strategies.
Fundamental stock data
kaggle.com
Updated Dec 8, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artem Burenok (2022). Fundamental stock data [Dataset]. https://www.kaggle.com/datasets/artemburenok/fundamental-stock-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 8, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Artem Burenok
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
It is not so often that one can find fundamental data of companies on which it would be possible to accurately assess the value of a company.

So I decided to use yahoo_fin api to collect some fundamentals of 48 companies from the S&P 500 index.

The content of indicators in each table: - total assets. - cash. - stockholder equity. - profit. - revenue. - return on equity, return on assets, profit margin. - trailing P/E, P/S, P/B, PEG, forward P/E.

In addition, the dataset has prices for all stocks for four years.
Global Startup Success Dataset
kaggle.com
Updated Mar 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hamna Kaleem (2025). Global Startup Success Dataset [Dataset]. https://www.kaggle.com/datasets/hamnakaleemds/global-startup-success-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 1, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Hamna Kaleem
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
📊 Dataset Features This dataset includes 5,000 startups from 10 countries and contains 15 key features: Startup Name: Name of the startup Founded Year: Year the startup was founded Country: Country where the startup is based Industry: Industry category (Tech, FinTech, AI, etc.) Funding Stage: Stage of investment (Seed, Series A, etc.) Total Funding ($M): Total funding received (in million $) Number of Employees: Number of employees in the startup Annual Revenue ($M): Annual revenue in million dollars Valuation ($B): Startup's valuation in billion dollars Success Score: Score from 1 to 10 based on growth Acquired?: Whether the startup was acquired (Yes/No) IPO?: Did the startup go public? (Yes/No) Customer Base (Millions): Number of active customers Tech Stack: Technologies used by the startup Social Media Followers: Total followers on social platforms Analysis Ideas 📈 What Can You Do with This Dataset? Here are some exciting analyses you can perform:

Predict Startup Success: Train a machine learning model to predict the success score. Industry Trends: Analyze which industries get the most funding. **Valuation vs. Funding: **Explore the correlation between funding and valuation. Acquisition Analysis: Investigate the factors that contribute to startups being acquired.

Facebook

Twitter

Click to copy link

Link copied

Cite

informrohit1 (2024). Top Companies Dataset [Dataset]. https://www.kaggle.com/datasets/informrohit1/top-companies-dataset

Top Companies Dataset

Web scraped Dataset of Top 1000 Companies.

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jun 10, 2024

Dataset provided by

Kagglehttp://kaggle.com/

Authors

informrohit1

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

This is Top 1000 companies dataset and it is web scraped form a website . This dataset includes 4 columns and 1000 unique rows . 4 columns includes Name of the company , Rating out of 5 , Reviews given by employees and Overall data. also Rows includes different 1000 Companies name.

Dataset is good for beginner level Data analysis project and Company Recommendation projects etc.

Clear search

Close search

Google apps

Main menu

Top Companies Dataset

Top Global Companies Innovators & Giants 🌍🏢

Data Description

About Data

twt-kaggle-data

Company Purchasing Dataset

test-dataset-kaggle

kaggle-entity-annotated-corpus-ner-dataset

‘Indian Company Dataset’ analyzed by Analyst-2

50 US Startups: Financial and Geographical Insight

Sales Dataset of USA [Updated]

kaggle

AppleStockData2025

kaggle-recipe-categorized-chunk-8

‘Top 100 Biggest Restaurant Chains 2021’ analyzed by Analyst-2

Context

Content

Inspiration

Dataset Usage

E-commerce Business Transaction

Context

Content

Inspiration

Dataset : Business Intelligence Research Trends

Complete Company DataSet for EDA,data cleaning

Dataset

Contents

Machine Learning Job Postings in the US

Column Descriptions

T-Mart

Fundamental stock data

Global Startup Success Dataset

Top Companies Dataset

Web scraped Dataset of Top 1000 Companies.