https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The dataset contains information about world's biggest companies.
Among them you can find companies founded in the US, the UK, Europe, Asia, South America, South Africa, Australia.
The dataset contains information about the year the company was founded, its' revenue and net income in years 2018 - 2020, and the industry.
I have included 2 csv files: the raw csv file if you want to practice cleaning the data, and the clean csv ready to be analyzed.
The third dataset includes the name of all the companies included in the previous datasets and 2 additional columns: number of employees and name of the founder.
In addition there's tesla.csv file containing shares prices for Tesla.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Forbes is one of the most respected sources for financial information and global rankings. Their Global 2000 list includes the world’s largest public companies at 2024, providing invaluable insights into global markets and leading economic players.
Resource Link : https://www.forbes.com/lists/global2000/
Financial metrics (Sales , Profit , Assets , Market Value ) are in billions of dollars.
Since 2003, Forbes’ Global 2000 list has measured the world’s largest public companies in terms of four equally weighted metrics: assets, market value, sales and profits.
You can find the biggest 500 companies in this data.
Data is taken from here.
This dataset shows the locations of the 50 companies that had the largest number of employees among the Fortune Global 500. The dataset also shows the company's revenue rank for 2007 The list comes from cnnmoney.com whose Fortune section does analysis on the top corporations throughout the world
CompanyData.com, powered by BoldData, delivers high-quality, verified B2B company information from official trade registers around the world. Our India company database includes 32,468,995 verified business records, giving you powerful insight into one of the fastest-growing economies on the planet.
Each company profile is rich with firmographic data, including company name, CIN (Corporate Identification Number), registration number, legal status, industry classification (NIC codes), revenue range, and employee size. Many records are enhanced with contact details such as email addresses, phone numbers, and names of key decision-makers, supporting direct outreach and smarter segmentation.
Our India dataset is designed for a wide range of business applications — from KYC and AML compliance, due diligence, and regulatory checks, to B2B sales, lead generation, marketing campaigns, CRM enrichment, and AI model training. Whether you’re targeting local startups or large enterprises, our data helps you connect with the right businesses at the right time.
Delivery is flexible to suit your needs. Choose from customized lists, full databases in Excel or CSV, access via our real-time API, or our intuitive self-service platform. We also offer data enrichment and cleansing services to refresh and improve your existing datasets with accurate, up-to-date company information from India.
With access to 32,468,995 verified companies across more than 200 countries, CompanyData.com helps businesses grow confidently — in India and beyond. Rely on our precise, structured data to fuel your strategies and scale with speed and accuracy.
This dataset shows the locations of the largest corporations in the world by the city where their headquarters is located. The rankings are based on the amount of total revenue that the corporation earned during the year 2006. These 500 corporations are known as being a part of the Fortune 500 group which is an annual poll of the top corporations in the world. The poll is conducted by cnnmoney.com.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
South Africa number dataset is crucial for those who want their company to be successful and appealing. The database contains the contacts for important people. These people are crucial and have the power to influence major choices for their organization or business. So, if you buy the list, you can get their contact information. You may now quickly connect with them thanks to the details. South Africa number dataset is GDPR-ready and usable on all CRM platforms. Consequently, using our directory will result in an immediate return on investment (ROI). List to Data guarantees that we will support you in all circumstances and can give you the information you need to launch a successful marketing campaign. Therefore, trust us and adopt the South Africa number dataset for your marketing issues. South Africa phone data will help you find the right customers for your organization. You may now accomplish that using this directory, and we are swearing an oath to our customers to do so. Your chances of obtaining a good contract from a higher business will increase as a result. Therefore, you must take it into account for your own welfare. On the other hand, the ideal remedy for your prior marketing issues is this library. South Africa phone data is a full-featured dataset package. Everything you require or desire for telemarketing will be provided for you. Every country is experiencing a financial crisis, and the world as a whole is struggling. You can therefore conduct a successful campaign with the South Africa phone data. South Africa phone number list contains all the answers that will greatly simplify your issues. We can only state that all businesspeople who are serious about their businesses should subscribe to our list. Choose the pre-built and most accurate contact directory if you also want something remarkable from your company. Without the database, it is impossible to expand in the modern world. South Africa phone number list is now available at a low rate. No matter what the circumstance, List to Data is always here to help. You can get in touch with us for more questions. We have a strong support team that is available 24/7 hours to assist you. In short, you can get in touch with us whenever you believe your company needs South Africa phone number list.
CompanyKG is a heterogeneous graph consisting of 1,169,931 nodes and 50,815,503 undirected edges, with each node representing a real-world company and each edge signifying a relationship between the connected pair of companies.
Edges: We model 15 different inter-company relations as undirected edges, each of which corresponds to a unique edge type. These edge types capture various forms of similarity between connected company pairs. Associated with each edge of a certain type, we calculate a real-numbered weight as an approximation of the similarity level of that type. It is important to note that the constructed edges do not represent an exhaustive list of all possible edges due to incomplete information. Consequently, this leads to a sparse and occasionally skewed distribution of edges for individual relation/edge types. Such characteristics pose additional challenges for downstream learning tasks. Please refer to our paper for a detailed definition of edge types and weight calculations.
Nodes: The graph includes all companies connected by edges defined previously. Each node represents a company and is associated with a descriptive text, such as "Klarna is a fintech company that provides support for direct and post-purchase payments ...". To comply with privacy and confidentiality requirements, we encoded the text into numerical embeddings using four different pre-trained text embedding models: mSBERT (multilingual Sentence BERT), ADA2, SimCSE (fine-tuned on the raw company descriptions) and PAUSE.
Evaluation Tasks. The primary goal of CompanyKG is to develop algorithms and models for quantifying the similarity between pairs of companies. In order to evaluate the effectiveness of these methods, we have carefully curated three evaluation tasks:
Background and Motivation
In the investment industry, it is often essential to identify similar companies for a variety of purposes, such as market/competitor mapping and Mergers & Acquisitions (M&A). Identifying comparable companies is a critical task, as it can inform investment decisions, help identify potential synergies, and reveal areas for growth and improvement. The accurate quantification of inter-company similarity, also referred to as company similarity quantification, is the cornerstone to successfully executing such tasks. However, company similarity quantification is often a challenging and time-consuming process, given the vast amount of data available on each company, and the complex and diversified relationships among them.
While there is no universally agreed definition of company similarity, researchers and practitioners in PE industry have adopted various criteria to measure similarity, typically reflecting the companies' operations and relationships. These criteria can embody one or more dimensions such as industry sectors, employee profiles, keywords/tags, customers' review, financial performance, co-appearance in news, and so on. Investment professionals usually begin with a limited number of companies of interest (a.k.a. seed companies) and require an algorithmic approach to expand their search to a larger list of companies for potential investment.
In recent years, transformer-based Language Models (LMs) have become the preferred method for encoding textual company descriptions into vector-space embeddings. Then companies that are similar to the seed companies can be searched in the embedding space using distance metrics like cosine similarity. The rapid advancements in Large LMs (LLMs), such as GPT-3/4 and LLaMA, have significantly enhanced the performance of general-purpose conversational models. These models, such as ChatGPT, can be employed to answer questions related to similar company discovery and quantification in a Q&A format.
However, graph is still the most natural choice for representing and learning diverse company relations due to its ability to model complex relationships between a large number of entities. By representing companies as nodes and their relationships as edges, we can form a Knowledge Graph (KG). Utilizing this KG allows us to efficiently capture and analyze the network structure of the business landscape. Moreover, KG-based approaches allow us to leverage powerful tools from network science, graph theory, and graph-based machine learning, such as Graph Neural Networks (GNNs), to extract insights and patterns to facilitate similar company analysis. While there are various company datasets (mostly commercial/proprietary and non-relational) and graph datasets available (mostly for single link/node/graph-level predictions), there is a scarcity of datasets and benchmarks that combine both to create a large-scale KG dataset expressing rich pairwise company relations.
Source Code and Tutorial:
https://github.com/llcresearch/CompanyKG2
Paper: to be published
CompanyData.com powered by BoldData provides verified, high-quality company information sourced directly from official trade registers around the world. We help businesses unlock insights, drive compliance and scale with confidence using reliable data.
Our Russia company database contains over 3,178,171 verified business records, covering companies across all regions of the Russian Federation. Each record includes in-depth firmographic data such as industry codes, company size and turnover, as well as corporate hierarchies, and verified contact information including emails, phone numbers and decision-makers when available.
This comprehensive dataset supports a wide range of business applications - Compliance and KYC verification - AML checks and regulatory reporting - Sales and business development - B2B marketing and targeting - CRM enrichment and database cleansing - AI training and market research
We offer flexible delivery options to suit your business requirements - Custom company lists filtered by location, sector or company size - Full Russian company database delivered in Excel or CSV - Real-time API access for seamless integration - Data enrichment services for updating and validating your internal records
Russia is part of our global coverage of more than 3,178,171 verified companies across 200+ countries, making CompanyData.com a trusted partner for international data-driven growth. Whether you are navigating compliance in Eastern Europe or expanding your sales footprint, our verified data ensures you move forward with clarity and confidence.
Partner with CompanyData.com to gain access to the most reliable company data in Russia—delivered how and when you need it.
Between fiscal year 2008 and 2024, Oracle’s total employee number had grown substantially, increasing from around ****** to *******. Oracle's annual revenues, on the other hand, has reached a record high in FY2023. Oracle Corporation Founded in 1977 by Larry Ellison, the small database start-up has grown into one of the biggest names in the database market around the world. For years, the company’s database products have been some of the most successful and widely used platforms in the industry. Massive growth in the tech industry and increased need for big data storage and analysis tools have transformed the small California start-up into one of the largest companies in the world in terms of market value. Oracle has shown great ability to adapt to the changing tech environment, quickly establishing itself in the cloud services business and constantly improving its database products. Despite the growing presence of free, open-source database software, Oracle’s quality and reputation within the industry has ensured that it remains one of the most popular platforms in the market. The fiscal year end of the company is May, 31st.
Global B2B Mobile Phone Number Database | 100M+ Verified Contacts | 95% Accuracy Forager.ai provides the world’s most reliable mobile phone number data for businesses that refuse to compromise on quality. With 100 million+ professionally verified mobile numbers refreshed every 3 weeks, our database ensures 95% accuracy – so your teams never waste time on dead-end leads.
Why Our Data Wins ✅ Accuracy You Can Trust 95% of mobile numbers are verified against live carrier records and tied to current job roles. Say goodbye to “disconnected number” voicemails.
✅ Depth Beyond Digits Each contact includes 150+ data points:
Direct mobile numbers
Current job title, company, and department
Full career history + education background
Location data + LinkedIn profiles
Company size, industry, and revenue
✅ Freshness Guaranteed Bi-weekly updates combat job-hopping and role changes – critical for sales teams targeting decision-makers.
✅ Ethically Sourced & Compliant First-party collected data with full GDPR/CCPA compliance.
Who Uses This Data?
Sales Teams: Cold-call C-suite prospects with verified mobile numbers.
Marketers: Run hyper-personalized SMS/WhatsApp campaigns.
Recruiters: Source passive candidates with up-to-date contact intel.
Data Vendors: License premium datasets to enhance your product.
Tech Platforms: Power your SaaS tools via API with enterprise-grade B2B data.
Flexible Delivery, Instant Results
API (REST): Real-time integration for CRMs, dialers, or marketing stacks
CSV/JSON: Campaign-ready files.
PostgreSQL: Custom databases for large-scale enrichment
Compliance: Full audit trails + opt-out management
Why Forager.ai? → Proven ROI: Clients see 62% higher connect rates vs. industry averages (request case studies). → No Guesswork: Test-drive free samples before committing. → Scalable Pricing: Pay per record, license datasets, or get unlimited API access.
B2B Mobile Phone Data | Verified Contact Database | Sales Prospecting Lists | CRM Enrichment | Recruitment Phone Numbers | Marketing Automation | Phone Number Datasets | GDPR-Compliant Leads | Direct Dial Contacts | Decision-Maker Data
Need Proof? Contact us to see why Fortune 500 companies and startups alike trust Forager.ai for mission-critical outreach.
The Ethics and Morality of a Corporation Human behavior and corporations ultimately stem from justice, honesty, and most importantly responsibility. With the rise of globalism and multinational corporations, this raises the question on the pursuit of business ethics and morality in training and safe workplaces. Comparing the United States and third world country working professionals we will see the difference between both groups and what we could do about it. The attached CSV file is the dataset used for this study.
On 1 April 2025 responsibility for fire and rescue transferred from the Home Office to the Ministry of Housing, Communities and Local Government.
This information covers fires, false alarms and other incidents attended by fire crews, and the statistics include the numbers of incidents, fires, fatalities and casualties as well as information on response times to fires. The Ministry of Housing, Communities and Local Government (MHCLG) also collect information on the workforce, fire prevention work, health and safety and firefighter pensions. All data tables on fire statistics are below.
MHCLG has responsibility for fire services in England. The vast majority of data tables produced by the Ministry of Housing, Communities and Local Government are for England but some (0101, 0103, 0201, 0501, 1401) tables are for Great Britain split by nation. In the past the Department for Communities and Local Government (who previously had responsibility for fire services in England) produced data tables for Great Britain and at times the UK. Similar information for devolved administrations are available at https://www.firescotland.gov.uk/about/statistics/">Scotland: Fire and Rescue Statistics, https://statswales.gov.wales/Catalogue/Community-Safety-and-Social-Inclusion/Community-Safety">Wales: Community safety and https://www.nifrs.org/home/about-us/publications/">Northern Ireland: Fire and Rescue Statistics.
If you use assistive technology (for example, a screen reader) and need a version of any of these documents in a more accessible format, please email alternativeformats@communities.gov.uk. Please tell us what format you need. It will help us if you say what assistive technology you use.
Fire statistics guidance
Fire statistics incident level datasets
https://assets.publishing.service.gov.uk/media/686d2aa22557debd867cbe14/FIRE0101.xlsx">FIRE0101: Incidents attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 153 KB) Previous FIRE0101 tables
https://assets.publishing.service.gov.uk/media/686d2ab52557debd867cbe15/FIRE0102.xlsx">FIRE0102: Incidents attended by fire and rescue services in England, by incident type and fire and rescue authority (MS Excel Spreadsheet, 2.19 MB) Previous FIRE0102 tables
https://assets.publishing.service.gov.uk/media/686d2aca10d550c668de3c69/FIRE0103.xlsx">FIRE0103: Fires attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 201 KB) Previous FIRE0103 tables
https://assets.publishing.service.gov.uk/media/686d2ad92557debd867cbe16/FIRE0104.xlsx">FIRE0104: Fire false alarms by reason for false alarm, England (MS Excel Spreadsheet, 492 KB) Previous FIRE0104 tables
https://assets.publishing.service.gov.uk/media/686d2af42cfe301b5fb6789f/FIRE0201.xlsx">FIRE0201: Dwelling fires attended by fire and rescue services by motive, population and nation (MS Excel Spreadsheet, 192 KB) Previous FIRE0201 tables
<span class="gem
• 3M+ Contact Profiles • 5M+ Worldwide eCommerce Brands • Direct Contact Info for Decision Makers • Contact Direct Email and Mobile Number • 15+ eCommerce Platforms • 20+ Data Points • Lifetime Support Until You 100% Satisfied
Buy eCommerce leads from our eCommerce leads database today. Reach out to eCommerce companies to expand your business. Now is the time to buy eCommerce leads and start running a campaign to attract new customers. We provide current and accurate information that will assist you in achieving your goals.
Our database is made up of highly valuable and interested leads who are ready to make online purchases. You can always filter our data and choose the database that best meets your needs if you need eCommerce leads based on industry.
We have millions of eCommerce data ready to go no matter where you are. We’ve acquired hundreds of clients from all over the world over the years and delivered data that they’re happy with.
We were able to do so by obtaining data from various locations around the world. As a result, our database is widely accessible, and anyone can use it from any location on the planet. Please contact us if you want the best eCommerce leads .
We sell eCommerce leads that can be filtered by industry. We know what you’re going through and what you’ll need for your next project. As a result, we’ve compiled a list of eCommerce leads that are exactly what you require. With the most potential data we provide, you can grow your business and achieve your business goals. All of our eCommerce leads are generated professionally, with real people – not bots – entering data.
We’re a leading brand in the industry because we source data from the most well-known platforms, ensuring that the information you receive from us is accurate and reliable. That’s especially true because we verify each and every piece of information in order to provide you with yet another benefit in your life.
The majority of our customers have had success with the information we’ve provided. That is why they keep contacting us for our services. You can count on our business-to-business eCommerce sales leads. Contact us to work with one of the most effective lead generation companies in the industry, which has already helped thousands of potential members achieve success.
Every month, we update our eCommerce store sales leads in order to provide our clients with the most accurate data possible. We have a team of professionals who strive for excellence when it comes to gathering the right leads to ensure you get the number of sales you need. Our experts also double-check that all of the sales data we receive is genuine and accurate.
The accuracy of our eCommerce database is why the majority of our clients choose us. Furthermore, we offer round-the-clock support to provide on-demand solutions. We take care of everything so you can spend less time evaluating our product database and more time becoming one of them.
This dataset was created by sriswaroopkoundinya
The 2007 World Bank Group Entrepreneurship Survey measures entrepreneurial activity in 84 developing and industrial countries over the period 2003-2005. The database includes cross-country, time-series data on the number of total and newly registered businesses, collected directly from Registrar of Companies around the world. In its second year, this survey incorporates improvements in methodology, and expanded participation from countries covered, allowing for greater cross-border compatibility of data compared with the 2006 survey. This joint effort by the IFC SME Department and the World Bank Developing Research Group is the most comprehensive dataset on cross-country firm entry data available today. This database The World Bank Group Entrepreneurship Dataaset presents data collected primarily from country business registries using the first annual World Bank Group Questionnaire on Entrepreneurship (alternative sources were tax authorities, finance ministries, and national statistics offices). For more information on the author of the database, Leora Klapper, visit: http://go.worldbank.org/DK5AHCQSO0. This data was access at the preceeding link, on October 11, 2007. Please visit the link for more information in regards to this dataset.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Dow Jones 30 is a price-weighted measurement stock market index of 30 prominent companies listed on stock exchanges in the United States. It is one of the most concise Indices as compared to other comprehensive ones such as NASDAQ or S&P 500.
The value index can be sum of the stock prices of the companies included in the index, divided by a factor which is currently approximately 0.152. The factor is changed whenever a constituent company undergoes a stock split so that the value of the index is unaffected by the stock split.
In the dataset, You'll find stock prices of 16 major companies which are listed in DOW JONES 30 Index before Apr 1, 1999 (Obviously there are 14 others which they've replaced others over time and were added in the index after the said date)
Date: Simply the data in m/d/yyyy format (which works in the US)
MMM to Walt Disney: Average stock price of the day of the corresponding companies
DJIA: "Dow Jones Industrial Average" which can be said the target column of the dataset. It is the pool in which all the 30 stocks have their influence. Target can be seen as how much impact each and every stock lays on the pool.
The data is obtained from the Moneycontrol website.
Predicting the influence of major stocks on one of the most weight carrying indices of the world.
As of June 2024, the most popular database management system (DBMS) worldwide was Oracle, with a ranking score of *******; MySQL and Microsoft SQL server rounded out the top three. Although the database management industry contains some of the largest companies in the tech industry, such as Microsoft, Oracle and IBM, a number of free and open-source DBMSs such as PostgreSQL and MariaDB remain competitive. Database Management Systems As the name implies, DBMSs provide a platform through which developers can organize, update, and control large databases. Given the business world’s growing focus on big data and data analytics, knowledge of SQL programming languages has become an important asset for software developers around the world, and database management skills are seen as highly desirable. In addition to providing developers with the tools needed to operate databases, DBMS are also integral to the way that consumers access information through applications, which further illustrates the importance of the software.
This dataset shows the locations of the 100 companies that had the greatest revenue increase from 2005 to 2006 that are among the Fortune Global 500 for 2006. The list comes from cnnmoney.com whose Fortune section does analysis on the top corporations throughout the world
During the second quarter of 2025, data breaches exposed more than ** million records worldwide. Since the first quarter of 2020, the highest number of data records were exposed in the third quarter of ****, more than *** billion data sets. Data breaches remain among the biggest concerns of company leaders worldwide. The most common causes of sensitive information loss were operating system vulnerabilities on endpoint devices. Which industries see the most data breaches? Meanwhile, certain conditions make some industry sectors more prone to data breaches than others. According to the latest observations, the public administration experienced the highest number of data breaches between 2021 and 2022. The industry saw *** reported data breach incidents with confirmed data loss. The second were financial institutions, with *** data breach cases, followed by healthcare providers. Data breach cost Data breach incidents have various consequences, the most common impact being financial losses and business disruptions. As of 2023, the average data breach cost across businesses worldwide was **** million U.S. dollars. Meanwhile, a leaked data record cost about *** U.S. dollars. The United States saw the highest average breach cost globally, at **** million U.S. dollars.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The dataset contains information about world's biggest companies.
Among them you can find companies founded in the US, the UK, Europe, Asia, South America, South Africa, Australia.
The dataset contains information about the year the company was founded, its' revenue and net income in years 2018 - 2020, and the industry.
I have included 2 csv files: the raw csv file if you want to practice cleaning the data, and the clean csv ready to be analyzed.
The third dataset includes the name of all the companies included in the previous datasets and 2 additional columns: number of employees and name of the founder.
In addition there's tesla.csv file containing shares prices for Tesla.