By Arthur Keen [source]
This dataset contains the top 100 global banks ranked by total assets on December 31, 2017. With a detailed list of key information for each bank's rank, country, balance sheet and US Total Assets (in billions), this data will be invaluable for those looking to research and study the current status of some of the world's leading financial organizations. From billion-dollar mega-banks such as JP Morgan Chase to small, local savings & loans institutions like BancorpSouth; this comprehensive overview allows researchers and analysts to gain a better understanding of who holds power in the world economy today
For more datasets, click here.
- šØ Your notebook can be here! šØ!
This dataset contains the rank and total asset information of the top 100 global banks as of December 31, 2017. It is a useful resource for researchers who wish to study how key financial institutions' asset information relate to each other across countries.
Using this dataset is relatively straightforward ā it consists of three columns - rank (the order in which each bank appears in the list), country (the country in which the bank is located) and total assets US billions (the total value expressed in US dollars). Additionally, there is a fourth column containing the balance sheet information for each bank as well.
In order to make full use of this dataset, one should analyse it by creating comparison grids based on different factors such as region, size or ownership structures. This can provide an interesting insight into how financial markets are structured within different economies and allow researchers to better understand some banking sector dynamics that are particularly relevant for certain countries or regions. Additionally, one can compare any two banks side-by-side using their respective balance sheets or distribution plot graphs based on size or concentration metrics by leverage or other financial ratios as well.
Overall, this dataset provides useful resources that can be put into practice through data visualization making an interesting reference point for trends analysis and forecasting purposes focusing on certain banking activities worldwide
Analyzing the differences in total assets across countries. By comparing and contrasting data, patterns could be found that give insight into the factors driving differences in banksā assets between different markets.
Using predictive models to identify which banks are more likely to perform better based on their balance sheet data, such as by predicting future profits or cashflows of said banks.
Leveraging the information on holdings and investments of ātop-rankedā banks as a guide for personal investments decisions or informing investment strategies of large financial institutions or hedge funds
If you use this dataset in your research, please credit the original authors. Data Source
License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.
File: top50banks2017-03-31.csv | Column name | Description | |:----------------------|:------------------------------------------------------------------------| | rank | The rank of the bank globally based on total assets. (Integer) | | country | The country where the bank is located. (String) | | total_assets_us_b | The total assets of a bank expressed in billions of US dollars. (Float) | | balance_sheet | A snapshot of banking activities for a specific date. (Date) |
File: top100banks2017-12-31.csv | Column name | Description | |:----------------------|:--------------------------------------------...
This clean dataset is a refined version of our company datasets, consisting of 35M+ data records.
Itās an excellent data solution for companies with limited data engineering capabilities and those who want to reduce their time to value. You get filtered, cleaned, unified, and standardized B2B data. After cleaning, this data is also enriched by leveraging a carefully instructed large language model (LLM).
AI-powered data enrichment offers more accurate information in key data fields, such as company descriptions. It also produces over 20 additional data points that are very valuable to B2B businesses. Enhancing and highlighting the most important information in web data contributes to quicker time to value, making data processing much faster and easier.
For your convenience, you can choose from multiple data formats (Parquet, JSON, JSONL, or CSV) and select suitable delivery frequency (quarterly, monthly, or weekly).
Coresignal is a leading public business data provider in the web data sphere with an extensive focus on firmographic data and public employee profiles. More than 3B data records in different categories enable companies to build data-driven products and generate actionable insights. Coresignal is exceptional in terms of data freshness, with 890M+ records updated monthly for unprecedented accuracy and relevance.
LinkedIn companies use datasets to access public company data for machine learning, ecosystem mapping, and strategic decisions. Popular use cases include competitive analysis, CRM enrichment, and lead generation.
Use our LinkedIn Companies Information dataset to access comprehensive data on companies worldwide, including business size, industry, employee profiles, and corporate activity. This dataset provides key company insights, organizational structure, and competitive landscape, tailored for market researchers, HR professionals, business analysts, and recruiters.
Leverage the LinkedIn Companies dataset to track company growth, analyze industry trends, and refine your recruitment strategies. By understanding company dynamics and employee movements, you can optimize sourcing efforts, enhance business development opportunities, and gain a strategic edge in your market. Stay informed and make data-backed decisions with this essential resource for understanding global company ecosystems.
This dataset is ideal for:
- Market Research: Identifying key trends and patterns across different industries and geographies.
- Business Development: Analyzing potential partners, competitors, or customers.
- Investment Analysis: Assessing investment potential based on company size, funding, and industries.
- Recruitment & Talent Analytics: Understanding the workforce size and specialties of various companies.
CUSTOM
Please review the respective licenses below:
Success.ai offers a cutting-edge solution for businesses and organizations seeking Company Financial Data on private and public companies. Our comprehensive database is meticulously crafted to provide verified profiles, including contact details for financial decision-makers such as CFOs, financial analysts, corporate treasurers, and other key stakeholders. This robust dataset is continuously updated and validated using AI technology to ensure accuracy and relevance, empowering businesses to make informed decisions and optimize their financial strategies.
Key Features of Success.ai's Company Financial Data:
Global Coverage: Access data from over 70 million businesses worldwide, including public and private companies across all major industries and regions. Our datasets span 250+ countries, offering extensive reach for your financial analysis and market research.
Detailed Financial Profiles: Gain insights into company financials, including revenue, profit margins, funding rounds, and operational costs. Profiles are enriched with key contact details, including work emails, phone numbers, and physical addresses, ensuring direct access to decision-makers.
Industry-Specific Data: Tailored datasets for sectors such as financial services, manufacturing, technology, healthcare, and energy, among others. Each dataset is customized to meet the unique needs of industry professionals and analysts.
Real-Time Accuracy: With continuous updates powered by AI-driven validation, our financial data maintains a 99% accuracy rate, ensuring you have access to the most reliable and up-to-date information available.
Compliance and Security: All data is collected and processed in strict adherence to global compliance standards, including GDPR, ensuring ethical and lawful usage.
Why Choose Success.ai for Company Financial Data?
Best Price Guarantee: We pride ourselves on offering the most competitive pricing in the industry, ensuring you receive unparalleled value for comprehensive financial data.
AI-Validated Accuracy: Our advanced AI algorithms meticulously verify every data point to ensure precision and reliability, helping you avoid costly errors in your financial decision-making.
Customized Data Solutions: Whether you need data for a specific region, industry, or type of business, we tailor our datasets to align perfectly with your requirements.
Scalable Data Access: From small startups to global enterprises, our platform caters to businesses of all sizes, delivering scalable solutions to suit your operational needs.
Comprehensive Use Cases for Financial Data:
Leverage our detailed financial profiles to create accurate budgets, forecasts, and strategic plans. Gain insights into competitorsā financial health and market positions to make data-driven decisions.
Access key financial details and contact information to streamline your M&A processes. Identify potential acquisition targets or partners with verified profiles and financial data.
Evaluate the financial performance of public and private companies for informed investment decisions. Use our data to identify growth opportunities and assess risk factors.
Enhance your sales outreach by targeting CFOs, financial analysts, and other decision-makers with verified contact details. Utilize accurate email and phone data to increase conversion rates.
Understand market trends and financial benchmarks with our industry-specific datasets. Use the data for competitive analysis, benchmarking, and identifying market gaps.
APIs to Power Your Financial Strategies:
Enrichment API: Integrate real-time updates into your systems with our Enrichment API. Keep your financial data accurate and current to drive dynamic decision-making and maintain a competitive edge.
Lead Generation API: Supercharge your lead generation efforts with access to verified contact details for key financial decision-makers. Perfect for personalized outreach and targeted campaigns.
Tailored Solutions for Industry Professionals:
Financial Services Firms: Gain detailed insights into revenue streams, funding rounds, and operational costs for competitor analysis and client acquisition.
Corporate Finance Teams: Enhance decision-making with precise data on industry trends and benchmarks.
Consulting Firms: Deliver informed recommendations to clients with access to detailed financial datasets and key stakeholder profiles.
Investment Firms: Identify potential investment opportunities with verified data on financial performance and market positioning.
What Sets Success.ai Apart?
Extensive Database: Access detailed financial data for 70M+ companies worldwide, including small businesses, startups, and large corporations.
Ethical Practices: Our data collection and processing methods are fully comp...
To create the dataset, the top 10 countries leading in the incidence of COVID-19 in the world were selected as of October 22, 2020 (on the eve of the second full of pandemics), which are presented in the Global 500 ranking for 2020: USA, India, Brazil, Russia, Spain, France and Mexico. For each of these countries, no more than 10 of the largest transnational corporations included in the Global 500 rating for 2020 and 2019 were selected separately. The arithmetic averages were calculated and the change (increase) in indicators such as profitability and profitability of enterprises, their ranking position (competitiveness), asset value and number of employees. The arithmetic mean values of these indicators for all countries of the sample were found, characterizing the situation in international entrepreneurship as a whole in the context of the COVID-19 crisis in 2020 on the eve of the second wave of the pandemic. The data is collected in a general Microsoft Excel table. Dataset is a unique database that combines COVID-19 statistics and entrepreneurship statistics. The dataset is flexible data that can be supplemented with data from other countries and newer statistics on the COVID-19 pandemic. Due to the fact that the data in the dataset are not ready-made numbers, but formulas, when adding and / or changing the values in the original table at the beginning of the dataset, most of the subsequent tables will be automatically recalculated and the graphs will be updated. This allows the dataset to be used not just as an array of data, but as an analytical tool for automating scientific research on the impact of the COVID-19 pandemic and crisis on international entrepreneurship. The dataset includes not only tabular data, but also charts that provide data visualization. The dataset contains not only actual, but also forecast data on morbidity and mortality from COVID-19 for the period of the second wave of the pandemic in 2020. The forecasts are presented in the form of a normal distribution of predicted values and the probability of their occurrence in practice. This allows for a broad scenario analysis of the impact of the COVID-19 pandemic and crisis on international entrepreneurship, substituting various predicted morbidity and mortality rates in risk assessment tables and obtaining automatically calculated consequences (changes) on the characteristics of international entrepreneurship. It is also possible to substitute the actual values identified in the process and following the results of the second wave of the pandemic to check the reliability of pre-made forecasts and conduct a plan-fact analysis. The dataset contains not only the numerical values of the initial and predicted values of the set of studied indicators, but also their qualitative interpretation, reflecting the presence and level of risks of a pandemic and COVID-19 crisis for international entrepreneurship.
Point of Interest (POI) is defined as an entity (such as a business) at a ground location (point) which may be (of interest). We provide high-quality POI data that is fresh, consistent, customizable, easy to use and with high-density coverage for all countries of the world.
This is our process flow:
Our machine learning systems continuously crawl for new POI data
Our geoparsing and geocoding calculates their geo locations
Our categorization systems cleanup and standardize the datasets
Our data pipeline API publishes the datasets on our data store
A new POI comes into existence. It could be a bar, a stadium, a museum, a restaurant, a cinema, or store, etc.. In today's interconnected world its information will appear very quickly in social media, pictures, websites, press releases. Soon after that, our systems will pick it up.
POI Data is in constant flux. Every minute worldwide over 200 businesses will move, over 600 new businesses will open their doors and over 400 businesses will cease to exist. And over 94% of all businesses have a public online presence of some kind tracking such changes. When a business changes, their website and social media presence will change too. We'll then extract and merge the new information, thus creating the most accurate and up-to-date business information dataset across the globe.
We offer our customers perpetual data licenses for any dataset representing this ever changing information, downloaded at any given point in time. This makes our company's licensing model unique in the current Data as a Service - DaaS Industry. Our customers don't have to delete our data after the expiration of a certain "Term", regardless of whether the data was purchased as a one time snapshot, or via our data update pipeline.
Customers requiring regularly updated datasets may subscribe to our Annual subscription plans. Our data is continuously being refreshed, therefore subscription plans are recommended for those who need the most up to date data. The main differentiators between us vs the competition are our flexible licensing terms and our data freshness.
Data samples may be downloaded at https://store.poidata.xyz/us
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
š Dataset Features This dataset includes 5,000 startups from 10 countries and contains 15 key features: Startup Name: Name of the startup Founded Year: Year the startup was founded Country: Country where the startup is based Industry: Industry category (Tech, FinTech, AI, etc.) Funding Stage: Stage of investment (Seed, Series A, etc.) Total Funding ($M): Total funding received (in million $) Number of Employees: Number of employees in the startup Annual Revenue ($M): Annual revenue in million dollars Valuation ($B): Startup's valuation in billion dollars Success Score: Score from 1 to 10 based on growth Acquired?: Whether the startup was acquired (Yes/No) IPO?: Did the startup go public? (Yes/No) Customer Base (Millions): Number of active customers Tech Stack: Technologies used by the startup Social Media Followers: Total followers on social platforms Analysis Ideas š What Can You Do with This Dataset? Here are some exciting analyses you can perform:
Predict Startup Success: Train a machine learning model to predict the success score. Industry Trends: Analyze which industries get the most funding. **Valuation vs. Funding: **Explore the correlation between funding and valuation. Acquisition Analysis: Investigate the factors that contribute to startups being acquired.
Success.aiās LinkedIn Data Solutions offer unparalleled access to a vast dataset of 700 million public LinkedIn profiles and 70 million LinkedIn company records, making it one of the most comprehensive and reliable LinkedIn datasets available on the market today. Our employee data and LinkedIn data are ideal for businesses looking to streamline recruitment efforts, build highly targeted lead lists, or develop personalized B2B marketing campaigns.
Whether youāre looking for recruiting data, conducting investment research, or seeking to enrich your CRM systems with accurate and up-to-date LinkedIn profile data, Success.ai provides everything you need with pinpoint precision. By tapping into LinkedIn company data, youāll have access to over 40 critical data points per profile, including education, professional history, and skills.
Key Benefits of Success.aiās LinkedIn Data: Our LinkedIn data solution offers more than just a dataset. With GDPR-compliant data, AI-enhanced accuracy, and a price match guarantee, Success.ai ensures you receive the highest-quality data at the best price in the market. Our datasets are delivered in Parquet format for easy integration into your systems, and with millions of profiles updated daily, you can trust that youāre always working with fresh, relevant data.
Global Reach and Industry Coverage: Our LinkedIn data covers professionals across all industries and sectors, providing you with detailed insights into businesses around the world. Our geographic coverage spans 259M profiles in the United States, 22M in the United Kingdom, 27M in India, and thousands of profiles in regions such as Europe, Latin America, and Asia Pacific. With LinkedIn company data, you can access profiles of top companies from the United States (6M+), United Kingdom (2M+), and beyond, helping you scale your outreach globally.
Why Choose Success.aiās LinkedIn Data: Success.ai stands out for its tailored approach and white-glove service, making it easy for businesses to receive exactly the data they need without managing complex data platforms. Our dedicated Success Managers will curate and deliver your dataset based on your specific requirements, so you can focus on what matters mostāreaching the right audience. Whether youāre sourcing employee data, LinkedIn profile data, or recruiting data, our service ensures a seamless experience with 99% data accuracy.
Key Use Cases:
LinkedIn URL: Access direct links to LinkedIn profiles for immediate insights. Full Name: Verified first and last names. Job Title: Current job titles, and prior experience. Company Information: Company name, LinkedIn URL, domain, and location. Work and Per...
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of āBrazil Largest Companiesā provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/brazil-largest-companiese on 13 February 2022.
--- Dataset description provided by original source is as follows ---
From the Forbes Global 2000 listā last updated on May 2013. Forbes publishes an annual list of the world's 2000 largest publicly listed corporations. āThe Forbes Global 2000 weighāsā sales, profits, assets and market valueā equallyā so companies can be ranked by size. Figures for all companies are in US dollars.
āSource: Economy Watch
This dataset was created by Finance and contains around 0 samples along with Assets ($billion), Sales ($billion), technical information and other features such as: - Profits ($billion) - Market Value ($billion) - and more.
- Analyze Assets ($billion) in relation to Sales ($billion)
- Study the influence of Profits ($billion) on Market Value ($billion)
- More datasets
If you use this dataset in your research, please credit Finance
--- Original source retains full ownership of the source dataset ---
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of āGreece Largest Companiesā provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/greece-largest-companiese on 13 February 2022.
--- Dataset description provided by original source is as follows ---
From the Forbes Global 2000 listā last updated on May 2013. Forbes publishes an annual list of the world's 2000 largest publicly listed corporations. āThe Forbes Global 2000 weighāsā sales, profits, assets and market valueā equallyā so companies can be ranked by size. Figures for all companies are in US dollars.
āSource: Economy Watch
This dataset was created by Finance and contains around 0 samples along with Market Value ($billion), Assets ($billion), technical information and other features such as: - Profits ($billion) - Market Value ($billion) - and more.
- Analyze Assets ($billion) in relation to Profits ($billion)
- Study the influence of Market Value ($billion) on Assets ($billion)
- More datasets
If you use this dataset in your research, please credit Finance
--- Original source retains full ownership of the source dataset ---
š Worldwide B2B Company Dataset | 65M+ Verified Records | Firmographics & API Access Power your sales, marketing, and investment strategies with the most comprehensive global B2B company dataāverified, AI-driven, and updated bi-weekly.
The Forager.ai Global Company Dataset delivers 65M+ high-quality firmographic records, covering public and private companies worldwide. Leveraging AI-powered validation and bi-weekly updates, our dataset ensures accuracy, freshness, and depthāmaking it ideal for sales intelligence, market analysis, and CRM enrichment.
š Key Features & Coverage ā 65M+ Company Records ā The largest, most reliable B2B firmographic dataset available. ā Bi-Weekly Updates ā Stay ahead with refreshed data every two weeks. ā AI-Driven Accuracy ā Sophisticated algorithms verify and enrich every record. ā Global Coverage ā Companies across North America, Europe, APAC, and emerging markets.
š Core Data Fields: ā Company Name, LinkedIn URL, & Domain ā Industries ā Job postings, Revenue, Employee Size, Funding Status ā Location (HQ + Regional Offices) ā Tech Stack & Firmographic Signals ā LinkedIn Profile details
šÆ Top Use Cases š¹ Sales & Lead Generation
Build targeted prospect lists using firmographics (size, industry, revenue).
Enhance lead scoring with technographic insights.
š¹ Market & Competitive Intelligence
Track company growth, expansions, and trends.
Benchmark competitors using real-time private company data.
š¹ Venture Capital & Private Equity
Discover investment opportunities with granular sector-level insights.
Monitor portfolio companies and industry shifts.
š¹ ABM & Marketing Automation
Enrich CRM data for hyper-targeted campaigns.
Power intent data and predictive analytics.
ā” Delivery & Integration Choose the best method for your workflow:
REST API ā Real-time access for developers.
Flat Files (CSV, JSON) ā Delivered via S3, Wasabi, Snowflake.
Custom Solutions ā Scalable enterprise integrations.
š Data Quality & Compliance 95%+ Field Completeness ā Minimize gaps in your analysis.
Ethically Sourced ā Compliant with GDPR, CCPA, and global privacy laws.
Transparent Licensing ā Clear usage terms for peace of mind.
š Why Forager.ai? ā AI-Powered Accuracy ā Better data, fewer false leads. ā Enterprise-Grade Freshness ā Bi-weekly updates keep insights relevant. ā Flexible Access ā API, bulk files, or custom database solutions. ā Dedicated Support ā Onboarding and SLA-backed assistance.
Tags: B2B Company Data |LinkedIn Job Postings | Firmographics | Global Business Intelligence | Sales Leads | VC & PE Data | Technographics | CRM Enrichment | API Access | AI-Validated Data
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Object recognition predominately still relies on many high-quality training examples per object category. In contrast, learning new objects from only a few examples could enable many impactful applications from robotics to user personalization. Most few-shot learning research, however, has been driven by benchmark datasets that lack the high variation that these applications will face when deployed in the real-world. To close this gap, we present the ORBIT dataset, grounded in a real-world application of teachable object recognizers for people who are blind/low vision. We provide a full, unfiltered dataset of 4,733 videos of 588 objects recorded by 97 people who are blind/low-vision on their mobile phones, and a benchmark dataset of 3,822 videos of 486 objects collected by 77 collectors. The code for loading the dataset, computing all benchmark metrics, and running the baseline models is available at https://github.com/microsoft/ORBIT-DatasetThis version comprises several zip files:- train, validation, test: benchmark dataset, organised by collector, with raw videos split into static individual frames in jpg format at 30FPS- other: data not in the benchmark set, organised by collector, with raw videos split into static individual frames in jpg format at 30FPS (please note that the train, validation, test, and other files make up the unfiltered dataset)- *_224: as for the benchmark, but static individual frames are scaled down to 224 pixels.- *_unfiltered_videos: full unfiltered dataset, organised by collector, in mp4 format.
A changing climate and increasing human population necessitate understanding global freshwater availability and temporal variability. To examine lake freshwater availability from local-to-global and monthly-to-decadal scales, we created the Global Lake area, Climate, and Population (GLCP) dataset, which contains annual lake surface area for 1.42 million lakes with paired annual basin-level climate and population data. Building off an existing data product infrastructure, the next generation of the GLCP includes monthly lake ice area, snow basin area, and more climate variables including specific humidity, longwave and shortwave radiation, as well as cloud cover. The new generation of the GLCP continues previous FAIR data efforts by expanding its scripting repository and maintaining unique relational keys for merging with external data products. Compared to the original version, the new GLCP contains an even richer suite of variables capable of addressing disparate analyses of lake water trends at wide spatial and temporal scales.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of āGermany Largest Companiesā provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/germany-largest-companiese on 13 February 2022.
--- Dataset description provided by original source is as follows ---
From the Forbes Global 2000 listā last updated on May 2013. Forbes publishes an annual list of the world's 2000 largest publicly listed corporations. āThe Forbes Global 2000 weighāsā sales, profits, assets and market valueā equallyā so companies can be ranked by size. Figures for all companies are in US dollars.
āSource: Economy Watch
This dataset was created by Finance and contains around 0 samples along with Profits ($billion), Assets ($billion), technical information and other features such as: - Sales ($billion) - Market Value ($billion) - and more.
- Analyze Global Rank in relation to Profits ($billion)
- Study the influence of Assets ($billion) on Sales ($billion)
- More datasets
If you use this dataset in your research, please credit Finance
--- Original source retains full ownership of the source dataset ---
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides values for EASE OF DOING BUSINESS INDEX 1 MOST BUSINESS reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A comprehensive dataset containing crowdsourced rankings of nearly all ski resorts worldwide. The dataset includes detailed information on each resort, such as location, snowfall, number of lifts and slopes, total slope length, and vertical drop. The dataset is updated regularly as more votes are collected.
https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcSO20g5cBn_b3UvD4HrPSKMrujGXq8LfT2NQP3LC3F3k8ufSV6TP97l7Har-625Bju08bc&usqp=CAU" alt="File:Yahoo Finance Logo 2013.svg - Wikipedia">
Yahoo! Finance is a media property that is part of the Yahoo! network. It provides financial news, data and commentary including stock quotes, press releases, financial reports, and original content. It also offers some online tools for personal finance management. In addition to posting partner content from other web sites, it posts original stories by its team of staff journalists. It is ranked 20th by Similar Web on the list of largest news and media websites.
###
python
1.Content:
2.Symbol:
3.Name:
4.Price:
5.Volume:
6.Market cap:
7.P/E ratio:
The data is sourced from Yahoo Finance and is updated daily, providing users with the most up-to-date financial information for each company listed.
The dataset is suitable for anyone interested in analyzing or predicting stock market trends and is particularly useful for financial analysts, investors, and traders.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The company which provided the dataset is the world leader in manufacturing of construction and mining equipment, diesel and natural gas engines, industrial gas turbines and diesel-electric locomotives. The current revenue of the company is estimated to be on the order of tens of billions and they sell products and parts via a worldwide dealer network. The company sells more than 3 million products and 700,000 parts in more than 20 countries around the world every year. They operate with more than 3,000 suppliers and 3,000 dealerships and their logistics operations alone are worth more than 60 million dollars per year. The dataset provided is one example of supply chain problem for one product of the company - a medium size excavator. In the current dataset, the number of dealers, production facilities and shipping ports is the same as in the original problem; it is only the demand figures, the production capacities, the transportation times and costs and the sale prices that have been randomly generated. The figures have been randomly generated in an interval between 0 and an upper limit which is a random increase over the maximum value in the original data, according to a negative exponential distribution.
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
The global AI training dataset market size was valued at approximately USD 1.2 billion in 2023 and is projected to reach USD 6.5 billion by 2032, growing at a compound annual growth rate (CAGR) of 20.5% from 2024 to 2032. This substantial growth is driven by the increasing adoption of artificial intelligence across various industries, the necessity for large-scale and high-quality datasets to train AI models, and the ongoing advancements in AI and machine learning technologies.
One of the primary growth factors in the AI training dataset market is the exponential increase in data generation across multiple sectors. With the proliferation of internet usage, the expansion of IoT devices, and the digitalization of industries, there is an unprecedented volume of data being generated daily. This data is invaluable for training AI models, enabling them to learn and make more accurate predictions and decisions. Moreover, the need for diverse and comprehensive datasets to improve AI accuracy and reliability is further propelling market growth.
Another significant factor driving the market is the rising investment in AI and machine learning by both public and private sectors. Governments around the world are recognizing the potential of AI to transform economies and improve public services, leading to increased funding for AI research and development. Simultaneously, private enterprises are investing heavily in AI technologies to gain a competitive edge, enhance operational efficiency, and innovate new products and services. These investments necessitate high-quality training datasets, thereby boosting the market.
The proliferation of AI applications in various industries, such as healthcare, automotive, retail, and finance, is also a major contributor to the growth of the AI training dataset market. In healthcare, AI is being used for predictive analytics, personalized medicine, and diagnostic automation, all of which require extensive datasets for training. The automotive industry leverages AI for autonomous driving and vehicle safety systems, while the retail sector uses AI for personalized shopping experiences and inventory management. In finance, AI assists in fraud detection and risk management. The diverse applications across these sectors underline the critical need for robust AI training datasets.
As the demand for AI applications continues to grow, the role of Ai Data Resource Service becomes increasingly vital. These services provide the necessary infrastructure and tools to manage, curate, and distribute datasets efficiently. By leveraging Ai Data Resource Service, organizations can ensure that their AI models are trained on high-quality and relevant data, which is crucial for achieving accurate and reliable outcomes. The service acts as a bridge between raw data and AI applications, streamlining the process of data acquisition, annotation, and validation. This not only enhances the performance of AI systems but also accelerates the development cycle, enabling faster deployment of AI-driven solutions across various sectors.
Regionally, North America currently dominates the AI training dataset market due to the presence of major technology companies and extensive R&D activities in the region. However, Asia Pacific is expected to witness the highest growth rate during the forecast period, driven by rapid technological advancements, increasing investments in AI, and the growing adoption of AI technologies across various industries in countries like China, India, and Japan. Europe and Latin America are also anticipated to experience significant growth, supported by favorable government policies and the increasing use of AI in various sectors.
The data type segment of the AI training dataset market encompasses text, image, audio, video, and others. Each data type plays a crucial role in training different types of AI models, and the demand for specific data types varies based on the application. Text data is extensively used in natural language processing (NLP) applications such as chatbots, sentiment analysis, and language translation. As the use of NLP is becoming more widespread, the demand for high-quality text datasets is continually rising. Companies are investing in curated text datasets that encompass diverse languages and dialects to improve the accuracy and efficiency of NLP models.
Image data is critical for computer vision application
Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
License information was derived automatically
About
We provide a comprehensive talking-head video dataset with over 50,000 videos, totaling more than 500 hours of footage and featuring 23,841 unique identities from around the world.
Distribution
Detailing the format, size, and structure of the dataset: Data Volume: -Total Size: 2.5TB
-Total Videos: 47,200
-Identities Covered: 23,000
-Resolution: 60% 4k(1980), 33% fullHD(1080)
-Formats: MP4
-Full-length videos with visible mouth movements in every frame.
-Minimum face size of 400 pixels.
-Video durations range from 20 seconds to 5 minutes.
-Faces have not been cut out, full screen videos including backgrounds.
Usage
This dataset is ideal for a variety of applications:
Face Recognition & Verification: Training and benchmarking facial recognition models.
Action Recognition: Identifying human activities and behaviors.
Re-Identification (Re-ID): Tracking identities across different videos and environments.
Deepfake Detection: Developing methods to detect manipulated videos.
Generative AI: Training high-resolution video generation models.
Lip Syncing Applications: Enhancing AI-driven lip-syncing models for dubbing and virtual avatars.
Background AI Applications: Developing AI models for automated background replacement, segmentation, and enhancement.
Coverage
Explaining the scope and coverage of the dataset:
Geographic Coverage: Worldwide
Time Range: Time range and size of the videos have been noted in the CSV file.
Demographics: Includes information about age, gender, ethnicity, format, resolution, and file size.
Languages Covered (Videos):
English: 23,038 videos
Portuguese: 1,346 videos
Spanish: 677 videos
Norwegian: 1,266 videos
Swedish: 1,056 videos
Korean: 848 videos
Polish: 1,807 videos
Indonesian: 1,163 videos
French: 1,102 videos
German: 1,276 videos
Japanese: 1,433 videos
Dutch: 1,666 videos
Indian: 1,163 videos
Czech: 590 videos
Chinese: 685 videos
Italian: 975 videos
Who Can Use It
List examples of intended users and their use cases:
Data Scientists: Training machine learning models for video-based AI applications.
Researchers: Studying human behavior, facial analysis, or video AI advancements.
Businesses: Developing facial recognition systems, video analytics, or AI-driven media applications.
Additional Notes
Ensure ethical usage and compliance with privacy regulations. The datasetās quality and scale make it valuable for high-performance AI training. Potential preprocessing (cropping, down sampling) may be needed for different use cases. Dataset has not been completed yet and expands daily, please contact for most up to date CSV file. The dataset has been divided into 100GB zipped files and is hosted on a private server (with the option to upload to the cloud if needed). To verify the dataset's quality, please contact me for the full CSV file. Iād be happy to provide example videos selected by the potential buyer.
By Arthur Keen [source]
This dataset contains the top 100 global banks ranked by total assets on December 31, 2017. With a detailed list of key information for each bank's rank, country, balance sheet and US Total Assets (in billions), this data will be invaluable for those looking to research and study the current status of some of the world's leading financial organizations. From billion-dollar mega-banks such as JP Morgan Chase to small, local savings & loans institutions like BancorpSouth; this comprehensive overview allows researchers and analysts to gain a better understanding of who holds power in the world economy today
For more datasets, click here.
- šØ Your notebook can be here! šØ!
This dataset contains the rank and total asset information of the top 100 global banks as of December 31, 2017. It is a useful resource for researchers who wish to study how key financial institutions' asset information relate to each other across countries.
Using this dataset is relatively straightforward ā it consists of three columns - rank (the order in which each bank appears in the list), country (the country in which the bank is located) and total assets US billions (the total value expressed in US dollars). Additionally, there is a fourth column containing the balance sheet information for each bank as well.
In order to make full use of this dataset, one should analyse it by creating comparison grids based on different factors such as region, size or ownership structures. This can provide an interesting insight into how financial markets are structured within different economies and allow researchers to better understand some banking sector dynamics that are particularly relevant for certain countries or regions. Additionally, one can compare any two banks side-by-side using their respective balance sheets or distribution plot graphs based on size or concentration metrics by leverage or other financial ratios as well.
Overall, this dataset provides useful resources that can be put into practice through data visualization making an interesting reference point for trends analysis and forecasting purposes focusing on certain banking activities worldwide
Analyzing the differences in total assets across countries. By comparing and contrasting data, patterns could be found that give insight into the factors driving differences in banksā assets between different markets.
Using predictive models to identify which banks are more likely to perform better based on their balance sheet data, such as by predicting future profits or cashflows of said banks.
Leveraging the information on holdings and investments of ātop-rankedā banks as a guide for personal investments decisions or informing investment strategies of large financial institutions or hedge funds
If you use this dataset in your research, please credit the original authors. Data Source
License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.
File: top50banks2017-03-31.csv | Column name | Description | |:----------------------|:------------------------------------------------------------------------| | rank | The rank of the bank globally based on total assets. (Integer) | | country | The country where the bank is located. (String) | | total_assets_us_b | The total assets of a bank expressed in billions of US dollars. (Float) | | balance_sheet | A snapshot of banking activities for a specific date. (Date) |
File: top100banks2017-12-31.csv | Column name | Description | |:----------------------|:--------------------------------------------...