100+ datasets found
  1. Bank Rankings by Total Assets

    • kaggle.com
    Updated Dec 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2022). Bank Rankings by Total Assets [Dataset]. https://www.kaggle.com/datasets/thedevastator/global-banking-rankings-by-total-assets-2017-12
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 6, 2022
    Dataset provided by
    Kaggle
    Authors
    The Devastator
    Description

    Bank Rankings by Total Assets

    Tracking the Financial Performance of the Top Banks

    By Arthur Keen [source]

    About this dataset

    This dataset contains the top 100 global banks ranked by total assets on December 31, 2017. With a detailed list of key information for each bank's rank, country, balance sheet and US Total Assets (in billions), this data will be invaluable for those looking to research and study the current status of some of the world's leading financial organizations. From billion-dollar mega-banks such as JP Morgan Chase to small, local savings & loans institutions like BancorpSouth; this comprehensive overview allows researchers and analysts to gain a better understanding of who holds power in the world economy today

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset contains the rank and total asset information of the top 100 global banks as of December 31, 2017. It is a useful resource for researchers who wish to study how key financial institutions' asset information relate to each other across countries.

    Using this dataset is relatively straightforward – it consists of three columns - rank (the order in which each bank appears in the list), country (the country in which the bank is located) and total assets US billions (the total value expressed in US dollars). Additionally, there is a fourth column containing the balance sheet information for each bank as well.

    In order to make full use of this dataset, one should analyse it by creating comparison grids based on different factors such as region, size or ownership structures. This can provide an interesting insight into how financial markets are structured within different economies and allow researchers to better understand some banking sector dynamics that are particularly relevant for certain countries or regions. Additionally, one can compare any two banks side-by-side using their respective balance sheets or distribution plot graphs based on size or concentration metrics by leverage or other financial ratios as well.

    Overall, this dataset provides useful resources that can be put into practice through data visualization making an interesting reference point for trends analysis and forecasting purposes focusing on certain banking activities worldwide

    Research Ideas

    • Analyzing the differences in total assets across countries. By comparing and contrasting data, patterns could be found that give insight into the factors driving differences in banks’ assets between different markets.

    • Using predictive models to identify which banks are more likely to perform better based on their balance sheet data, such as by predicting future profits or cashflows of said banks.

    • Leveraging the information on holdings and investments of ā€œtop-rankedā€ banks as a guide for personal investments decisions or informing investment strategies of large financial institutions or hedge funds

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.

    Columns

    File: top50banks2017-03-31.csv | Column name | Description | |:----------------------|:------------------------------------------------------------------------| | rank | The rank of the bank globally based on total assets. (Integer) | | country | The country where the bank is located. (String) | | total_assets_us_b | The total assets of a bank expressed in billions of US dollars. (Float) | | balance_sheet | A snapshot of banking activities for a specific date. (Date) |

    File: top100banks2017-12-31.csv | Column name | Description | |:----------------------|:--------------------------------------------...

  2. d

    Coresignal | Clean Data | Company Data | AI-Enriched Datasets | Global /...

    • datarade.ai
    .json, .csv
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Coresignal, Coresignal | Clean Data | Company Data | AI-Enriched Datasets | Global / 35M+ Records / Updated Weekly [Dataset]. https://datarade.ai/data-products/coresignal-clean-data-company-data-ai-enriched-datasets-coresignal
    Explore at:
    .json, .csvAvailable download formats
    Dataset authored and provided by
    Coresignal
    Area covered
    Hungary, Niue, Panama, Guatemala, Chile, Saint BarthƩlemy, Guinea-Bissau, Andorra, Guadeloupe, Namibia
    Description

    This clean dataset is a refined version of our company datasets, consisting of 35M+ data records.

    It’s an excellent data solution for companies with limited data engineering capabilities and those who want to reduce their time to value. You get filtered, cleaned, unified, and standardized B2B data. After cleaning, this data is also enriched by leveraging a carefully instructed large language model (LLM).

    AI-powered data enrichment offers more accurate information in key data fields, such as company descriptions. It also produces over 20 additional data points that are very valuable to B2B businesses. Enhancing and highlighting the most important information in web data contributes to quicker time to value, making data processing much faster and easier.

    For your convenience, you can choose from multiple data formats (Parquet, JSON, JSONL, or CSV) and select suitable delivery frequency (quarterly, monthly, or weekly).

    Coresignal is a leading public business data provider in the web data sphere with an extensive focus on firmographic data and public employee profiles. More than 3B data records in different categories enable companies to build data-driven products and generate actionable insights. Coresignal is exceptional in terms of data freshness, with 890M+ records updated monthly for unprecedented accuracy and relevance.

  3. o

    LinkedIn company information

    • opendatabay.com
    .undefined
    Updated May 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2025). LinkedIn company information [Dataset]. https://www.opendatabay.com/data/premium/bd1786ac-7b2e-45e3-957b-f98ebd46181c
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    May 23, 2025
    Dataset authored and provided by
    Bright Data
    Area covered
    Social Media and Networking
    Description

    LinkedIn companies use datasets to access public company data for machine learning, ecosystem mapping, and strategic decisions. Popular use cases include competitive analysis, CRM enrichment, and lead generation.

    Use our LinkedIn Companies Information dataset to access comprehensive data on companies worldwide, including business size, industry, employee profiles, and corporate activity. This dataset provides key company insights, organizational structure, and competitive landscape, tailored for market researchers, HR professionals, business analysts, and recruiters.

    Leverage the LinkedIn Companies dataset to track company growth, analyze industry trends, and refine your recruitment strategies. By understanding company dynamics and employee movements, you can optimize sourcing efforts, enhance business development opportunities, and gain a strategic edge in your market. Stay informed and make data-backed decisions with this essential resource for understanding global company ecosystems.

    Dataset Features

    • timestamp: Represents the date and time when the company data was collected.
    • id: Unique identifier for each company in the dataset.
    • company_id: Identifier linking the company to an external database or internal system.
    • url: Website or URL for more information about the company.
    • name: The name of the company.
    • about: Brief description of the company.
    • description: More detailed information about the company's operations and offerings.
    • organization_type: Type of the organization (e.g., private, public).
    • industries: List of industries the company operates in.
    • followers: Number of followers on the company's platform.
    • headquarters: Location of the company's headquarters.
    • country_code: Code for the country where the company is located.
    • country_codes_array: List of country codes associated with the company (may represent various locations or markets).
    • locations: Locations where the company operates.
    • get_directions_url: URL to get directions to the company's location(s).
    • formatted_locations: Human-readable format of the company's locations.
    • website: The official website of the company.
    • website_simplified: A simplified version of the company's website URL.
    • company_size: Number of employees or company size.
    • employees_in_linkedin: Number of employees listed on LinkedIn.
    • employees: URL of employees.
    • specialties: List of the company’s specializations or services.
    • updates: Recent updates or news related to the company.
    • crunchbase_url: Link to the company’s profile on Crunchbase.
    • founded: Year when the company was founded.
    • funding: Information on funding rounds or financial data.
    • investors: Investors who have funded the company.
    • alumni: Notable alumni from the company.
    • alumni_information: Details about the alumni, their roles, or achievements.
    • stock_info: Stock market information for publicly traded companies.
    • affiliated: Companies or organizations affiliated with the company.
    • image: Image representing the company.
    • logo: URL of the official logo of the company.
    • slogan: Company’s slogan or tagline.
    • similar: URL of companies similar to this one.

    Distribution

    • Data Volume: 56.51M rows and 35 columns.
    • Structure: Tabular format (CSV, Excel).

    Usage

    This dataset is ideal for:
    - Market Research: Identifying key trends and patterns across different industries and geographies.
    - Business Development: Analyzing potential partners, competitors, or customers.
    - Investment Analysis: Assessing investment potential based on company size, funding, and industries.
    - Recruitment & Talent Analytics: Understanding the workforce size and specialties of various companies.

    Coverage

    • Geographic Coverage: Global, with company locations and headquarters spanning multiple countries.
    • Time Range: Data likely covers both current and historical information about companies.
    • Demographics: Focuses on company attributes rather than demographics, but may contain information about the company's workforce.

    License

    CUSTOM

    Please review the respective licenses below:

    1. Data Provider's License

    Who Can Use It

    • Data Scientists: For building models, conducting research, or enhancing machine learning algorithms with business data.
    • Researchers: For academic analysis in fields like economics, business, or technology.
    • Businesses: For analysis, competitive benchmarking, and strategic development.
    • Investors: For identifying and evaluating potential investment opportunities.

    Dataset Name Ideas

    • Global Company Profile Database
    • **Business Intellige
  4. Company Financial Data | Private & Public Companies | Verified Profiles &...

    • datarade.ai
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai, Company Financial Data | Private & Public Companies | Verified Profiles & Contact Data | Best Price Guaranteed [Dataset]. https://datarade.ai/data-products/b2b-contact-data-premium-us-contact-data-us-b2b-contact-d-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset provided by
    Area covered
    Togo, Guam, Suriname, Antigua and Barbuda, Dominican Republic, United Kingdom, Montserrat, Iceland, Georgia, Korea (Democratic People's Republic of)
    Description

    Success.ai offers a cutting-edge solution for businesses and organizations seeking Company Financial Data on private and public companies. Our comprehensive database is meticulously crafted to provide verified profiles, including contact details for financial decision-makers such as CFOs, financial analysts, corporate treasurers, and other key stakeholders. This robust dataset is continuously updated and validated using AI technology to ensure accuracy and relevance, empowering businesses to make informed decisions and optimize their financial strategies.

    Key Features of Success.ai's Company Financial Data:

    Global Coverage: Access data from over 70 million businesses worldwide, including public and private companies across all major industries and regions. Our datasets span 250+ countries, offering extensive reach for your financial analysis and market research.

    Detailed Financial Profiles: Gain insights into company financials, including revenue, profit margins, funding rounds, and operational costs. Profiles are enriched with key contact details, including work emails, phone numbers, and physical addresses, ensuring direct access to decision-makers.

    Industry-Specific Data: Tailored datasets for sectors such as financial services, manufacturing, technology, healthcare, and energy, among others. Each dataset is customized to meet the unique needs of industry professionals and analysts.

    Real-Time Accuracy: With continuous updates powered by AI-driven validation, our financial data maintains a 99% accuracy rate, ensuring you have access to the most reliable and up-to-date information available.

    Compliance and Security: All data is collected and processed in strict adherence to global compliance standards, including GDPR, ensuring ethical and lawful usage.

    Why Choose Success.ai for Company Financial Data?

    Best Price Guarantee: We pride ourselves on offering the most competitive pricing in the industry, ensuring you receive unparalleled value for comprehensive financial data.

    AI-Validated Accuracy: Our advanced AI algorithms meticulously verify every data point to ensure precision and reliability, helping you avoid costly errors in your financial decision-making.

    Customized Data Solutions: Whether you need data for a specific region, industry, or type of business, we tailor our datasets to align perfectly with your requirements.

    Scalable Data Access: From small startups to global enterprises, our platform caters to businesses of all sizes, delivering scalable solutions to suit your operational needs.

    Comprehensive Use Cases for Financial Data:

    1. Strategic Financial Planning:

    Leverage our detailed financial profiles to create accurate budgets, forecasts, and strategic plans. Gain insights into competitors’ financial health and market positions to make data-driven decisions.

    1. Mergers and Acquisitions (M&A):

    Access key financial details and contact information to streamline your M&A processes. Identify potential acquisition targets or partners with verified profiles and financial data.

    1. Investment Analysis:

    Evaluate the financial performance of public and private companies for informed investment decisions. Use our data to identify growth opportunities and assess risk factors.

    1. Lead Generation and Sales:

    Enhance your sales outreach by targeting CFOs, financial analysts, and other decision-makers with verified contact details. Utilize accurate email and phone data to increase conversion rates.

    1. Market Research:

    Understand market trends and financial benchmarks with our industry-specific datasets. Use the data for competitive analysis, benchmarking, and identifying market gaps.

    APIs to Power Your Financial Strategies:

    Enrichment API: Integrate real-time updates into your systems with our Enrichment API. Keep your financial data accurate and current to drive dynamic decision-making and maintain a competitive edge.

    Lead Generation API: Supercharge your lead generation efforts with access to verified contact details for key financial decision-makers. Perfect for personalized outreach and targeted campaigns.

    Tailored Solutions for Industry Professionals:

    Financial Services Firms: Gain detailed insights into revenue streams, funding rounds, and operational costs for competitor analysis and client acquisition.

    Corporate Finance Teams: Enhance decision-making with precise data on industry trends and benchmarks.

    Consulting Firms: Deliver informed recommendations to clients with access to detailed financial datasets and key stakeholder profiles.

    Investment Firms: Identify potential investment opportunities with verified data on financial performance and market positioning.

    What Sets Success.ai Apart?

    Extensive Database: Access detailed financial data for 70M+ companies worldwide, including small businesses, startups, and large corporations.

    Ethical Practices: Our data collection and processing methods are fully comp...

  5. n

    Dataset of development of business during the COVID-19 crisis

    • narcis.nl
    • data.mendeley.com
    Updated Nov 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Litvinova, T (via Mendeley Data) (2020). Dataset of development of business during the COVID-19 crisis [Dataset]. http://doi.org/10.17632/9vvrd34f8t.1
    Explore at:
    Dataset updated
    Nov 9, 2020
    Dataset provided by
    Data Archiving and Networked Services (DANS)
    Authors
    Litvinova, T (via Mendeley Data)
    Description

    To create the dataset, the top 10 countries leading in the incidence of COVID-19 in the world were selected as of October 22, 2020 (on the eve of the second full of pandemics), which are presented in the Global 500 ranking for 2020: USA, India, Brazil, Russia, Spain, France and Mexico. For each of these countries, no more than 10 of the largest transnational corporations included in the Global 500 rating for 2020 and 2019 were selected separately. The arithmetic averages were calculated and the change (increase) in indicators such as profitability and profitability of enterprises, their ranking position (competitiveness), asset value and number of employees. The arithmetic mean values of these indicators for all countries of the sample were found, characterizing the situation in international entrepreneurship as a whole in the context of the COVID-19 crisis in 2020 on the eve of the second wave of the pandemic. The data is collected in a general Microsoft Excel table. Dataset is a unique database that combines COVID-19 statistics and entrepreneurship statistics. The dataset is flexible data that can be supplemented with data from other countries and newer statistics on the COVID-19 pandemic. Due to the fact that the data in the dataset are not ready-made numbers, but formulas, when adding and / or changing the values in the original table at the beginning of the dataset, most of the subsequent tables will be automatically recalculated and the graphs will be updated. This allows the dataset to be used not just as an array of data, but as an analytical tool for automating scientific research on the impact of the COVID-19 pandemic and crisis on international entrepreneurship. The dataset includes not only tabular data, but also charts that provide data visualization. The dataset contains not only actual, but also forecast data on morbidity and mortality from COVID-19 for the period of the second wave of the pandemic in 2020. The forecasts are presented in the form of a normal distribution of predicted values and the probability of their occurrence in practice. This allows for a broad scenario analysis of the impact of the COVID-19 pandemic and crisis on international entrepreneurship, substituting various predicted morbidity and mortality rates in risk assessment tables and obtaining automatically calculated consequences (changes) on the characteristics of international entrepreneurship. It is also possible to substitute the actual values identified in the process and following the results of the second wave of the pandemic to check the reliability of pre-made forecasts and conduct a plan-fact analysis. The dataset contains not only the numerical values of the initial and predicted values of the set of studied indicators, but also their qualitative interpretation, reflecting the presence and level of risks of a pandemic and COVID-19 crisis for international entrepreneurship.

  6. d

    US Restaurant POI dataset with metadata

    • datarade.ai
    .csv
    Updated Jul 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Geolytica (2022). US Restaurant POI dataset with metadata [Dataset]. https://datarade.ai/data-products/us-restaurant-poi-dataset-with-metadata-geolytica
    Explore at:
    .csvAvailable download formats
    Dataset updated
    Jul 30, 2022
    Dataset authored and provided by
    Geolytica
    Area covered
    United States of America
    Description

    Point of Interest (POI) is defined as an entity (such as a business) at a ground location (point) which may be (of interest). We provide high-quality POI data that is fresh, consistent, customizable, easy to use and with high-density coverage for all countries of the world.

    This is our process flow:

    Our machine learning systems continuously crawl for new POI data
    Our geoparsing and geocoding calculates their geo locations
    Our categorization systems cleanup and standardize the datasets
    Our data pipeline API publishes the datasets on our data store
    

    A new POI comes into existence. It could be a bar, a stadium, a museum, a restaurant, a cinema, or store, etc.. In today's interconnected world its information will appear very quickly in social media, pictures, websites, press releases. Soon after that, our systems will pick it up.

    POI Data is in constant flux. Every minute worldwide over 200 businesses will move, over 600 new businesses will open their doors and over 400 businesses will cease to exist. And over 94% of all businesses have a public online presence of some kind tracking such changes. When a business changes, their website and social media presence will change too. We'll then extract and merge the new information, thus creating the most accurate and up-to-date business information dataset across the globe.

    We offer our customers perpetual data licenses for any dataset representing this ever changing information, downloaded at any given point in time. This makes our company's licensing model unique in the current Data as a Service - DaaS Industry. Our customers don't have to delete our data after the expiration of a certain "Term", regardless of whether the data was purchased as a one time snapshot, or via our data update pipeline.

    Customers requiring regularly updated datasets may subscribe to our Annual subscription plans. Our data is continuously being refreshed, therefore subscription plans are recommended for those who need the most up to date data. The main differentiators between us vs the competition are our flexible licensing terms and our data freshness.

    Data samples may be downloaded at https://store.poidata.xyz/us

  7. Global Startup Success Dataset

    • kaggle.com
    Updated Mar 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hamna Kaleem (2025). Global Startup Success Dataset [Dataset]. https://www.kaggle.com/datasets/hamnakaleemds/global-startup-success-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 1, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Hamna Kaleem
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    šŸ“Š Dataset Features This dataset includes 5,000 startups from 10 countries and contains 15 key features: Startup Name: Name of the startup Founded Year: Year the startup was founded Country: Country where the startup is based Industry: Industry category (Tech, FinTech, AI, etc.) Funding Stage: Stage of investment (Seed, Series A, etc.) Total Funding ($M): Total funding received (in million $) Number of Employees: Number of employees in the startup Annual Revenue ($M): Annual revenue in million dollars Valuation ($B): Startup's valuation in billion dollars Success Score: Score from 1 to 10 based on growth Acquired?: Whether the startup was acquired (Yes/No) IPO?: Did the startup go public? (Yes/No) Customer Base (Millions): Number of active customers Tech Stack: Technologies used by the startup Social Media Followers: Total followers on social platforms Analysis Ideas šŸ“ˆ What Can You Do with This Dataset? Here are some exciting analyses you can perform:

    Predict Startup Success: Train a machine learning model to predict the success score. Industry Trends: Analyze which industries get the most funding. **Valuation vs. Funding: **Explore the correlation between funding and valuation. Acquisition Analysis: Investigate the factors that contribute to startups being acquired.

  8. Success.ai | LinkedIn Full Dataset – 700M Public Profiles & 70M Companies –...

    • datarade.ai
    Updated Jan 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2022). Success.ai | LinkedIn Full Dataset – 700M Public Profiles & 70M Companies – Global Dataset – Best Price and Quality Guarantee [Dataset]. https://datarade.ai/data-products/success-ai-linkedin-full-dataset-700m-public-profiles-7-success-ai
    Explore at:
    .json, .csv, .bin, .xml, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Jan 1, 2022
    Dataset provided by
    Area covered
    Mali, Bulgaria, Equatorial Guinea, Holy See, Jamaica, Cambodia, Tajikistan, Anguilla, Samoa, Norfolk Island
    Description

    Success.ai’s LinkedIn Data Solutions offer unparalleled access to a vast dataset of 700 million public LinkedIn profiles and 70 million LinkedIn company records, making it one of the most comprehensive and reliable LinkedIn datasets available on the market today. Our employee data and LinkedIn data are ideal for businesses looking to streamline recruitment efforts, build highly targeted lead lists, or develop personalized B2B marketing campaigns.

    Whether you’re looking for recruiting data, conducting investment research, or seeking to enrich your CRM systems with accurate and up-to-date LinkedIn profile data, Success.ai provides everything you need with pinpoint precision. By tapping into LinkedIn company data, you’ll have access to over 40 critical data points per profile, including education, professional history, and skills.

    Key Benefits of Success.ai’s LinkedIn Data: Our LinkedIn data solution offers more than just a dataset. With GDPR-compliant data, AI-enhanced accuracy, and a price match guarantee, Success.ai ensures you receive the highest-quality data at the best price in the market. Our datasets are delivered in Parquet format for easy integration into your systems, and with millions of profiles updated daily, you can trust that you’re always working with fresh, relevant data.

    Global Reach and Industry Coverage: Our LinkedIn data covers professionals across all industries and sectors, providing you with detailed insights into businesses around the world. Our geographic coverage spans 259M profiles in the United States, 22M in the United Kingdom, 27M in India, and thousands of profiles in regions such as Europe, Latin America, and Asia Pacific. With LinkedIn company data, you can access profiles of top companies from the United States (6M+), United Kingdom (2M+), and beyond, helping you scale your outreach globally.

    Why Choose Success.ai’s LinkedIn Data: Success.ai stands out for its tailored approach and white-glove service, making it easy for businesses to receive exactly the data they need without managing complex data platforms. Our dedicated Success Managers will curate and deliver your dataset based on your specific requirements, so you can focus on what matters most—reaching the right audience. Whether you’re sourcing employee data, LinkedIn profile data, or recruiting data, our service ensures a seamless experience with 99% data accuracy.

    • Best Price Guarantee: We offer unbeatable pricing on LinkedIn data, and we’ll match any competitor.
    • Global Scale: Access 700 million LinkedIn profiles and 70 million company records globally.
    • AI-Verified Accuracy: Enjoy 99% data accuracy through our advanced AI and manual validation processes.
    • Real-Time Data: Profiles are updated daily, ensuring you always have the most relevant insights.
    • Tailored Solutions: Get custom-curated LinkedIn data delivered directly, without managing platforms.
    • Ethically Sourced Data: Compliant with global privacy laws, ensuring responsible data usage.
    • Comprehensive Profiles: Over 40 data points per profile, including job titles, skills, and company details.
    • Wide Industry Coverage: Covering sectors from tech to finance across regions like the US, UK, Europe, and Asia.

    Key Use Cases:

    • Sales Prospecting and Lead Generation: Build targeted lead lists using LinkedIn company data and professional profiles, helping sales teams engage decision-makers at high-value accounts.
    • Recruitment and Talent Sourcing: Use LinkedIn profile data to identify and reach top candidates globally. Our employee data includes work history, skills, and education, providing all the details you need for successful recruitment.
    • Account-Based Marketing (ABM): Use our LinkedIn company data to tailor marketing campaigns to key accounts, making your outreach efforts more personalized and effective.
    • Investment Research & Due Diligence: Identify companies with strong growth potential using LinkedIn company data. Access key data points such as funding history, employee count, and company trends to fuel investment decisions.
    • Competitor Analysis: Stay ahead of your competition by tracking hiring trends, employee movement, and company growth through LinkedIn data. Use these insights to adjust your market strategy and improve your competitive positioning.
    • CRM Data Enrichment: Enhance your CRM systems with real-time updates from Success.ai’s LinkedIn data, ensuring that your sales and marketing teams are always working with accurate and up-to-date information.
    • Comprehensive Data Points for LinkedIn Profiles: Our LinkedIn profile data includes over 40 key data points for every individual and company, ensuring a complete understanding of each contact:

    LinkedIn URL: Access direct links to LinkedIn profiles for immediate insights. Full Name: Verified first and last names. Job Title: Current job titles, and prior experience. Company Information: Company name, LinkedIn URL, domain, and location. Work and Per...

  9. A

    ā€˜Brazil Largest Companies’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Aug 4, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ā€˜Brazil Largest Companies’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-brazil-largest-companies-41a1/latest
    Explore at:
    Dataset updated
    Aug 4, 2020
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Brazil
    Description

    Analysis of ā€˜Brazil Largest Companies’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/brazil-largest-companiese on 13 February 2022.

    --- Dataset description provided by original source is as follows ---

    About this dataset

    From the Forbes Global 2000 list​ last updated on May 2013. Forbes publishes an annual list of the world's 2000 largest publicly listed corporations. ​The Forbes Global 2000 weigh​s​ sales, profits, assets and market value​ equally​ so companies can be ranked by size. Figures for all companies are in US dollars.

    ​Source: Economy Watch

    This dataset was created by Finance and contains around 0 samples along with Assets ($billion), Sales ($billion), technical information and other features such as: - Profits ($billion) - Market Value ($billion) - and more.

    How to use this dataset

    • Analyze Assets ($billion) in relation to Sales ($billion)
    • Study the influence of Profits ($billion) on Market Value ($billion)
    • More datasets

    Acknowledgements

    If you use this dataset in your research, please credit Finance

    Start A New Notebook!

    --- Original source retains full ownership of the source dataset ---

  10. A

    ā€˜Greece Largest Companies’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Apr 2, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2017). ā€˜Greece Largest Companies’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-greece-largest-companies-e639/latest
    Explore at:
    Dataset updated
    Apr 2, 2017
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Greece
    Description

    Analysis of ā€˜Greece Largest Companies’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/greece-largest-companiese on 13 February 2022.

    --- Dataset description provided by original source is as follows ---

    About this dataset

    From the Forbes Global 2000 list​ last updated on May 2013. Forbes publishes an annual list of the world's 2000 largest publicly listed corporations. ​The Forbes Global 2000 weigh​s​ sales, profits, assets and market value​ equally​ so companies can be ranked by size. Figures for all companies are in US dollars.

    ​Source: Economy Watch

    This dataset was created by Finance and contains around 0 samples along with Market Value ($billion), Assets ($billion), technical information and other features such as: - Profits ($billion) - Market Value ($billion) - and more.

    How to use this dataset

    • Analyze Assets ($billion) in relation to Profits ($billion)
    • Study the influence of Market Value ($billion) on Assets ($billion)
    • More datasets

    Acknowledgements

    If you use this dataset in your research, please credit Finance

    Start A New Notebook!

    --- Original source retains full ownership of the source dataset ---

  11. d

    Global Company Funding Data | 65M+ Records| Bi-Weekly Updates

    • datarade.ai
    .json, .csv
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Forager.ai, Global Company Funding Data | 65M+ Records| Bi-Weekly Updates [Dataset]. https://datarade.ai/data-products/global-company-funding-data-61m-records-with-historical-ins-forager-ai
    Explore at:
    .json, .csvAvailable download formats
    Dataset provided by
    Forager.ai
    Area covered
    Heard Island and McDonald Islands, Namibia, Honduras, United Arab Emirates, Malaysia, Eritrea, Malawi, Australia, Luxembourg, Sierra Leone
    Description

    šŸŒ Worldwide B2B Company Dataset | 65M+ Verified Records | Firmographics & API Access Power your sales, marketing, and investment strategies with the most comprehensive global B2B company data—verified, AI-driven, and updated bi-weekly.

    The Forager.ai Global Company Dataset delivers 65M+ high-quality firmographic records, covering public and private companies worldwide. Leveraging AI-powered validation and bi-weekly updates, our dataset ensures accuracy, freshness, and depth—making it ideal for sales intelligence, market analysis, and CRM enrichment.

    šŸ“Š Key Features & Coverage āœ… 65M+ Company Records – The largest, most reliable B2B firmographic dataset available. āœ… Bi-Weekly Updates – Stay ahead with refreshed data every two weeks. āœ… AI-Driven Accuracy – Sophisticated algorithms verify and enrich every record. āœ… Global Coverage – Companies across North America, Europe, APAC, and emerging markets.

    šŸ“‹ Core Data Fields: āœ” Company Name, LinkedIn URL, & Domain āœ” Industries āœ” Job postings, Revenue, Employee Size, Funding Status āœ” Location (HQ + Regional Offices) āœ” Tech Stack & Firmographic Signals āœ” LinkedIn Profile details

    šŸŽÆ Top Use Cases šŸ”¹ Sales & Lead Generation

    Build targeted prospect lists using firmographics (size, industry, revenue).

    Enhance lead scoring with technographic insights.

    šŸ”¹ Market & Competitive Intelligence

    Track company growth, expansions, and trends.

    Benchmark competitors using real-time private company data.

    šŸ”¹ Venture Capital & Private Equity

    Discover investment opportunities with granular sector-level insights.

    Monitor portfolio companies and industry shifts.

    šŸ”¹ ABM & Marketing Automation

    Enrich CRM data for hyper-targeted campaigns.

    Power intent data and predictive analytics.

    ⚔ Delivery & Integration Choose the best method for your workflow:

    REST API – Real-time access for developers.

    Flat Files (CSV, JSON) – Delivered via S3, Wasabi, Snowflake.

    Custom Solutions – Scalable enterprise integrations.

    šŸ”’ Data Quality & Compliance 95%+ Field Completeness – Minimize gaps in your analysis.

    Ethically Sourced – Compliant with GDPR, CCPA, and global privacy laws.

    Transparent Licensing – Clear usage terms for peace of mind.

    šŸš€ Why Forager.ai? āœ” AI-Powered Accuracy – Better data, fewer false leads. āœ” Enterprise-Grade Freshness – Bi-weekly updates keep insights relevant. āœ” Flexible Access – API, bulk files, or custom database solutions. āœ” Dedicated Support – Onboarding and SLA-backed assistance.

    Tags: B2B Company Data |LinkedIn Job Postings | Firmographics | Global Business Intelligence | Sales Leads | VC & PE Data | Technographics | CRM Enrichment | API Access | AI-Validated Data

  12. f

    ORBIT: A real-world few-shot dataset for teachable object recognition...

    • city.figshare.com
    bin
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniela Massiceti; Lida Theodorou; Luisa Zintgraf; Matthew Tobias Harris; Simone Stumpf; Cecily Morrison; Edward Cutrell; Katja Hofmann (2023). ORBIT: A real-world few-shot dataset for teachable object recognition collected from people who are blind or low vision [Dataset]. http://doi.org/10.25383/city.14294597.v3
    Explore at:
    binAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    City, University of London
    Authors
    Daniela Massiceti; Lida Theodorou; Luisa Zintgraf; Matthew Tobias Harris; Simone Stumpf; Cecily Morrison; Edward Cutrell; Katja Hofmann
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Object recognition predominately still relies on many high-quality training examples per object category. In contrast, learning new objects from only a few examples could enable many impactful applications from robotics to user personalization. Most few-shot learning research, however, has been driven by benchmark datasets that lack the high variation that these applications will face when deployed in the real-world. To close this gap, we present the ORBIT dataset, grounded in a real-world application of teachable object recognizers for people who are blind/low vision. We provide a full, unfiltered dataset of 4,733 videos of 588 objects recorded by 97 people who are blind/low-vision on their mobile phones, and a benchmark dataset of 3,822 videos of 486 objects collected by 77 collectors. The code for loading the dataset, computing all benchmark metrics, and running the baseline models is available at https://github.com/microsoft/ORBIT-DatasetThis version comprises several zip files:- train, validation, test: benchmark dataset, organised by collector, with raw videos split into static individual frames in jpg format at 30FPS- other: data not in the benchmark set, organised by collector, with raw videos split into static individual frames in jpg format at 30FPS (please note that the train, validation, test, and other files make up the unfiltered dataset)- *_224: as for the benchmark, but static individual frames are scaled down to 224 pixels.- *_unfiltered_videos: full unfiltered dataset, organised by collector, in mp4 format.

  13. d

    The Extended Global Lake area, Climate, and Population Dataset (GLCP)

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Aug 25, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). The Extended Global Lake area, Climate, and Population Dataset (GLCP) [Dataset]. https://catalog.data.gov/dataset/the-extended-global-lake-area-climate-and-population-dataset-glcp
    Explore at:
    Dataset updated
    Aug 25, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    A changing climate and increasing human population necessitate understanding global freshwater availability and temporal variability. To examine lake freshwater availability from local-to-global and monthly-to-decadal scales, we created the Global Lake area, Climate, and Population (GLCP) dataset, which contains annual lake surface area for 1.42 million lakes with paired annual basin-level climate and population data. Building off an existing data product infrastructure, the next generation of the GLCP includes monthly lake ice area, snow basin area, and more climate variables including specific humidity, longwave and shortwave radiation, as well as cloud cover. The new generation of the GLCP continues previous FAIR data efforts by expanding its scripting repository and maintaining unique relational keys for merging with external data products. Compared to the original version, the new GLCP contains an even richer suite of variables capable of addressing disparate analyses of lake water trends at wide spatial and temporal scales.

  14. A

    ā€˜Germany Largest Companies’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Aug 5, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2017). ā€˜Germany Largest Companies’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-germany-largest-companies-c49e/b48dcbae/?iid=000-848&v=presentation
    Explore at:
    Dataset updated
    Aug 5, 2020
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Germany
    Description

    Analysis of ā€˜Germany Largest Companies’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/germany-largest-companiese on 13 February 2022.

    --- Dataset description provided by original source is as follows ---

    About this dataset

    From the Forbes Global 2000 list​ last updated on May 2013. Forbes publishes an annual list of the world's 2000 largest publicly listed corporations. ​The Forbes Global 2000 weigh​s​ sales, profits, assets and market value​ equally​ so companies can be ranked by size. Figures for all companies are in US dollars.

    ​Source: Economy Watch

    This dataset was created by Finance and contains around 0 samples along with Profits ($billion), Assets ($billion), technical information and other features such as: - Sales ($billion) - Market Value ($billion) - and more.

    How to use this dataset

    • Analyze Global Rank in relation to Profits ($billion)
    • Study the influence of Assets ($billion) on Sales ($billion)
    • More datasets

    Acknowledgements

    If you use this dataset in your research, please credit Finance

    Start A New Notebook!

    --- Original source retains full ownership of the source dataset ---

  15. T

    EASE OF DOING BUSINESS INDEX 1 MOST BUSINESS by Country Dataset

    • tradingeconomics.com
    csv, excel, json, xml
    Updated May 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2018). EASE OF DOING BUSINESS INDEX 1 MOST BUSINESS by Country Dataset [Dataset]. https://tradingeconomics.com/country-list/ease-of-doing-business-index-1-most-business-
    Explore at:
    xml, json, excel, csvAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2025
    Area covered
    World
    Description

    This dataset provides values for EASE OF DOING BUSINESS INDEX 1 MOST BUSINESS reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.

  16. m

    Global Ski Resort Rankings Dataset

    • data.mendeley.com
    • openicpsr.org
    • +1more
    Updated Sep 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Teddie Tank (2024). Global Ski Resort Rankings Dataset [Dataset]. http://doi.org/10.17632/5thd2k26sz.1
    Explore at:
    Dataset updated
    Sep 5, 2024
    Authors
    Teddie Tank
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A comprehensive dataset containing crowdsourced rankings of nearly all ski resorts worldwide. The dataset includes detailed information on each resort, such as location, snowfall, number of lifts and slopes, total slope length, and vertical drop. The dataset is updated regularly as more votes are collected.

  17. Yahoo Finance - Industries - Dataset

    • kaggle.com
    Updated May 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Belayet HossainDS (2023). Yahoo Finance - Industries - Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/5678079
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 13, 2023
    Dataset provided by
    Kaggle
    Authors
    Belayet HossainDS
    Description

    https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcSO20g5cBn_b3UvD4HrPSKMrujGXq8LfT2NQP3LC3F3k8ufSV6TP97l7Har-625Bju08bc&usqp=CAU" alt="File:Yahoo Finance Logo 2013.svg - Wikipedia">

    Yahoo! Finance is a media property that is part of the Yahoo! network. It provides financial news, data and commentary including stock quotes, press releases, financial reports, and original content. It also offers some online tools for personal finance management. In addition to posting partner content from other web sites, it posts original stories by its team of staff journalists. It is ranked 20th by Similar Web on the list of largest news and media websites.

    Description: This dataset contains financial information for companies listed on major stock exchanges around the world, as provided by Yahoo Finance. The data covers a range of industries and includes key financial metrics such as price, volume, market capitalization, P/E ratio, and more.

    ### python 1.Content: 2.Symbol: 3.Name: 4.Price: 5.Volume: 6.Market cap: 7.P/E ratio:

    The data is sourced from Yahoo Finance and is updated daily, providing users with the most up-to-date financial information for each company listed.

    The dataset is suitable for anyone interested in analyzing or predicting stock market trends and is particularly useful for financial analysts, investors, and traders.

  18. Mining Company's Global Supply Chain - Random Logistics Data for a Medium...

    • figshare.com
    zip
    Updated Jun 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marco Veluscek; Tatiana Kalganova (2023). Mining Company's Global Supply Chain - Random Logistics Data for a Medium Size Excavator [Dataset]. http://doi.org/10.6084/m9.figshare.1595939.v4
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    figshare
    Authors
    Marco Veluscek; Tatiana Kalganova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The company which provided the dataset is the world leader in manufacturing of construction and mining equipment, diesel and natural gas engines, industrial gas turbines and diesel-electric locomotives. The current revenue of the company is estimated to be on the order of tens of billions and they sell products and parts via a worldwide dealer network. The company sells more than 3 million products and 700,000 parts in more than 20 countries around the world every year. They operate with more than 3,000 suppliers and 3,000 dealerships and their logistics operations alone are worth more than 60 million dollars per year. The dataset provided is one example of supply chain problem for one product of the company - a medium size excavator. In the current dataset, the number of dealers, production facilities and shipping ports is the same as in the original problem; it is only the demand figures, the production capacities, the transportation times and costs and the sale prices that have been randomly generated. The figures have been randomly generated in an interval between 0 and an upper limit which is a random increase over the maximum value in the original data, according to a negative exponential distribution.

  19. AI Training Dataset Market Report | Global Forecast From 2025 To 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). AI Training Dataset Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-ai-training-dataset-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Jan 7, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    AI Training Dataset Market Outlook



    The global AI training dataset market size was valued at approximately USD 1.2 billion in 2023 and is projected to reach USD 6.5 billion by 2032, growing at a compound annual growth rate (CAGR) of 20.5% from 2024 to 2032. This substantial growth is driven by the increasing adoption of artificial intelligence across various industries, the necessity for large-scale and high-quality datasets to train AI models, and the ongoing advancements in AI and machine learning technologies.



    One of the primary growth factors in the AI training dataset market is the exponential increase in data generation across multiple sectors. With the proliferation of internet usage, the expansion of IoT devices, and the digitalization of industries, there is an unprecedented volume of data being generated daily. This data is invaluable for training AI models, enabling them to learn and make more accurate predictions and decisions. Moreover, the need for diverse and comprehensive datasets to improve AI accuracy and reliability is further propelling market growth.



    Another significant factor driving the market is the rising investment in AI and machine learning by both public and private sectors. Governments around the world are recognizing the potential of AI to transform economies and improve public services, leading to increased funding for AI research and development. Simultaneously, private enterprises are investing heavily in AI technologies to gain a competitive edge, enhance operational efficiency, and innovate new products and services. These investments necessitate high-quality training datasets, thereby boosting the market.



    The proliferation of AI applications in various industries, such as healthcare, automotive, retail, and finance, is also a major contributor to the growth of the AI training dataset market. In healthcare, AI is being used for predictive analytics, personalized medicine, and diagnostic automation, all of which require extensive datasets for training. The automotive industry leverages AI for autonomous driving and vehicle safety systems, while the retail sector uses AI for personalized shopping experiences and inventory management. In finance, AI assists in fraud detection and risk management. The diverse applications across these sectors underline the critical need for robust AI training datasets.



    As the demand for AI applications continues to grow, the role of Ai Data Resource Service becomes increasingly vital. These services provide the necessary infrastructure and tools to manage, curate, and distribute datasets efficiently. By leveraging Ai Data Resource Service, organizations can ensure that their AI models are trained on high-quality and relevant data, which is crucial for achieving accurate and reliable outcomes. The service acts as a bridge between raw data and AI applications, streamlining the process of data acquisition, annotation, and validation. This not only enhances the performance of AI systems but also accelerates the development cycle, enabling faster deployment of AI-driven solutions across various sectors.



    Regionally, North America currently dominates the AI training dataset market due to the presence of major technology companies and extensive R&D activities in the region. However, Asia Pacific is expected to witness the highest growth rate during the forecast period, driven by rapid technological advancements, increasing investments in AI, and the growing adoption of AI technologies across various industries in countries like China, India, and Japan. Europe and Latin America are also anticipated to experience significant growth, supported by favorable government policies and the increasing use of AI in various sectors.



    Data Type Analysis



    The data type segment of the AI training dataset market encompasses text, image, audio, video, and others. Each data type plays a crucial role in training different types of AI models, and the demand for specific data types varies based on the application. Text data is extensively used in natural language processing (NLP) applications such as chatbots, sentiment analysis, and language translation. As the use of NLP is becoming more widespread, the demand for high-quality text datasets is continually rising. Companies are investing in curated text datasets that encompass diverse languages and dialects to improve the accuracy and efficiency of NLP models.



    Image data is critical for computer vision application

  20. m

    THVD (Talking Head Video Dataset)

    • data.mendeley.com
    Updated Apr 2, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mario Peedor (2025). THVD (Talking Head Video Dataset) [Dataset]. http://doi.org/10.17632/ykhw8r7bfx.1
    Explore at:
    Dataset updated
    Apr 2, 2025
    Authors
    Mario Peedor
    License

    Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
    License information was derived automatically

    Description

    About

    We provide a comprehensive talking-head video dataset with over 50,000 videos, totaling more than 500 hours of footage and featuring 23,841 unique identities from around the world.

    Distribution

    Detailing the format, size, and structure of the dataset: Data Volume: -Total Size: 2.5TB

    -Total Videos: 47,200

    -Identities Covered: 23,000

    -Resolution: 60% 4k(1980), 33% fullHD(1080)

    -Formats: MP4

    -Full-length videos with visible mouth movements in every frame.

    -Minimum face size of 400 pixels.

    -Video durations range from 20 seconds to 5 minutes.

    -Faces have not been cut out, full screen videos including backgrounds.

    Usage

    This dataset is ideal for a variety of applications:

    Face Recognition & Verification: Training and benchmarking facial recognition models.

    Action Recognition: Identifying human activities and behaviors.

    Re-Identification (Re-ID): Tracking identities across different videos and environments.

    Deepfake Detection: Developing methods to detect manipulated videos.

    Generative AI: Training high-resolution video generation models.

    Lip Syncing Applications: Enhancing AI-driven lip-syncing models for dubbing and virtual avatars.

    Background AI Applications: Developing AI models for automated background replacement, segmentation, and enhancement.

    Coverage

    Explaining the scope and coverage of the dataset:

    Geographic Coverage: Worldwide

    Time Range: Time range and size of the videos have been noted in the CSV file.

    Demographics: Includes information about age, gender, ethnicity, format, resolution, and file size.

    Languages Covered (Videos):

    English: 23,038 videos

    Portuguese: 1,346 videos

    Spanish: 677 videos

    Norwegian: 1,266 videos

    Swedish: 1,056 videos

    Korean: 848 videos

    Polish: 1,807 videos

    Indonesian: 1,163 videos

    French: 1,102 videos

    German: 1,276 videos

    Japanese: 1,433 videos

    Dutch: 1,666 videos

    Indian: 1,163 videos

    Czech: 590 videos

    Chinese: 685 videos

    Italian: 975 videos

    Who Can Use It

    List examples of intended users and their use cases:

    Data Scientists: Training machine learning models for video-based AI applications.

    Researchers: Studying human behavior, facial analysis, or video AI advancements.

    Businesses: Developing facial recognition systems, video analytics, or AI-driven media applications.

    Additional Notes

    Ensure ethical usage and compliance with privacy regulations. The dataset’s quality and scale make it valuable for high-performance AI training. Potential preprocessing (cropping, down sampling) may be needed for different use cases. Dataset has not been completed yet and expands daily, please contact for most up to date CSV file. The dataset has been divided into 100GB zipped files and is hosted on a private server (with the option to upload to the cloud if needed). To verify the dataset's quality, please contact me for the full CSV file. I’d be happy to provide example videos selected by the potential buyer.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
The Devastator (2022). Bank Rankings by Total Assets [Dataset]. https://www.kaggle.com/datasets/thedevastator/global-banking-rankings-by-total-assets-2017-12
Organization logo

Bank Rankings by Total Assets

Tracking the Financial Performance of the Top Banks

Explore at:
8 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 6, 2022
Dataset provided by
Kaggle
Authors
The Devastator
Description

Bank Rankings by Total Assets

Tracking the Financial Performance of the Top Banks

By Arthur Keen [source]

About this dataset

This dataset contains the top 100 global banks ranked by total assets on December 31, 2017. With a detailed list of key information for each bank's rank, country, balance sheet and US Total Assets (in billions), this data will be invaluable for those looking to research and study the current status of some of the world's leading financial organizations. From billion-dollar mega-banks such as JP Morgan Chase to small, local savings & loans institutions like BancorpSouth; this comprehensive overview allows researchers and analysts to gain a better understanding of who holds power in the world economy today

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset contains the rank and total asset information of the top 100 global banks as of December 31, 2017. It is a useful resource for researchers who wish to study how key financial institutions' asset information relate to each other across countries.

Using this dataset is relatively straightforward – it consists of three columns - rank (the order in which each bank appears in the list), country (the country in which the bank is located) and total assets US billions (the total value expressed in US dollars). Additionally, there is a fourth column containing the balance sheet information for each bank as well.

In order to make full use of this dataset, one should analyse it by creating comparison grids based on different factors such as region, size or ownership structures. This can provide an interesting insight into how financial markets are structured within different economies and allow researchers to better understand some banking sector dynamics that are particularly relevant for certain countries or regions. Additionally, one can compare any two banks side-by-side using their respective balance sheets or distribution plot graphs based on size or concentration metrics by leverage or other financial ratios as well.

Overall, this dataset provides useful resources that can be put into practice through data visualization making an interesting reference point for trends analysis and forecasting purposes focusing on certain banking activities worldwide

Research Ideas

  • Analyzing the differences in total assets across countries. By comparing and contrasting data, patterns could be found that give insight into the factors driving differences in banks’ assets between different markets.

  • Using predictive models to identify which banks are more likely to perform better based on their balance sheet data, such as by predicting future profits or cashflows of said banks.

  • Leveraging the information on holdings and investments of ā€œtop-rankedā€ banks as a guide for personal investments decisions or informing investment strategies of large financial institutions or hedge funds

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.

Columns

File: top50banks2017-03-31.csv | Column name | Description | |:----------------------|:------------------------------------------------------------------------| | rank | The rank of the bank globally based on total assets. (Integer) | | country | The country where the bank is located. (String) | | total_assets_us_b | The total assets of a bank expressed in billions of US dollars. (Float) | | balance_sheet | A snapshot of banking activities for a specific date. (Date) |

File: top100banks2017-12-31.csv | Column name | Description | |:----------------------|:--------------------------------------------...

Search
Clear search
Close search
Google apps
Main menu