13 datasets found
  1. Developer Community and Code Datasets

    • datarade.ai
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oxylabs, Developer Community and Code Datasets [Dataset]. https://datarade.ai/data-products/developer-community-and-code-datasets-oxylabs
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset provided by
    oxylabs, UAB
    Authors
    Oxylabs
    Area covered
    Tuvalu, El Salvador, South Sudan, Guyana, Saint Pierre and Miquelon, Bahamas, United Kingdom, Marshall Islands, Philippines, Djibouti
    Description

    Unlock the power of ready-to-use data sourced from developer communities and repositories with Developer Community and Code Datasets.

    Data Sources:

    1. GitHub: Access comprehensive data about GitHub repositories, developer profiles, contributions, issues, social interactions, and more.

    2. StackShare: Receive information about companies, their technology stacks, reviews, tools, services, trends, and more.

    3. DockerHub: Dive into data from container images, repositories, developer profiles, contributions, usage statistics, and more.

    Developer Community and Code Datasets are a treasure trove of public data points gathered from tech communities and code repositories across the web.

    With our datasets, you'll receive:

    • Usernames;
    • Companies;
    • Locations;
    • Job Titles;
    • Follower Counts;
    • Contact Details;
    • Employability Statuses;
    • And More.

    Choose from various output formats, storage options, and delivery frequencies:

    • Get datasets in CSV, JSON, or other preferred formats.
    • Opt for data delivery via SFTP or directly to your cloud storage, such as AWS S3.
    • Receive datasets either once or as per your agreed-upon schedule.

    Why choose our Datasets?

    1. Fresh and accurate data: Access complete, clean, and structured data from scraping professionals, ensuring the highest quality.

    2. Time and resource savings: Let us handle data extraction and processing cost-effectively, freeing your resources for strategic tasks.

    3. Customized solutions: Share your unique data needs, and we'll tailor our data harvesting approach to fit your requirements perfectly.

    4. Legal compliance: Partner with a trusted leader in ethical data collection. Oxylabs is trusted by Fortune 500 companies and adheres to GDPR and CCPA standards.

    Pricing Options:

    Standard Datasets: choose from various ready-to-use datasets with standardized data schemas, priced from $1,000/month.

    Custom Datasets: Tailor datasets from any public web domain to your unique business needs. Contact our sales team for custom pricing.

    Experience a seamless journey with Oxylabs:

    • Understanding your data needs: We work closely to understand your business nature and daily operations, defining your unique data requirements.
    • Developing a customized solution: Our experts create a custom framework to extract public data using our in-house web scraping infrastructure.
    • Delivering data sample: We provide a sample for your feedback on data quality and the entire delivery process.
    • Continuous data delivery: We continuously collect public data and deliver custom datasets per the agreed frequency.

    Empower your data-driven decisions with Oxylabs Developer Community and Code Datasets!

  2. Climate Change: Earth Surface Temperature Data

    • kaggle.com
    • redivis.com
    zip
    Updated May 1, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Berkeley Earth (2017). Climate Change: Earth Surface Temperature Data [Dataset]. https://www.kaggle.com/datasets/berkeleyearth/climate-change-earth-surface-temperature-data
    Explore at:
    zip(88843537 bytes)Available download formats
    Dataset updated
    May 1, 2017
    Dataset authored and provided by
    Berkeley Earthhttp://berkeleyearth.org/
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Area covered
    Earth
    Description

    Some say climate change is the biggest threat of our age while others say it’s a myth based on dodgy science. We are turning some of the data over to you so you can form your own view.

    us-climate-change

    Even more than with other data sets that Kaggle has featured, there’s a huge amount of data cleaning and preparation that goes into putting together a long-time study of climate trends. Early data was collected by technicians using mercury thermometers, where any variation in the visit time impacted measurements. In the 1940s, the construction of airports caused many weather stations to be moved. In the 1980s, there was a move to electronic thermometers that are said to have a cooling bias.

    Given this complexity, there are a range of organizations that collate climate trends data. The three most cited land and ocean temperature data sets are NOAA’s MLOST, NASA’s GISTEMP and the UK’s HadCrut.

    We have repackaged the data from a newer compilation put together by the Berkeley Earth, which is affiliated with Lawrence Berkeley National Laboratory. The Berkeley Earth Surface Temperature Study combines 1.6 billion temperature reports from 16 pre-existing archives. It is nicely packaged and allows for slicing into interesting subsets (for example by country). They publish the source data and the code for the transformations they applied. They also use methods that allow weather observations from shorter time series to be included, meaning fewer observations need to be thrown away.

    In this dataset, we have include several files:

    Global Land and Ocean-and-Land Temperatures (GlobalTemperatures.csv):

    • Date: starts in 1750 for average land temperature and 1850 for max and min land temperatures and global ocean and land temperatures
    • LandAverageTemperature: global average land temperature in celsius
    • LandAverageTemperatureUncertainty: the 95% confidence interval around the average
    • LandMaxTemperature: global average maximum land temperature in celsius
    • LandMaxTemperatureUncertainty: the 95% confidence interval around the maximum land temperature
    • LandMinTemperature: global average minimum land temperature in celsius
    • LandMinTemperatureUncertainty: the 95% confidence interval around the minimum land temperature
    • LandAndOceanAverageTemperature: global average land and ocean temperature in celsius
    • LandAndOceanAverageTemperatureUncertainty: the 95% confidence interval around the global average land and ocean temperature

    Other files include:

    • Global Average Land Temperature by Country (GlobalLandTemperaturesByCountry.csv)
    • Global Average Land Temperature by State (GlobalLandTemperaturesByState.csv)
    • Global Land Temperatures By Major City (GlobalLandTemperaturesByMajorCity.csv)
    • Global Land Temperatures By City (GlobalLandTemperaturesByCity.csv)

    The raw data comes from the Berkeley Earth data page.

  3. d

    ISTARI.AI | Points of Interest Dataset (POI) | Global Coverage | 35+...

    • datarade.ai
    Updated Aug 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Istari.AI (2025). ISTARI.AI | Points of Interest Dataset (POI) | Global Coverage | 35+ Attributes | 40M+ Verified Company Profiles [Dataset]. https://datarade.ai/data-products/istari-ai-points-of-interest-dataset-poi-global-coverag-istari-ai
    Explore at:
    .json, .csv, .xls, .parquetAvailable download formats
    Dataset updated
    Aug 6, 2025
    Dataset provided by
    Istari.AI
    Area covered
    Chile, Montserrat, Greenland, French Guiana, Bouvet Island, Angola, Northern Mariana Islands, Ethiopia, Djibouti, Bonaire
    Description

    📍 Looking for high-quality Point of Interest (POI) data worldwide? ISTARI.AI offers tailored POI datasets to fit your exact business needs – whether you’re looking for all restaurants, gyms, electricians, or any other specific type of location-based business.

    📊 Our POI data includes: - Organizational structure & key personnel - Products, services & partnerships - Verified contact & domain info - Tech stack & business descriptions - Detailed geographic data (address, region, country)

    We don’t offer one-size-fits-all datasets – instead, you tell us what you need. Whether it’s a global dataset of all fitness centers, a list of car repair shops in a specific region, or just all vegan restaurants in major cities across the world, we generate the dataset based on your POI category and geographic scope.

    This flexibility makes our data ideal for use cases in: - Location-based services & apps - Market analysis & competitive intelligence - Retail expansion & site planning - Ad targeting & geofencing - Lead generation & B2B outreach

    All POI data is machine-generated, frequently updated, and sourced from publicly available web data, ensuring high freshness and consistency.

    Tell us your POI requirements – we’ll handle the rest. With ISTARI.AI, you receive structured POI datasets ready for direct integration into your systems.

    ✅ Ensuring Data Quality - The webAI AI Agent was developed in close collaboration with academic experts to guarantee expert-level accuracy. - Developed together with researchers at the University of Mannheim - Validated in the award-winning academic study: "When is AI Adoption Contagious? Epidemic Effects and Relational Embeddedness in the Inter-Firm Diffusion of Artificial Intelligence" - Co-authored by scholars from University of Mannheim, University of Giessen, University of Hohenheim, and ETH Zurich

  4. Success.ai | B2B Company & Contact Data – 28M Verified Company Profiles -...

    • datarade.ai
    Updated Oct 15, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2024). Success.ai | B2B Company & Contact Data – 28M Verified Company Profiles - Global - Best Price Guarantee & 99% Data Accuracy [Dataset]. https://datarade.ai/data-products/success-ai-b2b-company-contact-data-28m-verified-compan-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Oct 15, 2024
    Dataset provided by
    Area covered
    Solomon Islands, Burundi, United Republic of, Somalia, Niger, Greenland, India, Poland, Hungary, Côte d'Ivoire
    Description

    Success.ai’s Company Data Solutions provide businesses with powerful, enterprise-ready B2B company datasets, enabling you to unlock insights on over 28 million verified company profiles. Our solution is ideal for organizations seeking accurate and detailed B2B contact data, whether you’re targeting large enterprises, mid-sized businesses, or small business contact data.

    Success.ai offers B2B marketing data across industries and geographies, tailored to fit your specific business needs. With our white-glove service, you’ll receive curated, ready-to-use company datasets without the hassle of managing data platforms yourself. Whether you’re looking for UK B2B data or global datasets, Success.ai ensures a seamless experience with the most accurate and up-to-date information in the market.

    Why Choose Success.ai’s Company Data Solution? At Success.ai, we prioritize quality and relevancy. Every company profile is AI-validated for a 99% accuracy rate and manually reviewed to ensure you're accessing actionable and GDPR-compliant data. Our price match guarantee ensures you receive the best deal on the market, while our white-glove service provides personalized assistance in sourcing and delivering the data you need.

    Why Choose Success.ai?

    • Best Price Guarantee: We offer industry-leading pricing and beat any competitor.
    • Global Reach: Access over 28 million verified company profiles across 195 countries.
    • Comprehensive Data: Over 15 data points, including company size, industry, funding, and technologies used.
    • Accurate & Verified: AI-validated with a 99% accuracy rate, ensuring high-quality data.
    • Real-Time Updates: Stay ahead with continuously updated company information.
    • Ethically Sourced Data: Our B2B data is compliant with global privacy laws, ensuring responsible use.
    • Dedicated Service: Receive personalized, curated data without the hassle of managing platforms.
    • Tailored Solutions: Custom datasets are built to fit your unique business needs and industries.

    Our database spans 195 countries and covers 28 million public and private company profiles, with detailed insights into each company’s structure, size, funding history, and key technologies. We provide B2B company data for businesses of all sizes, from small business contact data to large corporations, with extensive coverage in regions such as North America, Europe, Asia-Pacific, and Latin America.

    Comprehensive Data Points: Success.ai delivers in-depth information on each company, with over 15 data points, including:

    Company Name: Get the full legal name of the company. LinkedIn URL: Direct link to the company's LinkedIn profile. Company Domain: Website URL for more detailed research. Company Description: Overview of the company’s services and products. Company Location: Geographic location down to the city, state, and country. Company Industry: The sector or industry the company operates in. Employee Count: Number of employees to help identify company size. Technologies Used: Insights into key technologies employed by the company, valuable for tech-based outreach. Funding Information: Track total funding and the most recent funding dates for investment opportunities. Maximize Your Sales Potential: With Success.ai’s B2B contact data and company datasets, sales teams can build tailored lists of target accounts, identify decision-makers, and access real-time company intelligence. Our curated datasets ensure you’re always focused on high-value leads—those who are most likely to convert into clients. Whether you’re conducting account-based marketing (ABM), expanding your sales pipeline, or looking to improve your lead generation strategies, Success.ai offers the resources you need to scale your business efficiently.

    Tailored for Your Industry: Success.ai serves multiple industries, including technology, healthcare, finance, manufacturing, and more. Our B2B marketing data solutions are particularly valuable for businesses looking to reach professionals in key sectors. You’ll also have access to small business contact data, perfect for reaching new markets or uncovering high-growth startups.

    From UK B2B data to contacts across Europe and Asia, our datasets provide global coverage to expand your business reach and identify new markets. With continuous data updates, Success.ai ensures you’re always working with the freshest information.

    Key Use Cases:

    • Targeted Lead Generation: Build accurate lead lists by filtering data by company size, industry, or location. Target decision-makers in key industries to streamline your B2B sales outreach.
    • Account-Based Marketing (ABM): Use B2B company data to personalize marketing campaigns, focusing on high-value accounts and improving conversion rates.
    • Investment Research: Track company growth, funding rounds, and employee trends to identify investment opportunities or potential M&A targets.
    • Market Research: Enrich your market intelligence initiatives by gain...
  5. Small Business Contact Data | Writing, Editing & Publishing Professionals...

    • datarade.ai
    Updated Oct 27, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2021). Small Business Contact Data | Writing, Editing & Publishing Professionals Worldwide | From 700M+ Dataset | Best Price Guarantee [Dataset]. https://datarade.ai/data-products/small-business-contact-data-small-business-owners-worldwide-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Oct 27, 2021
    Dataset provided by
    Area covered
    Montserrat, Virgin Islands (U.S.), Mali, Jersey, Botswana, Nepal, Korea (Democratic People's Republic of), Malawi, Kenya, Lebanon
    Description

    Unlock the potential of the global writing, editing, and publishing industry with Success.ai's Small Business Contact Data. Our extensive database provides access to verified profiles of professionals worldwide, curated from a dataset that encompasses over 700 million global entries. This specialized collection includes work emails, phone numbers, and comprehensive professional information, tailored to meet the needs of small businesses and independent professionals in the writing, editing, and publishing sectors.

    Why Choose Success.ai’s Small Business Contact Data?

    Targeted Professional Data: Gain access to a niche market of small business owners and freelancers in the writing, editing, and publishing industries. Global Reach: Our dataset covers professionals from all over the world, enabling you to execute international marketing campaigns and network expansion. Verified Contact Information: Ensure the reliability of your outreach with work emails and phone numbers that are regularly updated and verified for accuracy. Data Features:

    Comprehensive Profiles: Detailed insights into the professional lives of industry experts, including their job roles, career history, and areas of expertise. Industry-Specific Details: Information tailored to the nuances of the writing, editing, and publishing fields, helping you to better understand and target potential leads. Segmentation Options: Easily segment data by geographic location, professional experience, or specific industry niches such as freelance writers, independent publishers, or small press editors. Customizable Delivery and Integration: Success.ai offers flexible data solutions that can be customized to fit your specific requirements. Whether you need a one-time download or continuous API access for real-time data integration, our formats are designed to seamlessly integrate into your existing business workflows.

    Competitive Pricing with Best Price Guarantee: We commit to providing not only the highest quality data but also the most affordable pricing in the industry. Our Best Price Guarantee ensures you receive the best market rate for your data needs.

    Ideal Use Cases for Small Business Contact Data:

    Direct Marketing Campaigns: Utilize accurate contact details to send personalized email or direct mail campaigns to industry professionals. Networking and Partnership Development: Connect with key industry players to forge partnerships or collaborate on publishing projects. Event Promotion: Target industry-specific events like writing workshops, book fairs, or literary conferences with tailored invitations. Market Research: Analyze trends in the publishing industry, track the rise of independent writing professionals, or assess market needs. Quality Assurance and Compliance:

    Data Quality: Our data undergoes rigorous validation processes to maintain high accuracy and usefulness. Legal Compliance: All data collection and processing are performed in strict accordance with global data protection regulations, including GDPR. Support and Professional Consultation:

    Dedicated Support: Our team is ready to assist you with any queries or custom requests regarding the dataset. Expert Consultation: Leverage our expertise in data-driven marketing to enhance your outreach strategies and achieve better results. Start Reaching Writing and Publishing Professionals Today: With Success.ai’s Small Business Contact Data, you can start connecting with writing, editing, and publishing professionals globally. Enhance your marketing efforts, expand your professional network, and grow your presence in the industry with our reliable and comprehensive data solutions.

    Contact us to explore our offerings and take your business to the next level with tailored data that meets your exact needs.

  6. Success.ai | User Profiles Data | Comprehensive 700M Dataset of LinkedIn...

    • datarade.ai
    Updated Jan 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2022). Success.ai | User Profiles Data | Comprehensive 700M Dataset of LinkedIn Profiles for B2B Strategy [Dataset]. https://datarade.ai/data-products/success-ai-user-profiles-data-comprehensive-700m-dataset-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Jan 1, 2022
    Dataset provided by
    Area covered
    Uzbekistan, Wallis and Futuna, Chad, Liechtenstein, Saint Helena, Turkey, Marshall Islands, Aruba, Eritrea, Russian Federation
    Description

    Success.ai presents an unmatched opportunity with its User Profiles Data, offering in-depth access to LinkedIn profiles and company data that empowers businesses to develop ideal customer profiles, enrich company data, and sharpen competitive intelligence. Our LinkedIn Data Solutions are crafted to support your B2B strategies, providing a foundation for sales data enrichment and strategic market positioning.

    • User Profiles Data: Utilize detailed individual data for building precise customer profiles.
    • LinkedIn Data: Leverage the vast network of LinkedIn for enriched business insights.
    • LinkedIn Profile Data: Gain specific details from LinkedIn profiles to inform your marketing and sales strategies.
    • LinkedIn Company Data: Dive deep into company specifics to enhance your competitive edge.
    • Company Data: Access broad data sets for comprehensive market analysis and business planning.

    Key Use Cases:

    • Ideal Customer Profile Development: Craft detailed profiles that target the very core of your prospective markets.
    • Company Data Enrichment: Augment your existing databases with enriched factual data from LinkedIn.
    • Competitive Intelligence: Stay ahead by understanding market dynamics and competitor movements.
    • Sales Data Enrichment: Enrich your sales strategies with actionable insights and data points.
    • B2B Data Enrichment: Leverage enriched data to optimize your B2B processes and customer interactions.

    Why Success.ai is the Preferred Choice:

    • Data Precision: With a commitment to 99% accuracy, our data is constantly updated and verified.
    • Global Reach: Spanning 195 countries, our data solutions are as international as your business needs.
    • Adaptive Data Solutions: Customizable data options tailored to fit the specific needs of your business operations.
    • Legal Compliance: All data handling is GDPR-compliant, ensuring ethical practices in data usage.
    • Best Price Guarantee: We promise not only top-quality data but also the best prices in the industry.

    By choosing Success.ai, you gain access to a wealth of LinkedIn and user profile data that will enhance your market understanding, enrich customer interactions, and enable effective competitive strategies. Our extensive databases are the cornerstone of successful B2B engagements and strategic business planning.

    Get Started with Success.ai Now: Explore the potential of detailed LinkedIn data in your business strategy. Reach out to us for a consultation or start integrating our tailored data solutions today.

    And no one beats us on price. Period.

  7. d

    DATAANT | Amazon Data | Dataset, API | Product by keyword, by category, by...

    • datarade.ai
    Updated Feb 15, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataant (2021). DATAANT | Amazon Data | Dataset, API | Product by keyword, by category, by seller | 19 countries | 20+ Attributes [Dataset]. https://datarade.ai/data-products/amazon-product-data-by-keyword-by-category-by-seller-19-dataant
    Explore at:
    .json, .xml, .csv, .xls, .sqlAvailable download formats
    Dataset updated
    Feb 15, 2021
    Dataset authored and provided by
    Dataant
    Area covered
    Mexico, Australia, Turkey, Italy, Spain, Singapore, Netherlands, India, Sweden, France
    Description

    Get the needed Amazon product data right from the data extractor! Collect Amazon data product information from 19 Amazon countries from the following domains: - amazon.com - amazon.com.au - amazon.com.br - amazon.ca - amazon.cn - amazon.fr - amazon.de - amazon.in - amazon.it - amazon.com.mx - amazon.nl - amazon.sg - amazon.es - amazon.com.tr

    Request Ecommerce Product Data dataset by: - keyword - category - seller

    Amazon E-commerce Data datasets gathered by keyword and category contain: - product page position in search - product position on the page - product global position

    Data attributes contain: - ASIN - URL - Price (current price and discount information) - Reviews (total reviews and total rating). - Reviews information: each product can be enriched with the sub-dataset with reviews - Title - Description - Audio and Video And dozens of additional information.

    Amazon extraction results can be delivered by schedule or API request, so the data can be extracted in real-time.

    DATAANT uses the in-house web scraping service with no concurrency limitations, so unlimited data extractions can be performed simultaneously.

    Output can and attributes can be customized to fit your particular needs.

  8. d

    Machine Learning (ML) Data | 800M+ B2B Profiles | AI-Ready for Deep Learning...

    • datarade.ai
    .json, .csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xverum, Machine Learning (ML) Data | 800M+ B2B Profiles | AI-Ready for Deep Learning (DL), NLP & LLM Training [Dataset]. https://datarade.ai/data-products/xverum-company-data-b2b-data-belgium-netherlands-denm-xverum
    Explore at:
    .json, .csvAvailable download formats
    Dataset provided by
    Xverum LLC
    Authors
    Xverum
    Area covered
    Barbados, Sint Maarten (Dutch part), Cook Islands, India, Norway, Jordan, Dominican Republic, United Kingdom, Western Sahara, Oman
    Description

    Xverum’s AI & ML Training Data provides one of the most extensive datasets available for AI and machine learning applications, featuring 800M B2B profiles with 100+ attributes. This dataset is designed to enable AI developers, data scientists, and businesses to train robust and accurate ML models. From natural language processing (NLP) to predictive analytics, our data empowers a wide range of industries and use cases with unparalleled scale, depth, and quality.

    What Makes Our Data Unique?

    Scale and Coverage: - A global dataset encompassing 800M B2B profiles from a wide array of industries and geographies. - Includes coverage across the Americas, Europe, Asia, and other key markets, ensuring worldwide representation.

    Rich Attributes for Training Models: - Over 100 fields of detailed information, including company details, job roles, geographic data, industry categories, past experiences, and behavioral insights. - Tailored for training models in NLP, recommendation systems, and predictive algorithms.

    Compliance and Quality: - Fully GDPR and CCPA compliant, providing secure and ethically sourced data. - Extensive data cleaning and validation processes ensure reliability and accuracy.

    Annotation-Ready: - Pre-structured and formatted datasets that are easily ingestible into AI workflows. - Ideal for supervised learning with tagging options such as entities, sentiment, or categories.

    How Is the Data Sourced? - Publicly available information gathered through advanced, GDPR-compliant web aggregation techniques. - Proprietary enrichment pipelines that validate, clean, and structure raw data into high-quality datasets. This approach ensures we deliver comprehensive, up-to-date, and actionable data for machine learning training.

    Primary Use Cases and Verticals

    Natural Language Processing (NLP): Train models for named entity recognition (NER), text classification, sentiment analysis, and conversational AI. Ideal for chatbots, language models, and content categorization.

    Predictive Analytics and Recommendation Systems: Enable personalized marketing campaigns by predicting buyer behavior. Build smarter recommendation engines for ecommerce and content platforms.

    B2B Lead Generation and Market Insights: Create models that identify high-value leads using enriched company and contact information. Develop AI systems that track trends and provide strategic insights for businesses.

    HR and Talent Acquisition AI: Optimize talent-matching algorithms using structured job descriptions and candidate profiles. Build AI-powered platforms for recruitment analytics.

    How This Product Fits Into Xverum’s Broader Data Offering Xverum is a leading provider of structured, high-quality web datasets. While we specialize in B2B profiles and company data, we also offer complementary datasets tailored for specific verticals, including ecommerce product data, job listings, and customer reviews. The AI Training Data is a natural extension of our core capabilities, bridging the gap between structured data and machine learning workflows. By providing annotation-ready datasets, real-time API access, and customization options, we ensure our clients can seamlessly integrate our data into their AI development processes.

    Why Choose Xverum? - Experience and Expertise: A trusted name in structured web data with a proven track record. - Flexibility: Datasets can be tailored for any AI/ML application. - Scalability: With 800M profiles and more being added, you’ll always have access to fresh, up-to-date data. - Compliance: We prioritize data ethics and security, ensuring all data adheres to GDPR and other legal frameworks.

    Ready to supercharge your AI and ML projects? Explore Xverum’s AI Training Data to unlock the potential of 800M global B2B profiles. Whether you’re building a chatbot, predictive algorithm, or next-gen AI application, our data is here to help.

    Contact us for sample datasets or to discuss your specific needs.

  9. d

    Shopify, Woocommerce Data | Global Shopify Woocommerce Customers | 1.0M+...

    • datarade.ai
    Updated Jan 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Exellius Systems (2024). Shopify, Woocommerce Data | Global Shopify Woocommerce Customers | 1.0M+ Contacts | (Verified Email, Direct Dials) | Decision Makers | 20+ Attributes [Dataset]. https://datarade.ai/data-products/shopify-data-global-verified-shopify-customers-50m-conta-exellius-systems
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Jan 24, 2024
    Dataset authored and provided by
    Exellius Systems
    Area covered
    Qatar, Turkmenistan, Philippines, Anguilla, Gabon, Iraq, Nicaragua, Togo, Honduras, Suriname
    Description
    • Summary: Unlock direct connections with potential customers using our Shopify & Woocommerce Database – a robust collection of 1.5M+ verified contacts. This unique resource provides direct B2B emails and valid phone numbers, enhancing your marketing and sales efforts. Sourced from 10 active publication sites and our dedicated Contact Discovery Team, this database ensures reliability and accuracy.

    • Description: Empower your business with the dynamic capabilities of our Shopify & Woocommerce Database. With over 1.5M+ million verified contacts, this resource is a game-changer. Our commitment to providing direct B2B emails and valid phone numbers sets it apart, enabling you to establish meaningful connections with your audience.

    We take pride in our dual-sourcing strategy. Ten active publication sites continuously feed our databases with real-time information, forming a foundation of reliable data. Additionally, our Contact Discovery Team conducts in-depth research to verify and enhance the accuracy of our database, ensuring you get the most reliable information.

    This versatile database caters to various industries. Marketing teams can fine-tune their campaigns with precision using direct B2B emails and valid phone numbers. Sales teams benefit from enriched customer profiles, while research and analytics teams can delve into market studies and trend analyses. It's a powerful tool for boosting customer engagement and increasing revenue.

    But it's not just a standalone product; it seamlessly integrates into our broader data offerings. While focusing on e-commerce specifics, it complements our wider business and consumer datasets. Whether you prefer targeted insights or a panoramic view, our Shopify Database gives you the flexibility to navigate diverse business landscapes.

    In essence, our Shopify & Woocommerce Database is designed to transform your approach to customer connections. It's more than data; it's a strategic asset that empowers your business to thrive in a dynamic digital landscape. Connect directly, engage meaningfully, and watch your opportunities for growth multiply.

    • Unique Features: Discover the power of our Shopify & Woocommerce Database, packed with 1M+ verified contacts. What makes it special? We provide direct B2B emails and valid phone numbers, making it easy for businesses to connect directly with potential customers. This unique blend of verified contacts and communication channels enhances the effectiveness of your marketing and sales efforts.

    • Data Sources: Wondering where we get our data? We have two main sources. Firstly, we keep tabs on 10 active publication sites, continuously updating our databases with real-time information. Secondly, our dedicated Contact Discovery Team conducts thorough research to make sure the data is accurate. This two-step process ensures our database is reliable and up-to-date.

    • Primary Uses: Our Shopify & Woocommerce Database is versatile and useful across different industries. Marketing teams can target specific audiences with campaigns tailored using direct B2B emails and valid phone numbers. Sales teams benefit from detailed customer profiles, while research and analytics teams can dig into market studies and trend analyses. It's a valuable tool for boosting customer engagement and increasing revenue.

    • Integration with Other Data: This database seamlessly fits into our bigger data offerings. While it focuses on e-commerce, it also complements our broader business and consumer datasets. You can choose customized packages or combine datasets for a complete view, gaining insights across various industries and customer groups. It's not just a standalone product but part of a larger system, giving you the flexibility to navigate different business landscapes.

    In simple terms, our Shopify & Woocommerce Database is here to help businesses connect with verified information, creating opportunities for direct engagement and growth. Whether you're diving into e-commerce details or looking for a broader perspective, this data product ensures you have the tools to understand and connect with your target audience effectively.

  10. d

    Coresignal | Employee Data | From the Largest Professional Network | Global...

    • datarade.ai
    .json, .csv
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Coresignal, Coresignal | Employee Data | From the Largest Professional Network | Global / 712M+ Records / 5 Years of Historical Data / Updated Daily [Dataset]. https://datarade.ai/data-products/public-resume-data-coresignal
    Explore at:
    .json, .csvAvailable download formats
    Dataset authored and provided by
    Coresignal
    Area covered
    Latvia, Eritrea, Macao, Brunei Darussalam, Bosnia and Herzegovina, Russian Federation, French Guiana, Réunion, Christmas Island, Palestine
    Description

    ➡️ You can choose from multiple data formats, delivery frequency options, and delivery methods;

    ➡️ You can select raw or clean and AI-enriched datasets;

    ➡️ Multiple APIs designed for effortless search and enrichment (accessible using a user-friendly self-service tool);

    ➡️ Fresh data: daily updates, easy change tracking with dedicated data fields, and a constant flow of new data;

    ➡️ You get all necessary resources for evaluating our data: a free consultation, a data sample, or free credits for testing our APIs.

    Coresignal's employee data enables you to create and improve innovative data-driven solutions and extract actionable business insights. These datasets are popular among companies from different industries, including HR and sales technology and investment.

    Employee Data use cases:

    ✅ Source best-fit talent for your recruitment needs

    Coresignal's Employee Data can help source the best-fit talent for your recruitment needs by providing the most up-to-date information on qualified candidates globally.

    ✅ Fuel your lead generation pipeline

    Enhance lead generation with 712M+ up-to-date employee records from the largest professional network. Our Employee Data can help you develop a qualified list of potential clients and enrich your own database.

    ✅ Analyze talent for investment opportunities

    Employee Data can help you generate actionable signals and identify new investment opportunities earlier than competitors or perform deeper analysis of companies you're interested in.

    ➡️ Why 400+ data-powered businesses choose Coresignal:

    1. Experienced data provider (in the market since 2016);
    2. Exceptional client service;
    3. Responsible and secure data collection.
  11. Wirestock's AI/ML Image Training Data, 4.5M Files with Metadata

    • datarade.ai
    .csv
    Updated Jul 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WIRESTOCK (2023). Wirestock's AI/ML Image Training Data, 4.5M Files with Metadata [Dataset]. https://datarade.ai/data-products/wirestock-s-ai-ml-image-training-data-4-5m-files-with-metadata-wirestock
    Explore at:
    .csvAvailable download formats
    Dataset updated
    Jul 18, 2023
    Dataset provided by
    Wirestock, Inc.
    Authors
    WIRESTOCK
    Area covered
    Belarus, New Caledonia, Pakistan, Estonia, Georgia, Swaziland, Jersey, Sudan, Peru, Chile
    Description

    Wirestock's AI/ML Image Training Data, 4.5M Files with Metadata: This data product is a unique offering in the realm of AI/ML training data. What sets it apart is the sheer volume and diversity of the dataset, which includes 4.5 million files spanning across 20 different categories. These categories range from Animals/Wildlife and The Arts to Technology and Transportation, providing a rich and varied dataset for AI/ML applications.

    The data is sourced from Wirestock's platform, where creators upload and sell their photos, videos, and AI art online. This means that the data is not only vast but also constantly updated, ensuring a fresh and relevant dataset for your AI/ML needs. The data is collected in a GDPR-compliant manner, ensuring the privacy and rights of the creators are respected.

    The primary use-cases for this data product are numerous. It is ideal for training machine learning models for image recognition, improving computer vision algorithms, and enhancing AI applications in various industries such as retail, healthcare, and transportation. The diversity of the dataset also means it can be used for more niche applications, such as training AI to recognize specific objects or scenes.

    This data product fits into Wirestock's broader data offering as a key resource for AI/ML training. Wirestock is a platform for creators to sell their work, and this dataset is a collection of that work. It represents the breadth and depth of content available on Wirestock, making it a valuable resource for any company working with AI/ML.

    The core benefits of this dataset are its volume, diversity, and quality. With 4.5 million files, it provides a vast resource for AI training. The diversity of the dataset, spanning 20 categories, ensures a wide range of images for training purposes. The quality of the images is also high, as they are sourced from creators selling their work on Wirestock.

    In terms of how the data is collected, creators upload their work to Wirestock, where it is then sold on various marketplaces. This means the data is sourced directly from creators, ensuring a diverse and unique dataset. The data includes both the images themselves and associated metadata, providing additional context for each image.

    The different image categories included in this dataset are Animals/Wildlife, The Arts, Backgrounds/Textures, Beauty/Fashion, Buildings/Landmarks, Business/Finance, Celebrities, Education, Emotions, Food Drinks, Holidays, Industrial, Interiors, Nature Parks/Outdoor, People, Religion, Science, Signs/Symbols, Sports/Recreation, Technology, Transportation, Vintage, Healthcare/Medical, Objects, and Miscellaneous. This wide range of categories ensures a diverse dataset that can cater to a variety of AI/ML applications.

  12. d

    Coresignal | Web Scraping | Company Data | Global / 71M+ Records / Largest...

    • datarade.ai
    .json, .csv
    Updated Feb 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Coresignal (2024). Coresignal | Web Scraping | Company Data | Global / 71M+ Records / Largest Professional Network / Updated Daily [Dataset]. https://datarade.ai/data-products/coresignal-web-scraping-company-data-global-69m-reco-coresignal
    Explore at:
    .json, .csvAvailable download formats
    Dataset updated
    Feb 26, 2024
    Dataset authored and provided by
    Coresignal
    Area covered
    Sweden, Korea (Democratic People's Republic of), Mauritania, Latvia, Cabo Verde, French Polynesia, Cayman Islands, Sri Lanka, Nicaragua, Saint Helena
    Description

    Our Web Scraping dataset includes such data points as company name, location, headcount, industry, and size, among others. It offers extensive fresh and historical data, including even companies that operate in stealth mode.

    For lead generation

    With millions of companies from around the globe, this scraped data enables you to filter potential clients based on specific criteria and hasten the conversion process.

    Use cases

    1. Filter potential clients according to location, size, and other criteria
    2. Enrich your existing database
    3. Improve conversion rates
    4. Use predictive models to identify potential leads
    5. Group your leads in segments for more accurate targeting

    For market and business analysis

    Our Web Scraping Data on companies gives information about millions of businesses, allowing you to evaluate your competitors.

    Use cases

    1. Know your competitors
    2. See your competitors' size, headcount, and revenue
    3. Come up with a data-driven strategy for the next quarter

    For Investors

    We recommend Web Scraping Data for investors to discover and evaluate businesses with the highest potential.

    Gain strategic business insights, enhance decision-making, and maintain algorithms that signal investment opportunities with Coresignal’s global Web Scraping Data.

    Use cases

    1. Screen startups and industries showing early signs of growth
    2. Identify companies looking for the next investment
    3. Check if a startup is about to reach its maturity
    4. Predict a startup's potential at the founding moment
    5. Choose companies that fit you in terms of size and headcount

    For sales prospecting

    Web Scraping Data saves time your employees would otherwise use it to find potential clients and choose the best prospects manually.

    Use cases

    1. Make a short list of the top prospects
    2. Define which companies are large or small enough to buy your product
    3. Based on the revenue, determine which companies are ready to convert
    4. Sort the companies by their distance from your warehouse to draw a line where selling won't result in satisfactory profit
  13. d

    Company Data | 249 Countries Coverage | 270m+ Companies

    • datarade.ai
    .json, .csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rhetorik, Company Data | 249 Countries Coverage | 270m+ Companies [Dataset]. https://datarade.ai/data-products/neuron360-companies-from-rhetorik-rhetorik
    Explore at:
    .json, .csvAvailable download formats
    Dataset authored and provided by
    Rhetorik
    Area covered
    United States
    Description

    Rhetorik360 is the ultimate tool to both segment your market and target, and find your ideal customer. Access detailed Firmographics and Technographics for more than 200 million+ companies globally to power your sales and marketing efforts and increase your business revenues.

    Available for lead lists, data enrichment, account and contact data hygiene and validation, company technographics, leads, ABM, recruiting and other uses. One time and annual use licensing available.

    Use the Rhetorik360 Company DB with its linked sister database, the Neuron 360 Global B2B Professionals Profiles Database to get the best global coverage of Companies, Offices and Professionals.

    230 Million Companies 800 Million Professional Profiles 109 Company Attributes 192 Professional Profile Attributes

    This is a new to market, uniquely sourced data set using the power of Rhetorik's proprietary AI. We amalgamate billions of data points from scores of sources to create a world class BTB Company and Contact data asset.

    Company Profile Information: Micro-target and reach your ideal customer faster by gaining access to your complete company profile. Our global company profile data feed is always clean, accurate, up to date and compliant. Target by Technographics, Firmographics and much more.

    Technographics: Our extensive technographics data sets allow you to understand the tech stack of your prospects and their interest in your products. Look forward to enhanced insight to power your company’s organizational and segmentation efforts, improve your qualification process, increase the effectiveness of your account base marketing, and shorten your sales cycles. Rhetorik's technology data organizes installed enterprise technologies across all major hardware and software product categories, allowing easy searching and filtering on buyers’ technology assets. We track:

    26 Million+ technology installs 20,000+ technology products 7,900+ technology vendors 180+ technology categories

    Firmographics: Our Firmographics will help you to more efficiently and effectively segment your company through comprehensive data-analysis of your target markets! Determining if a business is the right fit for your company has never been easier. Experience the power of targeted messaging which can be adapted to each and every target audience; taking into account business size, budget, and much more!

    Access it where and when you need it. Rhetorik360-Profiles is available via APIs, Snowflake Marketplace, or bulk delivery in JSON and CSV formats and supports a wide range of use cases. Data is refreshed weekly, so you can be sure your information is always up to date!

    North America: 55M+ Companies EMEA: 70M+ Companies APAC: 45M+ Companies LATAM: 30M+ Companies

  14. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Oxylabs, Developer Community and Code Datasets [Dataset]. https://datarade.ai/data-products/developer-community-and-code-datasets-oxylabs
Organization logo

Developer Community and Code Datasets

Explore at:
.bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset provided by
oxylabs, UAB
Authors
Oxylabs
Area covered
Tuvalu, El Salvador, South Sudan, Guyana, Saint Pierre and Miquelon, Bahamas, United Kingdom, Marshall Islands, Philippines, Djibouti
Description

Unlock the power of ready-to-use data sourced from developer communities and repositories with Developer Community and Code Datasets.

Data Sources:

  1. GitHub: Access comprehensive data about GitHub repositories, developer profiles, contributions, issues, social interactions, and more.

  2. StackShare: Receive information about companies, their technology stacks, reviews, tools, services, trends, and more.

  3. DockerHub: Dive into data from container images, repositories, developer profiles, contributions, usage statistics, and more.

Developer Community and Code Datasets are a treasure trove of public data points gathered from tech communities and code repositories across the web.

With our datasets, you'll receive:

  • Usernames;
  • Companies;
  • Locations;
  • Job Titles;
  • Follower Counts;
  • Contact Details;
  • Employability Statuses;
  • And More.

Choose from various output formats, storage options, and delivery frequencies:

  • Get datasets in CSV, JSON, or other preferred formats.
  • Opt for data delivery via SFTP or directly to your cloud storage, such as AWS S3.
  • Receive datasets either once or as per your agreed-upon schedule.

Why choose our Datasets?

  1. Fresh and accurate data: Access complete, clean, and structured data from scraping professionals, ensuring the highest quality.

  2. Time and resource savings: Let us handle data extraction and processing cost-effectively, freeing your resources for strategic tasks.

  3. Customized solutions: Share your unique data needs, and we'll tailor our data harvesting approach to fit your requirements perfectly.

  4. Legal compliance: Partner with a trusted leader in ethical data collection. Oxylabs is trusted by Fortune 500 companies and adheres to GDPR and CCPA standards.

Pricing Options:

Standard Datasets: choose from various ready-to-use datasets with standardized data schemas, priced from $1,000/month.

Custom Datasets: Tailor datasets from any public web domain to your unique business needs. Contact our sales team for custom pricing.

Experience a seamless journey with Oxylabs:

  • Understanding your data needs: We work closely to understand your business nature and daily operations, defining your unique data requirements.
  • Developing a customized solution: Our experts create a custom framework to extract public data using our in-house web scraping infrastructure.
  • Delivering data sample: We provide a sample for your feedback on data quality and the entire delivery process.
  • Continuous data delivery: We continuously collect public data and deliver custom datasets per the agreed frequency.

Empower your data-driven decisions with Oxylabs Developer Community and Code Datasets!

Search
Clear search
Close search
Google apps
Main menu