https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Dataset Card for "BrightData/Wikipedia-Articles"
Dataset Summary
Explore a collection of millions of Wikipedia articles with the Wikipedia dataset, comprising over 1.23M structured records and 10 data fields updated and refreshed regularly. Each entry includes all major data points such as timestamp, URLs, article titles, raw and cataloged text, images, "see also" references, external links, and a structured table of contents. For a complete list of data points, please… See the full description on the dataset page: https://huggingface.co/datasets/BrightData/Wikipedia-Articles.
https://brightdata.com/licensehttps://brightdata.com/license
Pinterest dataset contains 109M records and 13 different attributes. Use our Pinterest profiles dataset (public data) to extract business and non-business insights from complete public profiles and filter by followers, boards amount, or locations. Tailor your dataset to suit your specific requirements, whether you need the complete dataset or a customized subset.
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Booking Listings – Free Cancellation 🏨✈️
Booking Listings is a structured snapshot of accommodation offers worldwide as listed on Booking.com. This subset contains only properties that offer free cancellation, enabling analysts and data scientists to study flexible‑booking behaviour, derive pricing strategies, and build recommendation or revenue‑management systems.
Highlights
75 k hotels & apartments across 84 countries Rich pricing & availability metadata (final vs. original… See the full description on the dataset page: https://huggingface.co/datasets/BrightData/Booking.com-Listings.
https://brightdata.com/licensehttps://brightdata.com/license
Access our extensive Facebook datasets that provide detailed information on public posts, pages, and user engagement. Gain insights into post performance, audience interactions, page details, and content trends with our ethically sourced data. Free samples are available for evaluation. Over 940M records available Price starts at $250/100K records Data formats are available in JSON, NDJSON, CSV, XLSX and Parquet. 100% ethical and compliant data collection Included datapoints:
Post ID Post Content & URL Date Posted Hashtags Number of Comments Number of Shares Likes & Reaction Counts (by type) Video View Count Page Name & Category Page Followers & Likes Page Verification Status Page Website & Contact Info Is Sponsored Post Attachments (Images/Videos) External Link Data And much more
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Dataset Card for "BrightData/Goodreads-Books"
Dataset Summary
Explore a collection of millions of books with the Goodreads dataset, comprising over 6.3M structured records and 14 data fields updated and refreshed regularly. Each entry includes all major data points such as URLs, book IDs, titles, authors, ratings, number of ratings, reviews, summaries, genres, publication dates, author details and prices. For a complete list of data points, please refer to the full "Data… See the full description on the dataset page: https://huggingface.co/datasets/BrightData/Goodreads-Books.
https://brightdata.com/licensehttps://brightdata.com/license
Bright Data’s datasets are created by utilizing proprietary technology for retrieving public web data at scale, resulting in fresh, complete, and accurate datasets. CrunchBase datasets provide unique insights into the latest industry trends. They enable the tracking of company growth, identifying key businesses and professionals, tracking employee movement between companies, as well as enabling more efficient competitive intelligence. Easily define your Crunchbase dataset using our smart filter capabilities, enabling you to customize pre-existing datasets, ensuring the data received fits your business needs. Bright Data’s Crunchbase company data includes over 2.8 million company profiles, with subsets available by industry, region, and any other parameters according to your requirements. There are over 70 data points per company, including overview, details, news, financials, investors, products, people, and more. Choose between full coverage or a subset. Get your Crunchbase dataset Today!
Bright Data’s retail data collector is uniquely crafted to enable your digital commerce business gain a competitive edge by collecting key data sets, including: - Pricing - Competitive landscape - Special offers - Customer reviews - Pictures and videos - Seller ratings - Consumer search trends - Search engine results for products, stores and websites - Competitor advertisement scanning
This data enables you to be dynamic and adapt to real-time market realities and trends.
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Dataset Card for "BrightData/IMDb-Media"
Dataset Summary
Explore feature films, TV series, episodes, mini-series, documentaries, and more with this IMDb dataset, comprising over 249K structured records and 32 data fields updated and refreshed regularly. Each entry includes all major data points such as timestamp, title, URLs, release date, IMDb rating, reviews, awards, origin, category/genre, budget, cast, director, images, videos and more. For a complete list of data… See the full description on the dataset page: https://huggingface.co/datasets/BrightData/IMDb-Media.
https://brightdata.com/licensehttps://brightdata.com/license
Use our YouTube profiles dataset to extract both business and non-business information from public channels and filter by channel name, views, creation date, or subscribers. Datapoints include URL, handle, banner image, profile image, name, subscribers, description, video count, create date, views, details, and more. You may purchase the entire dataset or a customized subset, depending on your needs. Popular use cases for this dataset include sentiment analysis, brand monitoring, influencer marketing, and more.
https://brightdata.com/licensehttps://brightdata.com/license
Gain extensive insights with our Amazon datasets, encompassing detailed product information including pricing, reviews, ratings, brand names, product categories, sellers, ASINs, images, and much more. Ideal for market researchers, data analysts, and eCommerce professionals looking to excel in the competitive online marketplace. Over 425M records available Price starts at $250/100K records Data formats are available in JSON, NDJSON, CSV, XLSX and Parquet. 100% ethical and compliant data collection Included datapoints:
Title Asin Main Image Brand Name Description Availability Subcategory Categories Parent Asin Type Product Type Name Model Number Manufacturer Color Size Date First Available Released Model Year Item Model Number Part Number Price Total Reviews Total Ratings Average Rating Features Best Sellers Rank Subcategory Buybox Buybox Seller Id Buybox Is Amazon Images Product URL And more
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
The Web Data Providers Software market is experiencing robust growth, driven by the increasing demand for real-time data insights across various industries. The market's expansion is fueled by the rising adoption of big data analytics, the proliferation of connected devices generating massive datasets, and the need for businesses to make data-driven decisions. Organizations across sectors like finance, marketing, and research leverage these software solutions to collect, process, and analyze web data for competitive intelligence, market research, and customer profiling. The market's growth is further accelerated by advancements in web scraping technologies, AI-powered data extraction, and cloud-based solutions offering scalable and cost-effective data access. While challenges like data privacy regulations and the ethical implications of web scraping exist, the overall market trajectory remains positive, indicating significant opportunities for established players and new entrants alike. We project a healthy Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033, with a market size exceeding $5 billion by 2033, based on current market trends and the continued expansion of digital technologies. The competitive landscape is highly dynamic, with a mix of established players like Microsoft and emerging specialized providers. The market is characterized by a diverse range of solutions, from comprehensive data extraction platforms to specialized tools focused on specific data types or industries. Success in this market hinges on factors such as data accuracy, speed of extraction, ease of use, compliance with data privacy regulations (like GDPR and CCPA), and the ability to integrate seamlessly with existing business intelligence systems. The market is also seeing a growing demand for ethical and responsible web scraping practices, further shaping the evolution of the technology and driving innovation within the sector. This includes the development of solutions that prioritize consent, respect website terms of service, and avoid overloading target servers. The trend towards automation and AI-powered data extraction is expected to continue, leading to greater efficiency and improved data quality.
https://brightdata.com/licensehttps://brightdata.com/license
The Google Maps dataset is ideal for getting extensive information on businesses anywhere in the world. Easily filter by location, business type, and other factors to get the exact data you need. The Google Maps dataset includes all major data points: timestamp, name, category, address, description, open website, phone number, open_hours, open_hours_updated, reviews_count, rating, main_image, reviews, url, lat, lon, place_id, country, and more.
https://brightdata.com/licensehttps://brightdata.com/license
Bright Data’s datasets are created using proprietary technology to collect public web data at scale, resulting in accurate, complete, and fresh insights. The PitchBook Companies Information dataset provides detailed profiles of global businesses, including company URLs, social handles, founding year, operational status, employee data, and financial details like deal types, funding rounds, and investments. Tailored for investors, analysts, and corporate strategists, this dataset supports due diligence, competitive analysis, and market trend evaluation. Customize your dataset using smart filters or choose full coverage to access key data points, including patent activity, research reports, and more. Get your PitchBook dataset today!
https://brightdata.com/licensehttps://brightdata.com/license
Unlock the full potential of LinkedIn data with our extensive dataset that combines profiles, company information, and job listings into one powerful resource for business decision-making, strategic hiring, competitive analysis, and market trend insights. This all-encompassing dataset is ideal for professionals, recruiters, analysts, and marketers aiming to enhance their strategies and operations across various business functions. Dataset Features
Profiles: Dive into detailed public profiles featuring names, titles, positions, experience, education, skills, and more. Utilize this data for talent sourcing, lead generation, and investment signaling, with a refresh rate ensuring up to 30 million records per month. Companies: Access comprehensive company data including ID, country, industry, size, number of followers, website details, subsidiaries, and posts. Tailored subsets by industry or region provide invaluable insights for CRM enrichment, competitive intelligence, and understanding the startup ecosystem, updated monthly with up to 40 million records. Job Listings: Explore current job opportunities detailed with job titles, company names, locations, and employment specifics such as seniority levels and employment functions. This dataset includes direct application links and real-time application numbers, serving as a crucial tool for job seekers and analysts looking to understand industry trends and the job market dynamics.
Customizable Subsets for Specific Needs Our LinkedIn dataset offers the flexibility to tailor the dataset according to your specific business requirements. Whether you need comprehensive insights across all data points or are focused on specific segments like job listings, company profiles, or individual professional details, we can customize the dataset to match your needs. This modular approach ensures that you get only the data that is most relevant to your objectives, maximizing efficiency and relevance in your strategic applications. Popular Use Cases
Strategic Hiring and Recruiting: Track talent movement, identify growth opportunities, and enhance your recruiting efforts with targeted data. Market Analysis and Competitive Intelligence: Gain a competitive edge by analyzing company growth, industry trends, and strategic opportunities. Lead Generation and CRM Enrichment: Enrich your database with up-to-date company and professional data for targeted marketing and sales strategies. Job Market Insights and Trends: Leverage detailed job listings for a nuanced understanding of employment trends and opportunities, facilitating effective job matching and market analysis. AI-Driven Predictive Analytics: Utilize AI algorithms to analyze large datasets for predicting industry shifts, optimizing business operations, and enhancing decision-making processes based on actionable data insights.
Whether you are mapping out competitive landscapes, sourcing new talent, or analyzing job market trends, our LinkedIn dataset provides the tools you need to succeed. Customize your access to fit specific needs, ensuring that you have the most relevant and timely data at your fingertips.
https://brightdata.com/licensehttps://brightdata.com/license
Real estate datasets from various websites cover all major real estate data points including: property type, size, location, price, bedrooms, baths, address, history, images, and much more. Popular use cases include: forecast housing demand, analyze price fluctuations, improve customer satisfaction, see past prices to monitor market trends, and more.
https://brightdata.com/licensehttps://brightdata.com/license
Gain a complete view of the real estate market with our Zillow datasets. Track price trends, rental/sale status, and price per square foot with the Zillow Price History dataset and explore detailed listings with prices, locations, and features using the Zillow Properties Listing dataset. Over 134M records available Price starts at $250/100K records Data formats are available in JSON, NDJSON, CSV, XLSX and Parquet. 100% ethical and compliant data collection Included datapoints:
Zpid
City
State
Home Status
Street Address
Zipcode
Home Type
Living Area Value
Bedrooms
Bathrooms
Price
Property Type
Date Sold
Annual Homeowners Insurance
Price Per Square Foot
Rent Zestimate
Tax Assessed Value
Zestimate
Home Values
Lot Area
Lot Area Unit
Living Area
Living Area Units
Property Tax Rate
Page View Count
Favorite Count
Time On Zillow
Time Zone
Abbreviated Address
Brokerage Name
And much more
https://brightdata.com/licensehttps://brightdata.com/license
This dataset offers a comprehensive collection of ZoomInfo company data, providing an in-depth view of businesses across various industries. It includes key attributes such as company name, description, revenue, employee count, industry, leadership details, financial metrics, and much more. With subsets available by industry and company size, this dataset enables businesses to access and analyze valuable B2B information tailored to their needs. Users can leverage this dataset for competitive intelligence, lead generation, and market analysis. The data can help identify high-value prospects, track business growth, and enrich CRM systems with detailed company profiles and firmographic information. Whether you are looking to enhance your investment strategy or improve your sales targeting, this dataset serves as a crucial resource for driving informed decisions and staying ahead in the market.
https://brightdata.com/licensehttps://brightdata.com/license
Access our extensive Reddit datasets that provide detailed information on posts, communities (subreddits), and user engagement. Gain insights into post performance, user comments, community statistics, and content trends with our ethically sourced data. Free samples are available for evaluation. 3M+ records available Price starts at $250/100K records Data formats are available in JSON, NDJSON, CSV, XLSX and Parquet. 100% ethical and compliant data collection Included datapoints:
Post ID, Title & URL Post Description & Date Username of Poster Upvotes & Comment Count Community Name, URL & Description Community Member Count Attached Photos & Videos Full Post Comments Related Posts Post Karma Post Tags And more
https://brightdata.com/licensehttps://brightdata.com/license
Use our constantly updated Walmart products dataset to get a complete snapshot of new products, categories, pricing, and consumer reviews. You may purchase the entire dataset or a customized subset, depending on your needs. Popular use cases: Identify product inventory gaps and increased demand for certain products, analyze consumer sentiment and define a pricing strategy by locating similar products and categories among your competitors. The dataset includes all major data points: product, SKU, GTIN, currency,timestamp, price,a nd more. Get your Walmart dataset today!
https://brightdata.com/licensehttps://brightdata.com/license
Gain valuable insights with our comprehensive Social Media Dataset, designed to help businesses, marketers, and analysts track trends, monitor engagement, and optimize strategies. This dataset provides structured and reliable social media data from multiple platforms.
Dataset Features
User Profiles: Access public social media profiles, including usernames, bios, follower counts, engagement metrics, and more. Ideal for audience analysis, influencer marketing, and competitive research. Posts & Content: Extract posts, captions, hashtags, media (images/videos), timestamps, and engagement metrics such as likes, shares, and comments. Useful for trend analysis, sentiment tracking, and content strategy optimization. Comments & Interactions: Analyze user interactions, including replies, mentions, and discussions. This data helps brands understand audience sentiment and engagement patterns. Hashtag & Trend Tracking: Monitor trending hashtags, topics, and viral content across platforms to stay ahead of industry trends and consumer interests.
Customizable Subsets for Specific Needs Our Social Media Dataset is fully customizable, allowing you to filter data based on platform, region, keywords, engagement levels, or specific user profiles. Whether you need a broad dataset for market research or a focused subset for brand monitoring, we tailor the dataset to your needs.
Popular Use Cases
Brand Monitoring & Reputation Management: Track brand mentions, customer feedback, and sentiment analysis to manage online reputation effectively. Influencer Marketing & Audience Analysis: Identify key influencers, analyze engagement metrics, and optimize influencer partnerships. Competitive Intelligence: Monitor competitor activity, content performance, and audience engagement to refine marketing strategies. Market Research & Consumer Insights: Analyze social media trends, customer preferences, and emerging topics to inform business decisions. AI & Predictive Analytics: Leverage structured social media data for AI-driven trend forecasting, sentiment analysis, and automated content recommendations.
Whether you're tracking brand sentiment, analyzing audience engagement, or monitoring industry trends, our Social Media Dataset provides the structured data you need. Get started today and customize your dataset to fit your business objectives.
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Dataset Card for "BrightData/Wikipedia-Articles"
Dataset Summary
Explore a collection of millions of Wikipedia articles with the Wikipedia dataset, comprising over 1.23M structured records and 10 data fields updated and refreshed regularly. Each entry includes all major data points such as timestamp, URLs, article titles, raw and cataloged text, images, "see also" references, external links, and a structured table of contents. For a complete list of data points, please… See the full description on the dataset page: https://huggingface.co/datasets/BrightData/Wikipedia-Articles.