100+ datasets found

F
OER sample data-set
data.uni-hannover.de
csv
Updated Jan 20, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
L3S (2022). OER sample data-set [Dataset]. https://data.uni-hannover.de/dataset/oer-sample-data-set
Explore at:
csv(6260265)Available download formats
Dataset updated
Jan 20, 2022
Dataset authored and provided by
L3S
License
Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
License information was derived automatically
Description
This data-set includes information about a sample of 8,887 of Open Educational Resources (OERs) from SkillsCommons website. It contains title, description, URL, type, availability date, issued date, subjects, and the availability of following metadata: level, time_required to finish, and accessibility.

This data-set has been used to build a metadata scoring and quality prediction model for OERs.
d
Global Web Data | Web Scraping Data | Job Postings Data | Source: Company...
datarade.ai
.json
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PredictLeads, Global Web Data | Web Scraping Data | Job Postings Data | Source: Company Website | 214M+ Records [Dataset]. https://datarade.ai/data-products/predictleads-web-data-web-scraping-data-job-postings-dat-predictleads
Explore at:
.jsonAvailable download formats
Dataset authored and provided by
PredictLeads
Area covered
French Guiana, Bosnia and Herzegovina, El Salvador, Comoros, Bonaire, Kuwait, Guadeloupe, Virgin Islands (British), Northern Mariana Islands, Kosovo
Description
PredictLeads Job Openings Data provides high-quality hiring insights sourced directly from company websites - not job boards. Using advanced web scraping technology, our dataset offers real-time access to job trends, salaries, and skills demand, making it a valuable resource for B2B sales, recruiting, investment analysis, and competitive intelligence.

Key Features:

✅214M+ Job Postings Tracked – Data sourced from 92 Million company websites worldwide. ✅7,1M+ Active Job Openings – Updated in real-time to reflect hiring demand. ✅Salary & Compensation Insights – Extract salary ranges, contract types, and job seniority levels. ✅Technology & Skill Tracking – Identify emerging tech trends and industry demands. ✅Company Data Enrichment – Link job postings to employer domains, firmographics, and growth signals. ✅Web Scraping Precision – Directly sourced from employer websites for unmatched accuracy.

Primary Attributes:

id (string, UUID) – Unique identifier for the job posting.

type (string, constant: "job_opening") – Object type.

title (string) – Job title.

description (string) – Full job description, extracted from the job listing.

url (string, URL) – Direct link to the job posting.

first_seen_at – Timestamp when the job was first detected.

last_seen_at – Timestamp when the job was last detected.

last_processed_at – Timestamp when the job data was last processed.

Job Metadata:

contract_types (array of strings) – Type of employment (e.g., "full time", "part time", "contract").

categories (array of strings) – Job categories (e.g., "engineering", "marketing").

seniority (string) – Seniority level of the job (e.g., "manager", "non_manager").

status (string) – Job status (e.g., "open", "closed").

language (string) – Language of the job posting.

location (string) – Full location details as listed in the job description.

Location Data (location_data) (array of objects)

city (string, nullable) – City where the job is located.

state (string, nullable) – State or region of the job location.

zip_code (string, nullable) – Postal/ZIP code.

country (string, nullable) – Country where the job is located.

region (string, nullable) – Broader geographical region.

continent (string, nullable) – Continent name.

fuzzy_match (boolean) – Indicates whether the location was inferred.

Salary Data (salary_data)

salary (string) – Salary range extracted from the job listing.

salary_low (float, nullable) – Minimum salary in original currency.

salary_high (float, nullable) – Maximum salary in original currency.

salary_currency (string, nullable) – Currency of the salary (e.g., "USD", "EUR").

salary_low_usd (float, nullable) – Converted minimum salary in USD.

salary_high_usd (float, nullable) – Converted maximum salary in USD.

salary_time_unit (string, nullable) – Time unit for the salary (e.g., "year", "month", "hour").

Occupational Data (onet_data) (object, nullable)

code (string, nullable) – ONET occupation code.

family (string, nullable) – Broad occupational family (e.g., "Computer and Mathematical").

occupation_name (string, nullable) – Official ONET occupation title.

Additional Attributes:

tags (array of strings, nullable) – Extracted skills and keywords (e.g., "Python", "JavaScript").

📌 Trusted by enterprises, recruiters, and investors for high-precision job market insights.

PredictLeads Dataset: https://docs.predictleads.com/v3/guide/job_openings_dataset
d
Web Scraping Data | Key Customers Domain Name Data | Scanning Logos found on...
datarade.ai
.json
Updated Jun 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PredictLeads (2024). Web Scraping Data | Key Customers Domain Name Data | Scanning Logos found on Websites | 248M+ Records [Dataset]. https://datarade.ai/data-products/predictleads-web-scraping-data-domain-name-data-business-predictleads
Explore at:
.jsonAvailable download formats
Dataset updated
Jun 27, 2024
Dataset authored and provided by
PredictLeads
Area covered
Malaysia, Turkmenistan, Benin, Nigeria, Svalbard and Jan Mayen, Burkina Faso, Curaçao, Oman, Northern Mariana Islands, Colombia
Description
PredictLeads Key Customers Data provides essential business intelligence by analyzing company relationships, uncovering vendor partnerships, client connections, and strategic affiliations through advanced web scraping and logo recognition. This dataset captures business interactions directly from company websites, offering valuable insights into market positioning, competitive landscapes, and growth opportunities.

Use Cases:

✅ Account Profiling – Gain a 360-degree customer view by mapping company relationships and partnerships. ✅ Competitive Intelligence – Track vendor-client connections and business affiliations to identify key industry players. ✅ B2B Lead Targeting – Prioritize leads based on their business relationships, improving sales and marketing efficiency. ✅ CRM Data Enrichment – Enhance company records with detailed key customer data, ensuring data accuracy. ✅ Market Research – Identify emerging trends and industry networks to optimize strategic planning.

Key API Attributes:

id (string, UUID) – Unique identifier for the company connection.

category (string) – Type of relationship (e.g., vendor, client, partner).

source_category (string) – Where the connection was detected (e.g., partner page, case study).

source_url (string, URL) – Website where the relationship was found.

individual_source_url (string, URL) – Specific page confirming the connection.

context (string) – Extracted description of the business relationship (e.g., "Company X - partners with Company Y to enhance payment processing").

first_seen_at (ISO 8601 date-time) – Date the connection was first detected.

last_seen_at (ISO 8601 date-time) – Most recent confirmation of the relationship.

company1 & company2 (objects) – Details of the two connected companies, including:

- domain (string) – Company website domain.

- company_name (string) – Official company name.

- ticker (string, nullable) – Stock ticker, if available.

📌 PredictLeads Key Customers Data is an indispensable tool for B2B sales, marketing, and market intelligence teams, providing actionable relationship insights to drive targeted outreach, competitor tracking, and strategic decision-making.

PredictLeads Docs: https://docs.predictleads.com/v3/guide/connections_dataset
Data from: Afromoths, online database of Afrotropical moth species...
gbif.org
demo.gbif-test.org
+2more
Updated Oct 21, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jurate De Prins; Willy De Prins; Jurate De Prins; Willy De Prins (2024). Afromoths, online database of Afrotropical moth species (Lepidoptera) [Dataset]. http://doi.org/10.15468/s1kwuw
Explore at:
Unique identifier
https://doi.org/10.15468/s1kwuw
Dataset updated
Oct 21, 2024
Dataset provided by
Global Biodiversity Information Facilityhttps://www.gbif.org/
Belgian Biodiversity Platform
Authors
Jurate De Prins; Willy De Prins; Jurate De Prins; Willy De Prins
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered

Description
This dataset covers all relevant information on every Afrotropical moth species. The zoogeographic area covered can be defined as the Africa continent south of the Sahara (i.e. excl. Morocco, Algeria, Tunisia, Libya and Egypt), the islands in the Atlantic Ocean: Amsterdam Island, Ascension, Cape Verde Archipelago, Inaccessible Island, St. Helena, São Tomé and Principe, Tristan da Cunha, and the islands in the Indian Ocean: Comores (Anjouan, Grande Comore, Mayotte, Mohéli), Madagascar, Mascarene Islands (La Réunion, Mauritius, Rodrigues), Seychelles (Félicité, Mahé, Praslin, Silhouette, a.o.). Furthermore, also those moth species occurring in the transition zone to the Palaearctic fauna have been included, namely most of the Arabia Peninsula (Kuwait, Oman, Saudi Arabia, United Arab Emirates, Yemen with Socotra) but not Iraq, Jordan and further north. Also, some Saharan species have been included (e. g. Hoggar Mts. in Algeria, Tibesti Mts. in South Libya). Utmost care was taken that the data incorporated in the database are correct. We decline any responsibility in case of damage to soft- or hardware based on information used in this website. Persons retrieving information from this website for their own research or for applied aspects such as pest control programmes, should acknowledge the usage of data from this website in the following format: De Prins, J. & De Prins, W. 2011. Afromoths, online database of Afrotropical moth species (Lepidoptera). World Wide Web electronic publication (www.afromoths.net)
d
National Legal Database Website Instructions
data.gov.tw
csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Information Management, National Legal Database Website Instructions [Dataset]. https://data.gov.tw/en/datasets/24930
Explore at:
csvAvailable download formats
Dataset authored and provided by
Department of Information Management
License
https://data.gov.tw/licensehttps://data.gov.tw/license
Description
Website usage instructions2. Website usage case instructions3. Website unit introduction
Data from: CottonGen: Cotton Database Resources
catalog.data.gov
agdatacommons.nal.usda.gov
Updated Apr 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agricultural Research Service (2025). CottonGen: Cotton Database Resources [Dataset]. https://catalog.data.gov/dataset/cottongen-cotton-database-resources-151bf
Explore at:
Dataset updated
Apr 21, 2025
Dataset provided by
Agricultural Research Servicehttps://www.ars.usda.gov/
Description
CottonGen (https://www.cottongen.org) is a curated and integrated web-based relational database providing access to publicly available genomic, genetic and breeding data to enable basic, translational and applied research in cotton. Built using the open-source Tripal database infrastructure, CottonGen supersedes CottonDB and the Cotton Marker Database, which includes sequences, genetic and physical maps, genotypic and phenotypic markers and polymorphisms, quantitative trait loci (QTLs), pathogens, germplasm collections and trait evaluations, pedigrees, and relevant bibliographic citations, with enhanced tools for easier data sharing, mining, visualization, and data retrieval of cotton research data. CottonGen contains annotated whole genome sequences, unigenes from expressed sequence tags (ESTs), markers, trait loci, genetic maps, genes, taxonomy, germplasm, publications and communication resources for the cotton community. Annotated whole genome sequences of Gossypium raimondii are available with aligned genetic markers and transcripts. These whole genome data can be accessed through genome pages, search tools and GBrowse, a popular genome browser. Most of the published cotton genetic maps can be viewed and compared using CMap, a comparative map viewer, and are searchable via map search tools. Search tools also exist for markers, quantitative trait loci (QTLs), germplasm, publications and trait evaluation data. CottonGen also provides online analysis tools such as NCBI BLAST and Batch BLAST. This project is funded/supported by Cotton Incorporated, the USDA-ARS Crop Germplasm Research Unit at College Station, TX, the Southern Association of Agricultural Experiment Station Directors, Bayer CropScience, Corteva/Agriscience, Dow/Phytogen, Monsanto, Washington State University, and NRSP10. Resources in this dataset:Resource Title: Website Pointer for CottonGen. File Name: Web Page, url: https://www.cottongen.org/ Genomic, Genetic and Breeding Resources for Cotton Research Discovery and Crop Improvement organized by : Species (Gossypium arboreum, barbadense, herbaceum, hirsutum, raimondii, others), Data (Contributors, Download, Submission, Community Projects, Archives, Cotton Trait Ontology, Nomenclatures, and links to Variety Testing Data and NCBISRA Datasets), Search options (Colleague, Genes and Transcripts, Genotype, Germplasm, Map, Markers, Publications, QTLs, Sequences, Trait Evaluation, MegaSearch), Tools (BIMS, BLAST+, CottonCyc, JBrowse, Map Viewer, Primer3, Sequence Retrieval, Synteny Viewer), International Cotton Genome Initiative (ICGI), and Help sources (User manual, FAQs). Also provides Quick Start links for Major Species and Tools.
Website Statistics
data.wu.ac.at
data.europa.eu
csv, pdf
Updated Jun 11, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lincolnshire County Council (2018). Website Statistics [Dataset]. https://data.wu.ac.at/schema/data_gov_uk/M2ZkZDBjOTUtMzNhYi00YWRjLWI1OWMtZmUzMzA5NjM0ZTdk
Explore at:
csv, pdfAvailable download formats
Dataset updated
Jun 11, 2018
Dataset provided by
Lincolnshire County Councilhttp://www.lincolnshire.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
This Website Statistics dataset has four resources showing usage of the Lincolnshire Open Data website. Web analytics terms used in each resource are defined in their accompanying Metadata file.

Website Usage Statistics: This document shows a statistical summary of usage of the Lincolnshire Open Data site for the latest calendar year.

Website Statistics Summary: This dataset shows a website statistics summary for the Lincolnshire Open Data site for the latest calendar year.

Webpage Statistics: This dataset shows statistics for individual Webpages on the Lincolnshire Open Data site by calendar year.

Dataset Statistics: This dataset shows cumulative totals for Datasets on the Lincolnshire Open Data site that have also been published on the national Open Data site Data.Gov.UK - see the Source link.

Note: Website and Webpage statistics (the first three resources above) show only UK users, and exclude API calls (automated requests for datasets). The Dataset Statistics are confined to users with javascript enabled, which excludes web crawlers and API calls.

These Website Statistics resources are updated annually in January by the Lincolnshire County Council Business Intelligence team. For any enquiries about the information contact opendata@lincolnshire.gov.uk.
D
Website Analytics
data.nola.gov
gimi9.com
+4more
csv, xlsx, xml
Updated Feb 2, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Information Technology and Innovation Web Team (2017). Website Analytics [Dataset]. https://data.nola.gov/City-Administration/Website-Analytics/62d3-pst8
Explore at:
xlsx, csv, xmlAvailable download formats
Dataset updated
Feb 2, 2017
Dataset authored and provided by
Information Technology and Innovation Web Team
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This data about nola.gov provides a window into how people are interacting with the the City of New Orleans online. The data comes from a unified Google Analytics account for New Orleans. We do not track individuals and we anonymize the IP addresses of all visitors.
p
Website Designers in Louisiana, United States - 540 Verified Listings...
poidata.io
csv, excel, json
Updated Jul 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Website Designers in Louisiana, United States - 540 Verified Listings Database [Dataset]. https://www.poidata.io/report/website-designer/united-states/louisiana
Explore at:
csv, excel, jsonAvailable download formats
Dataset updated
Jul 2, 2025
Dataset provided by
Poidata.io
Area covered
Louisiana, United States
Description
Comprehensive dataset of 540 Website designers in Louisiana, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
r
RAD-IT database
catalog.riits.net
Updated Nov 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). RAD-IT database [Dataset]. https://catalog.riits.net/dataset/rad-it-database
Explore at:
Dataset updated
Nov 16, 2021
Description
The RAD-IT database, based on version 8.1 of RAD-IT and ARC-IT, is available for download, and includes the interconnect details of the ITS elements and data flows. Interconnect diagrams developed from this database are used throughout the laconnect-it.com website.
R
Real-Time Index Database Report
marketreportanalytics.com
doc, pdf, ppt
Updated Apr 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Report Analytics (2025). Real-Time Index Database Report [Dataset]. https://www.marketreportanalytics.com/reports/real-time-index-database-75396
Explore at:
ppt, doc, pdfAvailable download formats
Dataset updated
Apr 10, 2025
Dataset authored and provided by
Market Report Analytics
License
https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The real-time index database market is experiencing robust growth, driven by the increasing demand for immediate insights from large volumes of data across diverse sectors. The market's expansion is fueled by the proliferation of IoT devices generating massive real-time data streams, the need for faster decision-making in competitive environments, and the rise of sophisticated analytics applications requiring rapid data access. Cloud-based solutions dominate the market due to their scalability, cost-effectiveness, and ease of deployment, attracting both individual users and large enterprises. However, concerns around data security and latency in cloud-based systems present some restraints. The on-premises segment, while smaller, continues to cater to businesses with stringent data sovereignty requirements or those managing exceptionally sensitive information. Key players like Elastic, Amazon Web Services, Apache Solr, Splunk, and Microsoft are shaping the market landscape through continuous innovation and competitive offerings. Geographic distribution reflects the concentration of technological infrastructure and data generation, with North America and Europe currently leading the market, followed by the Asia-Pacific region showing significant potential for future growth. The market's Compound Annual Growth Rate (CAGR) suggests a consistent upward trajectory, indicating continued investment and market expansion throughout the forecast period. The competitive dynamics are marked by a mix of established players and emerging entrants. Established players leverage their existing infrastructure and customer bases, while new entrants focus on niche areas and innovative solutions. The market is also witnessing increased adoption of hybrid models combining cloud and on-premises solutions to balance cost-efficiency, security, and performance. Future growth will depend on technological advancements, particularly in areas like distributed ledger technology and edge computing, which will enhance the real-time capabilities and scalability of index databases. Furthermore, the increasing focus on data governance and regulatory compliance will also influence market adoption and shape the development of future solutions. The market is anticipated to witness a sustained period of growth, fueled by the ever-growing demand for real-time data analytics and insights across various sectors and regions.
Z
Network Traffic Analysis: Data and Code
data.niaid.nih.gov
zenodo.org
Updated Jun 12, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Homan, Sophia (2024). Network Traffic Analysis: Data and Code [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11479410
Explore at:
Dataset updated
Jun 12, 2024
Dataset provided by
Homan, Sophia
Moran, Madeline
Soni, Shreena
Chan-Tin, Eric
Ferrell, Nathan
Honig, Joshua
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Code:

Packet_Features_Generator.py & Features.py

To run this code:

pkt_features.py [-h] -i TXTFILE [-x X] [-y Y] [-z Z] [-ml] [-s S] -j

-h, --help show this help message and exit -i TXTFILE input text file -x X Add first X number of total packets as features. -y Y Add first Y number of negative packets as features. -z Z Add first Z number of positive packets as features. -ml Output to text file all websites in the format of websiteNumber1,feature1,feature2,... -s S Generate samples using size s. -j

Purpose:

Turns a text file containing lists of incomeing and outgoing network packet sizes into separate website objects with associative features.

Uses Features.py to calcualte the features.

startMachineLearning.sh & machineLearning.py

To run this code:

bash startMachineLearning.sh

This code then runs machineLearning.py in a tmux session with the nessisary file paths and flags

Options (to be edited within this file):

--evaluate-only to test 5 fold cross validation accuracy

--test-scaling-normalization to test 6 different combinations of scalers and normalizers

Note: once the best combination is determined, it should be added to the data_preprocessing function in machineLearning.py for future use

--grid-search to test the best grid search hyperparameters - note: the possible hyperparameters must be added to train_model under 'if not evaluateOnly:' - once best hyperparameters are determined, add them to train_model under 'if evaluateOnly:'

Purpose:

Using the .ml file generated by Packet_Features_Generator.py & Features.py, this program trains a RandomForest Classifier on the provided data and provides results using cross validation. These results include the best scaling and normailzation options for each data set as well as the best grid search hyperparameters based on the provided ranges.

Data

Encrypted network traffic was collected on an isolated computer visiting different Wikipedia and New York Times articles, different Google search queres (collected in the form of their autocomplete results and their results page), and different actions taken on a Virtual Reality head set.

Data for this experiment was stored and analyzed in the form of a txt file for each experiment which contains:

First number is a classification number to denote what website, query, or vr action is taking place.

The remaining numbers in each line denote:

The size of a packet,

and the direction it is traveling.

negative numbers denote incoming packets

positive numbers denote outgoing packets

Figure 4 Data

This data uses specific lines from the Virtual Reality.txt file.

The action 'LongText Search' refers to a user searching for "Saint Basils Cathedral" with text in the Wander app.

The action 'ShortText Search' refers to a user searching for "Mexico" with text in the Wander app.

The .xlsx and .csv file are identical

Each file includes (from right to left):

The origional packet data,

each line of data organized from smallest to largest packet size in order to calculate the mean and standard deviation of each packet capture,

and the final Cumulative Distrubution Function (CDF) caluclation that generated the Figure 4 Graph.
d
Data from: CottonGen CottonCyc Pathways Database
catalog.data.gov
agdatacommons.nal.usda.gov
+1more
Updated Apr 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agricultural Research Service (2025). CottonGen CottonCyc Pathways Database [Dataset]. https://catalog.data.gov/dataset/cottongen-cottoncyc-pathways-database-a85f4
Explore at:
Dataset updated
Apr 21, 2025
Dataset provided by
Agricultural Research Service
Description
The CottonGen CottonCyc Pathways Database, part of CottonGen, supports searching and browsing the following CottonCyc databases: Cyc pathways for JGI v2.0 G. raimondii D5 genome assembly This Cyc database was constructed using PathwayTools version 20.0 using the gene models from the JGI v2.0 D5 genome assembly of Gossypium raimondii. There has been no manual curation of this Cyc database. Pathway predictions were made using PathwayTools and in-silico v2.1 annotations as provided by JGI. Cyc pathways for CGP-BGI v1.0 G. hirsutum AD1 genome assembly This Cyc database was constructed using PathwayTools version 20.0 using the gene models from the CGP-BGI v1.0 AD1 genome assembly of Gossypium hirsutum. There has been no manual curation of this Cyc database. Pathway predictions were made using PathwayTools and in-silico v1.0 annotations as provided by CGP-BGI. Search parameters include genes, proteins, RNAs, compounds, reactions, pathways, growth media, and BLAST search. Resources in this dataset:Resource Title: Website Pointer to CottonGen CottonCyc Pathways Database. File Name: Web Page, url: http://ptools.cottongen.org/
w
State of California - Data
data.wu.ac.at
Updated Oct 11, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Global (2013). State of California - Data [Dataset]. https://data.wu.ac.at/odso/datahub_io/NDZlMmFjNWEtMGY1ZS00ZWVhLTgzZWEtMmY5ZmFhMGQyMjEx
Explore at:
Dataset updated
Oct 11, 2013
Dataset provided by
Global
Description
About

Data from the State of California. From website:

Access raw State data files, databases, geographic data, and other data sources. Raw State data files can be reused by citizens and organizations for their own web applications and mashups.

Openness

Open. Effectively in the public domain. Terms of use page says:

In general, information presented on this web site, unless otherwise indicated, is considered in the public domain. It may be distributed or copied as permitted by law. However, the State does make use of copyrighted data (e.g., photographs) which may require additional permissions prior to your use. In order to use any information on this web site not owned or created by the State, you must seek permission directly from the owning (or holding) sources. The State shall have the unlimited right to use for any purpose, free of any charge, all information submitted via this site except those submissions made under separate legal contract. The State shall be free to use, for any purpose, any ideas, concepts, or techniques contained in information provided through this site.
d
Altosight | AI Custom Web Scraping Data | 100% Global | Free Unlimited Data...
datarade.ai
.json, .csv, .xls
Updated Sep 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Altosight (2024). Altosight | AI Custom Web Scraping Data | 100% Global | Free Unlimited Data Points | Bypassing All CAPTCHAs & Blocking Mechanisms | GDPR Compliant [Dataset]. https://datarade.ai/data-products/altosight-ai-custom-web-scraping-data-100-global-free-altosight
Explore at:
.json, .csv, .xlsAvailable download formats
Dataset updated
Sep 7, 2024
Dataset authored and provided by
Altosight
Area covered
Tajikistan, Guatemala, Chile, Wallis and Futuna, Paraguay, Svalbard and Jan Mayen, Czech Republic, Singapore, Côte d'Ivoire, Greenland
Description
Altosight | AI Custom Web Scraping Data

✦ Altosight provides global web scraping data services with AI-powered technology that bypasses CAPTCHAs, blocking mechanisms, and handles dynamic content.

We extract data from marketplaces like Amazon, aggregators, e-commerce, and real estate websites, ensuring comprehensive and accurate results.

✦ Our solution offers free unlimited data points across any project, with no additional setup costs.

We deliver data through flexible methods such as API, CSV, JSON, and FTP, all at no extra charge.

― Key Use Cases ―

➤ Price Monitoring & Repricing Solutions

🔹 Automatic repricing, AI-driven repricing, and custom repricing rules 🔹 Receive price suggestions via API or CSV to stay competitive 🔹 Track competitors in real-time or at scheduled intervals

➤ E-commerce Optimization

🔹 Extract product prices, reviews, ratings, images, and trends 🔹 Identify trending products and enhance your e-commerce strategy 🔹 Build dropshipping tools or marketplace optimization platforms with our data

➤ Product Assortment Analysis

🔹 Extract the entire product catalog from competitor websites 🔹 Analyze product assortment to refine your own offerings and identify gaps 🔹 Understand competitor strategies and optimize your product lineup

➤ Marketplaces & Aggregators

🔹 Crawl entire product categories and track best-sellers 🔹 Monitor position changes across categories 🔹 Identify which eRetailers sell specific brands and which SKUs for better market analysis

➤ Business Website Data

🔹 Extract detailed company profiles, including financial statements, key personnel, industry reports, and market trends, enabling in-depth competitor and market analysis

🔹 Collect customer reviews and ratings from business websites to analyze brand sentiment and product performance, helping businesses refine their strategies

➤ Domain Name Data

🔹 Access comprehensive data, including domain registration details, ownership information, expiration dates, and contact information. Ideal for market research, brand monitoring, lead generation, and cybersecurity efforts

➤ Real Estate Data

🔹 Access property listings, prices, and availability 🔹 Analyze trends and opportunities for investment or sales strategies

― Data Collection & Quality ―

► Publicly Sourced Data: Altosight collects web scraping data from publicly available websites, online platforms, and industry-specific aggregators

► AI-Powered Scraping: Our technology handles dynamic content, JavaScript-heavy sites, and pagination, ensuring complete data extraction

► High Data Quality: We clean and structure unstructured data, ensuring it is reliable, accurate, and delivered in formats such as API, CSV, JSON, and more

► Industry Coverage: We serve industries including e-commerce, real estate, travel, finance, and more. Our solution supports use cases like market research, competitive analysis, and business intelligence

► Bulk Data Extraction: We support large-scale data extraction from multiple websites, allowing you to gather millions of data points across industries in a single project

► Scalable Infrastructure: Our platform is built to scale with your needs, allowing seamless extraction for projects of any size, from small pilot projects to ongoing, large-scale data extraction

― Why Choose Altosight? ―

✔ Unlimited Data Points: Altosight offers unlimited free attributes, meaning you can extract as many data points from a page as you need without extra charges

✔ Proprietary Anti-Blocking Technology: Altosight utilizes proprietary techniques to bypass blocking mechanisms, including CAPTCHAs, Cloudflare, and other obstacles. This ensures uninterrupted access to data, no matter how complex the target websites are

✔ Flexible Across Industries: Our crawlers easily adapt across industries, including e-commerce, real estate, finance, and more. We offer customized data solutions tailored to specific needs

✔ GDPR & CCPA Compliance: Your data is handled securely and ethically, ensuring compliance with GDPR, CCPA and other regulations

✔ No Setup or Infrastructure Costs: Start scraping without worrying about additional costs. We provide a hassle-free experience with fast project deployment

✔ Free Data Delivery Methods: Receive your data via API, CSV, JSON, or FTP at no extra charge. We ensure seamless integration with your systems

✔ Fast Support: Our team is always available via phone and email, resolving over 90% of support tickets within the same day

― Custom Projects & Real-Time Data ―

✦ Tailored Solutions: Every business has unique needs, which is why Altosight offers custom data projects. Contact us for a feasibility analysis, and we’ll design a solution that fits your goals

✦ Real-Time Data: Whether you need real-time data delivery or scheduled updates, we provide the flexibility to receive data when you need it. Track price changes, monitor product trends, or gather...
p
Website Designers in Tennessee, United States - 1,027 Verified Listings...
poidata.io
csv, excel, json
Updated Aug 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Website Designers in Tennessee, United States - 1,027 Verified Listings Database [Dataset]. https://www.poidata.io/report/website-designer/united-states/tennessee
Explore at:
csv, json, excelAvailable download formats
Dataset updated
Aug 2, 2025
Dataset provided by
Poidata.io
Area covered
United States, Tennessee
Description
Comprehensive dataset of 1,027 Website designers in Tennessee, United States as of August, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
n
RsiteDB- RNA binding sites database
neuinfo.org
scicrunch.org
+1more
Updated Jul 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). RsiteDB- RNA binding sites database [Dataset]. http://identifiers.org/RRID:SCR_007906
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_007906
Dataset updated
Jul 26, 2025
Description
It is a database that details the interactions of extruded, unpaired RNA nucleotide bases. It presents and classifies the protein binding pockets that accommodate them, and also allows the recognition of similar protein binding patters involved in interactions with different RNA molecules. Given an unbound structure of a target protein, it allows the prediction of its RNA nucleotide binding sites. The goal of this database is to describe, classify, and predict the interactions between protein binding sites and single-stranded RNA bases. Specifically, RsiteDB describes the protein binding pockets that accommodate extruded nucleotides not involved in RNA base pairing. RsiteDB has two modes of operation. Analysis and classification of protein-RNA interactions: Given a protein-RNA complex RsiteDB analyzes its nucleotide and dinucleotide binding sites. It details the properties of the protein binding pockets that accommodate these extruded nucleotides and presents a list of proteins with similar binding pockets. These proteins may have a totally different overall sequences and structural folds. RsiteDB details and visualizes the features shared by all the binding sites classified to the same cluster. Prediction of RNA dinucleotide binding sites: Given a target, potentially unbound, protein structure we search its surface for regions similar to the created 3-D consensus binding patterns of RNA dinucleotides. The recognized regions are predicted to serve as binding sites. Using leave-one-out tests, the success rate of these predictions was estimated to be about 80%. It must be noted that currently we do not aim to predict whether a protein can bind RNA; rather, given an unbound RNA binding protein, our goal is to predict its binding sites and their modes of interaction. In addition, due to a low number of single nucleotide clusters, currently, we do not use them for the prediction.
p
Website Designers in Netherlands - 16,048 Verified Listings Database
poidata.io
csv, excel, json
Updated Jul 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Website Designers in Netherlands - 16,048 Verified Listings Database [Dataset]. https://www.poidata.io/report/website-designer/netherlands
Explore at:
csv, excel, jsonAvailable download formats
Dataset updated
Jul 2, 2025
Dataset provided by
Poidata.io
Area covered
Netherlands
Description
Comprehensive dataset of 16,048 Website designers in Netherlands as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
f
WP-Script | Web Hosting & Domain Names | Technology Data
datastore.forage.ai
Updated Nov 20, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). WP-Script | Web Hosting & Domain Names | Technology Data [Dataset]. https://datastore.forage.ai/searchresults/?resource_keyword=web
Explore at:
Dataset updated
Nov 20, 2024
Description
WP-Script is a company that provides WordPress themes and plugins for creating adult sites. They offer a range of products, including seven customizable adult WordPress themes and thirteen powerful adult WordPress plugins. Their products are designed to be easy to use and can help entrepreneurs create professional-looking adult sites with minimal technical expertise.

With WP-Script, you can start your adult site in six easy steps. They also offer a 14-day money-back guarantee, giving you the opportunity to test their products risk-free. Additionally, they provide premium support to help you resolve any issues you may encounter. Their customers love their products, citing excellent themes, easy installation, and good customer support.
s
Data from: World Database on Protected Areas
fsm-data.sprep.org
pacificdata.org
+13more
geojson, html, jpeg +3
Updated Feb 15, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UN Environment World Conservation Monitoring Centre (UNEP-WCMC) (2022). World Database on Protected Areas [Dataset]. https://fsm-data.sprep.org/dataset/world-database-protected-areas
Explore at:
html, jpeg, pdf, zip, geojson, websiteAvailable download formats
Dataset updated
Feb 15, 2022
Dataset provided by
The Nature Conservancy
Authors
UN Environment World Conservation Monitoring Centre (UNEP-WCMC)
License
Public Domain Mark 1.0https://creativecommons.org/publicdomain/mark/1.0/
License information was derived automatically
Area covered
136.54769897461 7.3188817303668, 152.98324584961 3.995780512963, 155.88363647461 0.043945308191358, 153.42269897461 9.9255659124055, 142.61215209961 5.5722498011139, 164.23324584961 4.7844689665794, 154.38949584961 0.39550467153202, 162.91488647461 6.1842461612806, POLYGON ((136.54769897461 10.531020008465, 139.71176147461 11.135287077054)), Federated States of Micronesia
Description
The World Database on Protected Areas (WDPA) is the most comprehensive global database of marine and terrestrial protected areas, updated on a monthly basis, and is one of the key global biodiversity data sets being widely used by scientists, businesses, governments, International secretariats and others to inform planning, policy decisions and management. The WDPA is a joint project between UN Environment and the International Union for Conservation of Nature (IUCN). The compilation and management of the WDPA is carried out by UN Environment World Conservation Monitoring Centre (UNEP-WCMC), in collaboration with governments, non-governmental organisations, academia and industry. There are monthly updates of the data which are made available online through the Protected Planet website where the data is both viewable and downloadable. Data and information on the world's protected areas compiled in the WDPA are used for reporting to the Convention on Biological Diversity on progress towards reaching the Aichi Biodiversity Targets (particularly Target 11), to the UN to track progress towards the 2030 Sustainable Development Goals, to some of the Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services (IPBES) core indicators, and other international assessments and reports including the Global Biodiversity Outlook, as well as for the publication of the United Nations List of Protected Areas. Every two years, UNEP-WCMC releases the Protected Planet Report on the status of the world's protected areas and recommendations on how to meet international goals and targets. Many platforms are incorporating the WDPA to provide integrated information to diverse users, including businesses and governments, in a range of sectors including mining, oil and gas, and finance. For example, the WDPA is included in the Integrated Biodiversity Assessment Tool, an innovative decision support tool that gives users easy access to up-to-date information that allows them to identify biodiversity risks and opportunities within a project boundary. The reach of the WDPA is further enhanced in services developed by other parties, such as the Global Forest Watch and the Digital Observatory for Protected Areas, which provide decision makers with access to monitoring and alert systems that allow whole landscapes to be managed better. Together, these applications of the WDPA demonstrate the growing value and significance of the Protected Planet initiative.

Facebook

Twitter

Click to copy link

Link copied

Cite

L3S (2022). OER sample data-set [Dataset]. https://data.uni-hannover.de/dataset/oer-sample-data-set

OER sample data-set

Explore at:

csv(6260265)Available download formats

Dataset updated

Jan 20, 2022

Dataset authored and provided by

L3S

License

Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
License information was derived automatically

Description

This data-set includes information about a sample of 8,887 of Open Educational Resources (OERs) from SkillsCommons website. It contains title, description, URL, type, availability date, issued date, subjects, and the availability of following metadata: level, time_required to finish, and accessibility.

This data-set has been used to build a metadata scoring and quality prediction model for OERs.

Clear search

Close search

Google apps

Main menu

OER sample data-set

Global Web Data | Web Scraping Data | Job Postings Data | Source: Company...

Web Scraping Data | Key Customers Domain Name Data | Scanning Logos found on...

Data from: Afromoths, online database of Afrotropical moth species...

National Legal Database Website Instructions

Data from: CottonGen: Cotton Database Resources

Website Statistics

Website Analytics

Website Designers in Louisiana, United States - 540 Verified Listings...

RAD-IT database

Real-Time Index Database Report

Network Traffic Analysis: Data and Code

Data from: CottonGen CottonCyc Pathways Database

State of California - Data

About

Openness

Altosight | AI Custom Web Scraping Data | 100% Global | Free Unlimited Data...

Website Designers in Tennessee, United States - 1,027 Verified Listings...

RsiteDB- RNA binding sites database

Website Designers in Netherlands - 16,048 Verified Listings Database

WP-Script | Web Hosting & Domain Names | Technology Data

Data from: World Database on Protected Areas

OER sample data-setSee More Versions

OER sample data-set