This guide brings together online resources that contain U.S. government documents. Some are freely available to anyone with Internet access. Others include subscription databases accessible with a DHS device.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Ireland - Individuals using the internet for finding information about goods and services was 93.17% in December of 2024, according to the EUROSTAT. Trading Economics provides the current actual value, an historical data chart and related indicators for Ireland - Individuals using the internet for finding information about goods and services - last updated from the EUROSTAT on July of 2025. Historically, Ireland - Individuals using the internet for finding information about goods and services reached a record high of 93.17% in December of 2024 and a record low of 52.91% in December of 2011.
According to a 2021 survey in Brazil, half of the internet users surveyed in the country stated to have searched for health or health services-related information on the internet, down from 53 percent of respondents in the year previous year. In March 2020, the outbreak of COVID-19 led to an increase of searches on Google for items such as alcohol-based sanitizer.
The share of Hungarians researching health-related information on the internet did not see a considerable increase over the period observed. As of 2019, 75 percent of Hungarians used the internet to find information about health, which represented an increase by only two percentage points compared to 2015.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
By [source]
This dataset collects job offers from web scraping which are filtered according to specific keywords, locations and times. This data gives users rich and precise search capabilities to uncover the best working solution for them. With the information collected, users can explore options that match with their personal situation, skillset and preferences in terms of location and schedule. The columns provide detailed information around job titles, employer names, locations, time frames as well as other necessary parameters so you can make a smart choice for your next career opportunity
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset is a great resource for those looking to find an optimal work solution based on keywords, location and time parameters. With this information, users can quickly and easily search through job offers that best fit their needs. Here are some tips on how to use this dataset to its fullest potential:
Start by identifying what type of job offer you want to find. The keyword column will help you narrow down your search by allowing you to search for job postings that contain the word or phrase you are looking for.
Next, consider where the job is located – the Location column tells you where in the world each posting is from so make sure it’s somewhere that suits your needs!
Finally, consider when the position is available – look at the Time frame column which gives an indication of when each posting was made as well as if it’s a full-time/ part-time role or even if it’s a casual/temporary position from day one so make sure it meets your requirements first before applying!
Additionally, if details such as hours per week or further schedule information are important criteria then there is also info provided under Horari and Temps Oferta columns too! Now that all three criteria have been ticked off - key words, location and time frame - then take a look at Empresa (Company Name) and Nom_Oferta (Post Name) columns too in order to get an idea of who will be employing you should you land the gig!
All these pieces of data put together should give any motivated individual all they need in order to seek out an optimal work solution - keep hunting good luck!
- Machine learning can be used to groups job offers in order to facilitate the identification of similarities and differences between them. This could allow users to specifically target their search for a work solution.
- The data can be used to compare job offerings across different areas or types of jobs, enabling users to make better informed decisions in terms of their career options and goals.
- It may also provide an insight into the local job market, enabling companies and employers to identify where there is potential for new opportunities or possible trends that simply may have previously gone unnoticed
If you use this dataset in your research, please credit the original authors. Data Source
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: web_scraping_information_offers.csv | Column name | Description | |:-----------------|:------------------------------------| | Nom_Oferta | Name of the job offer. (String) | | Empresa | Company offering the job. (String) | | UbicaciĂł | Location of the job offer. (String) | | Temps_Oferta | Time of the job offer. (String) | | Horari | Schedule of the job offer. (String) |
If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit .
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Malta - Individuals using the internet for finding information about goods and services was 80.67% in December of 2024, according to the EUROSTAT. Trading Economics provides the current actual value, an historical data chart and related indicators for Malta - Individuals using the internet for finding information about goods and services - last updated from the EUROSTAT on July of 2025. Historically, Malta - Individuals using the internet for finding information about goods and services reached a record high of 80.67% in December of 2024 and a record low of 50.10% in December of 2013.
https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/
Explore the historical Whois records related to com-device-find.info (Domain). Get insights into ownership history and changes over time.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Data from the Interactive Social Book Search Track Series 2014-2016
This data table contains a list of all hospitals that have been registered with Medicare. This list includes addresses, phone numbers, hospital types and quality of care information. The quality of care data is provided for over 4,000 Medicare-certified hospitals, including over 130 Veterans Administration (VA) medical centers, across the country. You can use this data to find hospitals and compare the quality of their care. This data was created through the efforts of the Centers for Medicare & Medicaid Services (CMS) in collaboration with organizations representing consumers, hospitals, doctors, employers, accrediting organizations, and other federal agencies. This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public datasets. What is BigQuery .
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
The Washington State Department of Health presents this information as a service to the public. True and correct copies of legal disciplinary actions taken after July 1998 are available on our Provider Credential Search site. These records are considered certified by the Department of Health.
This includes information on health care providers.
Please contact our Customer Service Center at 360-236-4700 for information about actions before July 1998. The information on this site comes directly from our database and is updated daily at 10:00 a.m.. This data is a primary source for verification of credentials and is extracted from the primary database at 2:00 a.m. daily.
News releases about disciplinary actions taken against Washington State healthcare providers, agencies or facilities are on the agency's Newsroom webpage.
Disclaimer The absence of information in the Provider Credential Search system doesn't imply any recommendation, endorsement or guarantee of competence of any healthcare professional. The presence of information in this system doesn't imply a provider isn't competent or qualified to practice. The reader is encouraged to carefully evaluate any information found in this data set.
Search for a business by name. You can obtain business information and then proceed to purchase a certificate of good standing or other documents. The purpose of this search is simply to determine whether a company/entity exists and to provide basic information on the company/entity.
The share of internet users informing themselves about goods and services online in Romania increased by 5.6 percentage points in 2022 in comparison to the previous year. Therefore, the share of people informing themselves online in Romania reached a peak in 2022 with 49.28 percent.The EU survey on the use of Information and Communication Technologies (ICT) in households and by individuals is an annual survey conducted since 2002 aiming at collecting and disseminating harmonised and comparable information on the use of ICT in households and by individuals. Data presented in this domain are collected on a yearly basis by the National Statistical Institutes and are based on Eurostat's annual model questionnaire. This questionnaire is updated each year to reflect the evolving situation of information and communication technologiesFind more statistics on other topics about Romania with key insights such as share of daily internet users, share of internet users seeking health information online, share of internet users looking for and applying for jobs online, share of internet users reading news online, share of internet users engaging in online learning activities, and share of people that upload self-created content.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In this repository, we included code to prepare dataset, train gemnet model, build the faiss index, search the faiss index and visualize the searched results in the notebook faiss-gemnet-qm9-mp.ipynb
. It reproduced our examples in the manuscript for the QM9 and the Materials Project dataset. For the OC20 dataset, we did not include its related data here because of its large size (> 50 GB), the code to process the OC20 dataset is almost the same as the code included in the notebook for the QM9 dataset.
We include the intermediate data (GemNet checkpoints, lmdb, faiss index and the searched result for the QM9 and the Materials project in the directory example-data
. We also put the GemNet checkpoint for the OC20 dataset in this directory. The training and evaluation of the Gaussian regression process model using the searched molecules for the query Benzene are demonstrated in the ben-gp-data
directory, in which the qm9-gp-gemnet-morgan-random-nrg.ipynb
can be run on Colab.
The Texas Department of Insurance, Division of Workers' Compensation (DWC) maintains a database of professional medical billing services (SV1). It contains charges, payments, and treatments billed on a CMS-1500 form by doctors and other health care professionals who treat injured employees, including ambulatory surgical centers, with dates of service more than five years old going back to 2010. For datasets from the past five years, see professional medical billing services (SV1) detail information. The detail contains information to identify insurance carriers, injured employees, employers, place of service, and diagnostic information. The bill details are individual line items that are grouped in the header section of a single bill. The bill selection date and bill ID must be used to group individual line items into a single bill. Find more information in our professional medical billing services (SV1) detail data dictionary. See professional medical billing services (SV1) header information – historical for the corresponding header records related to this dataset. Go to our page on DWC medical state reporting public use data file (PUDF) to learn more about using this information.
Please note: updates to this dataset are paused while the Find-It API is being revamped (2025-04-14)
This dataset lists information about public programs and events offered by community-based organizations, city agencies, and educational institutions that serve people who live or work in Cambridge, MA. The information is created by the organizations that input their information into the Find It Cambridge website. This dataset includes useful information about their events and programs.
DWP Jobcentre Plus office data that covers Jobcentre name, address and contact number, the postcodes it deals with and where each benefit is dealt with.
This data includes information on Arsenic violations in the US, including time patterns and spatial patterns in Arsenic violations, and people served by systems in violation. Most of the data is from the Safe Drinking Water Information System. This dataset is associated with the following publication: Foster, S., M. Pennino, J. Compton, S. Leibowitz, and M. Kile. Arsenic Drinking Water Violations Decreased Across the United States Following Revision of the Maximum Contaminant Level.. ENVIRONMENTAL SCIENCE & TECHNOLOGY. American Chemical Society, Washington, DC, USA, 53(19): 11478-11485, (2019).
The Texas Department of Insurance, Division of Workers' Compensation (DWC) maintains a database of institutional medical billing services (SV2). It contains charges, payments, and treatments billed on a CMS-1450 form (UB-92, UB-04) by hospitals and medical facilities that treat injured employees, excluding ambulatory surgical centers, with dates of service for the last five years. For datasets going back to 2010, see institutional medical billing services (SV2) header information – historical. The header identifies insurance carriers, injured employees, employers, place of service, and diagnostic information. The bill header information groups individual line items reported in the detail section. The bill selection date and bill ID must be used to group individual line items into a single bill. Find more information in our institutional medical billing services (SV2) header data dictionary. See institutional medical billing services (SV2) detail information for the corresponding detail records related to this dataset. Go to our page on DWC medical state reporting public use data file (PUDF) to learn more about using this information.
Access to information (ATI) policies are often praised for strengthening transparency, accountability, and trust in public institutions, yet evidence that they improve institutional performance is mixed. We argue that an important impediment to the effective operation of such policies is the failure of bureaucrats to comply with information requests that could expose poor performance. Analyzing a new dataset on the performance of approximately 20,000 aid projects financed by 12 donor agencies in 183 countries, we find that enforcement matters: the adoption of ATI policies by agencies is associated with better project outcomes when these policies include independent appeals processes for denied information requests but with no improvement when they do not. We also recover evidence that project staff adjust their behavior in anticipation of ATI appeals, and that the performance dividends of appeals processes increase when bottom-up collective action is easier and mechanisms of project oversight are weak.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The fact that Internet companies may record our personal data and track our online behavior for commercial or political purpose has emphasized aspects related to online privacy. This has also led to the development of search engines that promise no tracking and privacy. Search engines also have a major role in spreading low-quality health information such as that of anti-vaccine websites. This study investigates the relationship between search engines' approach to privacy and the scientific quality of the information they return. We analyzed the first 30 webpages returned searching “vaccines autism” in English, Spanish, Italian, and French. The results show that not only “alternative” search engines (Duckduckgo, Ecosia, Qwant, Swisscows, and Mojeek) but also other commercial engines (Bing, Yahoo) often return more anti-vaccine pages (10–53%) than Google.com (0%). Some localized versions of Google, however, returned more anti-vaccine webpages (up to 10%) than Google.com. Health information returned by search engines has an impact on public health and, specifically, in the acceptance of vaccines. The issue of information quality when seeking information for making health-related decisions also impact the ethical aspect represented by the right to an informed consent. Our study suggests that designing a search engine that is privacy savvy and avoids issues with filter bubbles that can result from user-tracking is necessary but insufficient; instead, mechanisms should be developed to test search engines from the perspective of information quality (particularly for health-related webpages) before they can be deemed trustworthy providers of public health information.
This guide brings together online resources that contain U.S. government documents. Some are freely available to anyone with Internet access. Others include subscription databases accessible with a DHS device.