100+ datasets found
  1. B

    Google Data Search Exercises

    • borealisdata.ca
    • search.dataone.org
    Updated Aug 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julie Marcoux (2024). Google Data Search Exercises [Dataset]. http://doi.org/10.5683/SP3/MW7BKH
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 26, 2024
    Dataset provided by
    Borealis
    Authors
    Julie Marcoux
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Google data search exercises can be used to practice finding data or statistics on a topic of interest, including using Google's own internal tools and by using advanced operators.

  2. d

    Google SERP Data, Web Search Data, Google Images Data | Real-Time API

    • datarade.ai
    .json, .csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenWeb Ninja, Google SERP Data, Web Search Data, Google Images Data | Real-Time API [Dataset]. https://datarade.ai/data-products/openweb-ninja-google-data-google-image-data-google-serp-d-openweb-ninja
    Explore at:
    .json, .csvAvailable download formats
    Dataset authored and provided by
    OpenWeb Ninja
    Area covered
    Ireland, Tokelau, Burundi, South Georgia and the South Sandwich Islands, Panama, Uganda, Grenada, Barbados, Virgin Islands (U.S.), Uruguay
    Description

    OpenWeb Ninja's Google Images Data (Google SERP Data) API provides real-time image search capabilities for images sourced from all public sources on the web.

    The API enables you to search and access more than 100 billion images from across the web including advanced filtering capabilities as supported by Google Advanced Image Search. The API provides Google Images Data (Google SERP Data) including details such as image URL, title, size information, thumbnail, source information, and more data points. The API supports advanced filtering and options such as file type, image color, usage rights, creation time, and more. In addition, any Advanced Google Search operators can be used with the API.

    OpenWeb Ninja's Google Images Data & Google SERP Data API common use cases:

    • Creative Media Production: Enhance digital content with a vast array of real-time images, ensuring engaging and brand-aligned visuals for blogs, social media, and advertising.

    • AI Model Enhancement: Train and refine AI models with diverse, annotated images, improving object recognition and image classification accuracy.

    • Trend Analysis: Identify emerging market trends and consumer preferences through real-time visual data, enabling proactive business decisions.

    • Innovative Product Design: Inspire product innovation by exploring current design trends and competitor products, ensuring market-relevant offerings.

    • Advanced Search Optimization: Improve search engines and applications with enriched image datasets, providing users with accurate, relevant, and visually appealing search results.

    OpenWeb Ninja's Annotated Imagery Data & Google SERP Data Stats & Capabilities:

    • 100B+ Images: Access an extensive database of over 100 billion images.

    • Images Data from all Public Sources (Google SERP Data): Benefit from a comprehensive aggregation of image data from various public websites, ensuring a wide range of sources and perspectives.

    • Extensive Search and Filtering Capabilities: Utilize advanced search operators and filters to refine image searches by file type, color, usage rights, creation time, and more, making it easy to find exactly what you need.

    • Rich Data Points: Each image comes with more than 10 data points, including URL, title (annotation), size information, thumbnail, and source information, providing a detailed context for each image.

  3. Great Places to Find Free Datasets for Your Next

    • kaggle.com
    zip
    Updated Aug 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nimaklkhan (2024). Great Places to Find Free Datasets for Your Next [Dataset]. https://www.kaggle.com/datasets/nimaklkhan/great-places-to-find-free-datasets-for-your-next
    Explore at:
    zip(5654 bytes)Available download formats
    Dataset updated
    Aug 6, 2024
    Authors
    nimaklkhan
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    if you’re looking for a job in data analytics, you’ll need a portfolio to demonstrate your expertise. Of course, if you’re new to data analytics, you probably don’t have much expertise! Not to worry. The fact you might not have worked on a paid project yet doesn’t mean you can’t whip up a compelling portfolio using some practice datasets.

    Fortunately, the Internet is awash with these, most of which are completely free to download (thanks to the open data initiative). In this post, we’ll highlight a few first-rate repositories where you can find data on everything from business to finance, planetary science and crime.

    Prefer to watch this information over reading it? Check out this video on dataset resources, presented by our very own in-house data scientist, Tom!

    It seems we turn to Google for everything these days, and data is no exception. Launched in 2018, Google Dataset Search is like Google’s standard search engine, but strictly for data.

    While it’s not the best tool if you prefer to browse, if you have a particular topic or keyword in mind, it won’t disappoint. Google Dataset Search aggregates data from external sources, providing a clear summary of what’s available, a description of the data, who it’s provided by, and when it was last updated. It’s an excellent place to start.

  4. h

    google_search_results_dataset_azerbaijan

    • huggingface.co
    Updated Aug 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LocalDoc (2024). google_search_results_dataset_azerbaijan [Dataset]. https://huggingface.co/datasets/LocalDoc/google_search_results_dataset_azerbaijan
    Explore at:
    Dataset updated
    Aug 18, 2024
    Dataset authored and provided by
    LocalDoc
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Azerbaijani Google Search Results URLs Dataset

      Overview
    

    The dataset includes multiple entries for each keyword, capturing different URLs and titles that were returned by Google. This allows researchers and developers to easily collect URLs for scraping content related to specific Azerbaijani keywords.

      Structure
    

    The dataset is structured as follows:

    Column Name Description

    keyword The search term entered into Google.

    title The title of the webpage… See the full description on the dataset page: https://huggingface.co/datasets/LocalDoc/google_search_results_dataset_azerbaijan.

  5. Recipes Search Engine Results Data

    • kaggle.com
    zip
    Updated Mar 30, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elias Dabbas (2019). Recipes Search Engine Results Data [Dataset]. https://www.kaggle.com/datasets/eliasdabbas/recipes-search-engine-results-data
    Explore at:
    zip(6875244 bytes)Available download formats
    Dataset updated
    Mar 30, 2019
    Authors
    Elias Dabbas
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Recipe keywords' positions on search; Google and YouTube.
    These datasets can be interesting for SEO research for the recipes industry.

    Content

    243 national recipes (based on Wikipedia's national dish list)
    2 keyword versions dish recipe and how to make dish
    Total 486 queries (10 results each)

    Google: 4,860 rows (defaults to 10 per result, and some missing)
    YouTube: 1,455 rows (defaults to 5 per result, and some missing)

    Acknowledgements

    Google CSE API, YouTube API, Python, requests, pandas, advertools.

    Inspiration

    It's interesting to know about how things are visible from a search engine perspective, and compare Google and YouTube as well.
    National dishes are mostly delicious as well!

  6. d

    DataForSEO Google Keyword Database, historical and current

    • datarade.ai
    .json, .csv
    Updated Mar 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DataForSEO (2023). DataForSEO Google Keyword Database, historical and current [Dataset]. https://datarade.ai/data-products/dataforseo-google-keyword-database-historical-and-current-dataforseo
    Explore at:
    .json, .csvAvailable download formats
    Dataset updated
    Mar 14, 2023
    Dataset authored and provided by
    DataForSEO
    Area covered
    Spain, Bangladesh, Bolivia (Plurinational State of), Canada, Turkey, Cyprus, Uruguay, Singapore, Bahrain, El Salvador
    Description

    You can check the fields description in the documentation: current Keyword database: https://docs.dataforseo.com/v3/databases/google/keywords/?bash; Historical Keyword database: https://docs.dataforseo.com/v3/databases/google/history/keywords/?bash. You don’t have to download fresh data dumps in JSON or CSV – we can deliver data straight to your storage or database. We send terrabytes of data to dozens of customers every month using Amazon S3, Google Cloud Storage, Microsoft Azure Blob, Eleasticsearch, and Google Big Query. Let us know if you’d like to get your data to any other storage or database.

  7. d

    DataForSEO Google Full (Keywords+SERP) database, historical data available

    • datarade.ai
    .json, .csv
    Updated Aug 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DataForSEO (2023). DataForSEO Google Full (Keywords+SERP) database, historical data available [Dataset]. https://datarade.ai/data-products/dataforseo-google-full-keywords-serp-database-historical-d-dataforseo
    Explore at:
    .json, .csvAvailable download formats
    Dataset updated
    Aug 17, 2023
    Dataset authored and provided by
    DataForSEO
    Area covered
    Paraguay, Sweden, Burkina Faso, United Kingdom, Côte d'Ivoire, Cyprus, South Africa, Portugal, Bolivia (Plurinational State of), Costa Rica
    Description

    You can check the fields description in the documentation: current Full database: https://docs.dataforseo.com/v3/databases/google/full/?bash; Historical Full database: https://docs.dataforseo.com/v3/databases/google/history/full/?bash.

    Full Google Database is a combination of the Advanced Google SERP Database and Google Keyword Database.

    Google SERP Database offers millions of SERPs collected in 67 regions with most of Google’s advanced SERP features, including featured snippets, knowledge graphs, people also ask sections, top stories, and more.

    Google Keyword Database encompasses billions of search terms enriched with related Google Ads data: search volume trends, CPC, competition, and more.

    This database is available in JSON format only.

    You don’t have to download fresh data dumps in JSON – we can deliver data straight to your storage or database. We send terrabytes of data to dozens of customers every month using Amazon S3, Google Cloud Storage, Microsoft Azure Blob, Eleasticsearch, and Google Big Query. Let us know if you’d like to get your data to any other storage or database.

  8. c

    ckanext-dcat

    • catalog.civicdataecosystem.org
    Updated Mar 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). ckanext-dcat [Dataset]. https://catalog.civicdataecosystem.org/dataset/ckanext-dcat
    Explore at:
    Dataset updated
    Mar 13, 2025
    Description

    The DCAT extension for CKAN enhances data portals by enabling the exposure and consumption of metadata using the DCAT vocabulary, facilitating interoperability with other data catalogs. It provides tools for serializing CKAN datasets as RDF documents and harvesting RDF data from external sources, promoting data sharing and reuse. The extension supports various DCAT Application Profiles, and includes features for adapting schemas, validating data, and integrating with search engines like Google Dataset Search. Key Features: DCAT Schemas: Offers pre-built CKAN schemas for common Application Profiles (DCAT AP v1, v2, and v3), which can be customized to align with site-specific requirements. These schemas include tailored form fields and validation rules to ensure DCAT compatibility. DCAT Endpoints: Exposes catalog datasets in different RDF serializations, allowing external systems to easily consume CKAN metadata in a standardized format. RDF Harvester: Enables the import of RDF serializations from other catalogs, automatically creating CKAN datasets based on the harvested metadata. This promotes data aggregation and discovery across different data sources. DCAT-CKAN Mapping: Establishes a base mapping between DCAT and CKAN datasets, facilitating bidirectional transformation of metadata. The mapping is compatible with DCAT-AP v1.1, v2.1, and v3. RDF Parser and Serializer: Includes an RDF parser for extracting CKAN dataset dictionaries from RDF serializations and an RDF serializer for transforming CKAN dataset metadata into different semantic formats. Both components are customizable through profiles. Command Line Interface (CLI): Provides a command-line interface for managing and interacting with the extension's features, such as harvesting and data transformation tasks. Google Dataset Search Integration: Offers support for indexing datasets in Google Dataset Search, improving the visibility of CKAN datasets to a wider audience. Technical Integration: The ckanext-dcat extension extends CKAN's functionality by adding new plugins for RDF harvesting and serialization, allowing users to expose and consume DCAT metadata through the portal and enabling dataset enrichment from external sources. This integration can be customized through profiles that define custom data mappings. Benefits & Impact: By implementing the DCAT extension, CKAN-based data portals can significantly improve their interoperability with other data catalogs and data repositories that support DCAT. This facilitates data sharing, reuse, and discovery, as well as improves the visibility of datasets through indexing in services like Google Dataset Search. The extension's built-in schemas and validation rules ensure that CKAN metadata conforms to DCAT standards, while the RDF harvester simplifies the process of importing data from external sources. Funded by organizations like the Government of Sweden, Vinnova, and FIWARE, the extension has been developed for production use cases and promotes a data-driven ecosystem.

  9. Dataset Metadata for CORD-19

    • kaggle.com
    zip
    Updated May 1, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google AI (2020). Dataset Metadata for CORD-19 [Dataset]. https://www.kaggle.com/datasets/googleai/dataset-metadata-for-cord19/data
    Explore at:
    zip(6172304 bytes)Available download formats
    Dataset updated
    May 1, 2020
    Dataset provided by
    Googlehttp://google.com/
    Authors
    Google AI
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Context

    This “dataset of metadata” contains paper--dataset pairs of datasets mentioned or referenced in papers comprising the CORD-19 dataset. CORD-19 is an open research dataset on COVID-19 produced by multiple institutions, and Google’s Dataset Search team has enhanced this dataset with additional metadata. Specifically, the metadata for these datasets was collected from their descriptions in schema.org mark-up across various data repositories on the Web.

    Content

    Each row of the table is a paper-dataset pair, with cord_uid, paper title and url from the CORD-19 dataset and the metadata for a dataset.

    Limitations

    • Only datasets that have schema.org metadata on their pages are included
    • Because we identify the paper--dataset correspondences automatically, some correspondences may be missing and some may be spurious.

    Next steps

    Does the linked data provide additional insights into the content of the papers?

    About Dataset Search

    Google's Dataset Search is a tool that makes it easier for researchers, students, and data geeks to discover datasets that they need for their work. It is built on the idea that metadata and data should be open whenever possible.

  10. Z

    Data for study "Direct Answers in Google Search Results"

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Strzelecki, Artur; Rutecka, Paulina (2020). Data for study "Direct Answers in Google Search Results" [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3541091
    Explore at:
    Dataset updated
    Jun 9, 2020
    Dataset provided by
    University of Economics in Katowice
    Authors
    Strzelecki, Artur; Rutecka, Paulina
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The goal of this research is to examine direct answers in Google web search engine. Dataset was collected using Senuto (https://www.senuto.com/). Senuto is as an online tool, that extracts data on websites visibility from Google search engine.

    Dataset contains the following elements:

    keyword,

    number of monthly searches,

    featured domain,

    featured main domain,

    featured position,

    featured type,

    featured url,

    content,

    content length.

    Dataset with visibility structure has 743 798 keywords that were resulting in SERPs with direct answer.

  11. Z

    Dataset: A Systematic Literature Review on the topic of High-value datasets

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anastasija Nikiforova; Nina Rizun; Magdalena Ciesielska; Charalampos Alexopoulos; Andrea Miletič (2023). Dataset: A Systematic Literature Review on the topic of High-value datasets [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7944424
    Explore at:
    Dataset updated
    Jun 23, 2023
    Dataset provided by
    Gdańsk University of Technology
    University of Tartu
    University of Zagreb
    University of the Aegean
    Authors
    Anastasija Nikiforova; Nina Rizun; Magdalena Ciesielska; Charalampos Alexopoulos; Andrea Miletič
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains data collected during a study ("Towards High-Value Datasets determination for data-driven development: a systematic literature review") conducted by Anastasija Nikiforova (University of Tartu), Nina Rizun, Magdalena Ciesielska (Gdańsk University of Technology), Charalampos Alexopoulos (University of the Aegean) and Andrea Miletič (University of Zagreb) It being made public both to act as supplementary data for "Towards High-Value Datasets determination for data-driven development: a systematic literature review" paper (pre-print is available in Open Access here -> https://arxiv.org/abs/2305.10234) and in order for other researchers to use these data in their own work.

    The protocol is intended for the Systematic Literature review on the topic of High-value Datasets with the aim to gather information on how the topic of High-value datasets (HVD) and their determination has been reflected in the literature over the years and what has been found by these studies to date, incl. the indicators used in them, involved stakeholders, data-related aspects, and frameworks. The data in this dataset were collected in the result of the SLR over Scopus, Web of Science, and Digital Government Research library (DGRL) in 2023.

    Methodology

    To understand how HVD determination has been reflected in the literature over the years and what has been found by these studies to date, all relevant literature covering this topic has been studied. To this end, the SLR was carried out to by searching digital libraries covered by Scopus, Web of Science (WoS), Digital Government Research library (DGRL).

    These databases were queried for keywords ("open data" OR "open government data") AND ("high-value data*" OR "high value data*"), which were applied to the article title, keywords, and abstract to limit the number of papers to those, where these objects were primary research objects rather than mentioned in the body, e.g., as a future work. After deduplication, 11 articles were found unique and were further checked for relevance. As a result, a total of 9 articles were further examined. Each study was independently examined by at least two authors.

    To attain the objective of our study, we developed the protocol, where the information on each selected study was collected in four categories: (1) descriptive information, (2) approach- and research design- related information, (3) quality-related information, (4) HVD determination-related information.

    Test procedure Each study was independently examined by at least two authors, where after the in-depth examination of the full-text of the article, the structured protocol has been filled for each study. The structure of the survey is available in the supplementary file available (see Protocol_HVD_SLR.odt, Protocol_HVD_SLR.docx) The data collected for each study by two researchers were then synthesized in one final version by the third researcher.

    Description of the data in this data set

    Protocol_HVD_SLR provides the structure of the protocol Spreadsheets #1 provides the filled protocol for relevant studies. Spreadsheet#2 provides the list of results after the search over three indexing databases, i.e. before filtering out irrelevant studies

    The information on each selected study was collected in four categories: (1) descriptive information, (2) approach- and research design- related information, (3) quality-related information, (4) HVD determination-related information

    Descriptive information
    1) Article number - a study number, corresponding to the study number assigned in an Excel worksheet 2) Complete reference - the complete source information to refer to the study 3) Year of publication - the year in which the study was published 4) Journal article / conference paper / book chapter - the type of the paper -{journal article, conference paper, book chapter} 5) DOI / Website- a link to the website where the study can be found 6) Number of citations - the number of citations of the article in Google Scholar, Scopus, Web of Science 7) Availability in OA - availability of an article in the Open Access 8) Keywords - keywords of the paper as indicated by the authors 9) Relevance for this study - what is the relevance level of the article for this study? {high / medium / low}

    Approach- and research design-related information 10) Objective / RQ - the research objective / aim, established research questions 11) Research method (including unit of analysis) - the methods used to collect data, including the unit of analy-sis (country, organisation, specific unit that has been ana-lysed, e.g., the number of use-cases, scope of the SLR etc.) 12) Contributions - the contributions of the study 13) Method - whether the study uses a qualitative, quantitative, or mixed methods approach? 14) Availability of the underlying research data- whether there is a reference to the publicly available underly-ing research data e.g., transcriptions of interviews, collected data, or explanation why these data are not shared? 15) Period under investigation - period (or moment) in which the study was conducted 16) Use of theory / theoretical concepts / approaches - does the study mention any theory / theoretical concepts / approaches? If any theory is mentioned, how is theory used in the study?

    Quality- and relevance- related information
    17) Quality concerns - whether there are any quality concerns (e.g., limited infor-mation about the research methods used)? 18) Primary research object - is the HVD a primary research object in the study? (primary - the paper is focused around the HVD determination, sec-ondary - mentioned but not studied (e.g., as part of discus-sion, future work etc.))

    HVD determination-related information
    19) HVD definition and type of value - how is the HVD defined in the article and / or any other equivalent term? 20) HVD indicators - what are the indicators to identify HVD? How were they identified? (components & relationships, “input -> output") 21) A framework for HVD determination - is there a framework presented for HVD identification? What components does it consist of and what are the rela-tionships between these components? (detailed description) 22) Stakeholders and their roles - what stakeholders or actors does HVD determination in-volve? What are their roles? 23) Data - what data do HVD cover? 24) Level (if relevant) - what is the level of the HVD determination covered in the article? (e.g., city, regional, national, international)

    Format of the file .xls, .csv (for the first spreadsheet only), .odt, .docx

    Licenses or restrictions CC-BY

    For more info, see README.txt

  12. Google Trends

    • console.cloud.google.com
    Updated Jun 11, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    https://console.cloud.google.com/marketplace/browse?filter=partner:BigQuery%20Public%20Datasets%20Program&hl=ES (2022). Google Trends [Dataset]. https://console.cloud.google.com/marketplace/product/bigquery-public-datasets/google-search-trends?hl=ES
    Explore at:
    Dataset updated
    Jun 11, 2022
    Dataset provided by
    BigQueryhttps://cloud.google.com/bigquery
    Google Searchhttp://google.com/
    Googlehttp://google.com/
    Description

    The Google Trends dataset will provide critical signals that individual users and businesses alike can leverage to make better data-driven decisions. This dataset simplifies the manual interaction with the existing Google Trends UI by automating and exposing anonymized, aggregated, and indexed search data in BigQuery. This dataset includes the Top 25 stories and Top 25 Rising queries from Google Trends. It will be made available as two separate BigQuery tables, with a set of new top terms appended daily. Each set of Top 25 and Top 25 rising expires after 30 days, and will be accompanied by a rolling five-year window of historical data in 210 distinct locations in the United States. This Google dataset is hosted in Google BigQuery as part of Google Cloud's Datasets solution and is included in BigQuery's 1TB/mo of free tier processing. This means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public datasets. What is BigQuery

  13. Google My Business: Local Search Optimization Data

    • kaggle.com
    zip
    Updated Feb 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agung Pambudi (2025). Google My Business: Local Search Optimization Data [Dataset]. https://www.kaggle.com/datasets/agungpambudi/google-my-business-local-search-optimization-data
    Explore at:
    zip(10435 bytes)Available download formats
    Dataset updated
    Feb 15, 2025
    Authors
    Agung Pambudi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Google My Business (GMB) is a platform designed to help you share detailed information about your business when it appears in search results. In addition to a URL and description, you can include photos, videos, contact numbers, operating hours, delivery zones, and links to booking services. Google My Business enables you to create eye-catching listings that enhance visibility when customers search online. It allows your in-store products to be displayed directly on your Google Business Profile. A cover photo, along with previews from Google Maps and Google Street View, gives potential customers a clear idea of what to expect when they visit. However, keep in mind that users can suggest changes to your profile, so it’s important to review it frequently to ensure accuracy.

    Google My Business also highlights key factors to consider for verifying your business presence and enhancing your local search visibility through optimization.

    Data Dictionary

    Column NameData TypeDescription
    location_idIntegerUnique identifier for each location.
    location_nameStringName of the business or location.
    addressStringFull address of the location.
    phone_numbersString/NaNContact phone number(s) for the business (if available).
    latitudeFloatGeographic coordinate (latitude) of the location.
    longitudeFloatGeographic coordinate (longitude) of the location.
    priceString/NaNPrice range of services or products offered (e.g., "SGD 1–10").
    regular_hoursDictionaryBusiness hours for each day of the week.
    service_optionsDictionaryAvailable service options (e.g., dine-in, takeout, delivery).
    average_ratingFloatCustomer rating of the business (e.g., 4.5).
    labelsStringCategory or type of business (e.g., "Halal restaurant").

    Notice on Dataset Usage and Attribution

    This dataset, created by Agung Pambudi, is entirely original and has not been shared previously. It is distributed under the CC BY 4.0 license, which permits unrestricted use, provided the author is appropriately credited. A DOI is included to ensure accurate citation. Please be aware that duplicating this work on Kaggle is prohibited.

  14. d

    DataForSEO Labs API for keyword research and search analytics, real-time...

    • datarade.ai
    .json
    Updated Jun 4, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DataForSEO (2021). DataForSEO Labs API for keyword research and search analytics, real-time data for all Google locations and languages [Dataset]. https://datarade.ai/data-products/dataforseo-labs-api-for-keyword-research-and-search-analytics-dataforseo
    Explore at:
    .jsonAvailable download formats
    Dataset updated
    Jun 4, 2021
    Dataset authored and provided by
    DataForSEO
    Area covered
    Tokelau, Armenia, Mauritania, Micronesia (Federated States of), Isle of Man, Morocco, Cocos (Keeling) Islands, Kenya, Azerbaijan, Korea (Democratic People's Republic of)
    Description

    DataForSEO Labs API offers three powerful keyword research algorithms and historical keyword data:

    • Related Keywords from the “searches related to” element of Google SERP. • Keyword Suggestions that match the specified seed keyword with additional words before, after, or within the seed key phrase. • Keyword Ideas that fall into the same category as specified seed keywords. • Historical Search Volume with current cost-per-click, and competition values.

    Based on in-market categories of Google Ads, you can get keyword ideas from the relevant Categories For Domain and discover relevant Keywords For Categories. You can also obtain Top Google Searches with AdWords and Bing Ads metrics, product categories, and Google SERP data.

    You will find well-rounded ways to scout the competitors:

    • Domain Whois Overview with ranking and traffic info from organic and paid search. • Ranked Keywords that any domain or URL has positions for in SERP. • SERP Competitors and the rankings they hold for the keywords you specify. • Competitors Domain with a full overview of its rankings and traffic from organic and paid search. • Domain Intersection keywords for which both specified domains rank within the same SERPs. • Subdomains for the target domain you specify along with the ranking distribution across organic and paid search. • Relevant Pages of the specified domain with rankings and traffic data. • Domain Rank Overview with ranking and traffic data from organic and paid search. • Historical Rank Overview with historical data on rankings and traffic of the specified domain from organic and paid search. • Page Intersection keywords for which the specified pages rank within the same SERP.

    All DataForSEO Labs API endpoints function in the Live mode. This means you will be provided with the results in response right after sending the necessary parameters with a POST request.

    The limit is 2000 API calls per minute, however, you can contact our support team if your project requires higher rates.

    We offer well-rounded API documentation, GUI for API usage control, comprehensive client libraries for different programming languages, free sandbox API testing, ad hoc integration, and deployment support.

    We have a pay-as-you-go pricing model. You simply add funds to your account and use them to get data. The account balance doesn't expire.

  15. Data from: Inventory of online public databases and repositories holding...

    • catalog.data.gov
    • s.cnmilf.com
    • +2more
    Updated Apr 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agricultural Research Service (2025). Inventory of online public databases and repositories holding agricultural data in 2017 [Dataset]. https://catalog.data.gov/dataset/inventory-of-online-public-databases-and-repositories-holding-agricultural-data-in-2017-d4c81
    Explore at:
    Dataset updated
    Apr 21, 2025
    Dataset provided by
    Agricultural Research Servicehttps://www.ars.usda.gov/
    Description

    United States agricultural researchers have many options for making their data available online. This dataset aggregates the primary sources of ag-related data and determines where researchers are likely to deposit their agricultural data. These data serve as both a current landscape analysis and also as a baseline for future studies of ag research data. Purpose As sources of agricultural data become more numerous and disparate, and collaboration and open data become more expected if not required, this research provides a landscape inventory of online sources of open agricultural data. An inventory of current agricultural data sharing options will help assess how the Ag Data Commons, a platform for USDA-funded data cataloging and publication, can best support data-intensive and multi-disciplinary research. It will also help agricultural librarians assist their researchers in data management and publication. The goals of this study were to establish where agricultural researchers in the United States-- land grant and USDA researchers, primarily ARS, NRCS, USFS and other agencies -- currently publish their data, including general research data repositories, domain-specific databases, and the top journals compare how much data is in institutional vs. domain-specific vs. federal platforms determine which repositories are recommended by top journals that require or recommend the publication of supporting data ascertain where researchers not affiliated with funding or initiatives possessing a designated open data repository can publish data Approach The National Agricultural Library team focused on Agricultural Research Service (ARS), Natural Resources Conservation Service (NRCS), and United States Forest Service (USFS) style research data, rather than ag economics, statistics, and social sciences data. To find domain-specific, general, institutional, and federal agency repositories and databases that are open to US research submissions and have some amount of ag data, resources including re3data, libguides, and ARS lists were analysed. Primarily environmental or public health databases were not included, but places where ag grantees would publish data were considered. Search methods We first compiled a list of known domain specific USDA / ARS datasets / databases that are represented in the Ag Data Commons, including ARS Image Gallery, ARS Nutrition Databases (sub-components), SoyBase, PeanutBase, National Fungus Collection, i5K Workspace @ NAL, and GRIN. We then searched using search engines such as Bing and Google for non-USDA / federal ag databases, using Boolean variations of “agricultural data” /“ag data” / “scientific data” + NOT + USDA (to filter out the federal / USDA results). Most of these results were domain specific, though some contained a mix of data subjects. We then used search engines such as Bing and Google to find top agricultural university repositories using variations of “agriculture”, “ag data” and “university” to find schools with agriculture programs. Using that list of universities, we searched each university web site to see if their institution had a repository for their unique, independent research data if not apparent in the initial web browser search. We found both ag specific university repositories and general university repositories that housed a portion of agricultural data. Ag specific university repositories are included in the list of domain-specific repositories. Results included Columbia University – International Research Institute for Climate and Society, UC Davis – Cover Crops Database, etc. If a general university repository existed, we determined whether that repository could filter to include only data results after our chosen ag search terms were applied. General university databases that contain ag data included Colorado State University Digital Collections, University of Michigan ICPSR (Inter-university Consortium for Political and Social Research), and University of Minnesota DRUM (Digital Repository of the University of Minnesota). We then split out NCBI (National Center for Biotechnology Information) repositories. Next we searched the internet for open general data repositories using a variety of search engines, and repositories containing a mix of data, journals, books, and other types of records were tested to determine whether that repository could filter for data results after search terms were applied. General subject data repositories include Figshare, Open Science Framework, PANGEA, Protein Data Bank, and Zenodo. Finally, we compared scholarly journal suggestions for data repositories against our list to fill in any missing repositories that might contain agricultural data. Extensive lists of journals were compiled, in which USDA published in 2012 and 2016, combining search results in ARIS, Scopus, and the Forest Service's TreeSearch, plus the USDA web sites Economic Research Service (ERS), National Agricultural Statistics Service (NASS), Natural Resources and Conservation Service (NRCS), Food and Nutrition Service (FNS), Rural Development (RD), and Agricultural Marketing Service (AMS). The top 50 journals' author instructions were consulted to see if they (a) ask or require submitters to provide supplemental data, or (b) require submitters to submit data to open repositories. Data are provided for Journals based on a 2012 and 2016 study of where USDA employees publish their research studies, ranked by number of articles, including 2015/2016 Impact Factor, Author guidelines, Supplemental Data?, Supplemental Data reviewed?, Open Data (Supplemental or in Repository) Required? and Recommended data repositories, as provided in the online author guidelines for each the top 50 journals. Evaluation We ran a series of searches on all resulting general subject databases with the designated search terms. From the results, we noted the total number of datasets in the repository, type of resource searched (datasets, data, images, components, etc.), percentage of the total database that each term comprised, any dataset with a search term that comprised at least 1% and 5% of the total collection, and any search term that returned greater than 100 and greater than 500 results. We compared domain-specific databases and repositories based on parent organization, type of institution, and whether data submissions were dependent on conditions such as funding or affiliation of some kind. Results A summary of the major findings from our data review: Over half of the top 50 ag-related journals from our profile require or encourage open data for their published authors. There are few general repositories that are both large AND contain a significant portion of ag data in their collection. GBIF (Global Biodiversity Information Facility), ICPSR, and ORNL DAAC were among those that had over 500 datasets returned with at least one ag search term and had that result comprise at least 5% of the total collection. Not even one quarter of the domain-specific repositories and datasets reviewed allow open submission by any researcher regardless of funding or affiliation. See included README file for descriptions of each individual data file in this dataset. Resources in this dataset:Resource Title: Journals. File Name: Journals.csvResource Title: Journals - Recommended repositories. File Name: Repos_from_journals.csvResource Title: TDWG presentation. File Name: TDWG_Presentation.pptxResource Title: Domain Specific ag data sources. File Name: domain_specific_ag_databases.csvResource Title: Data Dictionary for Ag Data Repository Inventory. File Name: Ag_Data_Repo_DD.csvResource Title: General repositories containing ag data. File Name: general_repos_1.csvResource Title: README and file inventory. File Name: README_InventoryPublicDBandREepAgData.txt

  16. d

    Tutorial: How to use Google Data Studio and ArcGIS Online to create an...

    • search.dataone.org
    • hydroshare.org
    • +1more
    Updated Apr 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sarah Beganskas (2022). Tutorial: How to use Google Data Studio and ArcGIS Online to create an interactive data portal [Dataset]. http://doi.org/10.4211/hs.9edae0ef99224e0b85303c6d45797d56
    Explore at:
    Dataset updated
    Apr 15, 2022
    Dataset provided by
    Hydroshare
    Authors
    Sarah Beganskas
    Description

    This tutorial will teach you how to take time-series data from many field sites and create a shareable online map, where clicking on a field location brings you to a page with interactive graph(s).

    The tutorial can be completed with a sample dataset (provided via a Google Drive link within the document) or with your own time-series data from multiple field sites.

    Part 1 covers how to make interactive graphs in Google Data Studio and Part 2 covers how to link data pages to an interactive map with ArcGIS Online. The tutorial will take 1-2 hours to complete.

    An example interactive map and data portal can be found at: https://temple.maps.arcgis.com/apps/View/index.html?appid=a259e4ec88c94ddfbf3528dc8a5d77e8

  17. p

    Google Search Trends Data

    • paradoxintelligence.com
    json/csv
    Updated May 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paradox Intelligence (2025). Google Search Trends Data [Dataset]. https://www.paradoxintelligence.com/datasets
    Explore at:
    json/csvAvailable download formats
    Dataset updated
    May 3, 2025
    Dataset authored and provided by
    Paradox Intelligence
    License

    https://www.paradoxintelligence.com/termshttps://www.paradoxintelligence.com/terms

    Time period covered
    2004 - Present
    Area covered
    Global
    Description

    Real-time search volume and trend analysis across global markets with geographic and temporal granularity

  18. Google Trends(Past 7 Days) in India Dataset

    • kaggle.com
    zip
    Updated Oct 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhadra Mohit (2024). Google Trends(Past 7 Days) in India Dataset [Dataset]. https://www.kaggle.com/datasets/bhadramohit/google-trendspast-7-days-in-india-dataset
    Explore at:
    zip(64377 bytes)Available download formats
    Dataset updated
    Oct 30, 2024
    Authors
    Bhadra Mohit
    License

    https://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/

    Area covered
    India
    Description

    Context

    Time-based Popularity:

    Tracks search interest over time, showing peaks and troughs in popularity for specific keywords.

    Regional Insights:

    Provides data on search trends by location, allowing for geographic comparisons of interest.

    Related Queries:

    Lists associated search terms, highlighting related topics that are frequently searched alongside the primary keywords.

    Top vs. Rising Trends:

    Distinguishes between the most popular queries and those with a sharp increase in search volume.

    Category Analysis:

    Organizes search data by category, enabling focused insights into specific industries, interests, or demographic groups.

    Search Volume Index: Uses an index score (0–100) to represent search volume relative to the highest point on the chart for the selected region and time.

    Real-time Data Availability:

    Offers access to both historical and real-time data, ideal for identifying ongoing or emerging trends.

  19. d

    ESS-DIVE Reporting Format for Dataset Package Metadata

    • search.dataone.org
    • knb.ecoinformatics.org
    • +1more
    Updated Jun 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deb Agarwal; Shreyas Cholia; Valerie C. Hendrix; Robert Crystal-Ornelas; Cory Snavely; Joan Damerow; Charuleka Varadharajan (2022). ESS-DIVE Reporting Format for Dataset Package Metadata [Dataset]. http://doi.org/10.15485/1866026
    Explore at:
    Dataset updated
    Jun 11, 2022
    Dataset provided by
    ESS-DIVE
    Authors
    Deb Agarwal; Shreyas Cholia; Valerie C. Hendrix; Robert Crystal-Ornelas; Cory Snavely; Joan Damerow; Charuleka Varadharajan
    Time period covered
    Jan 1, 2017
    Description

    ESS-DIVE’s (Environmental Systems Science Data Infrastructure for a Virtual Ecosystem) dataset metadata reporting format is intended to compile information about a dataset (e.g., title, description, funding sources) that can enable reuse of data submitted to the ESS-DIVE data repository. The files contained in this dataset include instructions (dataset_metadata_guide.md and README.md) that can be used to understand the types of metadata ESS-DIVE collects. The data dictionary (dd.csv) follows ESS-DIVE’s file-level metadata reporting format and includes brief descriptions about each element of the dataset metadata reporting format. This dataset also includes a terminology crosswalk (dataset_metadata_crosswalk.csv) that shows how ESS-DIVE’s metadata reporting format maps onto other existing metadata standards and reporting formats. Data contributors to ESS-DIVE can provide this metadata by manual entry using a web form or programmatically via ESS-DIVE’s API (Application Programming Interface). A metadata template (dataset_metadata_template.docx or dataset_metadata_template.pdf) can be used to collaboratively compile metadata before providing it to ESS-DIVE. Since being incorporated into ESS-DIVE’s data submission user interface, ESS-DIVE’s dataset metadata reporting format, has enabled features like automated metadata quality checks, and dissemination of ESS-DIVE datasets onto other data platforms including Google Dataset Search and DataCite.

  20. B

    Dataset for “I only knew how to search Google”: students’ reflections on a...

    • borealisdata.ca
    Updated May 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Denise A Smith; Stephanie Sanger (2025). Dataset for “I only knew how to search Google”: students’ reflections on a four-year information literacy curriculum [Dataset]. http://doi.org/10.5683/SP3/XG3AW6
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 8, 2025
    Dataset provided by
    Borealis
    Authors
    Denise A Smith; Stephanie Sanger
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Survey response data to accompany an article in the Journal of Canadian Health Libraries Association (2025)

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Julie Marcoux (2024). Google Data Search Exercises [Dataset]. http://doi.org/10.5683/SP3/MW7BKH

Google Data Search Exercises

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 26, 2024
Dataset provided by
Borealis
Authors
Julie Marcoux
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Google data search exercises can be used to practice finding data or statistics on a topic of interest, including using Google's own internal tools and by using advanced operators.

Search
Clear search
Close search
Google apps
Main menu