64 datasets found

Field-wide assessment of differential HT-seq from NCBI GEO database
zenodo.org
application/gzip
Updated Jan 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp (2023). Field-wide assessment of differential HT-seq from NCBI GEO database [Dataset]. http://doi.org/10.5281/zenodo.5356064
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5356064
Dataset updated
Jan 13, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We analysed the field of expression profiling by high throughput sequencing, or HT-seq, in terms of replicability and reproducibility, using data from the NCBI GEO (Gene Expression Omnibus) repository.

- This release includes GEO series up to Dec-31, 2020;

- Fixed xlrd missing optional dependency, which affected import of some xls files, previously we were using only openpyxl (thanks to anonymous reviewer);

- All files in supplementary _RAW.tar files were checked for p values, previously _RAW.tar files were completely omitted, alas (thanks to anonymous reviewer).

Archived dataset contains following files:

- output/parsed_suppfiles.csv, p-value histograms, histogram classes, estimated number of true null hypotheses (pi0).

- output/document_summaries.csv, document summaries of NCBI GEO series

- output/publications.csv, publication info of NCBI GEO series

- output/scopus_citedbycount.csv, Scopus citation info of NCBI GEO series

- output/single-cell.csv, single cell experiments

- spots.csv, NCBI SRA sequencing run metadata

- suppfilenames.txt, list of all supplementary file names of NCBI GEO submissions. One filename per row.

- suppfilenames_filtered.txt, list of supplementary file names used for downloading files from NCBI GEO. One filename per row.
f
Putative target genes identified from rheumatoid arthritis...
figshare.com
plos.figshare.com
xls
Updated Jun 4, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yi-Jiang Song; Guiling Li; Jian-Hua He; Yao Guo; Li Yang (2023). Putative target genes identified from rheumatoid arthritis (RA)/osteoarthritis (OA) microarray data. [Dataset]. http://doi.org/10.1371/journal.pone.0137551.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0137551.t003
Dataset updated
Jun 4, 2023
Dataset provided by
PLOS ONE
Authors
Yi-Jiang Song; Guiling Li; Jian-Hua He; Yao Guo; Li Yang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Putative target genes identified from rheumatoid arthritis (RA)/osteoarthritis (OA) microarray data.
IKONOS ESA archive
earth.esa.int
Updated Jun 21, 2013
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Space Agency (2013). IKONOS ESA archive [Dataset]. https://earth.esa.int/eogateway/catalog/ikonos-esa-archive
Explore at:
Dataset updated
Jun 21, 2013
Dataset authored and provided by
European Space Agencyhttp://www.esa.int/
License
https://earth.esa.int/eogateway/documents/20142/1560778/ESA-Third-Party-Missions-Terms-and-Conditions.pdfhttps://earth.esa.int/eogateway/documents/20142/1560778/ESA-Third-Party-Missions-Terms-and-Conditions.pdf
Time period covered
Dec 25, 2000 - Dec 9, 2008
Description
ESA maintains an archive of IKONOS Geo Ortho Kit data previously requested through the TPM scheme and acquired between 2000 and 2008, over Europe, North Africa and the Middle East. The imagery products gathered from IKONOS are categorised according to positional accuracy, which is determined by the reliability of an object in the image to be within the specified accuracy of the actual location of the object on the ground. Within each IKONOS-derived product, location error is defined by a circular error at 90% confidence (CE90), which means that locations of objects are represented on the image within the stated accuracy 90% of the time. There are six levels of IKONOS imagery products, determined by the level of positional accuracy: Geo, Standard Ortho, Reference, Pro, Precision and PrecisionPlus. The product provided by ESA to Category-1 users is the Geo Ortho Kit, consisting of IKONOS Black-and-White images with radiometric and geometric corrections (1-metre pixels, CE90=15 metres) bundled with IKONOS multispectral images with absolute radiometry (4-metre pixels, CE90=50 metres). IKONOS collects 1m and 4m Geo Ortho Kit imagery (nominally at nadir 0.82m for panchromatic image, 3.28m for multispectral mode) at an elevation angle between 60 and 90 degrees. To increase the positional accuracy of the final orthorectified imagery, customers should select imagery with IKONOS elevation angle between 72 and 90 degrees. The Geo Ortho Kit is tailored for sophisticated users such as photogrammetrists who want to control the orthorectification process. Geo Ortho Kit images include the camera geometry obtained at the time of image collection. Applying Geo Ortho Kit imagery, customers can produce their own highly accurate orthorectified products by using commercial off the shelf software, digital elevation models (DEMs) and optional ground control. Spatial coverage: Check the spatial coverage of the collection on a map available on the Third Party Missions Dissemination Service.
f
Bioinformatics-Based Identification of MicroRNA-Regulated and Rheumatoid...
plos.figshare.com
tiff
Updated Jun 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yi-Jiang Song; Guiling Li; Jian-Hua He; Yao Guo; Li Yang (2023). Bioinformatics-Based Identification of MicroRNA-Regulated and Rheumatoid Arthritis-Associated Genes [Dataset]. http://doi.org/10.1371/journal.pone.0137551
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0137551
Dataset updated
Jun 5, 2023
Dataset provided by
PLOS ONE
Authors
Yi-Jiang Song; Guiling Li; Jian-Hua He; Yao Guo; Li Yang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
MicroRNAs (miRNAs) act as epigenetic markers and regulate the expression of their target genes, including those characterized as regulators in autoimmune diseases. Rheumatoid arthritis (RA) is one of the most common autoimmune diseases. The potential roles of miRNA-regulated genes in RA pathogenesis have greatly aroused the interest of clinicians and researchers in recent years. In the current study, RA-related miRNAs records were obtained from PubMed through conditional literature retrieval. After analyzing the selected records, miRNA targeted genes were predicted. We identified 14 RA-associated miRNAs, and their sub-analysis in 5 microarray or RNA sequencing (RNA-seq) datasets was performed. The microarray and RNA-seq data of RA were also downloaded from NCBI Gene Expression Omnibus (GEO) and Sequence Read Archive (SRA), analyzed, and annotated. Using a bioinformatics approach, we identified a series of differentially expressed genes (DEGs) by comparing studies on RA and the controls. The RA-related gene expression profile was thus obtained and the expression of miRNA-regulated genes was analyzed. After functional annotation analysis, we found GO molecular function (MF) terms significantly enriched in calcium ion binding (GO: 0005509). Moreover, some novel dysregulated target genes were identified in RA through integrated analysis of miRNA/mRNA expression. The result revealed that the expression of a number of genes, including ROR2, ABI3BP, SMOC2, etc., was not only affected by dysregulated miRNAs, but also altered in RA. Our findings indicate that there is a close association between negatively correlated mRNA/miRNA pairs and RA. These findings may be applied to identify genetic markers for RA diagnosis and treatment in the future.
Madagascar - Geo-located Towns
data.amerigeoss.org
cloud.csiss.gmu.edu
+1more
geojson, shp zip
Updated Apr 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank (2023). Madagascar - Geo-located Towns [Dataset]. https://data.amerigeoss.org/ar/dataset/activity/madagascar-geo-located-towns-2006
Explore at:
shp zip, geojsonAvailable download formats
Dataset updated
Apr 5, 2023
Dataset provided by
World Bankhttp://worldbank.org/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The dataset contains the geo-location info of the towns in Madagascar, but lacks town name and population. The data is curated from the Southern African Human-development Information Management Network (SAHIMS) static archive server https://web.archive.org/web/20070808004545/http://www.sahims.net:80/gis/... To view metadata, please visit https://web.archive.org/web/20070705025938/http://www.sahims.net:80/gis/...
H
Harvard CGA Geotweet IDs Archive
dataverse.harvard.edu
Updated Oct 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Devika Kakkar; Jack Hayes (2023). Harvard CGA Geotweet IDs Archive [Dataset]. http://doi.org/10.7910/DVN/KTRIJP
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/KTRIJP
Dataset updated
Oct 20, 2023
Dataset provided by
Harvard Dataverse
Authors
Devika Kakkar; Jack Hayes
License
https://dataverse.harvard.edu/api/datasets/:persistentId/versions/1.2/customlicense?persistentId=doi:10.7910/DVN/KTRIJPhttps://dataverse.harvard.edu/api/datasets/:persistentId/versions/1.2/customlicense?persistentId=doi:10.7910/DVN/KTRIJP
Description
Harvard CGA Geotweet IDs Archive is a subset of Harvard CGA Geotweet Archive v2.0 . It contains the user and message identification records of individual tweets for approximately 10 billion geo-tagged tweets from January 2010 to July 2023. This dataset is available to the academic community at large, unlike the Harvard CGA Geotweet Archive v2.0 which is under Twitter's redistribution policy restriction for public sharing. It could serve as cross-validation data for publications that used data from Harvard CGA Geotweet Archive v2.0 . If you are interested in accessing this archive, please fill out our Geotweet Request Form. Before requesting or receiving Tweet IDs, requestors must agree to Twitter's Terms of Service, Twitter's Privacy Policy, and Twitter's Developer Policy . Geotweets IDs data provided by CGA can only be used for not-for-profit research and academic purposes. Recipients may not share CGA provided Tweet IDs or content derived from them without written permission from the CGA. Citations: If you use the Geotweet Archive in your research please reference it: "Harvard CGA Geotweet IDs Archive". ======================================================== Schema of Geotweet IDs Archive Field name_TYPE_Description message_id----BIGINT----Tweet ID user_id ----BIGINT----User ID number
e
Zambia - Geo-located Health Facilities - Dataset - ENERGYDATA.INFO
energydata.info
Updated Oct 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Zambia - Geo-located Health Facilities - Dataset - ENERGYDATA.INFO [Dataset]. https://energydata.info/dataset/zambia-geo-located-health-facilities-2006
Explore at:
Dataset updated
Oct 28, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Zambia
Description
The dataset contains the geo-location info, name and type of the health facilities in Zambia. The data is created by Zambia central statistical office, and curated from the Southern African Human-development Information Management Network (SAHIMS) static archive server https://web.archive.org/web/20070322051956/http://www.sahims.net/gis/GIS%20input/GIS_Library_Regional.asp To view metadata, please visit https://web.archive.org/web/20070322051956/http://www.sahims.net/gis/GIS%20input/GIS_Library_Regional.asp
GeoEye-1 full archive and tasking
earth.esa.int
eocat.esa.int
+1more
Updated Oct 2, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Space Agency (2008). GeoEye-1 full archive and tasking [Dataset]. https://earth.esa.int/eogateway/catalog/geoeye-1-full-archive-and-tasking
Explore at:
Dataset updated
Oct 2, 2008
Dataset authored and provided by
European Space Agencyhttp://www.esa.int/
License
https://earth.esa.int/eogateway/documents/20142/1560778/ESA-Third-Party-Missions-Terms-and-Conditions.pdfhttps://earth.esa.int/eogateway/documents/20142/1560778/ESA-Third-Party-Missions-Terms-and-Conditions.pdf
Description
GeoEye-1 high resolution optical products are available as part of the Maxar Standard Satellite Imagery products from the QuickBird, WorldView-1/-2/-3/-4 and GeoEye-1 satellites. All details about the data provision, data access conditions and quota assignment procedure are described into the Terms of Applicability available in Resources section. In particular, GeoEye-1 offers archive and tasking panchromatic products up to 0.41 m GSD resolution and Multispectral products up to 1.65 m GSD resolution. Band Combination Data Processing Level Resolutions Panchromatic and 4-bands Standard (2A) / View Ready Standard (OR2A) 15 cm HD, 30 cm HD, 30 cm, 40 cm, 50/60 cm View Ready Stereo 30 cm, 40 cm, 50/60 cm Map-Ready (Ortho) 1:12,000 Orthorectified 15 cm HD, 30 cm HD, 30 cm, 40 cm, 50/60 cm The options for 4-Bands are the following: 4-Band Multispectral (BLUE, GREEN, RED, NIR1) 4-Band Pan-sharpened (BLUE, GREEN, RED, NIR1) 4-Band Bundle (PAN, BLUE, GREEN, RED, NIR1) 3-Bands Natural Colour (pan-sharpened BLUE, GREEN, RED) 3-Band Colored Infrared (pan-sharpened GREEN, RED, NIR1). Native 30 cm and 50/60 cm resolution products are processed with MAXAR HD Technology to generate respectively the 15 cm HD and 30 cm HD products the initial special resolution (GSD) is unchanged but the HD technique increases the number of pixels and improves the visual clarity achieving aesthetically refined imagery with precise edges and well-reconstructed details. As per ESA policy, very high-resolution imagery of conflict areas cannot be provided.
Geoindex JISC UK Web Domain Dataset (1996-2010) Sum of Year and domain by UK...
figshare.com
xlsx
Updated Jun 1, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
John Kaye (2023). Geoindex JISC UK Web Domain Dataset (1996-2010) Sum of Year and domain by UK Post District [Dataset]. http://doi.org/10.6084/m9.figshare.825948.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.825948.v1
Dataset updated
Jun 1, 2023
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
John Kaye
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United Kingdom
Description
Data is aggregated to UK post area from: Geoindex JISC UK Web Domain Dataset. Counts of postcodes are summed by year of archive.org instance and sub-domain e.g. .ac.uk About the Geoindex http://dx.doi.org/10.5259/ukwa.ds.2/geo/1 The ~2.5 billion 200 OK responses in the JISC UK Web Domain Dataset (1996-2010) dataset have been scanned for geographic references - specifically postcodes. This set of postcode citations, found at particular URLs, crawled at particular times, forms an historical geoindex of the UK web. For more details about how the data was created, its format, and how to use it, see here. The geoindex is composed of some 700,641,549 lines of TSV data, each asserting that a given web page, crawled at a given data, contained one or more references to a given postcode.
Global Archiving Software Market Size By Implementation (On-Premise, Cloud),...
verifiedmarketresearch.com
Updated Jul 17, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
VERIFIED MARKET RESEARCH (2023). Global Archiving Software Market Size By Implementation (On-Premise, Cloud), By Industry (BFSI, Government), By Geographic Scope And Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/archiving-software-analysis-market/
Explore at:
Dataset updated
Jul 17, 2023
Dataset provided by
Verified Market Researchhttps://www.verifiedmarketresearch.com/
Authors
VERIFIED MARKET RESEARCH
License
https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/
Time period covered
2024 - 2031
Area covered
Global
Description
Archiving Software Market size was valued at USD 8 Billion in 2024 and is projected to reach USD 16 Billion by 2031, growing at a CAGR of 10% during the forecast period 2024-2031.

Global Archiving Software Market Drivers

Explosion of Data and Growth in Volume: One of the main factors propelling the archiving software industry is the exponential rise in data generated by enterprises. As digital transformation programs gain momentum, businesses gather enormous volumes of structured and unstructured data. By securely storing and facilitating easy data retrieval, archiving software assists organizations in managing their data effectively—a critical aspect of preserving operational efficiency.

Data governance and regulatory compliance: Regulations of data management and retention are becoming more and more demanding for organizations. Policies like GDPR, HIPAA, and Sarbanes-Oxley require businesses to hold onto specific data for predetermined amounts of time. By automating data retention and destruction, archiving software helps firms stay compliant and helps them avoid the heavy fines that come with non-compliance.
d
Harvard CGA Geotweet Census Archive
search.dataone.org
Updated Dec 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hayes, Jack (2023). Harvard CGA Geotweet Census Archive [Dataset]. http://doi.org/10.7910/DVN/IAYJOC
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/IAYJOC
Dataset updated
Dec 16, 2023
Dataset provided by
Harvard Dataverse
Authors
Hayes, Jack
Description
Harvard CGA Geotweet Census Archive is a subset of Harvard CGA Geotweet Archive v2.0 enriched with nationwide census data. It contains the tweet and user identification records along with census variables for more than 2 billion geo-tagged tweets from January 2012 to July 2023. This dataset is available to the academic community at large, unlike the Harvard CGA Geotweet Archive v2.0 which is under Twitter's redistribution policy restriction for public sharing. It could serve as cross-validation data for publications that used data from Harvard CGA Geotweet Archive v2.0 . If you are interested in accessing this archive, please fill out our Geotweet Request Form. Before requesting or receiving Tweet IDs, requestors must agree to Twitter's Terms of Service, Twitter's Privacy Policy, and Twitter's Developer Policy . Geotweets IDs data provided by CGA can only be used for not-for-profit research and academic purposes. Recipients may not share CGA provided Tweet IDs or content derived from them without written permission from the CGA. Citations: If you use the Geotweet Archive in your research please reference it: "Harvard CGA Geotweet IDs Archive". ======================================================== Schema of Geotweet Census Archive Field name_TYPE_Description message_id----TEXT----Tweet ID user_id ----TEXT----User ID number fips ----FLOAT----County fips code county ----TEXT----County name state ----TEXT----State abbreviation GEOID20 ----FLOAT----Census block geoid
o
Data from: Data archiving is a good investment
explore.openaire.eu
borealisdata.ca
+1more
Updated Apr 28, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Heather A. Piwowar; Todd J. Vision; Michael C. Whitlock (2011). Data from: Data archiving is a good investment [Dataset]. http://doi.org/10.14288/1.0397839
Explore at:
Unique identifier
https://doi.org/10.14288/1.0397839
Dataset updated
Apr 28, 2011
Authors
Heather A. Piwowar; Todd J. Vision; Michael C. Whitlock
Description
PubMed Central reuse of GEO datasets deposited in 2007This is the raw data behind the analysis. It contains one row for every mention of a 2007 GEO dataset in PubMed Central. Each row identifies the mentioned GEO dataset, the PubMed Central article that mentions the dataset's accession number, whether the authors of the dataset and the attributing article overlap, and whether this is considered an instance of third-party data reuse.PMC_reuse_of_2007_GEO_datasets.csvAggregate Table DataAggregate table data behind the figures and results in the README associated with the main dataset. Includes Baseline metrics used for extrapolating PubMed Central (PMC) results to PubMed, Number of mentions of a 2007 GEO dataset by authors who submitted the dataset, and Number of mentions of a dataset by authors who DID NOT submit the dataset across 2007-2010.tables.csv Funding agencies are reluctant to support data archiving, even though large research funders such as the National Science Foundation (NSF) and the National Institutes of Health acknowledge its importance for scientific progress. Our quantitative estimates of data reuse indicate that ongoing financial investment in data-archiving infrastructure provides a high scientific return.
d
Harvard CGA Geotweet Sentiment Archive
dataone.org
dataverse.harvard.edu
Updated Dec 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jack Hayes (2023). Harvard CGA Geotweet Sentiment Archive [Dataset]. http://doi.org/10.7910/DVN/X2KJPC
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/X2KJPC
Dataset updated
Dec 16, 2023
Dataset provided by
Harvard Dataverse
Authors
Jack Hayes
Description
Harvard CGA Geotweet Sentiment Archive is a subset of Harvard CGA Geotweet Archive v2.0 enriched with a sentiment score. It contains the tweet identification records along with a sentiment score based on tweet text for about 4.3 billion geo-tagged tweets since 2019. This sentiment score was calculated using Bidirectional Encoder Representations from Transformers. More information about this methodology can be found in our Nature Paper on Twitter Sentiment Geographical Index. This dataset is available to the academic community at large, unlike the Harvard CGA Geotweet Archive v2.0 which is under Twitter's redistribution policy restriction for public sharing. It could serve as cross-validation data for publications that used data from Harvard CGA Geotweet Archive v2.0 . If you are interested in accessing this archive, please fill out our Geotweet Request Form. Before requesting or receiving Tweet IDs, requestors must agree to Twitter's Terms of Service, Twitter's Privacy Policy, and Twitter's Developer Policy . Geotweets IDs data provided by CGA can only be used for not-for-profit research and academic purposes. Recipients may not share CGA provided Tweet IDs or content derived from them without written permission from the CGA. Citations: If you use the Geotweet Archive in your research please reference it: "Harvard CGA Geotweet IDs Archive". ======================================================== Schema of Geotweet Census Archive Field name_TYPE_Description message_id----TEXT----Tweet ID score ----FLOAT----BERT sentiment score
Medicaid Opioid Prescribing Rates - by Geography - 3fp8-zi9z - Archive...
healthdata.gov
application/rdfxml +5
Updated Feb 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Medicaid Opioid Prescribing Rates - by Geography - 3fp8-zi9z - Archive Repository [Dataset]. https://healthdata.gov/dataset/Medicaid-Opioid-Prescribing-Rates-by-Geography-3fp/sk6f-c2dk
Explore at:
xml, json, csv, application/rdfxml, tsv, application/rssxmlAvailable download formats
Dataset updated
Feb 24, 2025
Description
This dataset tracks the updates made on the dataset "Medicaid Opioid Prescribing Rates - by Geography" as a repository for previous versions of the data and metadata.
D
ARCHIVED: COVID-19 Cases by Geography Over Time
data.sfgov.org
application/rdfxml +5
Updated Oct 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Public Health - Population Health Division (2023). ARCHIVED: COVID-19 Cases by Geography Over Time [Dataset]. https://data.sfgov.org/COVID-19/ARCHIVED-COVID-19-Cases-by-Geography-Over-Time/d2ef-idww
Explore at:
csv, json, application/rssxml, xml, tsv, application/rdfxmlAvailable download formats
Dataset updated
Oct 24, 2023
Dataset authored and provided by
Department of Public Health - Population Health Division
License
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Description
A. SUMMARY This dataset contains COVID-19 positive confirmed cases aggregated by several different geographic areas and by day. COVID-19 cases are mapped to the residence of the individual and shown on the date the positive test was collected. In addition, 2016-2020 American Community Survey (ACS) population estimates are included to calculate the cumulative rate per 10,000 residents.

Dataset covers cases going back to 3/2/2020 when testing began. This data may not be immediately available for recently reported cases and data will change to reflect as information becomes available. Data updated daily.

Geographic areas summarized are: 1. Analysis Neighborhoods 2. Census Tracts 3. Census Zip Code Tabulation Areas

B. HOW THE DATASET IS CREATED Addresses from the COVID-19 case data are geocoded by the San Francisco Department of Public Health (SFDPH). Those addresses are spatially joined to the geographic areas. Counts are generated based on the number of address points that match each geographic area for a given date.

The 2016-2020 American Community Survey (ACS) population estimates provided by the Census are used to create a cumulative rate which is equal to ([cumulative count up to that date] / [acs_population]) * 10000) representing the number of total cases per 10,000 residents (as of the specified date).

COVID-19 case data undergo quality assurance and other data verification processes and are continually updated to maximize completeness and accuracy of information. This means data may change for previous days as information is updated.

C. UPDATE PROCESS Geographic analysis is scripted by SFDPH staff and synced to this dataset daily at 05:00 Pacific Time.

D. HOW TO USE THIS DATASET San Francisco population estimates for geographic regions can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).

This dataset can be used to track the spread of COVID-19 throughout the city, in a variety of geographic areas. Note that the new cases column in the data represents the number of new cases confirmed in a certain area on the specified day, while the cumulative cases column is the cumulative total of cases in a certain area as of the specified date.

Privacy rules in effect To protect privacy, certain rules are in effect: 1. Any area with a cumulative case count less than 10 are dropped for all days the cumulative count was less than 10. These will be null values. 2. Once an area has a cumulative case count of 10 or greater, that area will have a new row of case data every day following. 3. Cases are dropped altogether for areas where acs_population < 1000 4. Deaths data are not included in this dataset for privacy reasons. The low COVID-19 death rate in San Francisco, along with other publicly available information on deaths, means that deaths data by geography and day is too granular and potentially risky. Read more in our privacy guidelines

Rate suppression in effect where counts lower than 20 Rates are not calculated unless the cumulative case count is greater than or equal to 20. Rates are generally unstable at small numbers, so we avoid calculating them directly. We advise you to apply the same approach as this is best practice in epidemiology.

A note on Census ZIP Code Tabulation Areas (ZCTAs) ZIP Code Tabulation Areas are special boundaries created by the U.S. Census based on ZIP Codes developed by the USPS. They are not, however, the same thing. ZCTAs are areal representations of routes. Read how the Census develops ZCTAs on their website.

Rows included for Citywide case counts Rows are included for the Citywide case counts and incidence rate every day. These Citywide rows can be used for comparisons. Citywide will capture all cases regardless of address quality. While some cases cannot be mapped to sub-areas like Census Tracts, ongoing data quality efforts result in improved mapping on a rolling bases.

Related dataset See the dataset of the most recent cumulative counts for all geographic areas here: https://data.sfgov.org/COVID-19/COVID-19-Cases-and-Deaths-Summarized-by-Geography/tpyr-dvnc

E. CHANGE LOG
9/11/2023 - data on COVID-19 cases by geography over time are no longer being updated. This data is currently through 9/6/2023 and will not include any new data after this date.
4/6/2023 - the State implemented system updates to improve the integrity of historical data.
2/21/2023 - system updates to improve reliability and accuracy of cases data were implemented.
1/31/2023 - updated “acs_population” column to reflect the 2020 Census Bureau American Community Survey (ACS) San Francisco Population estimates.
1/31/2023 - implemented system updates to streamline and improve our geo-coded data, resulting in small shifts in our case data by geography.
1/31/2023 - renamed column “last_updated_at” to “data_as_of”.
1/31/2023 - removed the “multipolygon” column. To access the multipolygon geometry column for each geography unit, refer to COVID-19 Cases and Deaths Summarized by Geography.
1/22/2022 - system updates to improve timeliness and accuracy of cases and deaths data were implemented.
4/16/2021 - dataset updated to refresh with a five-day data lag.
Medicare Inpatient Hospitals - by Geography and Service - q269-h3u3 -...
healthdata.gov
application/rdfxml +5
Updated Feb 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Medicare Inpatient Hospitals - by Geography and Service - q269-h3u3 - Archive Repository [Dataset]. https://healthdata.gov/dataset/Medicare-Inpatient-Hospitals-by-Geography-and-Serv/y35v-bs76
Explore at:
xml, tsv, csv, application/rssxml, application/rdfxml, jsonAvailable download formats
Dataset updated
Feb 24, 2025
Description
This dataset tracks the updates made on the dataset "Medicare Inpatient Hospitals - by Geography and Service" as a repository for previous versions of the data and metadata.
g
Drilling archive | gimi9.com
gimi9.com
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Drilling archive | gimi9.com [Dataset]. https://www.gimi9.com/dataset/eu_2b609fd3-dac9-11d2-9a86-080000507261-1/
Explore at:
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This is a geo-referenced data collection (database) with the released geoscientific drilling and profile data from the geological layer directories. The data encryption is based on the “symbol key geology (SEP1 of 1991)” of the German State Geological Services. The data will be presented in the Hamburg drilling data portal as far as a release is available. For free machine processing in accordance with the Hamburg Transparency Act, data in the format gml is provided in a rar archive. It contains the master data of all holes. Due to their size, the layer data is divided into 7 files according to the districts of Hamburg and for data protection reasons do not contain the data of private drilling for which no release is available for publication. The download files are updated as required. For a better understanding of the data, a file with field descriptions and key lists is also provided. The data will also be provided by the WFS services ¿WFS BoreholeML 3.0 Header and ¿WFS BoreholeML 3.0 for the Drilling Point Map Germany, as far as release is available. The data are not available here in the original SEP1 but in the undifferentiated BoreholeML3 format. These two services provide complex GML schemas that cannot easily be processed by standard GIS clients such as ArcMap or QGis.
d
Archive of Digitized Analog Boomer Seismic Reflection Data Collected from...
catalog.data.gov
data.usgs.gov
+5more
Updated Jul 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Archive of Digitized Analog Boomer Seismic Reflection Data Collected from the Mississippi-Alabama-Florida shelf During Cruises Onboard the R/V Kit Jones, June 1990 and July 1991 [Dataset]. https://catalog.data.gov/dataset/archive-of-digitized-analog-boomer-seismic-reflection-data-collected-from-the-mississippi-
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Area covered
Mississippi
Description
In June of 1990 and July of 1991, the U.S. Geological Survey (USGS) conducted geophysical surveys to investigate the shallow geologic framework of the Mississippi-Alabama-Florida shelf in the northern Gulf of Mexico, from Mississippi Sound to the Florida Panhandle. Work was done onboard the Mississippi Mineral Resources Institute R/V Kit Jones as part of a project to study coastal erosion and offshore sand resources. This report is part of a series to digitally archive the legacy analog data collected from the Mississippi-Alabama SHelf (MASH). The MASH data rescue project is a cooperative effort by the USGS and the Minerals Management Service (MMS). This report serves as an archive of high-resolution scanned Tagged Image File Format (TIFF) and Graphics Interchange Format (GIF) images of the original boomer paper records, navigation files, trackline maps, Geographic Information System (GIS) files, cruise logs, and formal Federal Geographic Data Committee (FGDC) metadata.
E
Geographic variation of mutagenic exposures in kidney cancer genomes – copy...
ega-archive.org
Updated Feb 23, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). Geographic variation of mutagenic exposures in kidney cancer genomes – copy number variants (Mutographs) [Dataset]. https://ega-archive.org/datasets/EGAD00001013727
Explore at:
Dataset updated
Feb 23, 2021
License
https://ega-archive.org/dacs/EGAC00001000000https://ega-archive.org/dacs/EGAC00001000000
Description
Geographic variation of mutagenic exposures in kidney cancer genomes – copy number variants (Mutographs)
National Archives Landmark Geographic Distribution
data.gov.tw
csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Archives Administration, National Archives Landmark Geographic Distribution [Dataset]. https://data.gov.tw/en/datasets/33642
Explore at:
csvAvailable download formats
Dataset provided by
National Archives and Records Administrationhttp://www.archives.gov/
Authors
National Archives Administration
License
https://data.gov.tw/licensehttps://data.gov.tw/license
Description
Provide national archives landmark geographical distribution data.

Facebook

Twitter

Click to copy link

Link copied

Cite

Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp (2023). Field-wide assessment of differential HT-seq from NCBI GEO database [Dataset]. http://doi.org/10.5281/zenodo.5356064

Field-wide assessment of differential HT-seq from NCBI GEO database

Explore at:

application/gzipAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.5356064

Dataset updated

Jan 13, 2023

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

We analysed the field of expression profiling by high throughput sequencing, or HT-seq, in terms of replicability and reproducibility, using data from the NCBI GEO (Gene Expression Omnibus) repository.

- This release includes GEO series up to Dec-31, 2020;

- Fixed xlrd missing optional dependency, which affected import of some xls files, previously we were using only openpyxl (thanks to anonymous reviewer);

- All files in supplementary _RAW.tar files were checked for p values, previously _RAW.tar files were completely omitted, alas (thanks to anonymous reviewer).

Archived dataset contains following files:

- output/parsed_suppfiles.csv, p-value histograms, histogram classes, estimated number of true null hypotheses (pi0).

- output/document_summaries.csv, document summaries of NCBI GEO series

- output/publications.csv, publication info of NCBI GEO series

- output/scopus_citedbycount.csv, Scopus citation info of NCBI GEO series

- output/single-cell.csv, single cell experiments

- spots.csv, NCBI SRA sequencing run metadata

- suppfilenames.txt, list of all supplementary file names of NCBI GEO submissions. One filename per row.

- suppfilenames_filtered.txt, list of supplementary file names used for downloading files from NCBI GEO. One filename per row.

Clear search

Close search

Google apps

Main menu

Field-wide assessment of differential HT-seq from NCBI GEO database

Putative target genes identified from rheumatoid arthritis...

IKONOS ESA archive

Bioinformatics-Based Identification of MicroRNA-Regulated and Rheumatoid...

Madagascar - Geo-located Towns

Harvard CGA Geotweet IDs Archive

Zambia - Geo-located Health Facilities - Dataset - ENERGYDATA.INFO

GeoEye-1 full archive and tasking

Geoindex JISC UK Web Domain Dataset (1996-2010) Sum of Year and domain by UK...

Global Archiving Software Market Size By Implementation (On-Premise, Cloud),...

Harvard CGA Geotweet Census Archive

Data from: Data archiving is a good investment

Harvard CGA Geotweet Sentiment Archive

Medicaid Opioid Prescribing Rates - by Geography - 3fp8-zi9z - Archive...

ARCHIVED: COVID-19 Cases by Geography Over Time

Medicare Inpatient Hospitals - by Geography and Service - q269-h3u3 -...

Drilling archive | gimi9.com

Archive of Digitized Analog Boomer Seismic Reflection Data Collected from...

Geographic variation of mutagenic exposures in kidney cancer genomes – copy...

National Archives Landmark Geographic Distribution

Field-wide assessment of differential HT-seq from NCBI GEO databaseSee More Versions

Field-wide assessment of differential HT-seq from NCBI GEO database