99 datasets found

CSV file used in statistical analyses
data.csiro.au
researchdata.edu.au
+1more
Updated Oct 13, 2014
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CSIRO (2014). CSV file used in statistical analyses [Dataset]. http://doi.org/10.4225/08/543B4B4CA92E6
Explore at:
Unique identifier
https://doi.org/10.4225/08/543B4B4CA92E6
Dataset updated
Oct 13, 2014
Dataset authored and provided by
CSIROhttp://www.csiro.au/
License
https://research.csiro.au/dap/licences/csiro-data-licence/https://research.csiro.au/dap/licences/csiro-data-licence/
Time period covered
Mar 14, 2008 - Jun 9, 2009
Dataset funded by
CSIROhttp://www.csiro.au/
Description
A csv file containing the tidal frequencies used for statistical analyses in the paper "Estimating Freshwater Flows From Tidally-Affected Hydrographic Data" by Dan Pagendam and Don Percival.
Raw Data - CSV Files
osf.io
Updated Apr 27, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Katelyn Conn (2020). Raw Data - CSV Files [Dataset]. https://osf.io/h5wbt
Explore at:
Dataset updated
Apr 27, 2020
Dataset provided by
Center for Open Sciencehttps://cos.io/
Authors
Katelyn Conn
Description
Raw Data in .csv format for use with the R data wrangling scripts.
m
Download CSV DB
maclookup.app
json
Updated Mar 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Download CSV DB [Dataset]. https://maclookup.app/downloads/csv-database
Explore at:
jsonAvailable download formats
Dataset updated
Mar 17, 2025
Description
Free, daily updated MAC prefix and vendor CSV database. Download now for accurate device identification.
Datasets
figshare.com
zip
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bastian Eichenberger; YinXiu Zhan (2023). Datasets [Dataset]. http://doi.org/10.6084/m9.figshare.12958037.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.12958037.v1
Dataset updated
May 31, 2023
Dataset provided by
figshare
Authors
Bastian Eichenberger; YinXiu Zhan
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The benchmarking datasets used for deepBlink. The npz files contain train/valid/test splits inside and can be used directly. The files belong to the following challenges / classes:- ISBI Particle tracking challenge: microtubule, vesicle, receptor- Custom synthetic (based on http://smal.ws): particle- Custom fixed cell: smfish- Custom live cell: suntagThe csv files are to determine which image in the test splits correspond to which original image, SNR, and density.
B
Residential School Locations Dataset (CSV Format)
borealisdata.ca
search.dataone.org
Updated Jun 5, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rosa Orlandini (2019). Residential School Locations Dataset (CSV Format) [Dataset]. http://doi.org/10.5683/SP2/RIYEMU
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.5683/SP2/RIYEMU
Dataset updated
Jun 5, 2019
Dataset provided by
Borealis
Authors
Rosa Orlandini
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 1863 - Jun 30, 1998
Area covered
Canada
Description
The Residential School Locations Dataset [IRS_Locations.csv] contains the locations (latitude and longitude) of Residential Schools and student hostels operated by the federal government in Canada. All the residential schools and hostels that are listed in the Indian Residential School Settlement Agreement are included in this dataset, as well as several Industrial schools and residential schools that were not part of the IRRSA. This version of the dataset doesn’t include the five schools under the Newfoundland and Labrador Residential Schools Settlement Agreement. The original school location data was created by the Truth and Reconciliation Commission, and was provided to the researcher (Rosa Orlandini) by the National Centre for Truth and Reconciliation in April 2017. The dataset was created by Rosa Orlandini, and builds upon and enhances the previous work of the Truth and Reconcilation Commission, Morgan Hite (creator of the Atlas of Indian Residential Schools in Canada that was produced for the Tk'emlups First Nation and Justice for Day Scholar's Initiative, and Stephanie Pyne (project lead for the Residential Schools Interactive Map). Each individual school location in this dataset is attributed either to RSIM, Morgan Hite, NCTR or Rosa Orlandini. Many schools/hostels had several locations throughout the history of the institution. If the school/hostel moved from its’ original location to another property, then the school is considered to have two unique locations in this dataset,the original location and the new location. For example, Lejac Indian Residential School had two locations while it was operating, Stuart Lake and Fraser Lake. If a new school building was constructed on the same property as the original school building, it isn't considered to be a new location, as is the case of Girouard Indian Residential School.When the precise location is known, the coordinates of the main building are provided, and when the precise location of the building isn’t known, an approximate location is provided. For each residential school institution location, the following information is provided: official names, alternative name, dates of operation, religious affiliation, latitude and longitude coordinates, community location, Indigenous community name, contributor (of the location coordinates), school/institution photo (when available), location point precision, type of school (hostel or residential school) and list of references used to determine the location of the main buildings or sites.
P
MNAD Dataset
paperswithcode.com
Updated May 16, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). MNAD Dataset [Dataset]. https://paperswithcode.com/dataset/mnad
Explore at:
Dataset updated
May 16, 2023
Description
About the MNAD Dataset The MNAD corpus is a collection of over 1 million Moroccan news articles written in modern Arabic language. These news articles have been gathered from 11 prominent electronic news sources. The dataset is made available to the academic community for research purposes, such as data mining (clustering, classification, etc.), information retrieval (ranking, search, etc.), and other non-commercial activities.

Dataset Fields

Title: The title of the article Body: The body of the article Category: The category of the article Source: The Electronic News paper source of the article

About Version 1 of the Dataset (MNAD.v1) Version 1 of the dataset comprises 418,563 articles classified into 19 categories. The data was collected from well-known electronic news sources, namely Akhbarona.ma, Hespress.ma, Hibapress.com, and Le360.com. The articles were stored in four separate CSV files, each corresponding to the news website source. Each CSV file contains three fields: Title, Body, and Category of the news article.

The dataset is rich in Arabic vocabulary, with approximately 906,125 unique words. It has been utilized as a benchmark in the research paper: "A Moroccan News Articles Dataset (MNAD) For Arabic Text Categorization". In 2021 International Conference on Decision Aid Sciences and Application (DASA).

This dataset is available for download from the following sources: - Kaggle Datasets : MNADv1 - Huggingface Datasets: MNADv1

About Version 2 of the Dataset (MNAD.v2) Version 2 of the MNAD dataset includes an additional 653,901 articles, bringing the total number of articles to over 1 million (1,069,489), classified into the same 19 categories as in version 1. The new documents were collected from seven additional prominent Moroccan news websites, namely al3omk.com, medi1news.com, alayam24.com, anfaspress.com, alyaoum24.com, barlamane.com, and SnrtNews.com.

The newly collected articles have been merged with the articles from the previous version into a single CSV file named MNADv2.csv. This file includes an additional column called "Source" to indicate the source of each news article.

Furthermore, MNAD.v2 incorporates improved pre-processing techniques and data cleaning methods. These enhancements involve removing duplicates, eliminating multiple spaces, discarding rows with NaN values, replacing new lines with " ", excluding very long and very short articles, and removing non-Arabic articles. These additions and improvements aim to enhance the usability and value of the MNAD dataset for researchers and practitioners in the field of Arabic text analysis.

This dataset is available for download from the following sources: - Kaggle Datasets : MNADv2 - Huggingface Datasets: MNADv2

Citation If you use our data, please cite the following paper:

bibtex @inproceedings{MNAD2021, author = {Mourad Jbene and Smail Tigani and Rachid Saadane and Abdellah Chehri}, title = {A Moroccan News Articles Dataset ({MNAD}) For Arabic Text Categorization}, year = {2021}, publisher = {{IEEE}}, booktitle = {2021 International Conference on Decision Aid Sciences and Application ({DASA})} doi = {10.1109/dasa53625.2021.9682402}, url = {https://doi.org/10.1109/dasa53625.2021.9682402}, }
Industrial Park Management Bureau of the Ministry of Economic...
data.gov.tw
csv
Updated Oct 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bureau of Industrial Parks, Ministry of Economic Affairs (2024). Industrial Park Management Bureau of the Ministry of Economic Affairs_Statistics on Import and Export Trade Volume of Science and Technology Industrial Parks [Dataset]. https://data.gov.tw/en/datasets/25792
Explore at:
csvAvailable download formats
Dataset updated
Oct 14, 2024
Dataset authored and provided by
Bureau of Industrial Parks, Ministry of Economic Affairs
License
https://data.gov.tw/licensehttps://data.gov.tw/license
Description
Provide "Statistics of Import and Export Trade Volume of Each Park" to let the public understand the import and export and its growth trend of each park. In addition to updating this information every month, CSV file format is also provided for free download and use by the public.The dataset includes statistics on the import and export trade volume of parks such as Nanzih, Kaohsiung, Taichung, Zhonggang, Pingtung, and other parks (Lingguang, Chenggong, Gaoruan), with main fields including "Park, Import and Export (This Month, Year-to-Date)", "Export (This Month, Year-to-Date)", "Import (This Month, Year-to-Date)", and other important information.

Ecosystem-Level Factors Affecting the Survival of Open-Source Projects: A...

zenodo.org

application/gzip, bin +2

Updated Aug 2, 2024

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

Marat Valiev; Marat Valiev; Bogdan Vasilescu; James Herbsleb; Bogdan Vasilescu; James Herbsleb (2024). Ecosystem-Level Factors Affecting the Survival of Open-Source Projects: A Case Study of the PyPI Ecosystem - the dataset [Dataset]. http://doi.org/10.5281/zenodo.1297925

Explore at:

text/x-python, zip, bin, application/gzipAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.1297925

Dataset updated

Aug 2, 2024

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Marat Valiev; Marat Valiev; Bogdan Vasilescu; James Herbsleb; Bogdan Vasilescu; James Herbsleb

License

https://www.gnu.org/licenses/old-licenses/gpl-2.0-standalone.htmlhttps://www.gnu.org/licenses/old-licenses/gpl-2.0-standalone.html

Description

Replication pack, FSE2018 submission #164:
------------------------------------------

**Working title:** Ecosystem-Level Factors Affecting the Survival of Open-Source Projects: 
A Case Study of the PyPI Ecosystem

**Note:** link to data artifacts is already included in the paper. 
Link to the code will be included in the Camera Ready version as well.


Content description
===================

- **ghd-0.1.0.zip** - the code archive. This code produces the dataset files 
 described below
- **settings.py** - settings template for the code archive.
- **dataset_minimal_Jan_2018.zip** - the minimally sufficient version of the dataset.
 This dataset only includes stats aggregated by the ecosystem (PyPI)
- **dataset_full_Jan_2018.tgz** - full version of the dataset, including project-level
 statistics. It is ~34Gb unpacked. This dataset still doesn't include PyPI packages
 themselves, which take around 2TB.
- **build_model.r, helpers.r** - R files to process the survival data 
  (`survival_data.csv` in **dataset_minimal_Jan_2018.zip**, 
  `common.cache/survival_data.pypi_2008_2017-12_6.csv` in 
  **dataset_full_Jan_2018.tgz**)
- **Interview protocol.pdf** - approximate protocol used for semistructured interviews.
- LICENSE - text of GPL v3, under which this dataset is published
- INSTALL.md - replication guide (~2 pages)

Replication guide
=================

Step 0 - prerequisites
----------------------

- Unix-compatible OS (Linux or OS X)
- Python interpreter (2.7 was used; Python 3 compatibility is highly likely)
- R 3.4 or higher (3.4.4 was used, 3.2 is known to be incompatible)

Depending on detalization level (see Step 2 for more details):
- up to 2Tb of disk space (see Step 2 detalization levels)
- at least 16Gb of RAM (64 preferable)
- few hours to few month of processing time

Step 1 - software
----------------

- unpack **ghd-0.1.0.zip**, or clone from gitlab:

   git clone https://gitlab.com/user2589/ghd.git
   git checkout 0.1.0
 
 `cd` into the extracted folder. 
 All commands below assume it as a current directory.
  
- copy `settings.py` into the extracted folder. Edit the file:
  * set `DATASET_PATH` to some newly created folder path
  * add at least one GitHub API token to `SCRAPER_GITHUB_API_TOKENS` 
- install docker. For Ubuntu Linux, the command is 
  `sudo apt-get install docker-compose`
- install libarchive and headers: `sudo apt-get install libarchive-dev`
- (optional) to replicate on NPM, install yajl: `sudo apt-get install yajl-tools`
 Without this dependency, you might get an error on the next step, 
 but it's safe to ignore.
- install Python libraries: `pip install --user -r requirements.txt` . 
- disable all APIs except GitHub (Bitbucket and Gitlab support were
 not yet implemented when this study was in progress): edit
 `scraper/init.py`, comment out everything except GitHub support
 in `PROVIDERS`.

Step 2 - obtaining the dataset
-----------------------------

The ultimate goal of this step is to get output of the Python function 
`common.utils.survival_data()` and save it into a CSV file:

  # copy and paste into a Python console
  from common import utils
  survival_data = utils.survival_data('pypi', '2008', smoothing=6)
  survival_data.to_csv('survival_data.csv')

Since full replication will take several months, here are some ways to speedup
the process:

####Option 2.a, difficulty level: easiest

Just use the precomputed data. Step 1 is not necessary under this scenario.

- extract **dataset_minimal_Jan_2018.zip**
- get `survival_data.csv`, go to the next step

####Option 2.b, difficulty level: easy

Use precomputed longitudinal feature values to build the final table.
The whole process will take 15..30 minutes.

- create a folder `

7+ Million Company Dataset
kaggle.com
zip
Updated May 10, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
People Data Labs (2019). 7+ Million Company Dataset [Dataset]. https://www.kaggle.com/datasets/peopledatalabssf/free-7-million-company-dataset
Explore at:
zip(291957415 bytes)Available download formats
Dataset updated
May 10, 2019
Authors
People Data Labs
Description
Dataset

This dataset was created by People Data Labs

Contents
Raw ERP data in csv format
osf.io
Updated Feb 23, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Daniel Baker (2019). Raw ERP data in csv format [Dataset]. https://osf.io/xu87h
Explore at:
Dataset updated
Feb 23, 2019
Dataset provided by
Center for Open Sciencehttps://cos.io/
Authors
Daniel Baker
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
No description was included in this Dataset collected from the OSF
EOD data for all Dow Jones stocks
kaggle.com
zip
Updated Jun 12, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Timo Bozsolik (2019). EOD data for all Dow Jones stocks [Dataset]. https://www.kaggle.com/timoboz/stock-data-dow-jones
Explore at:
zip(1697460 bytes)Available download formats
Dataset updated
Jun 12, 2019
Authors
Timo Bozsolik
Description
Update

Unfortunately, the API this dataset used to pull the stock data isn't free anymore. Instead of having this auto-updating, I dropped the last version of the data files in here, so at least the historic data is still usable.

Content

This dataset provides free end of day data for all stocks currently in the Dow Jones Industrial Average. For each of the 30 components of the index, there is one CSV file named by the stock's symbol (e.g. AAPL for Apple). Each file provides historically adjusted market-wide data (daily, max. 5 years back). See here for description of the columns: https://iextrading.com/developer/docs/#chart

Since this dataset uses remote URLs as files, it is automatically updated daily by the Kaggle platform and automatically represents the latest data.

Acknowledgements

List of stocks and symbols as per https://en.wikipedia.org/wiki/Dow_Jones_Industrial_Average

Thanks to https://iextrading.com for providing this data for free!

Terms of Use

Data provided for free by IEX. View IEX’s Terms of Use.
g
Demographics
health.google.com
Updated Oct 7, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). Demographics [Dataset]. https://health.google.com/covid-19/open-data/raw-data
Explore at:
Dataset updated
Oct 7, 2021
Variables measured
key, population, population_male, rural_population, urban_population, population_female, population_density, clustered_population, population_age_00_09, population_age_10_19, and 11 more
Description
Various population statistics, including structured demographics data.
Gene expression csv files
figshare.com
txt
Updated Jun 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cristina Alvira (2023). Gene expression csv files [Dataset]. http://doi.org/10.6084/m9.figshare.21861975.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.21861975.v1
Dataset updated
Jun 12, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Cristina Alvira
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Csv files containing all detectable genes.
t
Soil benchmark data sets 2021 district-free city of Brandenburg an der Havel...
service.tib.eu
Updated Feb 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Soil benchmark data sets 2021 district-free city of Brandenburg an der Havel - Vdataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/govdata_94ee7f6f-d384-4499-b592-718f6027da98
Explore at:
Dataset updated
Feb 4, 2025
Area covered
Brandenburg
Description
Ground benchmark datasets are issued annually in the standard file formats Text (CSV) and XML in relation to EPSG code 25833. Depending on the file format, ground benchmark data sets are provided in full for the areas of competence of the expert committees and for the State of Brandenburg in a zipped file with a statistical indication and a description of the elements. The CSV file is based on VBORIS2. A key bridge to the old format can be extracted from the data. On request, soil benchmark datasets for municipal areas can be cut out or provided in shape format. Furthermore, the delivery of soil benchmarks in the form of web-based geoservices is possible. Ground benchmark datasets are issued annually in the standard file formats Text (CSV) and XML in relation to EPSG code 25833. Depending on the file format, ground benchmark data sets are provided in full for the areas of competence of the expert committees and for the State of Brandenburg in a zipped file with a statistical indication and a description of the elements. The CSV file is based on VBORIS2. A key bridge to the old format can be extracted from the data. On request, soil benchmark datasets for municipal areas can be cut out or provided in shape format. Furthermore, the delivery of soil benchmarks in the form of web-based geoservices is possible. Ground benchmark datasets are issued annually in the standard file formats Text (CSV) and XML in relation to EPSG code 25833. Depending on the file format, ground benchmark data sets are provided in full for the areas of competence of the expert committees and for the State of Brandenburg in a zipped file with a statistical indication and a description of the elements. The CSV file is based on VBORIS2. A key bridge to the old format can be extracted from the data. On request, soil benchmark datasets for municipal areas can be cut out or provided in shape format. Furthermore, the delivery of soil benchmarks in the form of web-based geoservices is possible. Ground benchmark datasets are issued annually in the standard file formats Text (CSV) and XML in relation to EPSG code 25833. Depending on the file format, ground benchmark data sets are provided in full for the areas of competence of the expert committees and for the State of Brandenburg in a zipped file with a statistical indication and a description of the elements. The CSV file is based on VBORIS2. A key bridge to the old format can be extracted from the data. On request, soil benchmark datasets for municipal areas can be cut out or provided in shape format. Furthermore, the delivery of soil benchmarks in the form of web-based geoservices is possible.
US Software Engineer Jobs
zenrows.com
csv
Updated Jul 22, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ZenRows (2021). US Software Engineer Jobs [Dataset]. https://www.zenrows.com/datasets/us-software-engineer-jobs
Explore at:
csv(57,8MB)Available download formats
Dataset updated
Jul 22, 2021
Dataset provided by
ZenRows S.L.
Authors
ZenRows
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Time period covered
Jun 22, 2021 - Jul 22, 2021
Area covered
United States
Description
High-quality, free software engineer jobs Dataset from the United States, in CSV format. Over 58.000 records relevant to investors, recruiters, agencies, and software engineers. We are working on complete datasets from a wide variety of countries. Don't hesitate to contact us for more information.
g
Land benchmark data sets 2019 district-free city of Brandenburg an der Havel...
gimi9.com
Updated May 27, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Land benchmark data sets 2019 district-free city of Brandenburg an der Havel | gimi9.com [Dataset]. https://gimi9.com/dataset/eu_9438e6c2-6047-4db0-a05e-c501ef4766da
Explore at:
Dataset updated
May 27, 2022
Area covered
Brandenburg
Description
Ground benchmark datasets are published annually in the standard file formats Text (CSV) and XML with reference to EPSG code 25833. Depending on the file format, soil benchmark data sets are provided in full for the areas of responsibility of the expert committees and for the state of Brandenburg in a zipped file with a statistical indication and a description of the elements. The CSV file is based on VBORIS2. A key bridge to the alt format can also be found in the test data. On request, soil benchmark datasets for municipal areas can be cut out or provided in shape format. Furthermore, the submission of soil benchmarks in the form of web-based geoservices is possible.
CVEfixes Dataset
kaggle.com
Updated Jun 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Girish (2023). CVEfixes Dataset [Dataset]. https://www.kaggle.com/datasets/girish17019/cvefixes-vulnerable-and-fixed-code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 13, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Girish
Description
Context

CVEfixes is a comprehensive vulnerability dataset that is automatically collected and curated from Common Vulnerabilities and Exposures (CVE) records in the public U.S. National Vulnerability Database (NVD). The goal is to support data-driven security research based on source code and source code metrics related to fixes for CVEs in the NVD by providing detailed information at different interlinked levels of abstraction, such as the commit-, file-, and method level, as well as the repository- and CVE level.

This dataset is a preprocessed version of the CVEfixes dataset provided at the following link: https://zenodo.org/record/7029359

File Information

This dataset consists of two files: - CVEFixes.csv : The preprocessed dataset. - LICENSE.txt : The license information of this dataset.

Column Description

In the CVEFixes.csv, there are three columns: - code : The source code of the data point. - language : The programming language of the source code (c, java, php, etc) - safety : Whether the code is vulnerable or safe.
h
Inclusive Design and Dissemination in Digital Scholarly Editing: CSV Dataset...
works.hcommons.org
repository.uantwerpen.be
csv
Updated Nov 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Elli Bleeker; Wout Dillen; Aodhán Kelly; Merisa Martinez; Anna-Maria Sichani; Elli Bleeker; Wout Dillen; Aodhán Kelly; Merisa Martinez; Anna-Maria Sichani (2024). Inclusive Design and Dissemination in Digital Scholarly Editing: CSV Dataset [Dataset]. http://doi.org/10.17613/c3m9-kq76
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.17613/c3m9-kq76
Dataset updated
Nov 18, 2024
Dataset provided by
unknown
Authors
Elli Bleeker; Wout Dillen; Aodhán Kelly; Merisa Martinez; Anna-Maria Sichani; Elli Bleeker; Wout Dillen; Aodhán Kelly; Merisa Martinez; Anna-Maria Sichani
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Time period covered
Apr 9, 2019
Description
In 2017, the authors designed a survey titled Inclusive Design and Dissemination in Digital Scholarly Editions. The survey was designed and hosted using SurveyMonkey (https://www.surveymonkey.com) and was open from 1 July to 31 November 2017. The survey received 219 responses, 109 of which completed every required question in the survey – resulting in a completion rate of 49,7%.

At the 2017 ADHO conference in Montreal (Canada), the authors participated in a panel discussion on the subject, where they discussed some preliminary survey results (Sichani et al. 2017). A more detailed treatment of the complete survey results will be published Variants 14 (https://journals.openedition.org/variants/), the journal of the ESTS (Martinez et al. forthcoming).

In view of this publication, the authors have deposited the survey results as data sets here. These include a CSV file of the survey's data (scrubbed of respondents' personal information), and the current PDF with graphical representations of the survey's statistics. Both files present the survey's raw, uncorrected (albeit redacted) data, as recorded and automatically analyzed by SurveyMonkey, including response rates per question and diagrams.

As the uncorrected survey results, some of the data offered in these files may differ slightly from those presented in the forthcoming Variants article. For their qualitative analysis of the survey's data in that publication, the authors corrected the data (e.g. excluding invalid answers, or reclassifying incorrectly classified answers), and interpreted them (e.g. creating categories for similar responses). Such interventions were justified in the relevant sections of the Variants article. Rather than depositing the corrected version of the survey's results in the Humanities Commons repository, the authors decided to publish the uncorrected results instead, so as not to force their interpretation of the survey's data on future research.
Example of how to manually extract incubation bouts from interactive plots...
figshare.com
txt
Updated Jan 22, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Martin Bulla (2016). Example of how to manually extract incubation bouts from interactive plots of raw data - R-CODE and DATA [Dataset]. http://doi.org/10.6084/m9.figshare.2066784.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.2066784.v1
Dataset updated
Jan 22, 2016
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Martin Bulla
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
{# General information# The script runs with R (Version 3.1.1; 2014-07-10) and packages plyr (Version 1.8.1), XLConnect (Version 0.2-9), utilsMPIO (Version 0.0.25), sp (Version 1.0-15), rgdal (Version 0.8-16), tools (Version 3.1.1) and lattice (Version 0.20-29)# --------------------------------------------------------------------------------------------------------# Questions can be directed to: Martin Bulla (bulla.mar@gmail.com)# -------------------------------------------------------------------------------------------------------- # Data collection and how the individual variables were derived is described in: #Steiger, S.S., et al., When the sun never sets: diverse activity rhythms under continuous daylight in free-living arctic-breeding birds. Proceedings of the Royal Society B: Biological Sciences, 2013. 280(1764): p. 20131016-20131016. # Dale, J., et al., The effects of life history and sexual selection on male and female plumage colouration. Nature, 2015. # Data are available as Rdata file # Missing values are NA. # --------------------------------------------------------------------------------------------------------# For better readability the subsections of the script can be collapsed # --------------------------------------------------------------------------------------------------------}{# Description of the method # 1 - data are visualized in an interactive actogram with time of day on x-axis and one panel for each day of data # 2 - red rectangle indicates the active field, clicking with the mouse in that field on the depicted light signal generates a data point that is automatically (via custom made function) saved in the csv file. For this data extraction I recommend, to click always on the bottom line of the red rectangle, as there is always data available due to a dummy variable ("lin") that creates continuous data at the bottom of the active panel. The data are captured only if greenish vertical bar appears and if new line of data appears in R console). # 3 - to extract incubation bouts, first click in the new plot has to be start of incubation, then next click depict end of incubation and the click on the same stop start of the incubation for the other sex. If the end and start of incubation are at different times, the data will be still extracted, but the sex, logger and bird_ID will be wrong. These need to be changed manually in the csv file. Similarly, the first bout for a given plot will be always assigned to male (if no data are present in the csv file) or based on previous data. Hence, whenever a data from a new plot are extracted, at a first mouse click it is worth checking whether the sex, logger and bird_ID information is correct and if not adjust it manually. # 4 - if all information from one day (panel) is extracted, right-click on the plot and choose "stop". This will activate the following day (panel) for extraction. # 5 - If you wish to end extraction before going through all the rectangles, just press "escape". }{# Annotations of data-files from turnstone_2009_Barrow_nest-t401_transmitter.RData dfr-- contains raw data on signal strength from radio tag attached to the rump of female and male, and information about when the birds where captured and incubation stage of the nest1. who: identifies whether the recording refers to female, male, capture or start of hatching2. datetime_: date and time of each recording3. logger: unique identity of the radio tag 4. signal_: signal strength of the radio tag5. sex: sex of the bird (f = female, m = male)6. nest: unique identity of the nest7. day: datetime_ variable truncated to year-month-day format8. time: time of day in hours9. datetime_utc: date and time of each recording, but in UTC time10. cols: colors assigned to "who"--------------------------------------------------------------------------------------------------------m-- contains metadata for a given nest1. sp: identifies species (RUTU = Ruddy turnstone)2. nest: unique identity of the nest3. year_: year of observation4. IDfemale: unique identity of the female5. IDmale: unique identity of the male6. lat: latitude coordinate of the nest7. lon: longitude coordinate of the nest8. hatch_start: date and time when the hatching of the eggs started 9. scinam: scientific name of the species10. breeding_site: unique identity of the breeding site (barr = Barrow, Alaska)11. logger: type of device used to record incubation (IT - radio tag)12. sampling: mean incubation sampling interval in seconds--------------------------------------------------------------------------------------------------------s-- contains metadata for the incubating parents1. year_: year of capture2. species: identifies species (RUTU = Ruddy turnstone)3. author: identifies the author who measured the bird4. nest: unique identity of the nest5. caught_date_time: date and time when the bird was captured6. recapture: was the bird capture before? (0 - no, 1 - yes)7. sex: sex of the bird (f = female, m = male)8. bird_ID: unique identity of the bird9. logger: unique identity of the radio tag --------------------------------------------------------------------------------------------------------}
d5-2-cities-database
zenodo.org
data.subak.org
bin, csv, pdf
Updated Jul 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kris Vanherle; Vera Rodrigues; Myriam Lopes; Kevin de Oliveira; Sandra Rafael; Ana Patrícia Fernandes; Iason Diafas; Carlo Trozzi; Angreine Kewo; Peter Papics; Joana Soares; Willem Himpe; Kris Vanherle; Vera Rodrigues; Myriam Lopes; Kevin de Oliveira; Sandra Rafael; Ana Patrícia Fernandes; Iason Diafas; Carlo Trozzi; Angreine Kewo; Peter Papics; Joana Soares; Willem Himpe (2024). d5-2-cities-database [Dataset]. http://doi.org/10.5281/zenodo.3931943
Explore at:
bin, csv, pdfAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3931943
Dataset updated
Jul 19, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Kris Vanherle; Vera Rodrigues; Myriam Lopes; Kevin de Oliveira; Sandra Rafael; Ana Patrícia Fernandes; Iason Diafas; Carlo Trozzi; Angreine Kewo; Peter Papics; Joana Soares; Willem Himpe; Kris Vanherle; Vera Rodrigues; Myriam Lopes; Kevin de Oliveira; Sandra Rafael; Ana Patrícia Fernandes; Iason Diafas; Carlo Trozzi; Angreine Kewo; Peter Papics; Joana Soares; Willem Himpe
Description
This data-set contains all data resources, either directly downloadable via this platform or as links to external databases, to execute the generic modeling tool as described in D5.4

0 generic model
file: d5.4-generic-model_february-2020.pdf
Report with methodology and source code for the generic model

1.1 COPERNICUS
https://land.copernicus.eu/local/urban-atlas/urban-atlas-2012?tab=download.%20
The European Urban Atlas provides reliable, inter-comparable, high-resolution land use and land cover data for 800 Functional Urban Area (FUA) for the 2012 reference year in EEA39 countries.

1.2a 1km grid shapefiles for NOx emissions in EU
The shapefiles are: nox.dbf nox.prj nox.shp nox.shx

1.2b 1km grid shapefiles for PM10 emissions in EU
The shapefiles are: pm10.dbf pm10.prj pm10.shp pm10.shx

2.1 Eurostat energy consumption data
https://ec.europa.eu/eurostat/web/energy/data/database?p_p_id=NavTreeportletprod_WAR_NavTreeportletprod_INSTANCE_QAMy7Pe6HwI1&p_p_lifecycle=0&p_p_state=normal&p_p_mode=view&p_p_col_id=column-2&p_p_col_count=1

2.2 JODI gas consumption data
https://www.jodidata.org/gas/database/data-downloads.aspx
Complete data series for all products, flows and countries, from January 2009 to one month-old can be downloaded, for free, in a beta version of .csv format.

2.3 Meteo data UK
https://www.metoffice.gov.uk/services/data/datapoint/datapoint-products

2.4 Meteo data NL
https://www.knmi.nl/nederland-nu/klimatologie/daggegevens

2.5 hourly electricity load profiles
https://data.open-power-system-data.org/time_series/

3.1 fleet data
file: 200210_summary_fleet_2016.csv
Summary of historical fleet for EU28+NO by age and vehicle type: Diesel, CNG, Petrol, Electric, PHEV, LPG

3.2 modal shares
file: 200210_summary_shares.csv
This data set holds the modal share of different modes, for EU cities at level of vehicle kilometers. Modes considered: bus car heavy duty truck >32t heavy duty truck 16-32t heavy duty truck 3_5-7_5t heavy duty truck 7_5-16t light duty truck metro/tram moped motorcycle passenger train slow mode van

3.3 emission factors
file: 200212_summary_copert_ef.xlsx
Emission factor for passenger cars (in g/km )from COPERT V, urban conditions. Data dimensions: 1. type: diesel, petrol, CNG, LPG, PHEV (in line with fleet data) 2. pollutant: NOx or PM 3. Peak/off-peak: (P/OP) 4. year of built: YOB

5.1 Crude death rates
http://apps.who.int/healthinfo/statistics/mortality/causeofdeath_query/start.php
the total number of deaths in a country per year per 1,000 people broken down by age cohort and sex

5.2 UN population data
https://population.un.org/wpp/Download/Standard/Population/
the number of inhabitants at country level broken down by age cohort and sex the time a person has lived broken down by age cohort and sex, and country

Facebook

Twitter

Click to copy link

Link copied

Cite

CSIRO (2014). CSV file used in statistical analyses [Dataset]. http://doi.org/10.4225/08/543B4B4CA92E6

CSV file used in statistical analyses

Explore at:

Unique identifier

https://doi.org/10.4225/08/543B4B4CA92E6

Dataset updated

Oct 13, 2014

Dataset authored and provided by

CSIROhttp://www.csiro.au/

License

https://research.csiro.au/dap/licences/csiro-data-licence/https://research.csiro.au/dap/licences/csiro-data-licence/

Time period covered

Mar 14, 2008 - Jun 9, 2009

Dataset funded by

CSIROhttp://www.csiro.au/

Description

A csv file containing the tidal frequencies used for statistical analyses in the paper "Estimating Freshwater Flows From Tidally-Affected Hydrographic Data" by Dan Pagendam and Don Percival.

Clear search

Close search

Google apps

Main menu

CSV file used in statistical analyses

Raw Data - CSV Files

Download CSV DB

Datasets

Residential School Locations Dataset (CSV Format)

MNAD Dataset

Industrial Park Management Bureau of the Ministry of Economic...

Ecosystem-Level Factors Affecting the Survival of Open-Source Projects: A...

7+ Million Company Dataset

Dataset

Contents

Raw ERP data in csv format

EOD data for all Dow Jones stocks

Update

Content

Acknowledgements

Terms of Use

Demographics

Gene expression csv files

Soil benchmark data sets 2021 district-free city of Brandenburg an der Havel...

US Software Engineer Jobs

Land benchmark data sets 2019 district-free city of Brandenburg an der Havel...

CVEfixes Dataset

Context

File Information

Column Description

Inclusive Design and Dissemination in Digital Scholarly Editing: CSV Dataset...

Example of how to manually extract incubation bouts from interactive plots...

d5-2-cities-database

CSV file used in statistical analysesSee More Versions

CSV file used in statistical analyses