https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10293677%2Fa6d81c06dc03412bfd063941bd1dfa18%2Fspacex-falcon9-reaching-orbit-wide.jpg?generation=1672337964521833&alt=media" alt="">
Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
License information was derived automatically
Please note: this archive requires support for dangling symlinks, which excludes the Windows operating system.
To use this dataset, you will need to download the MS COCO 2017 detection images and expand them to a folder called coco17 in the train_val_combined directory. The download can be found here: https://cocodataset.org/#download You will also need to download the AI2D image description dataset and expand them to a folder called ai2d in the train_val_combined directory. The download can be found here: https://prior.allenai.org/projects/diagram-understanding
License Notes for Train and Val: Since the images in this dataset come from different sources, they are bound by different licenses.
Images for bar charts, x-y plots, maps, pie charts, tables, and technical drawings were downloaded directly from wikimedia commons. License and authorship information is stored independently for each image in these categories in the wikimedia_commons_licenses.csv file. Each row (note: some rows are multi-line) is formatted so:
Images in the slides category were taken from presentations which were downloaded from Wikimedia Commons. The names of the presentations on Wikimedia Commons omits the trailing underscore, number, and file extension, and ends with .pdf instead. The source materials' licenses are shown in source_slices_licenses.csv.
Wikimedia commons photos' information page can be found at "https://commons.wikimedia.org/wiki/File:
License Notes for Testing: The testing images have been uploaded to SlideWiki by SlideWiki users. The image authorship and copyright information is available in authors.csv.
Further information can be found for each image using the SlideWiki file service. Documentation is available at https://fileservice.slidewiki.org/documentation#/ and in particular: metadata is available at "https://fileservice.slidewiki.org/metadata/
This is the SlideImages dataset, which has been assembled for the SlideImages paper. If you find the dataset useful, please cite our paper: https://doi.org/10.1007/978-3-030-45442-5_36
Download the current cigarette use among youth slides. These slides are available in PDF and PowerPoint formats. The PDF version can be found at: https://chronicdata.cdc.gov/Survey-Data/Current-Cigarette-Use-Among-Youth-YRBSS-PDF-Slides/rpbm-bfkm
For those who are actively looking for data scientist jobs in the U.S., the best news this month is the LinkedIn Workforce Report August 2018. According to the report, there is a shortage of 151,717 people with data science skills, with particularly acute shortages in New York City, San Francisco Bay Area and Los Angeles.
To help job hunters (including me) to better understand the job market, I scraped Indeed website and collected information of 7,000 data scientist jobs around the U.S. on August 3rd. The information that I collected are: Company Name, Position Name, Location, Job Description, and Number of Reviews of the Company.
Special thanks to Indeed for not blocking me : )
Possible Questions:
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data set contains the .wav sound files, .trs Transcriber files, .txt Toolbox-compatible Notepad files and .pdf files with the completely transcribed, glossed, parsed and translated examples of the recordings that belong to the following publication:
Bodt, Timotheus Adrianus. 2020. Grammar of Duhumbi. Leiden: Brill. ISBN 978-90-04-40947-7. https://brill.com/view/title/55767
The explanation of all the grammatical features that occur in these sound files can be found in the Grammar of Duhumbi.
The main Toolbox files can be found in the zip file “Settings”, this includes the IPA keys for Duhumbi, the entire setup of the Toolbox database, and the Duhumbi dictionary and Parsing dictionary.
The .wav, .txt and .trs files combined in the same folder will enable to open Toolbox and work with the recordings, e.g. play them sentence for sentence and see the transcriptions and translations.
Transcriber version 1.5.1: http://trans.sourceforge.net/en/presentation.php or https://osdn.net/projects/sfnet_trans/downloads/transcriber/1.5.1/Transcriber-1.5.1-Windows.exe/
Toolbox version 1.6.1: https://software.sil.org/toolbox/download/
This data set contains the files belonging to the sound files as mentioned in the pdf file “Duhumbi Grammar All Files Upload 2”. The S/N code corresponds to the code used in the Grammar to identify the text from which an example was taken. The name of the file refers to the name of the .wav, .trs, .txt and .pdf files in this upload. The subject is a short description of the topic of the text. The duration is the duration of the recording.
For the metadata of the sound files in this data set, I refer to Chapter 13 Texts in the Grammar of Duhumbi. This Chapter has a complete listing of the texts, their topics, the speakers and their background etc.
This material is made freely available to everyone for informative or scientific purposes as long as the source (this DOI) / the collectors are properly credited. Please note that use of the material for commercial purposes of any kind, which includes conversion into commercial audio-visual media (documentaries etc.), storage and dissemination through sites that require registration & payment for access, or sites that rely on advertisement (including YouTube) is not permitted without specific written consent from the speakers and their community, obtained through the collector of the material. By downloading this material, you agree to these restrictions.
This data set falls under the Attribution-NonCommercial-ShareAlike (CC BY-NC-SA) license. This license lets you remix, tweak, and build upon this work non-commercially, as long as you credit us and license your new creations under the identical terms. License Deed on https://creativecommons.org/licenses/by-nc-sa/4.0/. Legal Code on https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode.
Tim Bodt: monpasang (at) gmail (dot) com
Download the current cigarette excise tax rates on packs of cigarettes slides. These slides are available in PDF and PowerPoint formats. The PDF version can be found at: https://chronicdata.cdc.gov/Legislation/Excise-Tax-Rates-On-Packs-Of-Cigarettes-PDF-Slides/i9js-434w
Not seeing a result you expected?
Learn how you can add new datasets to our index.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10293677%2Fa6d81c06dc03412bfd063941bd1dfa18%2Fspacex-falcon9-reaching-orbit-wide.jpg?generation=1672337964521833&alt=media" alt="">