92 datasets found

TikTok Trending Metadata
kaggle.com
Updated Feb 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Brad Culbertson (2023). TikTok Trending Metadata [Dataset]. https://www.kaggle.com/datasets/vbradculbertson/tiktok-trending-metadata
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 24, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Brad Culbertson
Description
The dataset was originally obtained from TikTok's trending API by a GitHub user named Ivan Tran. It contains metadata on engagement with user-created videos and user profile data. The original create time is in Unix timecode format and is extracted directly from the video id number. TikTok's API has become much more difficult to access recently, so more current data is harder to obtain. The hashtags column contains lists.
MeDAL Dataset
kaggle.com
opendatalab.com
+1more
zip
Updated Nov 16, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
xhlulu (2020). MeDAL Dataset [Dataset]. https://www.kaggle.com/xhlulu/medal-emnlp
Explore at:
zip(7324382521 bytes)Available download formats
Dataset updated
Nov 16, 2020
Authors
xhlulu
Description
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2352583%2F868a18fb09d7a1d3da946d74a9857130%2FLogo.PNG?generation=1604973725053566&alt=media" alt="">

Medical Dataset for Abbreviation Disambiguation for Natural Language Understanding (MeDAL) is a large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain. It was published at the ClinicalNLP workshop at EMNLP.

💻 Code 🤗 Dataset (Hugging Face) 💾 Dataset (Kaggle) 💽 Dataset (Zenodo) 📜 Paper (ACL) 📝 Paper (Arxiv) ⚡ Pre-trained ELECTRA (Hugging Face)

Downloading the data

We recommend downloading from Kaggle if you can authenticate through their API. The advantage to Kaggle is that the data is compressed, so it will be faster to download. Links to the data can be found at the top of the readme.

First, you will need to create an account on kaggle.com. Afterwards, you will need to install the kaggle API: pip install kaggle

Then, you will need to follow the instructions here to add your username and key. Once that's done, you can run: kaggle datasets download xhlulu/medal-emnlp

Now, unzip everything and place them inside the data directory: unzip -nq crawl-300d-2M-subword.zip -d data mv data/pretrain_sample/* data/

Loading FastText Embeddings

For the LSTM models, we will need to use the fastText embeddings. To do so, first download and extract the weights: wget -nc -P data/ https://dl.fbaipublicfiles.com/fasttext/vectors-english/crawl-300d-2M-subword.zip unzip -nq data/crawl-300d-2M-subword.zip -d data/

Model Quickstart

Using Torch Hub

You can directly load LSTM and LSTM-SA with torch.hub: ```python import torch

lstm = torch.hub.load("BruceWen120/medal", "lstm") lstm_sa = torch.hub.load("BruceWen120/medal", "lstm_sa") ```

If you want to use the Electra model, you need to first install transformers: pip install transformers Then, you can load it with torch.hub: python import torch electra = torch.hub.load("BruceWen120/medal", "electra")

Using Huggingface transformers

If you are only interested in the pre-trained ELECTRA weights (without the disambiguation head), you can load it directly from the Hugging Face Repository:

from transformers import AutoModel, AutoTokenizer model = AutoModel.from_pretrained("xhlu/electra-medal") tokenizer = AutoTokenizer.from_pretrained("xhlu/electra-medal")

Citation

Download the bibtex here, or copy the text below: @inproceedings{wen-etal-2020-medal, title = "{M}e{DAL}: Medical Abbreviation Disambiguation Dataset for Natural Language Understanding Pretraining", author = "Wen, Zhi and Lu, Xing Han and Reddy, Siva", booktitle = "Proceedings of the 3rd Clinical Natural Language Processing Workshop", month = nov, year = "2020", address = "Online", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/2020.clinicalnlp-1.15", pages = "130--135", }

License, Terms and Conditions

The ELECTRA model is licensed under Apache 2.0. The license for the libraries used in this project (transformers, pytorch, etc.) can be found in their respective GitHub repository. Our model is released under a MIT license.

The original dataset was retrieved and modified from the NLM website. By using this dataset, you are bound by the terms and conditions specified by NLM:

INTRODUCTION

Downloading data from the National Library of Medicine FTP servers indicates your acceptance of the following Terms and Conditions: No charges, usage fees or royalties are paid to NLM for this data.

MEDLINE/PUBMED SPECIFIC TERMS

NLM freely provides PubMed/MEDLINE data. Please note some PubMed/MEDLINE abstracts may be protected by copyright.

GENERAL TERMS AND CONDITIONS

Users of the data agree to:

acknowledge NLM as the source of the data by including the phrase "Courtesy of the U.S. National Library of Medicine" in a clear and conspicuous manner,

properly use registration and/or trademark symbols when referring to NLM products, and

not indicate or imply that NLM has endorsed its products/services/applications.

Users who republish or redistribute the data (services, products or raw data) agree to:

maintain the most current version of all distributed data, or

make known in a clear and conspicuous manner that the products/services/applications do not reflect the most current/accurate data available from NLM.

These data are produced with a reasonable standard of care, but NLM makes no warranties express or implied, including no warranty of merchantability or fitness for particular purpose, regarding the accuracy or completeness of the data. Users agree to hold NLM and the U.S. Government harmless from any liability resulting from errors in the data. NLM disclaims any liability for any consequences due to use, misuse, or interpretation of information contained or not contained in the data.

NLM does not provide legal advice regarding copyright, fair use, or other aspects of intellectual property rights. See the NLM Copyright page.

NLM reserves the right to change the type and format of its machine-readable data. NLM will take reasonable steps to inform users of any changes to the format of the data before the data are distributed via the announcement section or subscription to email and RSS updates.
E-Commerce Retail Sales Series Data Collection
kaggle.com
Updated Dec 7, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
US Census Bureau (2019). E-Commerce Retail Sales Series Data Collection [Dataset]. https://www.kaggle.com/datasets/census/e-commerce-retail-sales-series-data-collection
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 7, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
US Census Bureau
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset from the U.S. Census Bureau hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according the amount of data that is brought in. Explore the U.S. Census Bureau using Kaggle and all of the data sources available through the U.S. Census Bureau organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using FRED's API and Kaggle's API.

FiveThirtyEight Daily Show Guests Dataset

kaggle.com

zip

Updated Jan 13, 2019

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

FiveThirtyEight (2019). FiveThirtyEight Daily Show Guests Dataset [Dataset]. https://www.kaggle.com/fivethirtyeight/fivethirtyeight-daily-show-guests-dataset

Explore at:

zip(37571 bytes)Available download formats

Dataset updated

Jan 13, 2019

Dataset authored and provided by

FiveThirtyEighthttps://abcnews.go.com/538

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Content

Daily Show Guests

This folder contains data behind the story Every Guest Jon Stewart Ever Had On ‘The Daily Show’.

Header	Definition
`YEAR`	The year the episode aired
`GoogleKnowlege_Occupation`	Their occupation or office, according to Google's Knowledge Graph or, if they're not in there, how Stewart introduced them on the program.
`Show`	Air date of episode. Not unique, as some shows had more than one guest
`Group`	A larger group designation for the occupation. For instance, us senators, us presidents, and former presidents are all under "politicians"
`Raw_Guest_List`	The person or list of people who appeared on the show, according to Wikipedia. The GoogleKnowlege_Occupation only refers to one of them in a given row.

Source: Google Knowlege Graph, The Daily Show clip library, Wikipedia.

Context

This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using GitHub's API and Kaggle's API.

This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

Cover photo by Oscar Nord on Unsplash
Unsplash Images are distributed under a unique Unsplash License.

League of Legends Master+ Players
kaggle.com
Updated Sep 22, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ignacio Guillermo Martinez (2021). League of Legends Master+ Players [Dataset]. https://www.kaggle.com/jasperan/league-of-legends-master-players/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 22, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ignacio Guillermo Martinez
Description
GitHub repository

Click Here

Why?

I am writing articles on League of Legends and Machine Learning. You can find the full repository where this information is stored here.
FiveThirtyEight NBA Winprobs Dataset
kaggle.com
Updated Jan 13, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FiveThirtyEight (2019). FiveThirtyEight NBA Winprobs Dataset [Dataset]. https://www.kaggle.com/datasets/fivethirtyeight/fivethirtyeight-nba-winprobs-dataset/versions/69
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 13, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
FiveThirtyEight
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using GitHub's API and Kaggle's API.

This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.
FiveThirtyEight Bob Ross Dataset
kaggle.com
Updated Jan 10, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FiveThirtyEight (2019). FiveThirtyEight Bob Ross Dataset [Dataset]. https://www.kaggle.com/datasets/fivethirtyeight/fivethirtyeight-bob-ross-dataset/versions/105
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 10, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
FiveThirtyEight
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Content

Bob Ross

This folder contains data behind the story A Statistical Analysis of the Work of Bob Ross.

Context

This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using GitHub's API and Kaggle's API.

This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

Cover photo by Alex Kotomanov on Unsplash
Unsplash Images are distributed under a unique Unsplash License.

FiveThirtyEight Goose Dataset

kaggle.com

Updated Apr 26, 2019

Facebook

Twitter

Click to copy link

Link copied

Cite

FiveThirtyEight (2019). FiveThirtyEight Goose Dataset [Dataset]. https://www.kaggle.com/datasets/fivethirtyeight/fivethirtyeight-goose-dataset/code

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Apr 26, 2019

Dataset provided by

Kagglehttp://kaggle.com/

Authors

FiveThirtyEight

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Content

Goose

This folder contains data behind the stories: * The Save Ruined Relief Pitching. The Goose Egg Can Fix It * Kenley Jansen Is The Model Of A Modern Reliever

Header	Definition
`name`	Pitcher name
`year`	Start year of season
`team`	Retrosheet team code
`league`	NL or AL
`goose_eggs`	Goose eggs
`broken_eggs`	Broken eggs
`mehs`	Mehs
`league_average_gpct`	League-average goose percentage
`ppf`	Pitcher park factor
`replacement_gpct`	Replacement-level goose pecentage
`gwar`	Goose Wins Above Replacement
`key_retro`	Retrosheet unique player identifier

Source: Retrosheet

Context

This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using GitHub's API and Kaggle's API.

This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

Canada / U.S. Foreign Exchange Rate
kaggle.com
zip
Updated Dec 24, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Federal Reserve (2019). Canada / U.S. Foreign Exchange Rate [Dataset]. https://kaggle.com/federalreserve/canada--u.s.-foreign-exchange-rate
Explore at:
zip(66892 bytes)Available download formats
Dataset updated
Dec 24, 2019
Dataset authored and provided by
Federal Reserve
Area covered
Canada
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using FRED's API and Kaggle's API.

Cover photo by Ben Waardenburg on Unsplash
Unsplash Images are distributed under a unique Unsplash License.
FiveThirtyEight Obama Commutations Dataset
kaggle.com
zip
Updated Jan 4, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FiveThirtyEight (2019). FiveThirtyEight Obama Commutations Dataset [Dataset]. https://www.kaggle.com/fivethirtyeight/fivethirtyeight-obama-commutations-dataset
Explore at:
zip(87648 bytes)Available download formats
Dataset updated
Jan 4, 2019
Dataset authored and provided by
FiveThirtyEight
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Content

Obama Commutation

This folder contains data behind the story Obama Granted Clemency Unlike Any Other President In History.

The data in obama_commutations.csv is copied from the Justice Department website. The python script parses it by looking at the first column to figure out what is contained in the second column.

Source: Department of Justice

Context

This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using GitHub's API and Kaggle's API.

This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.
World Development Indicators
kaggle.com
zip
Updated Apr 10, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank (2019). World Development Indicators [Dataset]. https://www.kaggle.com/theworldbank/world-development-indicators
Explore at:
zip(134125679 bytes)Available download formats
Dataset updated
Apr 10, 2019
Dataset authored and provided by
World Bankhttp://topics.nytimes.com/top/reference/timestopics/organizations/w/world_bank/index.html
License
https://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets
Description
Content

The primary World Bank collection of development indicators, compiled from officially-recognized international sources. It presents the most current and accurate global development data available, and includes national, regional and global estimates.

Context

This is a dataset hosted by the World Bank. The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore the World Bank using Kaggle and all of the data sources available through the World Bank organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using the World Bank's APIs and Kaggle's API.

Cover photo by Alex Block on Unsplash
Unsplash Images are distributed under a unique Unsplash License.
US Treasury securities held by the Federal Reserve
kaggle.com
zip
Updated Dec 10, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Federal Reserve (2019). US Treasury securities held by the Federal Reserve [Dataset]. https://www.kaggle.com/federalreserve/us-treasury-securities-held-by-the-federal-reserve
Explore at:
zip(26923 bytes)Available download formats
Dataset updated
Dec 10, 2019
Dataset provided by
Federal Reserve Systemhttp://www.federalreserve.gov/
Authors
Federal Reserve
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using FRED's API and Kaggle's API.

FiveThirtyEight Mad Men Dataset

kaggle.com

zip

Updated Dec 13, 2018

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

FiveThirtyEight (2018). FiveThirtyEight Mad Men Dataset [Dataset]. https://www.kaggle.com/fivethirtyeight/fivethirtyeight-mad-men-dataset

Explore at:

zip(16691 bytes)Available download formats

Dataset updated

Dec 13, 2018

Dataset authored and provided by

FiveThirtyEighthttps://abcnews.go.com/538

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Content

Mad Men

This directory contains the data behind the story ‘Mad Men’ Is Ending. What’s Next For The Cast?

The primary file show-data.csv contains data of actors who appeared on at least half the episodes of television shows that were nominated for an Emmy for Outstanding Drama since the year 2000. It contains the following variables:

Header	Definition
`Performer`	The name of the actor, according to IMDb. This is not a unique identifier - two performers appeared in more than one program
`Show`	The television show where this actor appeared in more than half the episodes
`Show Start`	The year the television show began
`Show End`	The year the television show ended, "PRESENT" if the show remains on the air as of May 10.
`Status?`	Why the actor is no longer on the program: "END" if the show has concluded, "LEFT" if the show remains on the air.
`CharEnd`	The year the character left the show. Equal to "Show End" if the performer stayed on until the final season.
`Years Since`	2015 minus CharEnd
`#LEAD`	The number of leading roles in films the performer has appeared in since and including "CharEnd", according to OpusData
`#SUPPORT`	The number of leading roles in films the performer has appeared in since and including "CharEnd", according to OpusData
`#Shows`	The number of seasons of television of which the performer appeared in at least half the episodes since and including "CharEnd", according to OpusData
`Score`	#LEAD + #Shows + 0.25*(#SUPPORT)
`Score/Y`	"Score" divided by "Years Since"
`lead_notes`	The list of films counted in #LEAD
`support_notes`	The list of films counted in #SUPPORT
`show_notes`	The seasons of shows counted in #Shows

The supplemental file performer-scores.csv is the consolidated data from show-data.csv made into a pivot table.

Context

This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using GitHub's API and Kaggle's API.

This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

FiveThirtyEight Antiquities Act Dataset

kaggle.com

zip

Updated Feb 1, 2019

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

FiveThirtyEight (2019). FiveThirtyEight Antiquities Act Dataset [Dataset]. https://www.kaggle.com/fivethirtyeight/fivethirtyeight-antiquities-act-dataset

Explore at:

zip(8324 bytes)Available download formats

Dataset updated

Feb 1, 2019

Dataset authored and provided by

FiveThirtyEighthttps://abcnews.go.com/538

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Content

Antiquities Act

This folder contains the data behind the story Trump Might Be The First President To Scrap A National Monument.

This data was compiled by the National Parks Conservation Association and includes national monuments that were created by presidents by under the Antiquities Act. It does not include national monuments created by Congress.

Header	Definition
`current_name`	Current name of piece of land designated under the Antiquities Act
`states`	State(s) or territory where land is located
`original_name`	If included, original name of piece of land designated under the Antiquities Act
`current_agency`	Current land management agency. NPS = National Parks Service, BLM = Bureau of Land Management, USFS = US Forest Service, FWS = US Fish and Wildlife Service, NOAA = National Oceanic and National Oceanic and Atmospheric Administration
`action`	Type of action taken on land
`date`	Date of action
`year`	Year of action
`pres_or_congress`	President or congress that issued action
`acres_affected`	Acres affected by action. Note that total current acreage is not included. National monuments that cover ocean are listed in square miles.

Sources: National Parks Conservation Association and National Parks Service Archeology Program

Context

This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using GitHub's API and Kaggle's API.

This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

Cover photo by Nick Tiemeyer on Unsplash
Unsplash Images are distributed under a unique Unsplash License.

Mortgage Debt Outstanding
kaggle.com
Updated Dec 24, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Federal Reserve (2019). Mortgage Debt Outstanding [Dataset]. https://www.kaggle.com/federalreserve/mortgage-debt-outstanding/activity
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 24, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Federal Reserve
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using FRED's API and Kaggle's API.

Cover photo by Andrew Neel on Unsplash
Unsplash Images are distributed under a unique Unsplash License.
Federal Debt as Percent of GDP
kaggle.com
zip
Updated Dec 12, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
St. Louis Fed (2019). Federal Debt as Percent of GDP [Dataset]. https://www.kaggle.com/datasets/stlouisfed/federal-debt-as-percent-of-gdp
Explore at:
zip(4872 bytes)Available download formats
Dataset updated
Dec 12, 2019
Dataset provided by
Federal Reserve Bank Of St. Louishttps://www.stlouisfed.org/
Authors
St. Louis Fed
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset from the Federal Reserve Bank of St. Louis hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve Bank of St. Louis using Kaggle and all of the data sources available through the St. Louis Fed organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using FRED's API and Kaggle's API.

Cover photo by Noah Silliman on Unsplash
Unsplash Images are distributed under a unique Unsplash License.
Commercial Paper Time Series
kaggle.com
zip
Updated Dec 13, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Federal Reserve (2019). Commercial Paper Time Series [Dataset]. https://www.kaggle.com/federalreserve/commercial-paper-time-series
Explore at:
zip(25741 bytes)Available download formats
Dataset updated
Dec 13, 2019
Dataset provided by
Federal Reserve Systemhttp://www.federalreserve.gov/
Authors
Federal Reserve
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using FRED's API and Kaggle's API.

Cover photo by Copper and Wild on Unsplash
Unsplash Images are distributed under a unique Unsplash License.
1-Year Treasury Constant Maturity Rate
kaggle.com
zip
Updated Dec 25, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Federal Reserve (2019). 1-Year Treasury Constant Maturity Rate [Dataset]. https://www.kaggle.com/federalreserve/1-year-treasury-constant-maturity-rate
Explore at:
zip(81448 bytes)Available download formats
Dataset updated
Dec 25, 2019
Dataset authored and provided by
Federal Reserve
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using FRED's API and Kaggle's API.

Cover photo by Amruth Pillai on Unsplash
Unsplash Images are distributed under a unique Unsplash License.
World Bank Quarterly External Debt Statistics
kaggle.com
zip
Updated May 4, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank (2019). World Bank Quarterly External Debt Statistics [Dataset]. https://www.kaggle.com/theworldbank/world-bank-quarterly-external-debt-statistics
Explore at:
zip(11652734 bytes)Available download formats
Dataset updated
May 4, 2019
Dataset authored and provided by
World Bankhttp://topics.nytimes.com/top/reference/timestopics/organizations/w/world_bank/index.html
License
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset hosted by the World Bank. The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore the World Bank using Kaggle and all of the data sources available through the World Bank organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using the World Bank's APIs and Kaggle's API.

Cover photo by Markus Spiske on Unsplash
Unsplash Images are distributed under a unique Unsplash License.
Total Revenue Time Series Collection
kaggle.com
Updated Dec 6, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
US Census Bureau (2019). Total Revenue Time Series Collection [Dataset]. https://www.kaggle.com/census/total-revenue-time-series-collection/tasks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 6, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
US Census Bureau
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset from the U.S. Census Bureau hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according the amount of data that is brought in. Explore the U.S. Census Bureau using Kaggle and all of the data sources available through the U.S. Census Bureau organization page!

Update Frequency: This dataset is updated daily.

Acknowledgements

This dataset is maintained using FRED's API and Kaggle's API.

Cover photo by Nathan Dumlao on Unsplash
Unsplash Images are distributed under a unique Unsplash License.

Facebook

Twitter

Click to copy link

Link copied

Cite

Brad Culbertson (2023). TikTok Trending Metadata [Dataset]. https://www.kaggle.com/datasets/vbradculbertson/tiktok-trending-metadata

TikTok Trending Metadata

Video metadata scraped from TikTok's API, obtained from Ivan Tran on GitHub.

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Feb 24, 2023

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Brad Culbertson

Description

The dataset was originally obtained from TikTok's trending API by a GitHub user named Ivan Tran. It contains metadata on engagement with user-created videos and user profile data. The original create time is in Unix timecode format and is extracted directly from the video id number. TikTok's API has become much more difficult to access recently, so more current data is harder to obtain. The hashtags column contains lists.

Clear search

Close search

Google apps

Main menu

TikTok Trending Metadata

MeDAL Dataset

Downloading the data

Loading FastText Embeddings

Model Quickstart

Using Torch Hub

Using Huggingface transformers

Citation

License, Terms and Conditions

E-Commerce Retail Sales Series Data Collection

Content

Context

Acknowledgements

FiveThirtyEight Daily Show Guests Dataset

Content

Daily Show Guests

Context

Acknowledgements

League of Legends Master+ Players

GitHub repository

Why?

FiveThirtyEight NBA Winprobs Dataset

Context

Acknowledgements

FiveThirtyEight Bob Ross Dataset

Content

Bob Ross

Context

Acknowledgements

FiveThirtyEight Goose Dataset

Content

Goose

Context

Acknowledgements

Canada / U.S. Foreign Exchange Rate

Content

Context

Acknowledgements

FiveThirtyEight Obama Commutations Dataset

Content

Obama Commutation

Context

Acknowledgements

World Development Indicators

Content

Context

Acknowledgements

US Treasury securities held by the Federal Reserve

Content

Context

Acknowledgements

FiveThirtyEight Mad Men Dataset

Content

Mad Men

Context

Acknowledgements

FiveThirtyEight Antiquities Act Dataset

Content

Antiquities Act

Context

Acknowledgements

Mortgage Debt Outstanding

Content

Context

Acknowledgements

Federal Debt as Percent of GDP

Content

Context

Acknowledgements

Commercial Paper Time Series

Content

Context

Acknowledgements

1-Year Treasury Constant Maturity Rate

Content

Context

Acknowledgements

World Bank Quarterly External Debt Statistics

Content

Context

Using Huggingface `transformers`