92 datasets found
  1. TikTok Trending Metadata

    • kaggle.com
    Updated Feb 24, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brad Culbertson (2023). TikTok Trending Metadata [Dataset]. https://www.kaggle.com/datasets/vbradculbertson/tiktok-trending-metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 24, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Brad Culbertson
    Description

    The dataset was originally obtained from TikTok's trending API by a GitHub user named Ivan Tran. It contains metadata on engagement with user-created videos and user profile data. The original create time is in Unix timecode format and is extracted directly from the video id number. TikTok's API has become much more difficult to access recently, so more current data is harder to obtain. The hashtags column contains lists.

  2. MeDAL Dataset

    • kaggle.com
    • opendatalab.com
    • +1more
    zip
    Updated Nov 16, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    xhlulu (2020). MeDAL Dataset [Dataset]. https://www.kaggle.com/xhlulu/medal-emnlp
    Explore at:
    zip(7324382521 bytes)Available download formats
    Dataset updated
    Nov 16, 2020
    Authors
    xhlulu
    Description

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2352583%2F868a18fb09d7a1d3da946d74a9857130%2FLogo.PNG?generation=1604973725053566&alt=media" alt="">

    Medical Dataset for Abbreviation Disambiguation for Natural Language Understanding (MeDAL) is a large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain. It was published at the ClinicalNLP workshop at EMNLP.

    💻 Code 🤗 Dataset (Hugging Face) 💾 Dataset (Kaggle) 💽 Dataset (Zenodo) 📜 Paper (ACL) 📝 Paper (Arxiv)Pre-trained ELECTRA (Hugging Face)

    Downloading the data

    We recommend downloading from Kaggle if you can authenticate through their API. The advantage to Kaggle is that the data is compressed, so it will be faster to download. Links to the data can be found at the top of the readme.

    First, you will need to create an account on kaggle.com. Afterwards, you will need to install the kaggle API: pip install kaggle

    Then, you will need to follow the instructions here to add your username and key. Once that's done, you can run: kaggle datasets download xhlulu/medal-emnlp

    Now, unzip everything and place them inside the data directory: unzip -nq crawl-300d-2M-subword.zip -d data mv data/pretrain_sample/* data/

    Loading FastText Embeddings

    For the LSTM models, we will need to use the fastText embeddings. To do so, first download and extract the weights: wget -nc -P data/ https://dl.fbaipublicfiles.com/fasttext/vectors-english/crawl-300d-2M-subword.zip unzip -nq data/crawl-300d-2M-subword.zip -d data/

    Model Quickstart

    Using Torch Hub

    You can directly load LSTM and LSTM-SA with torch.hub: ```python import torch

    lstm = torch.hub.load("BruceWen120/medal", "lstm") lstm_sa = torch.hub.load("BruceWen120/medal", "lstm_sa") ```

    If you want to use the Electra model, you need to first install transformers: pip install transformers Then, you can load it with torch.hub: python import torch electra = torch.hub.load("BruceWen120/medal", "electra")

    Using Huggingface transformers

    If you are only interested in the pre-trained ELECTRA weights (without the disambiguation head), you can load it directly from the Hugging Face Repository:

    from transformers import AutoModel, AutoTokenizer
    
    model = AutoModel.from_pretrained("xhlu/electra-medal")
    tokenizer = AutoTokenizer.from_pretrained("xhlu/electra-medal")
    

    Citation

    Download the bibtex here, or copy the text below: @inproceedings{wen-etal-2020-medal, title = "{M}e{DAL}: Medical Abbreviation Disambiguation Dataset for Natural Language Understanding Pretraining", author = "Wen, Zhi and Lu, Xing Han and Reddy, Siva", booktitle = "Proceedings of the 3rd Clinical Natural Language Processing Workshop", month = nov, year = "2020", address = "Online", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/2020.clinicalnlp-1.15", pages = "130--135", }

    License, Terms and Conditions

    The ELECTRA model is licensed under Apache 2.0. The license for the libraries used in this project (transformers, pytorch, etc.) can be found in their respective GitHub repository. Our model is released under a MIT license.

    The original dataset was retrieved and modified from the NLM website. By using this dataset, you are bound by the terms and conditions specified by NLM:

    INTRODUCTION

    Downloading data from the National Library of Medicine FTP servers indicates your acceptance of the following Terms and Conditions: No charges, usage fees or royalties are paid to NLM for this data.

    MEDLINE/PUBMED SPECIFIC TERMS

    NLM freely provides PubMed/MEDLINE data. Please note some PubMed/MEDLINE abstracts may be protected by copyright.

    GENERAL TERMS AND CONDITIONS

    • Users of the data agree to:

      • acknowledge NLM as the source of the data by including the phrase "Courtesy of the U.S. National Library of Medicine" in a clear and conspicuous manner,
      • properly use registration and/or trademark symbols when referring to NLM products, and
      • not indicate or imply that NLM has endorsed its products/services/applications.
    • Users who republish or redistribute the data (services, products or raw data) agree to:

      • maintain the most current version of all distributed data, or
      • make known in a clear and conspicuous manner that the products/services/applications do not reflect the most current/accurate data available from NLM.
    • These data are produced with a reasonable standard of care, but NLM makes no warranties express or implied, including no warranty of merchantability or fitness for particular purpose, regarding the accuracy or completeness of the data. Users agree to hold NLM and the U.S. Government harmless from any liability resulting from errors in the data. NLM disclaims any liability for any consequences due to use, misuse, or interpretation of information contained or not contained in the data.

    • NLM does not provide legal advice regarding copyright, fair use, or other aspects of intellectual property rights. See the NLM Copyright page.

    • NLM reserves the right to change the type and format of its machine-readable data. NLM will take reasonable steps to inform users of any changes to the format of the data before the data are distributed via the announcement section or subscription to email and RSS updates.

  3. E-Commerce Retail Sales Series Data Collection

    • kaggle.com
    Updated Dec 7, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    US Census Bureau (2019). E-Commerce Retail Sales Series Data Collection [Dataset]. https://www.kaggle.com/datasets/census/e-commerce-retail-sales-series-data-collection
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 7, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    US Census Bureau
    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset from the U.S. Census Bureau hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according the amount of data that is brought in. Explore the U.S. Census Bureau using Kaggle and all of the data sources available through the U.S. Census Bureau organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using FRED's API and Kaggle's API.

  4. FiveThirtyEight Daily Show Guests Dataset

    • kaggle.com
    zip
    Updated Jan 13, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FiveThirtyEight (2019). FiveThirtyEight Daily Show Guests Dataset [Dataset]. https://www.kaggle.com/fivethirtyeight/fivethirtyeight-daily-show-guests-dataset
    Explore at:
    zip(37571 bytes)Available download formats
    Dataset updated
    Jan 13, 2019
    Dataset authored and provided by
    FiveThirtyEighthttps://abcnews.go.com/538
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    Daily Show Guests

    This folder contains data behind the story Every Guest Jon Stewart Ever Had On ‘The Daily Show’.

    HeaderDefinition
    YEARThe year the episode aired
    GoogleKnowlege_OccupationTheir occupation or office, according to Google's Knowledge Graph or, if they're not in there, how Stewart introduced them on the program.
    ShowAir date of episode. Not unique, as some shows had more than one guest
    GroupA larger group designation for the occupation. For instance, us senators, us presidents, and former presidents are all under "politicians"
    Raw_Guest_ListThe person or list of people who appeared on the show, according to Wikipedia. The GoogleKnowlege_Occupation only refers to one of them in a given row.

    Source: Google Knowlege Graph, The Daily Show clip library, Wikipedia.

    Context

    This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using GitHub's API and Kaggle's API.

    This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

    Cover photo by Oscar Nord on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  5. League of Legends Master+ Players

    • kaggle.com
    Updated Sep 22, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ignacio Guillermo Martinez (2021). League of Legends Master+ Players [Dataset]. https://www.kaggle.com/jasperan/league-of-legends-master-players/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 22, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ignacio Guillermo Martinez
    Description

    GitHub repository

    Click Here

    Why?

    I am writing articles on League of Legends and Machine Learning. You can find the full repository where this information is stored here.

  6. FiveThirtyEight NBA Winprobs Dataset

    • kaggle.com
    Updated Jan 13, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FiveThirtyEight (2019). FiveThirtyEight NBA Winprobs Dataset [Dataset]. https://www.kaggle.com/datasets/fivethirtyeight/fivethirtyeight-nba-winprobs-dataset/versions/69
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 13, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    FiveThirtyEight
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using GitHub's API and Kaggle's API.

    This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

  7. FiveThirtyEight Bob Ross Dataset

    • kaggle.com
    Updated Jan 10, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FiveThirtyEight (2019). FiveThirtyEight Bob Ross Dataset [Dataset]. https://www.kaggle.com/datasets/fivethirtyeight/fivethirtyeight-bob-ross-dataset/versions/105
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 10, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    FiveThirtyEight
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    Bob Ross

    This folder contains data behind the story A Statistical Analysis of the Work of Bob Ross.

    Context

    This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using GitHub's API and Kaggle's API.

    This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

    Cover photo by Alex Kotomanov on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  8. FiveThirtyEight Goose Dataset

    • kaggle.com
    Updated Apr 26, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FiveThirtyEight (2019). FiveThirtyEight Goose Dataset [Dataset]. https://www.kaggle.com/datasets/fivethirtyeight/fivethirtyeight-goose-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 26, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    FiveThirtyEight
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    Goose

    This folder contains data behind the stories: * The Save Ruined Relief Pitching. The Goose Egg Can Fix It * Kenley Jansen Is The Model Of A Modern Reliever

    HeaderDefinition
    namePitcher name
    yearStart year of season
    teamRetrosheet team code
    leagueNL or AL
    goose_eggsGoose eggs
    broken_eggsBroken eggs
    mehsMehs
    league_average_gpctLeague-average goose percentage
    ppfPitcher park factor
    replacement_gpctReplacement-level goose pecentage
    gwarGoose Wins Above Replacement
    key_retroRetrosheet unique player identifier

    Source: Retrosheet

    Context

    This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using GitHub's API and Kaggle's API.

    This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

  9. Canada / U.S. Foreign Exchange Rate

    • kaggle.com
    zip
    Updated Dec 24, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federal Reserve (2019). Canada / U.S. Foreign Exchange Rate [Dataset]. https://kaggle.com/federalreserve/canada--u.s.-foreign-exchange-rate
    Explore at:
    zip(66892 bytes)Available download formats
    Dataset updated
    Dec 24, 2019
    Dataset authored and provided by
    Federal Reserve
    Area covered
    Canada
    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using FRED's API and Kaggle's API.

    Cover photo by Ben Waardenburg on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  10. FiveThirtyEight Obama Commutations Dataset

    • kaggle.com
    zip
    Updated Jan 4, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FiveThirtyEight (2019). FiveThirtyEight Obama Commutations Dataset [Dataset]. https://www.kaggle.com/fivethirtyeight/fivethirtyeight-obama-commutations-dataset
    Explore at:
    zip(87648 bytes)Available download formats
    Dataset updated
    Jan 4, 2019
    Dataset authored and provided by
    FiveThirtyEight
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    Obama Commutation

    This folder contains data behind the story Obama Granted Clemency Unlike Any Other President In History.

    The data in obama_commutations.csv is copied from the Justice Department website. The python script parses it by looking at the first column to figure out what is contained in the second column.

    Source: Department of Justice

    Context

    This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using GitHub's API and Kaggle's API.

    This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

  11. World Development Indicators

    • kaggle.com
    zip
    Updated Apr 10, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2019). World Development Indicators [Dataset]. https://www.kaggle.com/theworldbank/world-development-indicators
    Explore at:
    zip(134125679 bytes)Available download formats
    Dataset updated
    Apr 10, 2019
    Dataset authored and provided by
    World Bankhttp://topics.nytimes.com/top/reference/timestopics/organizations/w/world_bank/index.html
    License

    https://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets

    Description

    Content

    The primary World Bank collection of development indicators, compiled from officially-recognized international sources. It presents the most current and accurate global development data available, and includes national, regional and global estimates.

    Context

    This is a dataset hosted by the World Bank. The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore the World Bank using Kaggle and all of the data sources available through the World Bank organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using the World Bank's APIs and Kaggle's API.

    Cover photo by Alex Block on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  12. US Treasury securities held by the Federal Reserve

    • kaggle.com
    zip
    Updated Dec 10, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federal Reserve (2019). US Treasury securities held by the Federal Reserve [Dataset]. https://www.kaggle.com/federalreserve/us-treasury-securities-held-by-the-federal-reserve
    Explore at:
    zip(26923 bytes)Available download formats
    Dataset updated
    Dec 10, 2019
    Dataset provided by
    Federal Reserve Systemhttp://www.federalreserve.gov/
    Authors
    Federal Reserve
    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using FRED's API and Kaggle's API.

  13. FiveThirtyEight Mad Men Dataset

    • kaggle.com
    zip
    Updated Dec 13, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FiveThirtyEight (2018). FiveThirtyEight Mad Men Dataset [Dataset]. https://www.kaggle.com/fivethirtyeight/fivethirtyeight-mad-men-dataset
    Explore at:
    zip(16691 bytes)Available download formats
    Dataset updated
    Dec 13, 2018
    Dataset authored and provided by
    FiveThirtyEighthttps://abcnews.go.com/538
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    Mad Men

    This directory contains the data behind the story ‘Mad Men’ Is Ending. What’s Next For The Cast?

    The primary file show-data.csv contains data of actors who appeared on at least half the episodes of television shows that were nominated for an Emmy for Outstanding Drama since the year 2000. It contains the following variables:

    HeaderDefinition
    PerformerThe name of the actor, according to IMDb. This is not a unique identifier - two performers appeared in more than one program
    ShowThe television show where this actor appeared in more than half the episodes
    Show StartThe year the television show began
    Show EndThe year the television show ended, "PRESENT" if the show remains on the air as of May 10.
    Status?Why the actor is no longer on the program: "END" if the show has concluded, "LEFT" if the show remains on the air.
    CharEndThe year the character left the show. Equal to "Show End" if the performer stayed on until the final season.
    Years Since2015 minus CharEnd
    #LEADThe number of leading roles in films the performer has appeared in since and including "CharEnd", according to OpusData
    #SUPPORTThe number of leading roles in films the performer has appeared in since and including "CharEnd", according to OpusData
    #ShowsThe number of seasons of television of which the performer appeared in at least half the episodes since and including "CharEnd", according to OpusData
    Score#LEAD + #Shows + 0.25*(#SUPPORT)
    Score/Y"Score" divided by "Years Since"
    lead_notesThe list of films counted in #LEAD
    support_notesThe list of films counted in #SUPPORT
    show_notesThe seasons of shows counted in #Shows

    The supplemental file performer-scores.csv is the consolidated data from show-data.csv made into a pivot table.

    Context

    This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using GitHub's API and Kaggle's API.

    This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

  14. FiveThirtyEight Antiquities Act Dataset

    • kaggle.com
    zip
    Updated Feb 1, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FiveThirtyEight (2019). FiveThirtyEight Antiquities Act Dataset [Dataset]. https://www.kaggle.com/fivethirtyeight/fivethirtyeight-antiquities-act-dataset
    Explore at:
    zip(8324 bytes)Available download formats
    Dataset updated
    Feb 1, 2019
    Dataset authored and provided by
    FiveThirtyEighthttps://abcnews.go.com/538
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    Antiquities Act

    This folder contains the data behind the story Trump Might Be The First President To Scrap A National Monument.

    This data was compiled by the National Parks Conservation Association and includes national monuments that were created by presidents by under the Antiquities Act. It does not include national monuments created by Congress.

    HeaderDefinition
    current_nameCurrent name of piece of land designated under the Antiquities Act
    statesState(s) or territory where land is located
    original_nameIf included, original name of piece of land designated under the Antiquities Act
    current_agencyCurrent land management agency. NPS = National Parks Service, BLM = Bureau of Land Management, USFS = US Forest Service, FWS = US Fish and Wildlife Service, NOAA = National Oceanic and National Oceanic and Atmospheric Administration
    actionType of action taken on land
    dateDate of action
    yearYear of action
    pres_or_congressPresident or congress that issued action
    acres_affectedAcres affected by action. Note that total current acreage is not included. National monuments that cover ocean are listed in square miles.

    Sources: National Parks Conservation Association and National Parks Service Archeology Program

    Context

    This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using GitHub's API and Kaggle's API.

    This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

    Cover photo by Nick Tiemeyer on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  15. Mortgage Debt Outstanding

    • kaggle.com
    Updated Dec 24, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federal Reserve (2019). Mortgage Debt Outstanding [Dataset]. https://www.kaggle.com/federalreserve/mortgage-debt-outstanding/activity
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 24, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Federal Reserve
    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using FRED's API and Kaggle's API.

    Cover photo by Andrew Neel on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  16. Federal Debt as Percent of GDP

    • kaggle.com
    zip
    Updated Dec 12, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    St. Louis Fed (2019). Federal Debt as Percent of GDP [Dataset]. https://www.kaggle.com/datasets/stlouisfed/federal-debt-as-percent-of-gdp
    Explore at:
    zip(4872 bytes)Available download formats
    Dataset updated
    Dec 12, 2019
    Dataset provided by
    Federal Reserve Bank Of St. Louishttps://www.stlouisfed.org/
    Authors
    St. Louis Fed
    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset from the Federal Reserve Bank of St. Louis hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve Bank of St. Louis using Kaggle and all of the data sources available through the St. Louis Fed organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using FRED's API and Kaggle's API.

    Cover photo by Noah Silliman on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  17. Commercial Paper Time Series

    • kaggle.com
    zip
    Updated Dec 13, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federal Reserve (2019). Commercial Paper Time Series [Dataset]. https://www.kaggle.com/federalreserve/commercial-paper-time-series
    Explore at:
    zip(25741 bytes)Available download formats
    Dataset updated
    Dec 13, 2019
    Dataset provided by
    Federal Reserve Systemhttp://www.federalreserve.gov/
    Authors
    Federal Reserve
    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using FRED's API and Kaggle's API.

    Cover photo by Copper and Wild on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  18. 1-Year Treasury Constant Maturity Rate

    • kaggle.com
    zip
    Updated Dec 25, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federal Reserve (2019). 1-Year Treasury Constant Maturity Rate [Dataset]. https://www.kaggle.com/federalreserve/1-year-treasury-constant-maturity-rate
    Explore at:
    zip(81448 bytes)Available download formats
    Dataset updated
    Dec 25, 2019
    Dataset authored and provided by
    Federal Reserve
    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset from the Federal Reserve hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according to the frequency that the data updates. Explore the Federal Reserve using Kaggle and all of the data sources available through the Federal Reserve organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using FRED's API and Kaggle's API.

    Cover photo by Amruth Pillai on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  19. World Bank Quarterly External Debt Statistics

    • kaggle.com
    zip
    Updated May 4, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2019). World Bank Quarterly External Debt Statistics [Dataset]. https://www.kaggle.com/theworldbank/world-bank-quarterly-external-debt-statistics
    Explore at:
    zip(11652734 bytes)Available download formats
    Dataset updated
    May 4, 2019
    Dataset authored and provided by
    World Bankhttp://topics.nytimes.com/top/reference/timestopics/organizations/w/world_bank/index.html
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset hosted by the World Bank. The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore the World Bank using Kaggle and all of the data sources available through the World Bank organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using the World Bank's APIs and Kaggle's API.

    Cover photo by Markus Spiske on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  20. Total Revenue Time Series Collection

    • kaggle.com
    Updated Dec 6, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    US Census Bureau (2019). Total Revenue Time Series Collection [Dataset]. https://www.kaggle.com/census/total-revenue-time-series-collection/tasks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 6, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    US Census Bureau
    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset from the U.S. Census Bureau hosted by the Federal Reserve Economic Database (FRED). FRED has a data platform found here and they update their information according the amount of data that is brought in. Explore the U.S. Census Bureau using Kaggle and all of the data sources available through the U.S. Census Bureau organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using FRED's API and Kaggle's API.

    Cover photo by Nathan Dumlao on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Brad Culbertson (2023). TikTok Trending Metadata [Dataset]. https://www.kaggle.com/datasets/vbradculbertson/tiktok-trending-metadata
Organization logo

TikTok Trending Metadata

Video metadata scraped from TikTok's API, obtained from Ivan Tran on GitHub.

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 24, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Brad Culbertson
Description

The dataset was originally obtained from TikTok's trending API by a GitHub user named Ivan Tran. It contains metadata on engagement with user-created videos and user profile data. The original create time is in Unix timecode format and is extracted directly from the video id number. TikTok's API has become much more difficult to access recently, so more current data is harder to obtain. The hashtags column contains lists.

Search
Clear search
Close search
Google apps
Main menu