16 datasets found
  1. 70,000 Real Faces 5

    • kaggle.com
    zip
    Updated Jan 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2020). 70,000 Real Faces 5 [Dataset]. https://www.kaggle.com/datasets/tunguz/70000-real-faces-5
    Explore at:
    zip(20526725556 bytes)Available download formats
    Dataset updated
    Jan 8, 2020
    Authors
    Bojan Tunguz
    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Contents

  2. 1 Million Fake Faces - 3

    • kaggle.com
    zip
    Updated Nov 15, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2019). 1 Million Fake Faces - 3 [Dataset]. https://www.kaggle.com/tunguz/1-million-fake-faces-3
    Explore at:
    zip(16656441943 bytes)Available download formats
    Dataset updated
    Nov 15, 2019
    Authors
    Bojan Tunguz
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Released under Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)

    Contents

  3. Major Cities of the World

    • kaggle.com
    Updated Apr 2, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2021). Major Cities of the World [Dataset]. https://www.kaggle.com/tunguz/major-cities-of-the-world/metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 2, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bojan Tunguz
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Area covered
    World
    Description

    List of major cities in the world

    Data

    The data is extracted from geonames, a very exhaustive list of worldwide toponyms.

    This datapackage only list cities above 15,000 inhabitants. Each city is associated with its country and subcountry to reduce the number of ambiguities. Subcountry can be the name of a state (eg in United Kingdom or the United States of America) or the major administrative section (eg ''region'' in France''). See admin1 field on geonames website for further info about subcountry.

    Notice that : * some cities like Vatican city or Singapore are a whole state so they don't belong to any subcountry. Therefore subcountry is N/A. * There is no guaranty that a city has a unique name in a country and subcountry (At the time of writing, there are about 60 ambiguities). But for each city, the source data primary key geonameid is provided.

    Preparation

    You can run the script yourself to update the data and publish them to github : see scripts README

    License

    All data is licensed under the Creative Common Attribution License as is the original data from geonames. This means you have to credit geonames when using the data. And while no credit is formally required a link back or credit to Lexman and the Open Knowledge Foundation is much appreciated.

    All source code is licensed under the MIT licence.

  4. Motor Carrier Census Information

    • kaggle.com
    zip
    Updated Nov 13, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2019). Motor Carrier Census Information [Dataset]. https://www.kaggle.com/datasets/tunguz/motor-carrier-census-information/suggestions
    Explore at:
    zip(116869181 bytes)Available download formats
    Dataset updated
    Nov 13, 2019
    Authors
    Bojan Tunguz
    License

    https://www.usa.gov/government-works/https://www.usa.gov/government-works/

    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Released under U.S. Government Works

    Contents

  5. EfficientNet PyTorch 0.7.1

    • kaggle.com
    zip
    Updated Apr 18, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2021). EfficientNet PyTorch 0.7.1 [Dataset]. https://www.kaggle.com/datasets/tunguz/efficientnet-pytorch-071/discussion
    Explore at:
    zip(28651 bytes)Available download formats
    Dataset updated
    Apr 18, 2021
    Authors
    Bojan Tunguz
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Released under CC0: Public Domain

    Contents

  6. Melanoma Resized Images Train 1024 2

    • kaggle.com
    zip
    Updated Aug 7, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2020). Melanoma Resized Images Train 1024 2 [Dataset]. https://www.kaggle.com/datasets/tunguz/melanoma-resized-images-train-1024-2/data
    Explore at:
    zip(32323205385 bytes)Available download formats
    Dataset updated
    Aug 7, 2020
    Authors
    Bojan Tunguz
    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Contents

  7. Melanoma Resized Images 256

    • kaggle.com
    zip
    Updated Aug 6, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2020). Melanoma Resized Images 256 [Dataset]. https://www.kaggle.com/tunguz/melanoma-resized-images-256
    Explore at:
    zip(6418280293 bytes)Available download formats
    Dataset updated
    Aug 6, 2020
    Authors
    Bojan Tunguz
    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Contents

  8. Population Estimates And Projections

    • kaggle.com
    Updated Apr 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2021). Population Estimates And Projections [Dataset]. https://www.kaggle.com/tunguz/population-estimates-and-projections/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 20, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bojan Tunguz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    About the Dataset

    This database presents population and other demographic estimates and projections from 1960 to 2050, covering more than 200 economies. It includes population data by various age groups, sex, urban/rural; fertility data; mortality data; and migration data.

  9. Kidney EfficientUNet B7 512x512 1

    • kaggle.com
    zip
    Updated Dec 14, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2020). Kidney EfficientUNet B7 512x512 1 [Dataset]. https://www.kaggle.com/tunguz/kidney-efficientunet-b7-512x512-1
    Explore at:
    zip(3331911376 bytes)Available download formats
    Dataset updated
    Dec 14, 2020
    Authors
    Bojan Tunguz
    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Contents

  10. Melanoma Resized Images 384

    • kaggle.com
    zip
    Updated Aug 6, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2020). Melanoma Resized Images 384 [Dataset]. https://www.kaggle.com/datasets/tunguz/melanoma-resized-images-384
    Explore at:
    zip(13959571805 bytes)Available download formats
    Dataset updated
    Aug 6, 2020
    Authors
    Bojan Tunguz
    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Contents

  11. Country, Regional and World GDP

    • kaggle.com
    Updated Mar 29, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2021). Country, Regional and World GDP [Dataset]. https://www.kaggle.com/tunguz/country-regional-and-world-gdp/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 29, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bojan Tunguz
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Area covered
    World
    Description

    Read me

    Country, regional and world GDP in current US Dollars ($). Regional means collections of countries e.g. Europe & Central Asia.

    Data

    The data is sourced from the World Bank, which in turn lists as sources: World Bank national accounts data, and OECD National Accounts data files.

  12. Melanoma Resized Images 768 Train

    • kaggle.com
    zip
    Updated Aug 7, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2020). Melanoma Resized Images 768 Train [Dataset]. https://www.kaggle.com/tunguz/melanoma-resized-images-768-train
    Explore at:
    zip(30082368266 bytes)Available download formats
    Dataset updated
    Aug 7, 2020
    Authors
    Bojan Tunguz
    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Contents

  13. Residential property price - different countries

    • kaggle.com
    Updated Mar 31, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2021). Residential property price - different countries [Dataset]. https://www.kaggle.com/tunguz/residential-property-price-different-countries/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 31, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bojan Tunguz
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Read me

    Residential property price statistics from different countries. Contains property price indicators (real series are the nominal price series deflated by the consumer price index), both in levels and in growth rates. Can be used for property market analysis.

    Data

    This data comes from Bank For International Settlements BIS.

  14. Data from: Comic Characters

    • kaggle.com
    Updated Apr 21, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2021). Comic Characters [Dataset]. https://www.kaggle.com/tunguz/comic-characters/metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 21, 2021
    Dataset provided by
    Kaggle
    Authors
    Bojan Tunguz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comic Characters

    This folder contains data behind the story Comic Books Are Still Made By Men, For Men And About Men.

    The data comes from Marvel Wikia and DC Wikia. Characters were scraped on August 24. Appearance counts were scraped on September 2. The month and year of the first issue each character appeared in was pulled on October 6.

    The data is split into two files, for DC and Marvel, respectively: dc-wikia-data.csv and marvel-wikia-data.csv. Each file has the following variables:

    VariableDefinition
    page_idThe unique identifier for that characters page within the wikia
    nameThe name of the character
    urlslugThe unique url within the wikia that takes you to the character
    IDThe identity status of the character (Secret Identity, Public identity, [on marvel only: No Dual Identity])
    ALIGNIf the character is Good, Bad or Neutral
    EYEEye color of the character
    HAIRHair color of the character
    SEXSex of the character (e.g. Male, Female, etc.)
    GSMIf the character is a gender or sexual minority (e.g. Homosexual characters, bisexual characters)
    ALIVEIf the character is alive or deceased
    APPEARANCESThe number of appareances of the character in comic books (as of Sep. 2, 2014. Number will become increasingly out of date as time goes on.)
    FIRST APPEARANCEThe month and year of the character's first appearance in a comic book, if available
    YEARThe year of the character's first appearance in a comic book, if available
  15. SIIM-ISIC Melanoma Resized Images

    • kaggle.com
    zip
    Updated May 30, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2020). SIIM-ISIC Melanoma Resized Images [Dataset]. https://www.kaggle.com/tunguz/siimisic-melanoma-resized-images
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    May 30, 2020
    Authors
    Bojan Tunguz
    Description

    Dataset

    This dataset was created by Bojan Tunguz

    Contents

  16. COVID-19 Death Counts in the US by County

    • kaggle.com
    Updated Nov 22, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2021). COVID-19 Death Counts in the US by County [Dataset]. https://www.kaggle.com/tunguz/covid19-death-counts-in-the-us-by-county/metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 22, 2021
    Dataset provided by
    Kaggle
    Authors
    Bojan Tunguz
    License

    https://www.usa.gov/government-works/https://www.usa.gov/government-works/

    Area covered
    United States
    Description

    Context

    Provisional count of deaths involving coronavirus disease 2019 (COVID-19) by county of occurrence, in the United States, 2020-2021.

    Contact Name

    National Center for Health Statistics

    Footnotes

    Deaths with confirmed or presumed COVID-19, coded to ICD–10 code U07.1. Counties included in this table have more than one (1) death overall at the time of analysis. Number of deaths reported in this table are the total number of deaths received and coded as of the date of analysis and do not represent all deaths that occurred in that period. Data during this period are incomplete because of the lag in time between when the death occurred and when the death certificate is completed, submitted to NCHS and processed for reporting purposes.

  17. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bojan Tunguz (2020). 70,000 Real Faces 5 [Dataset]. https://www.kaggle.com/datasets/tunguz/70000-real-faces-5
Organization logo

70,000 Real Faces 5

Explore at:
zip(20526725556 bytes)Available download formats
Dataset updated
Jan 8, 2020
Authors
Bojan Tunguz
Description

Dataset

This dataset was created by Bojan Tunguz

Contents

Search
Clear search
Close search
Google apps
Main menu