4 datasets found
  1. o

    Meta_Album_INS_Extended

    • openml.org
    Updated Nov 8, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ihsan Ullah (2022). Meta_Album_INS_Extended [Dataset]. https://www.openml.org/d/44340
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 8, 2022
    Authors
    Ihsan Ullah
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Meta-Album Insects Dataset (Extended)

    The original Insects dataset is created by the National Museum of Natural History, Paris (https://www.mnhn.fr/fr). It has more than 290 000 images in different sizes and orientations. The dataset has hierarchical classes which are listed from top to bottom as Order, Super-Family, Family, and Texa. Each image contains an insect in its natural environment or habitat, i.e, either on a flower or near to vegetation. The images are collected by the researchers and hundreds of volunteers from SPIPOLL Science project(https://www.spipoll.org/). The images are uploaded to a centralized server either by using the SPIPOLL website, Android application or IOS application. The preprocessed insect dataset is prepared from the original Insects dataset by carefully preprocessing the images, i.e., cropping the images from either side to make squared images. These cropped images are then resized into 128x128 using Open-CV with an anti-aliasing filter.

    Dataset Details

    https://meta-album.github.io/assets/img/samples/INS.png" alt="">

    Meta Album ID: SM_AM.INS
    Meta Album URL: https://meta-album.github.io/datasets/INS.html
    Domain ID: SM_AM
    Domain Name: Small Aninamls
    Dataset ID: INS
    Dataset Name: Insects
    Short Description: Insects dataset from Science Project SPIPOLL
    # Classes: 117
    # Images: 170506
    Keywords: insects, ecology
    Data Format: images
    Image size: 128x128

    License (original data release): CC BY-NC 2.0
    License URL(original data release): https://www.spipoll.org/mentions-legales

    License (Meta-Album data release): CC BY-NC 2.0
    License URL (Meta-Album data release): https://creativecommons.org/licenses/by-nc/2.0/

    Source: SPIPOLL; National Museum of Natural History, Paris
    Source URL: https://www.spipoll.org/

    Original Author: Gregoire Lois, Colin Fontaine, Jean-Francois Julien
    Original contact: contact@spipoll.org

    Meta Album author: Ihsan Ullah
    Created Date: 01 March 2022
    Contact Name: Ihsan Ullah
    Contact Email: meta-album@chalearn.org
    Contact URL: https://meta-album.github.io/

    Cite this dataset

    @article{insects, 
      title={Data quality and participant engagement in citizen science: comparing two approaches for monitoring pollinators in France and South Korea}, 
      author={Serret, Hortense and Deguines, Nicolas and Jang, Yikweon and Lois, Gregoire and Julliard, Romain}, 
      journal={Citizen Science: Theory and Practice}, 
      volume={4}, 
      number={1}, 
      pages={22}, 
      year={2019} 
    }
    

    Cite Meta-Album

    @inproceedings{meta-album-2022,
        title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},
        author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},
        booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
        url = {https://meta-album.github.io/},
        year = {2022}
      }
    

    More

    For more information on the Meta-Album dataset, please see the [NeurIPS 2022 paper]
    For details on the dataset preprocessing, please see the [supplementary materials]
    Supporting code can be found on our [GitHub repo]
    Meta-Album on Papers with Code [Meta-Album]

    Other versions of this dataset

    [Micro] [Mini]

  2. Metal Album Art by Subgenre

    • kaggle.com
    zip
    Updated Jul 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fraser Watt (2023). Metal Album Art by Subgenre [Dataset]. https://www.kaggle.com/datasets/fraserwtt/metal-album-art-by-subgenre
    Explore at:
    zip(375639527 bytes)Available download formats
    Dataset updated
    Jul 2, 2023
    Authors
    Fraser Watt
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Scraped this data from the Spotify API, searching for bands by genre, then cycling through those bands and saving the image from the album art URL. Put it together as my first lesson project for Fast.ai's Practical Deep Learning for Coders.

    Have gone for high level sub-genre classes (e.g. for these purposes am not interested that Blind Guardian were influenced by Thrash Metal, or that Fleshgod Apocalypse's use of orchestral elements might make them "Symphonic Death Metal"). There are also a lot of exclusions of bands being labelled incorrectly, or the first band that comes up in Spotify's API query clearly being a totally different band.

    Just working as a proof of concept, then I'll add in Thrash Metal and Death Metal. Feel free to fork / improve the web scraper I've got in the notebook here: https://github.com/fraserwat/metal-album-subgenre-classifier

    Once I've done that I won't be maintaining this as it's just for a fun little project to get me used to basic NN stuff.

  3. Metal Albums Artwork

    • kaggle.com
    zip
    Updated Feb 15, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Benjamín Machín (2021). Metal Albums Artwork [Dataset]. https://www.kaggle.com/benjamnmachn/metal-albums-artwork
    Explore at:
    zip(1664251992 bytes)Available download formats
    Dataset updated
    Feb 15, 2021
    Authors
    Benjamín Machín
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Context

    This dataset was gathered in order to train a GAN to be able to generate heavy metal artwork.

    Content

    In each row you'll find basic artist info (artist name, country, status, main and second subgenres), an album name and album artwork URL. Note that since the artist info and the albums list were gathered from different sources, there are a lot of empty values in the artist info fields.

    Acknowledgements

    To build this dataset, I used: * https://pypi.org/project/metalparser/ * https://pypi.org/project/coverpy/ * https://github.com/jonchar/ma-scraper

    The header photo is a sample taken during training of the GAN.

  4. Nightwish Lyrics (1996-2020)

    • kaggle.com
    zip
    Updated Apr 21, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yuan Meng (2020). Nightwish Lyrics (1996-2020) [Dataset]. https://www.kaggle.com/crazyrichbayesians/nightwish-lyrics
    Explore at:
    zip(94244 bytes)Available download formats
    Dataset updated
    Apr 21, 2020
    Authors
    Yuan Meng
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Context

    Nightwish is a symphonic power metal band from Finland and one of the biggest names in the European metal scene. Since 1996, they have released 9 studio albums and numerous singles, EP's, and live albums. The most recent album Human. :II: Nature. just came out this April. Throughout their career, Nightwish has explored many different themes, ranging from love and sorrow to science and nature. It might be super interesting to look at how the lyrical themes of Nightwish have evolved over the past 24 years.

    I scraped all the lyrics of Nightwish from the metal lyrics archive Dark Lyrics using GitHub user medakk's script.

    Content

    The raw data scraped from Dark Lyrics is stored in a plain text file (nightwish_lyrics.txt). To increase data usability, I extracted the following information and stored it in a CSV file (nightwish_lyrics.csv):

    • lyric: Each row is one line of lyric
    • album_title: The title of the album in which the lyric was from
    • year: The year in which the album was released
    • track_title: The title of the corresponding song track
    • track_number: The track number of the song in an album

    You can use the cleaned CSV file or start from the plain text file and do your own text mining!

    Inspiration

    I was inspired by the Taylor Swift lyrics dataset and thought we could perform similar analyses on Nightwish. Some example questions to explore:

    1. What are the most common themes (or topics) in Nightwish's lyrics?
    2. How have Nightwish's lyrical themes changed over the years?
    3. Which songs/albums are most similar/dissimilar to one another?
    4. Can you predict the Nightwish album based on lyrics?
    5. Can you generate new lyrics in the style of Nightwish?
  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ihsan Ullah (2022). Meta_Album_INS_Extended [Dataset]. https://www.openml.org/d/44340

Meta_Album_INS_Extended

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 8, 2022
Authors
Ihsan Ullah
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Meta-Album Insects Dataset (Extended)

The original Insects dataset is created by the National Museum of Natural History, Paris (https://www.mnhn.fr/fr). It has more than 290 000 images in different sizes and orientations. The dataset has hierarchical classes which are listed from top to bottom as Order, Super-Family, Family, and Texa. Each image contains an insect in its natural environment or habitat, i.e, either on a flower or near to vegetation. The images are collected by the researchers and hundreds of volunteers from SPIPOLL Science project(https://www.spipoll.org/). The images are uploaded to a centralized server either by using the SPIPOLL website, Android application or IOS application. The preprocessed insect dataset is prepared from the original Insects dataset by carefully preprocessing the images, i.e., cropping the images from either side to make squared images. These cropped images are then resized into 128x128 using Open-CV with an anti-aliasing filter.

Dataset Details

https://meta-album.github.io/assets/img/samples/INS.png" alt="">

Meta Album ID: SM_AM.INS
Meta Album URL: https://meta-album.github.io/datasets/INS.html
Domain ID: SM_AM
Domain Name: Small Aninamls
Dataset ID: INS
Dataset Name: Insects
Short Description: Insects dataset from Science Project SPIPOLL
# Classes: 117
# Images: 170506
Keywords: insects, ecology
Data Format: images
Image size: 128x128

License (original data release): CC BY-NC 2.0
License URL(original data release): https://www.spipoll.org/mentions-legales

License (Meta-Album data release): CC BY-NC 2.0
License URL (Meta-Album data release): https://creativecommons.org/licenses/by-nc/2.0/

Source: SPIPOLL; National Museum of Natural History, Paris
Source URL: https://www.spipoll.org/

Original Author: Gregoire Lois, Colin Fontaine, Jean-Francois Julien
Original contact: contact@spipoll.org

Meta Album author: Ihsan Ullah
Created Date: 01 March 2022
Contact Name: Ihsan Ullah
Contact Email: meta-album@chalearn.org
Contact URL: https://meta-album.github.io/

Cite this dataset

@article{insects, 
  title={Data quality and participant engagement in citizen science: comparing two approaches for monitoring pollinators in France and South Korea}, 
  author={Serret, Hortense and Deguines, Nicolas and Jang, Yikweon and Lois, Gregoire and Julliard, Romain}, 
  journal={Citizen Science: Theory and Practice}, 
  volume={4}, 
  number={1}, 
  pages={22}, 
  year={2019} 
}

Cite Meta-Album

@inproceedings{meta-album-2022,
    title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},
    author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},
    booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
    url = {https://meta-album.github.io/},
    year = {2022}
  }

More

For more information on the Meta-Album dataset, please see the [NeurIPS 2022 paper]
For details on the dataset preprocessing, please see the [supplementary materials]
Supporting code can be found on our [GitHub repo]
Meta-Album on Papers with Code [Meta-Album]

Other versions of this dataset

[Micro] [Mini]

Search
Clear search
Close search
Google apps
Main menu