4 datasets found
  1. The Items Dataset

    • zenodo.org
    Updated Nov 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patrick Egan; Patrick Egan (2024). The Items Dataset [Dataset]. http://doi.org/10.5281/zenodo.10964134
    Explore at:
    Dataset updated
    Nov 13, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Patrick Egan; Patrick Egan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset originally created 03/01/2019 UPDATE: Packaged on 04/18/2019 UPDATE: Edited README on 04/18/2019

    I. About this Data Set This data set is a snapshot of work that is ongoing as a collaboration between Kluge Fellow in Digital Studies, Patrick Egan and an intern at the Library of Congress in the American Folklife Center. It contains a combination of metadata from various collections that contain audio recordings of Irish traditional music. The development of this dataset is iterative, and it integrates visualizations that follow the key principles of trust and approachability. The project, entitled, “Connections In Sound” invites you to use and re-use this data.

    The text available in the Items dataset is generated from multiple collections of audio material that were discovered at the American Folklife Center. Each instance of a performance was listed and “sets” or medleys of tunes or songs were split into distinct instances in order to allow machines to read each title separately (whilst still noting that they were part of a group of tunes). The work of the intern was then reviewed before publication, and cross-referenced with the tune index at www.irishtune.info. The Items dataset consists of just over 1000 rows, with new data being added daily in a separate file.

    The collections dataset contains at least 37 rows of collections that were located by a reference librarian at the American Folklife Center. This search was complemented by searches of the collections by the scholar both on the internet at https://catalog.loc.gov and by using card catalogs.

    Updates to these datasets will be announced and published as the project progresses.

    II. What’s included? This data set includes:

    • The Items Dataset – a .CSV containing Media Note, OriginalFormat, On Website, Collection Ref, Missing In Duplication, Collection, Outside Link, Performer, Solo/multiple, Sub-item, type of tune, Tune, Position, Location, State, Date, Notes/Composer, Potential Linked Data, Instrument, Additional Notes, Tune Cleanup. This .CSV is the direct export of the Items Google Spreadsheet

    III. How Was It Created? These data were created by a Kluge Fellow in Digital Studies and an intern on this program over the course of three months. By listening, transcribing, reviewing, and tagging audio recordings, these scholars improve access and connect sounds in the American Folklife Collections by focusing on Irish traditional music. Once transcribed and tagged, information in these datasets is reviewed before publication.

    IV. Data Set Field Descriptions

    IV

    a) Collections dataset field descriptions

    • ItemId – this is the identifier for the collection that was found at the AFC
    • Viewed – if the collection has been viewed, or accessed in any way by the researchers.
    • On LOC – whether or not there are audio recordings of this collection available on the Library of Congress website.
    • On Other Website – if any of the recordings in this collection are available elsewhere on the internet
    • Original Format – the format that was used during the creation of the recordings that were found within each collection
    • Search – this indicates the type of search that was performed in order that resulted in locating recordings and collections within the AFC
    • Collection – the official title for the collection as noted on the Library of Congress website
    • State – The primary state where recordings from the collection were located
    • Other States – The secondary states where recordings from the collection were located
    • Era / Date – The decade or year associated with each collection
    • Call Number – This is the official reference number that is used to locate the collections, both in the urls used on the Library website, and in the reference search for catalog cards (catalog cards can be searched at this address: https://memory.loc.gov/diglib/ihas/html/afccards/afccards-home.html)
    • Finding Aid Online? – Whether or not a finding aid is available for this collection on the internet

    b) Items dataset field descriptions

    • id – the specific identification of the instance of a tune, song or dance within the dataset
    • Media Note – Any information that is included with the original format, such as identification, name of physical item, additional metadata written on the physical item
    • Original Format – The physical format that was used when recording each specific performance. Note: this field is used in order to calculate the number of physical items that were created in each collection such as 32 wax cylinders.
    • On Webste? – Whether or not each instance of a performance is available on the Library of Congress website
    • Collection Ref – The official reference number of the collection
    • Missing In Duplication – This column marks if parts of some recordings had been made available on other websites, but not all of the recordings were included in duplication (see recordings from Philadelphia Céilí Group on Villanova University website)
    • Collection – The official title of the collection given by the American Folklife Center
    • Outside Link – If recordings are available on other websites externally
    • Performer – The name of the contributor(s)
    • Solo/multiple – This field is used to calculate the amount of solo performers vs group performers in each collection
    • Sub-item – In some cases, physical recordings contained extra details, the sub-item column was used to denote these details
    • Type of item – This column describes each individual item type, as noted by performers and collectors
    • Item – The item title, as noted by performers and collectors. If an item was not described, it was entered as “unidentified”
    • Position – The position on the recording (in some cases during playback, audio cassette player counter markers were used)
    • Location – Local address of the recording
    • State – The state where the recording was made
    • Date – The date that the recording was made
    • Notes/Composer – The stated composer or source of the item recorded
    • Potential Linked Data – If items may be linked to other recordings or data, this column was used to provide examples of potential relationships between them
    • Instrument – The instrument(s) that was used during the performance
    • Additional Notes – Notes about the process of capturing, transcribing and tagging recordings (for researcher and intern collaboration purposes)
    • Tune Cleanup – This column was used to tidy each item so that it could be read by machines, but also so that spelling mistakes from the Item column could be corrected, and as an aid to preserving iterations of the editing process

    V. Rights statement The text in this data set was created by the researcher and intern and can be used in many different ways under creative commons with attribution. All contributions to Connections In Sound are released into the public domain as they are created. Anyone is free to use and re-use this data set in any way they want, provided reference is given to the creators of these datasets.

    VI. Creator and Contributor Information

    Creator: Connections In Sound

    Contributors: Library of Congress Labs

    VII. Contact Information Please direct all questions and comments to Patrick Egan via www.twitter.com/drpatrickegan or via his website at www.patrickegan.org. You can also get in touch with the Library of Congress Labs team via LC-Labs@loc.gov.

  2. K

    A Survey of Irish Writers in The New Yorker, 1940-1980

    • rdr.kuleuven.be
    csv, txt
    Updated Sep 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yen-Chi Wu; Yen-Chi Wu (2023). A Survey of Irish Writers in The New Yorker, 1940-1980 [Dataset]. http://doi.org/10.48804/P3WWQR
    Explore at:
    csv(14395), txt(853)Available download formats
    Dataset updated
    Sep 27, 2023
    Dataset provided by
    KU Leuven RDR
    Authors
    Yen-Chi Wu; Yen-Chi Wu
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Area covered
    Ireland
    Dataset funded by
    European Commission
    Description

    This dataset provides a survey of Irish writers publishing in The New Yorker magazine from 1940 to 1980. Methodology I conduct the survey through archival research and secondary references. The primary sources are The New Yorker's digital archive and the New Yorker Records housed in the New York Public Library. Parameters The timeframe of the survey concerns The New Yorker’s international expansion in the middle decades of the twentieth century. It starts from 1940 and ends in 1980, when the magazine industry’s cultural impact was eclipsed by the popularity of TV. For the purpose of the project, I focus on "fiction" contributions. Verse and shorter writings (such as column pieces and book reviews) are not included. Therefore, Maeve Brennan's shorter pieces under her alias "the long-winded lady" and Patricia Collinge's shorter contributions are not included in the quantatative survey. This survey includes both Irish and Irish-American writers. One key criterium of the selection is the writer’s connection with Ireland and Irish culture. Irish-American writers whose works are more concerned about (Irish-)America rather than Ireland itself are excluded from the survey. Therefore, Elizabeth Cullinan and J.P. Donleavy are included, while John O’Hara and Mary McCarthy are not. Notes for Users The list is presented in the chronological order of the contributions’ appearance in the magazine. The date format follows the international convention (ISO8601), thus: year/month/day. The date refers to the publication of The New Yorker issues. The New Yorker is a weekly, and the timeframe of the project spans four decades. This means that there are thousands of back issues under examination. I acknowledge the possibility that there are Irish writers whose contributions in the magazine escaped my attention. If there is any omission, I would appreciate the user’s input to update the survey. It is hoped that this survey will help researchers investigate the Irish connections with one of America’s most influential publications. Teachers, students, and the general public may also use this list as a guide to better appreciate these fascinating Irish stories.

  3. A transnational newspaper dataset covering Spenceanism

    • zenodo.org
    csv
    Updated Nov 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anselm Küsters; Anselm Küsters; Matilde Cazzola; Matilde Cazzola (2023). A transnational newspaper dataset covering Spenceanism [Dataset]. http://doi.org/10.5281/zenodo.7696185
    Explore at:
    csvAvailable download formats
    Dataset updated
    Nov 30, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Anselm Küsters; Anselm Küsters; Matilde Cazzola; Matilde Cazzola
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    To analyse the media legacy of English radical thinker Thomas Spence (1750–1814), his revolutionary "Plan", and his disciples (the "Spencean Philanthropists") in global press circuits, a dataset consisting of 275 Spencean-related articles in newspapers from Ireland (106 articles), the British West Indies (35), British India (29), the Australian colonies (4), Canada (1), and the United States (100) was created. The corpus consists of either the full text of each article or relevant extracts, as well as additional metadata such as source, date, title (if applicable), keyword, and region. In particular, the following databases have been relied upon: the Irish Newspaper Archives for Ireland, the Caribbean Newspapers: Digital Library of the Caribbean and Caribbean Newspapers 1718–1876 for the British West Indies, Newspapers & Gazettes – Trove for Australia, America's Historical Newspapers and Chronicling America: Historic American Newspapers for the US, Newspapers.com by Ancestry for Canada, Ireland, and the US, and the British Newspaper Archive for British India, Ireland, and the Caribbean. These databases were searched using the following keywords: "Thomas Spence" [1750–1814], "Spence's Plan", "Spencean" / "Spenceans", and "Spenceanism"; "swinish multitude", "people's farm", and "pigs' meat" were also used. All databases checked for Australia, the Caribbean, India, and Canada have been exhausted, and many Irish and US-American articles have also been grabbed. Due to the low scan quality of databases, most articles needed to be copied and corrected manually. The 275 articles of the corpus consist of 167,515 tokens. In addition, 157 articles on Spence and the Spenceans from British newspapers have been downloaded, too, and were added to this dataset for qualitative analysis and comparative purposes. Overall, the dataset made available here thus consists of 432 articles. The results from analysing this corpus are shown and discussed in a paper entitled "Transnational Echoes of Spenceanism: A Text Mining Exploration in English-Language Newspapers (1790–1850)", which is accepted for publication in the International Review of Social History (IRSH).

  4. n

    Mick Moloney Irish-American Music and Popular Culture Commercial Recordings...

    • ultraviolet.library.nyu.edu
    bin, pdf
    Updated Apr 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Josephine Jenks; Josephine Jenks; Kimberly Tarr; Kimberly Tarr (2025). Mick Moloney Irish-American Music and Popular Culture Commercial Recordings Collection FTIR dataset (AIA.031.001, Conservation ID 22_060) [Dataset]. http://doi.org/10.58153/fbspx-gaw98
    Explore at:
    bin, pdfAvailable download formats
    Dataset updated
    Apr 25, 2025
    Dataset provided by
    New York University
    Authors
    Josephine Jenks; Josephine Jenks; Kimberly Tarr; Kimberly Tarr
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Time period covered
    Jun 2, 2022
    Description

    This is a technical analysis dataset for cultural heritage materials that are in the collection of New York University Libraries and were examined by the NYU Barbara Goldsmith Preservation & Conservation Department. The materials were examined on June 2, 2022 and are part of the Mick Moloney Irish-American Music and Popular Culture Commercial Recordings Collection held by the NYU Special Collections (AIA.031.001). The dataset includes a conservation report, FTIR (Fourier Transform Infrared) spectra and, if applicable, a standard visible light image of the object. For more information about this object or its FTIR spectra, please contact the Barbara Goldsmith Preservation & Conservation Department at lib-preservation@nyu.edu

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Patrick Egan; Patrick Egan (2024). The Items Dataset [Dataset]. http://doi.org/10.5281/zenodo.10964134
Organization logo

The Items Dataset

Explore at:
Dataset updated
Nov 13, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Patrick Egan; Patrick Egan
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Dataset originally created 03/01/2019 UPDATE: Packaged on 04/18/2019 UPDATE: Edited README on 04/18/2019

I. About this Data Set This data set is a snapshot of work that is ongoing as a collaboration between Kluge Fellow in Digital Studies, Patrick Egan and an intern at the Library of Congress in the American Folklife Center. It contains a combination of metadata from various collections that contain audio recordings of Irish traditional music. The development of this dataset is iterative, and it integrates visualizations that follow the key principles of trust and approachability. The project, entitled, “Connections In Sound” invites you to use and re-use this data.

The text available in the Items dataset is generated from multiple collections of audio material that were discovered at the American Folklife Center. Each instance of a performance was listed and “sets” or medleys of tunes or songs were split into distinct instances in order to allow machines to read each title separately (whilst still noting that they were part of a group of tunes). The work of the intern was then reviewed before publication, and cross-referenced with the tune index at www.irishtune.info. The Items dataset consists of just over 1000 rows, with new data being added daily in a separate file.

The collections dataset contains at least 37 rows of collections that were located by a reference librarian at the American Folklife Center. This search was complemented by searches of the collections by the scholar both on the internet at https://catalog.loc.gov and by using card catalogs.

Updates to these datasets will be announced and published as the project progresses.

II. What’s included? This data set includes:

  • The Items Dataset – a .CSV containing Media Note, OriginalFormat, On Website, Collection Ref, Missing In Duplication, Collection, Outside Link, Performer, Solo/multiple, Sub-item, type of tune, Tune, Position, Location, State, Date, Notes/Composer, Potential Linked Data, Instrument, Additional Notes, Tune Cleanup. This .CSV is the direct export of the Items Google Spreadsheet

III. How Was It Created? These data were created by a Kluge Fellow in Digital Studies and an intern on this program over the course of three months. By listening, transcribing, reviewing, and tagging audio recordings, these scholars improve access and connect sounds in the American Folklife Collections by focusing on Irish traditional music. Once transcribed and tagged, information in these datasets is reviewed before publication.

IV. Data Set Field Descriptions

IV

a) Collections dataset field descriptions

  • ItemId – this is the identifier for the collection that was found at the AFC
  • Viewed – if the collection has been viewed, or accessed in any way by the researchers.
  • On LOC – whether or not there are audio recordings of this collection available on the Library of Congress website.
  • On Other Website – if any of the recordings in this collection are available elsewhere on the internet
  • Original Format – the format that was used during the creation of the recordings that were found within each collection
  • Search – this indicates the type of search that was performed in order that resulted in locating recordings and collections within the AFC
  • Collection – the official title for the collection as noted on the Library of Congress website
  • State – The primary state where recordings from the collection were located
  • Other States – The secondary states where recordings from the collection were located
  • Era / Date – The decade or year associated with each collection
  • Call Number – This is the official reference number that is used to locate the collections, both in the urls used on the Library website, and in the reference search for catalog cards (catalog cards can be searched at this address: https://memory.loc.gov/diglib/ihas/html/afccards/afccards-home.html)
  • Finding Aid Online? – Whether or not a finding aid is available for this collection on the internet

b) Items dataset field descriptions

  • id – the specific identification of the instance of a tune, song or dance within the dataset
  • Media Note – Any information that is included with the original format, such as identification, name of physical item, additional metadata written on the physical item
  • Original Format – The physical format that was used when recording each specific performance. Note: this field is used in order to calculate the number of physical items that were created in each collection such as 32 wax cylinders.
  • On Webste? – Whether or not each instance of a performance is available on the Library of Congress website
  • Collection Ref – The official reference number of the collection
  • Missing In Duplication – This column marks if parts of some recordings had been made available on other websites, but not all of the recordings were included in duplication (see recordings from Philadelphia Céilí Group on Villanova University website)
  • Collection – The official title of the collection given by the American Folklife Center
  • Outside Link – If recordings are available on other websites externally
  • Performer – The name of the contributor(s)
  • Solo/multiple – This field is used to calculate the amount of solo performers vs group performers in each collection
  • Sub-item – In some cases, physical recordings contained extra details, the sub-item column was used to denote these details
  • Type of item – This column describes each individual item type, as noted by performers and collectors
  • Item – The item title, as noted by performers and collectors. If an item was not described, it was entered as “unidentified”
  • Position – The position on the recording (in some cases during playback, audio cassette player counter markers were used)
  • Location – Local address of the recording
  • State – The state where the recording was made
  • Date – The date that the recording was made
  • Notes/Composer – The stated composer or source of the item recorded
  • Potential Linked Data – If items may be linked to other recordings or data, this column was used to provide examples of potential relationships between them
  • Instrument – The instrument(s) that was used during the performance
  • Additional Notes – Notes about the process of capturing, transcribing and tagging recordings (for researcher and intern collaboration purposes)
  • Tune Cleanup – This column was used to tidy each item so that it could be read by machines, but also so that spelling mistakes from the Item column could be corrected, and as an aid to preserving iterations of the editing process

V. Rights statement The text in this data set was created by the researcher and intern and can be used in many different ways under creative commons with attribution. All contributions to Connections In Sound are released into the public domain as they are created. Anyone is free to use and re-use this data set in any way they want, provided reference is given to the creators of these datasets.

VI. Creator and Contributor Information

Creator: Connections In Sound

Contributors: Library of Congress Labs

VII. Contact Information Please direct all questions and comments to Patrick Egan via www.twitter.com/drpatrickegan or via his website at www.patrickegan.org. You can also get in touch with the Library of Congress Labs team via LC-Labs@loc.gov.

Search
Clear search
Close search
Google apps
Main menu