4 datasets found

The Items Dataset
zenodo.org
Updated Nov 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Patrick Egan; Patrick Egan (2024). The Items Dataset [Dataset]. http://doi.org/10.5281/zenodo.10964134
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.10964134
Dataset updated
Nov 13, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Patrick Egan; Patrick Egan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset originally created 03/01/2019 UPDATE: Packaged on 04/18/2019 UPDATE: Edited README on 04/18/2019

I. About this Data Set This data set is a snapshot of work that is ongoing as a collaboration between Kluge Fellow in Digital Studies, Patrick Egan and an intern at the Library of Congress in the American Folklife Center. It contains a combination of metadata from various collections that contain audio recordings of Irish traditional music. The development of this dataset is iterative, and it integrates visualizations that follow the key principles of trust and approachability. The project, entitled, “Connections In Sound” invites you to use and re-use this data.

The text available in the Items dataset is generated from multiple collections of audio material that were discovered at the American Folklife Center. Each instance of a performance was listed and “sets” or medleys of tunes or songs were split into distinct instances in order to allow machines to read each title separately (whilst still noting that they were part of a group of tunes). The work of the intern was then reviewed before publication, and cross-referenced with the tune index at www.irishtune.info. The Items dataset consists of just over 1000 rows, with new data being added daily in a separate file.

The collections dataset contains at least 37 rows of collections that were located by a reference librarian at the American Folklife Center. This search was complemented by searches of the collections by the scholar both on the internet at https://catalog.loc.gov and by using card catalogs.

Updates to these datasets will be announced and published as the project progresses.

II. What’s included? This data set includes:

The Items Dataset – a .CSV containing Media Note, OriginalFormat, On Website, Collection Ref, Missing In Duplication, Collection, Outside Link, Performer, Solo/multiple, Sub-item, type of tune, Tune, Position, Location, State, Date, Notes/Composer, Potential Linked Data, Instrument, Additional Notes, Tune Cleanup. This .CSV is the direct export of the Items Google Spreadsheet

III. How Was It Created? These data were created by a Kluge Fellow in Digital Studies and an intern on this program over the course of three months. By listening, transcribing, reviewing, and tagging audio recordings, these scholars improve access and connect sounds in the American Folklife Collections by focusing on Irish traditional music. Once transcribed and tagged, information in these datasets is reviewed before publication.

IV. Data Set Field Descriptions

IV

a) Collections dataset field descriptions

ItemId – this is the identifier for the collection that was found at the AFC

Viewed – if the collection has been viewed, or accessed in any way by the researchers.

On LOC – whether or not there are audio recordings of this collection available on the Library of Congress website.

On Other Website – if any of the recordings in this collection are available elsewhere on the internet

Original Format – the format that was used during the creation of the recordings that were found within each collection

Search – this indicates the type of search that was performed in order that resulted in locating recordings and collections within the AFC

Collection – the official title for the collection as noted on the Library of Congress website

State – The primary state where recordings from the collection were located

Other States – The secondary states where recordings from the collection were located

Era / Date – The decade or year associated with each collection

Call Number – This is the official reference number that is used to locate the collections, both in the urls used on the Library website, and in the reference search for catalog cards (catalog cards can be searched at this address: https://memory.loc.gov/diglib/ihas/html/afccards/afccards-home.html)

Finding Aid Online? – Whether or not a finding aid is available for this collection on the internet

b) Items dataset field descriptions

id – the specific identification of the instance of a tune, song or dance within the dataset

Media Note – Any information that is included with the original format, such as identification, name of physical item, additional metadata written on the physical item

Original Format – The physical format that was used when recording each specific performance. Note: this field is used in order to calculate the number of physical items that were created in each collection such as 32 wax cylinders.

On Webste? – Whether or not each instance of a performance is available on the Library of Congress website

Collection Ref – The official reference number of the collection

Missing In Duplication – This column marks if parts of some recordings had been made available on other websites, but not all of the recordings were included in duplication (see recordings from Philadelphia Céilí Group on Villanova University website)

Collection – The official title of the collection given by the American Folklife Center

Outside Link – If recordings are available on other websites externally

Performer – The name of the contributor(s)

Solo/multiple – This field is used to calculate the amount of solo performers vs group performers in each collection

Sub-item – In some cases, physical recordings contained extra details, the sub-item column was used to denote these details

Type of item – This column describes each individual item type, as noted by performers and collectors

Item – The item title, as noted by performers and collectors. If an item was not described, it was entered as “unidentified”

Position – The position on the recording (in some cases during playback, audio cassette player counter markers were used)

Location – Local address of the recording

State – The state where the recording was made

Date – The date that the recording was made

Notes/Composer – The stated composer or source of the item recorded

Potential Linked Data – If items may be linked to other recordings or data, this column was used to provide examples of potential relationships between them

Instrument – The instrument(s) that was used during the performance

Additional Notes – Notes about the process of capturing, transcribing and tagging recordings (for researcher and intern collaboration purposes)

Tune Cleanup – This column was used to tidy each item so that it could be read by machines, but also so that spelling mistakes from the Item column could be corrected, and as an aid to preserving iterations of the editing process

V. Rights statement The text in this data set was created by the researcher and intern and can be used in many different ways under creative commons with attribution. All contributions to Connections In Sound are released into the public domain as they are created. Anyone is free to use and re-use this data set in any way they want, provided reference is given to the creators of these datasets.

VI. Creator and Contributor Information

Creator: Connections In Sound

Contributors: Library of Congress Labs

VII. Contact Information Please direct all questions and comments to Patrick Egan via www.twitter.com/drpatrickegan or via his website at www.patrickegan.org. You can also get in touch with the Library of Congress Labs team via LC-Labs@loc.gov.
K
A Survey of Irish Writers in The New Yorker, 1940-1980
rdr.kuleuven.be
csv, txt
Updated Sep 27, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yen-Chi Wu; Yen-Chi Wu (2023). A Survey of Irish Writers in The New Yorker, 1940-1980 [Dataset]. http://doi.org/10.48804/P3WWQR
Explore at:
csv(14395), txt(853)Available download formats
Unique identifier
https://doi.org/10.48804/P3WWQR
Dataset updated
Sep 27, 2023
Dataset provided by
KU Leuven RDR
Authors
Yen-Chi Wu; Yen-Chi Wu
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Area covered
Ireland
Dataset funded by
European Commission
Description
This dataset provides a survey of Irish writers publishing in The New Yorker magazine from 1940 to 1980. Methodology I conduct the survey through archival research and secondary references. The primary sources are The New Yorker's digital archive and the New Yorker Records housed in the New York Public Library. Parameters The timeframe of the survey concerns The New Yorker’s international expansion in the middle decades of the twentieth century. It starts from 1940 and ends in 1980, when the magazine industry’s cultural impact was eclipsed by the popularity of TV. For the purpose of the project, I focus on "fiction" contributions. Verse and shorter writings (such as column pieces and book reviews) are not included. Therefore, Maeve Brennan's shorter pieces under her alias "the long-winded lady" and Patricia Collinge's shorter contributions are not included in the quantatative survey. This survey includes both Irish and Irish-American writers. One key criterium of the selection is the writer’s connection with Ireland and Irish culture. Irish-American writers whose works are more concerned about (Irish-)America rather than Ireland itself are excluded from the survey. Therefore, Elizabeth Cullinan and J.P. Donleavy are included, while John O’Hara and Mary McCarthy are not. Notes for Users The list is presented in the chronological order of the contributions’ appearance in the magazine. The date format follows the international convention (ISO8601), thus: year/month/day. The date refers to the publication of The New Yorker issues. The New Yorker is a weekly, and the timeframe of the project spans four decades. This means that there are thousands of back issues under examination. I acknowledge the possibility that there are Irish writers whose contributions in the magazine escaped my attention. If there is any omission, I would appreciate the user’s input to update the survey. It is hoped that this survey will help researchers investigate the Irish connections with one of America’s most influential publications. Teachers, students, and the general public may also use this list as a guide to better appreciate these fascinating Irish stories.
A transnational newspaper dataset covering Spenceanism
zenodo.org
csv
Updated Nov 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anselm Küsters; Anselm Küsters; Matilde Cazzola; Matilde Cazzola (2023). A transnational newspaper dataset covering Spenceanism [Dataset]. http://doi.org/10.5281/zenodo.7696185
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7696185
Dataset updated
Nov 30, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Anselm Küsters; Anselm Küsters; Matilde Cazzola; Matilde Cazzola
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
To analyse the media legacy of English radical thinker Thomas Spence (1750–1814), his revolutionary "Plan", and his disciples (the "Spencean Philanthropists") in global press circuits, a dataset consisting of 275 Spencean-related articles in newspapers from Ireland (106 articles), the British West Indies (35), British India (29), the Australian colonies (4), Canada (1), and the United States (100) was created. The corpus consists of either the full text of each article or relevant extracts, as well as additional metadata such as source, date, title (if applicable), keyword, and region. In particular, the following databases have been relied upon: the Irish Newspaper Archives for Ireland, the Caribbean Newspapers: Digital Library of the Caribbean and Caribbean Newspapers 1718–1876 for the British West Indies, Newspapers & Gazettes – Trove for Australia, America's Historical Newspapers and Chronicling America: Historic American Newspapers for the US, Newspapers.com by Ancestry for Canada, Ireland, and the US, and the British Newspaper Archive for British India, Ireland, and the Caribbean. These databases were searched using the following keywords: "Thomas Spence" [1750–1814], "Spence's Plan", "Spencean" / "Spenceans", and "Spenceanism"; "swinish multitude", "people's farm", and "pigs' meat" were also used. All databases checked for Australia, the Caribbean, India, and Canada have been exhausted, and many Irish and US-American articles have also been grabbed. Due to the low scan quality of databases, most articles needed to be copied and corrected manually. The 275 articles of the corpus consist of 167,515 tokens. In addition, 157 articles on Spence and the Spenceans from British newspapers have been downloaded, too, and were added to this dataset for qualitative analysis and comparative purposes. Overall, the dataset made available here thus consists of 432 articles. The results from analysing this corpus are shown and discussed in a paper entitled "Transnational Echoes of Spenceanism: A Text Mining Exploration in English-Language Newspapers (1790–1850)", which is accepted for publication in the International Review of Social History (IRSH).
n
Mick Moloney Irish-American Music and Popular Culture Commercial Recordings...
ultraviolet.library.nyu.edu
bin, pdf
Updated Apr 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Josephine Jenks; Josephine Jenks; Kimberly Tarr; Kimberly Tarr (2025). Mick Moloney Irish-American Music and Popular Culture Commercial Recordings Collection FTIR dataset (AIA.031.001, Conservation ID 22_060) [Dataset]. http://doi.org/10.58153/fbspx-gaw98
Explore at:
bin, pdfAvailable download formats
Unique identifier
https://doi.org/10.58153/fbspx-gaw98
Dataset updated
Apr 25, 2025
Dataset provided by
New York University
Authors
Josephine Jenks; Josephine Jenks; Kimberly Tarr; Kimberly Tarr
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Time period covered
Jun 2, 2022
Description
This is a technical analysis dataset for cultural heritage materials that are in the collection of New York University Libraries and were examined by the NYU Barbara Goldsmith Preservation & Conservation Department. The materials were examined on June 2, 2022 and are part of the Mick Moloney Irish-American Music and Popular Culture Commercial Recordings Collection held by the NYU Special Collections (AIA.031.001). The dataset includes a conservation report, FTIR (Fourier Transform Infrared) spectra and, if applicable, a standard visible light image of the object. For more information about this object or its FTIR spectra, please contact the Barbara Goldsmith Preservation & Conservation Department at lib-preservation@nyu.edu
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Patrick Egan; Patrick Egan (2024). The Items Dataset [Dataset]. http://doi.org/10.5281/zenodo.10964134

The Items Dataset

Explore at:

Unique identifier

https://doi.org/10.5281/zenodo.10964134

Dataset updated

Nov 13, 2024

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Patrick Egan; Patrick Egan

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Dataset originally created 03/01/2019 UPDATE: Packaged on 04/18/2019 UPDATE: Edited README on 04/18/2019

I. About this Data Set This data set is a snapshot of work that is ongoing as a collaboration between Kluge Fellow in Digital Studies, Patrick Egan and an intern at the Library of Congress in the American Folklife Center. It contains a combination of metadata from various collections that contain audio recordings of Irish traditional music. The development of this dataset is iterative, and it integrates visualizations that follow the key principles of trust and approachability. The project, entitled, “Connections In Sound” invites you to use and re-use this data.

The text available in the Items dataset is generated from multiple collections of audio material that were discovered at the American Folklife Center. Each instance of a performance was listed and “sets” or medleys of tunes or songs were split into distinct instances in order to allow machines to read each title separately (whilst still noting that they were part of a group of tunes). The work of the intern was then reviewed before publication, and cross-referenced with the tune index at www.irishtune.info. The Items dataset consists of just over 1000 rows, with new data being added daily in a separate file.

The collections dataset contains at least 37 rows of collections that were located by a reference librarian at the American Folklife Center. This search was complemented by searches of the collections by the scholar both on the internet at https://catalog.loc.gov and by using card catalogs.

Updates to these datasets will be announced and published as the project progresses.

II. What’s included? This data set includes:

The Items Dataset – a .CSV containing Media Note, OriginalFormat, On Website, Collection Ref, Missing In Duplication, Collection, Outside Link, Performer, Solo/multiple, Sub-item, type of tune, Tune, Position, Location, State, Date, Notes/Composer, Potential Linked Data, Instrument, Additional Notes, Tune Cleanup. This .CSV is the direct export of the Items Google Spreadsheet

III. How Was It Created? These data were created by a Kluge Fellow in Digital Studies and an intern on this program over the course of three months. By listening, transcribing, reviewing, and tagging audio recordings, these scholars improve access and connect sounds in the American Folklife Collections by focusing on Irish traditional music. Once transcribed and tagged, information in these datasets is reviewed before publication.

IV. Data Set Field Descriptions

a) Collections dataset field descriptions

ItemId – this is the identifier for the collection that was found at the AFC
Viewed – if the collection has been viewed, or accessed in any way by the researchers.
On LOC – whether or not there are audio recordings of this collection available on the Library of Congress website.
On Other Website – if any of the recordings in this collection are available elsewhere on the internet
Original Format – the format that was used during the creation of the recordings that were found within each collection
Search – this indicates the type of search that was performed in order that resulted in locating recordings and collections within the AFC
Collection – the official title for the collection as noted on the Library of Congress website
State – The primary state where recordings from the collection were located
Other States – The secondary states where recordings from the collection were located
Era / Date – The decade or year associated with each collection
Call Number – This is the official reference number that is used to locate the collections, both in the urls used on the Library website, and in the reference search for catalog cards (catalog cards can be searched at this address: https://memory.loc.gov/diglib/ihas/html/afccards/afccards-home.html)
Finding Aid Online? – Whether or not a finding aid is available for this collection on the internet

b) Items dataset field descriptions

id – the specific identification of the instance of a tune, song or dance within the dataset
Media Note – Any information that is included with the original format, such as identification, name of physical item, additional metadata written on the physical item
Original Format – The physical format that was used when recording each specific performance. Note: this field is used in order to calculate the number of physical items that were created in each collection such as 32 wax cylinders.
On Webste? – Whether or not each instance of a performance is available on the Library of Congress website
Collection Ref – The official reference number of the collection
Missing In Duplication – This column marks if parts of some recordings had been made available on other websites, but not all of the recordings were included in duplication (see recordings from Philadelphia Céilí Group on Villanova University website)
Collection – The official title of the collection given by the American Folklife Center
Outside Link – If recordings are available on other websites externally
Performer – The name of the contributor(s)
Solo/multiple – This field is used to calculate the amount of solo performers vs group performers in each collection
Sub-item – In some cases, physical recordings contained extra details, the sub-item column was used to denote these details
Type of item – This column describes each individual item type, as noted by performers and collectors
Item – The item title, as noted by performers and collectors. If an item was not described, it was entered as “unidentified”
Position – The position on the recording (in some cases during playback, audio cassette player counter markers were used)
Location – Local address of the recording
State – The state where the recording was made
Date – The date that the recording was made
Notes/Composer – The stated composer or source of the item recorded
Potential Linked Data – If items may be linked to other recordings or data, this column was used to provide examples of potential relationships between them
Instrument – The instrument(s) that was used during the performance
Additional Notes – Notes about the process of capturing, transcribing and tagging recordings (for researcher and intern collaboration purposes)
Tune Cleanup – This column was used to tidy each item so that it could be read by machines, but also so that spelling mistakes from the Item column could be corrected, and as an aid to preserving iterations of the editing process

V. Rights statement The text in this data set was created by the researcher and intern and can be used in many different ways under creative commons with attribution. All contributions to Connections In Sound are released into the public domain as they are created. Anyone is free to use and re-use this data set in any way they want, provided reference is given to the creators of these datasets.

VI. Creator and Contributor Information

Creator: Connections In Sound

Contributors: Library of Congress Labs

VII. Contact Information Please direct all questions and comments to Patrick Egan via www.twitter.com/drpatrickegan or via his website at www.patrickegan.org. You can also get in touch with the Library of Congress Labs team via LC-Labs@loc.gov.

Clear search

Close search

Google apps

Main menu

The Items Dataset

A Survey of Irish Writers in The New Yorker, 1940-1980

A transnational newspaper dataset covering Spenceanism

Mick Moloney Irish-American Music and Popular Culture Commercial Recordings...

The Items Dataset