58 datasets found

P
WikiBio Dataset
paperswithcode.com
Updated Nov 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Remi Lebret; David Grangier; Michael Auli (2021). WikiBio Dataset [Dataset]. https://paperswithcode.com/dataset/wikibio
Explore at:
Dataset updated
Nov 16, 2021
Authors
Remi Lebret; David Grangier; Michael Auli
Description
This dataset gathers 728,321 biographies from English Wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).
H
Data from: Pantheon 1.0, A Manually Verified Dataset of Globally Famous...
dataverse.harvard.edu
Updated Jan 4, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harvard Dataverse (2016). Pantheon 1.0, A Manually Verified Dataset of Globally Famous Biographies [Dataset]. http://doi.org/10.7910/DVN/28201
Explore at:
tsv(2176393), text/plain; charset=utf-8(13938718), text/plain; charset=us-ascii(149252802)Available download formats
Unique identifier
https://doi.org/10.7910/DVN/28201
Dataset updated
Jan 4, 2016
Dataset provided by
Harvard Dataverse
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
We present the Pantheon 1.0 dataset: a manually verified dataset of individuals that have transcended linguistic, temporal, and geographic boundaries. The Pantheon 1.0 dataset includes the 11,341 biographies present in more than 25 languages in Wikipedia and is enriched with: (i) manually verified demographic information (place and date of birth, gender) (ii) a taxonomy of occupations classifying each biography at three levels of aggregation and (iii) two measures of global popularity including the number of languages in which a biography is present in Wikipedia (L), and the Historical Popularity Index (HPI) a metric that combines information on L, time since birth, and page-views (2008-2013). We compare the Pantheon 1.0 dataset to data from the 2003 book, Human Accomplishments, and also to external measures of accomplishment in individual games and sports: Tennis, Swimming, Car Racing, and Chess. In all of these cases we find that measures of popularity (L and HPI) correlate highly with individual accomplishment, suggesting that measures of global popularity proxy the historical impact of individuals.
Data from: Member biographies
gov.uk
s3.amazonaws.com
Updated Oct 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Biometrics and Forensics Ethics Group (2024). Member biographies [Dataset]. https://www.gov.uk/government/publications/member-biographies
Explore at:
Dataset updated
Oct 11, 2024
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Biometrics and Forensics Ethics Group
Description
Full biographies of the members of the Biometrics and Forensics Ethics Group.
f
Database_biographies of the Sverdlovsk oblast officials.xlsx
figshare.com
xlsx
Updated Dec 1, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kirill Melnikov (2020). Database_biographies of the Sverdlovsk oblast officials.xlsx [Dataset]. http://doi.org/10.6084/m9.figshare.13313045.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.13313045.v1
Dataset updated
Dec 1, 2020
Dataset provided by
figshare
Authors
Kirill Melnikov
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Sverdlovsk Oblast
Description
This dataset contains the biographies of the Sverdlovsk Oblast officials (2004-2005; 2019-2020)
f
Data from: Short fictional biography. Posibility of a reader's literary...
scielo.figshare.com
figshare.com
jpeg
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rafael Andugar Sousa (2023). Short fictional biography. Posibility of a reader's literary genre [Dataset]. http://doi.org/10.6084/m9.figshare.7101119.v1
Explore at:
jpegAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7101119.v1
Dataset updated
Jun 3, 2023
Dataset provided by
SciELO journals
Authors
Rafael Andugar Sousa
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Abstract: In this article, we begin from the theoretical implications vislumbrated by J. M. Schaeffer with the porpose to create a new literary genre based in the task of the reader to compare diverse literary works which maybe don't belong to the same tradition. The object of our interest is the existence of tales and narrations of biographies which are invented by an author interested in real historical characters (or even also invented). To explore the limits of the genre is necessary to know deeply the field of biografphy and the relations with literary writing and the relations with historiographical discourse too.
f
Biographies of literature writers written in English language
figshare.com
application/gzip
Updated Mar 17, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Javier Gomez; Cesar Alfaro; Felipe Ortega; Javier M. Moguerza; Maria Jesus Algar; Raul Moreno (2023). Biographies of literature writers written in English language [Dataset]. http://doi.org/10.6084/m9.figshare.13551467.v4
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.13551467.v4
Dataset updated
Mar 17, 2023
Dataset provided by
figshare
Authors
Javier Gomez; Cesar Alfaro; Felipe Ortega; Javier M. Moguerza; Maria Jesus Algar; Raul Moreno
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains 1000 biographies of literature writers retrieved from the english version of Wikipedia. There is a total of 500 biographies of women writers extracted from the category entitled “19th-century_women_writers” (https://en.wikipedia.org/wiki/Category:19th-century_women_writers) and 500 male biographies extracted from the category “19th-century_male_writers” (https://en.wikipedia.org/wiki/Category:19th-century_male_writers)
h
bio-mcp-data
huggingface.co
Updated Jun 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Longevity Genie (2025). bio-mcp-data [Dataset]. https://huggingface.co/datasets/longevity-genie/bio-mcp-data
Explore at:
Dataset updated
Jun 16, 2025
Dataset authored and provided by
Longevity Genie
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Bio-MCP-Data

A repository containing biological datasets that will be used by BIO-MCP MCP (Model Context Protocol) standard.

About

This repository hosts biological data assets formatted to be compatible with the Model Context Protocol, enabling AI models to efficiently access and process biological information. The data is managed using Git Large File Storage (LFS) to handle large biological datasets.

Purpose

Provide standardized biological datasets for AI… See the full description on the dataset page: https://huggingface.co/datasets/longevity-genie/bio-mcp-data.
Biography wear corp Import Company US
seair.co.in
Updated Nov 5, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Seair Exim (2017). Biography wear corp Import Company US [Dataset]. https://www.seair.co.in
Explore at:
.bin, .xml, .csv, .xlsAvailable download formats
Dataset updated
Nov 5, 2017
Dataset provided by
Seair Exim Solutions
Authors
Seair Exim
Area covered
United States
Description
Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.
m
ZH-preview Dataset
data.mendeley.com
Updated Jun 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
曾昊 (2024). ZH-preview Dataset [Dataset]. http://doi.org/10.17632/nx8hknrgfz.1
Explore at:
Unique identifier
https://doi.org/10.17632/nx8hknrgfz.1
Dataset updated
Jun 27, 2024
Authors
曾昊
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Preview Dataset for editorial evaluation and review.
R
Big Data V3 No Bio Dataset
universe.roboflow.com
zip
Updated Jun 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
graduationproject (2023). Big Data V3 No Bio Dataset [Dataset]. https://universe.roboflow.com/graduationproject-aqm0w/big-data-v3-no-bio
Explore at:
zipAvailable download formats
Dataset updated
Jun 6, 2023
Dataset authored and provided by
graduationproject
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Trash Bounding Boxes
Description
Big Data V3 No Bio

## Overview Big Data V3 No Bio is a dataset for object detection tasks - it contains Trash annotations for 8,825 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
APIS Dataset artists
zenodo.org
data.niaid.nih.gov
bin
Updated Jun 2, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maximilian Kaiser; Maximilian Kaiser (2020). APIS Dataset artists [Dataset]. http://doi.org/10.5281/zenodo.3865451
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3865451
Dataset updated
Jun 2, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Maximilian Kaiser; Maximilian Kaiser
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains biographical data produced in course of the digital humanities project “Mapping historical networks: Building the new Austrian Prosopographical/Biographical Information System (APIS)” at the Austrian Academy of Sciences. It was funded by the Austrian National Fonds for Research, Technology and Development. The biographies were manually annotated by the author via a web application (apis.acdh.oewa.ac.at) which was developed at the Austrian Centre for Digital Humanities and Cultural Heritage (ACDH-CH).

The starting point of the dataset (cl Kuenstlerhaus) were 506 annotated artists’ biographies from the Austrian Biographical Encyclopaedia 1815–1950 (ÖBL). For these persons, the membership in the Association of Fine Artists Vienna (Genossenschaft der bildenden Künstler Wiens) was confirmed by the comparison of the yearly published membership lists with the lemmas of the ÖBL. The data were collected primarily to enable a) statistics b) historical network analyses and c) cartographic analyses.

The data is provided as graphml files:

relations between persons (kinship, pupil/teacher)
cl_kuenstlerhaus_person-person_v1-01

relations between persons and institutions (education, career, social networks)
cl_kuenstlerhaus_person-institution_v1-01

relations between persons and places (mobility)
cl_kuenstlerhaus_person-place_v1-01

The datset was last reviewed in January 2020.
c
Data from: Destined for Success? Educational Biographies of Academically...
datacatalogue.cessda.eu
beta.ukdataservice.ac.uk
Updated Nov 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Power, S., University of London, Institute of Education; Whitty, G., University of London, Institute of Education; Edwards, T., University of Newcastle upon Tyne (2024). Destined for Success? Educational Biographies of Academically Able Pupils, 1981-1997 [Dataset]. http://doi.org/10.5255/UKDA-SN-3827-1
Explore at:
Unique identifier
https://doi.org/10.5255/UKDA-SN-3827-1
Dataset updated
Nov 28, 2024
Dataset provided by
Policy Studies
School of Education
Authors
Power, S., University of London, Institute of Education; Whitty, G., University of London, Institute of Education; Edwards, T., University of Newcastle upon Tyne
Time period covered
Jan 1, 1995 - Jan 1, 1997
Area covered
England
Variables measured
Individuals, National, Young people
Measurement technique
Face-to-face interview, Telephone interview, Postal survey, Self-completion
Description
Abstract copyright UK Data Service and data collection copyright owner.

This is a mixed methods data collection.

This project made use of a sample drawn for an earlier research project to explore the different ways in which 'academically able' students attending different types of secondary school at age 11 in the mid 1980s realised and experienced their subsequent educational and career opportunities. It involved four groups of academically able pupils: assisted place holders in independent schools, full fee paying pupils in the same schools, pupils at maintained grammar schools and those attending comprehensive schools. The findings provide important insights into the experiences, qualifications, attitudes and values of new recruits to middle class occupations in the 1990s.

The broad aim of Destined for Success? Educational Biographies of Academically Able Pupils, 1981-1997 was to explore the different ways in which academically able students realise and experience educational opportunities. The study had the following specific objectives:
to compare the dimensions and directions along which different forms of schooling and sponsorship had impacted upon the educational careers of 'academically able' students
to investigate the extent to which students had been able to translate their educational promise at age 11 into subsequent school achievements, further educational opportunities and occupational locations
to explore the ways in which their experiences have resulted in the continuity or transformation of social identities in terms of family, friendship or work
The research was conducted by means of a postal survey and semi-structured interviews. A sample of questionnaire respondents was selected for interview to ensure that all sectors, schools and modes of sponsorship were represented.

A follow-up to this study is available under SN 6501 - Success Sustained? A Follow-up Survey of the 'Destined for Success' Cohort, 2004. This quantitative study revisits the respondents in their early thirties.

Further information is available from the Destined for Success? Educational Biographies of Academically Able Pupils ESRC Award web page.

For the second edition (May 2011), transcripts of qualitative interviews conducted with 34 of the original respondents were added to the quantitative data, making the study a mixed methods data collection.

Main Topics:

The following topics are covered: education; school types; academic ability; school achievements; higher education; transition from school to work and subsequent careers; social identities; basic socio-economic indicators; cultural and political dispositions.
Data from: A short biography of Hubert Ludwig and a note on the publication...
search.datacite.org
gbif.org
Updated Apr 27, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Plazi (2016). A short biography of Hubert Ludwig and a note on the publication dates of his monograph Die Seewalzen (1889 – 1892) [Dataset]. http://doi.org/10.15468/qf39mc
Explore at:
Unique identifier
https://doi.org/10.15468/qf39mc
Dataset updated
Apr 27, 2016
Dataset provided by
DataCitehttps://www.datacite.org/
Plazi.org taxonomic treatments database
Authors
Plazi
Description
This dataset contains the digitized treatments in Plazi based on the original journal article Reich, Mike (2015): A short biography of Hubert Ludwig and a note on the publication dates of his monograph Die Seewalzen (1889 – 1892). Zootaxa 4052 (2): 332-344, DOI: http://dx.doi.org/10.11646/zootaxa.4052.3.3
m
phbdataset
data.mendeley.com
Updated Jul 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Phillip Dangaiso (2023). phbdataset [Dataset]. http://doi.org/10.17632/5pyf6bm36g.1
Explore at:
Unique identifier
https://doi.org/10.17632/5pyf6bm36g.1
Dataset updated
Jul 13, 2023
Authors
Phillip Dangaiso
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
data collected from rural communities in Zimbabwe to evaluate preventive health behavior based on the health belief model.
Topic Model for English Wikipedia's Biographies with list of all 1.8M...
zenodo.org
data.niaid.nih.gov
bin, csv, txt, zip
Updated Jan 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michael Mandiberg; Michael Mandiberg; Danara Sarıoğlu; Danara Sarıoğlu (2023). Topic Model for English Wikipedia's Biographies with list of all 1.8M articles linked to Wikidata [Dataset]. http://doi.org/10.5281/zenodo.5747336
Explore at:
zip, bin, csv, txtAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5747336
Dataset updated
Jan 28, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Michael Mandiberg; Michael Mandiberg; Danara Sarıoğlu; Danara Sarıoğlu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
A Genism LDA Topic Model of English Wikipedia biographical articles with list of all 1.8M articles, and some associated Wikidata information

The model has 150 Topics.

This model was developed in the process of isolating a set of visual arts biographical articles, as described in "Clowns in the Visual Artists: Topic Modeling Wikipedia and Wikidata" in the Spring 2022 issue of Art Documentation - https://doi.org/10.1086/719999

Because names, nationalities, and birthdays are so prominent in biographies, the stopwords list removed 170,000 names, surnames, city names, place names, countries, days, months and other time related words (https://github.com/mandiberg/Names-Surnames-and-Countries-for-Stopwords). We also directly removed each article subject’s given and surname, which were almost always the most frequently occurring words in any given article. Otherwise, the model just produced topics based on nationality, and common names and surnames.

Files:

all_enwiki_bios_from_wikidata.csv
The list of all Wikidata items for humans with an enwiki page (e.g biographical article) was extracted from Wikidata JSON dump; list includes gender, occupation, and nationality. This was joined with the converted plaintext from an English Wikipedia dump. This data was downloaded in March 2021.

Wikipedia Biographies LDA Topic Model human readable summary.csv
A human readable file with the 150 topics ranked by count of articles per topic from the 1.8M corpus. The most popular topics have categorical descriptions of the occupations of each cluster. Some are marked as not an occupation cluster.

BoW_corpus.mm*
model_lda_full_Sep2_150Tv2*
These six files comprise the topic model. The code to load them is present in the python files.

dict_full_Aug-28-2021
processed_docs_full_Aug-28-2021.txt
processed_docs_1000_Aug-18-2021.txt
These are the dictionary and processed corpuses required to build and implement the model using this code. The corpus with the first 1000 items is meant to be used for testing, as the full one is quite large and takes a long time to complete.

topic-model-wikipedia-sept2021.zip
The code and settings used for creating and implementing this model are included in this zip and are also available here: https://github.com/mandiberg/topic-model-wikipedia

All-Wikipedia-Biographies-with-topic1.csv
All-Wikipedia-Biographies-with-topic1and2.csv
These are the list of 1.8M biographies matched to topics. The "topic1" file just includes the first topic, this is a slightly larger list. The "topic1and2" file is slightly smaller because about 2% articles do not match to a second topic.

Analysis-for-Clowns-Visual-Arts.zip
These are the raw data and final data produced for the "Clowns in the Visual Artists." Please see the article for context.
w
Data from: Biographical Directory of the United States Congress
data.wu.ac.at
api/sparql +2
Updated Oct 10, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DataFAQs (2013). Biographical Directory of the United States Congress [Dataset]. https://data.wu.ac.at/schema/datahub_io/NDM5Y2EzMTYtZjJhMS00NzdkLTk5N2UtODg0MTBmZTM1MjE2
Explore at:
api/sparql, example/turtle, meta/void(60.0)Available download formats
Dataset updated
Oct 10, 2013
Dataset provided by
DataFAQs
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Names, positions, state, party, and congress number of members of US Congress 1774-present.

Scraped from http://bioguide.congress.gov/biosearch/biosearch.asp by https://scraperwiki.com/scrapers/biographical_directory_usc/#
H
U.S. District Court Judges Merge File
dataverse.harvard.edu
Updated Aug 16, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maya Sen (2011). U.S. District Court Judges Merge File [Dataset]. http://doi.org/10.7910/DVN/J1A6RW
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/J1A6RW
Dataset updated
Aug 16, 2011
Dataset provided by
Harvard Dataverse
Authors
Maya Sen
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
United States
Description
This merge file can be used to combine detailed biographical data on U.S. District Court Judges from the Federal Judicial Center (http://www.fjc.gov/history/home.nsf/page/export.html) with data on cases form the U.S. Court of Appeals Database Project (http://www.wmich.edu/nsf-coa/). The file includes the unique identifiers used by each group to make it easy for researchers to combine the two data sources together. Note that this is a merge file for U.S. District Court Judges only.
P
BiasBios Dataset
paperswithcode.com
Updated Jan 26, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maria De-Arteaga; Alexey Romanov; Hanna Wallach; Jennifer Chayes; Christian Borgs; Alexandra Chouldechova; Sahin Geyik; Krishnaram Kenthapadi; Adam Tauman Kalai (2019). BiasBios Dataset [Dataset]. https://paperswithcode.com/dataset/biasbios
Explore at:
Dataset updated
Jan 26, 2019
Authors
Maria De-Arteaga; Alexey Romanov; Hanna Wallach; Jennifer Chayes; Christian Borgs; Alexandra Chouldechova; Sahin Geyik; Krishnaram Kenthapadi; Adam Tauman Kalai
Description
The purpose of this dataset was to study gender bias in occupations. Online biographies, written in English, were collected to find the names, pronouns, and occupations. Twenty-eight most frequent occupations were identified based on their appearances. The resulting dataset consists of 397,340 biographies spanning twenty-eight different occupations. Of these occupations, the professor is the most frequent, with 118,400 biographies, while the rapper is the least frequent, with 1,406 biographies. Important information about the biographies: 1. The longest biography is 194 tokens, while the shortest is eighteen; the median biography length is seventy-two tokens. 2. It should be noted that the demographics of online biographies’ subjects differ from those of the overall workforce and that this dataset does not contain all biographies on the Internet.
Z
Early Members of the Leopoldina (1652-1818): Biographical Data
data.niaid.nih.gov
zenodo.org
Updated Oct 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Münnich, Fanny (2024). Early Members of the Leopoldina (1652-1818): Biographical Data [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13818617
Explore at:
Dataset updated
Oct 1, 2024
Dataset provided by
Schilling, Jacob
Splinter, Susan
Münnich, Fanny
Gassner, Sebastian
Rehbein, Malte
Doppler, Tobias
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This data set encompasses a collection of detailed biographical data about 850 of the first members of the German National Academy of Sciences Leopoldina (Deutsche Akademie der Naturforscher Leopoldina – Nationale Akademie der Wissenschaften) from 1652 to 1818. The data includes information about the members themselves, their family, their membership in the Leopoldina, academic and professional positions held, as well as works, portraits, and associated sources.
B
Data from: Yellow Nineties 2.0
borealisdata.ca
search.dataone.org
+1more
Updated May 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lorraine Janzen Kooistra; MJ Suhonos; Alison F. Hedley; Reg Beatty; Marion Tempest Grant; Linked Infrastructure for Networked Cultural Scholarship (LINCS) (2025). Yellow Nineties 2.0 [Dataset]. http://doi.org/10.5683/SP3/2FTQXM
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.5683/SP3/2FTQXM
Dataset updated
May 31, 2025
Dataset provided by
Borealis
Authors
Lorraine Janzen Kooistra; MJ Suhonos; Alison F. Hedley; Reg Beatty; Marion Tempest Grant; Linked Infrastructure for Networked Cultural Scholarship (LINCS)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
1889 - 1905
Description
Yellow Nineties 2.0 uses digital tools to advance knowledge of eight late-Victorian little magazines and the people who contributed to their production between 1889 and 1905: Pagan Review (1 volume, 1892) Yellow Book (13 volumes, 1894–1897) The Dial (5 volumes, 1889–1897) The Evergreen: A Northern Seasonal (4 volumes, 1895–1897) The Green Sheaf (13 issues, 1903–1904) The Pageant (2 volumes, 1896–1897) The Savoy (2 quarterly and 6 monthly issues, 1896) The Venture: An Annual of Art and Literature (2 volumes, 1903 and 1905) The data document the communities of production responsible for these little magazines, particularly by recovering the social networks of and biographical information about women and marginalized persons in those communities. The dataset enables users to query, visualize, and analyze the relationships, connections, and social networks of magazine contributors. The Yellow Nineties project site (https://1890s.ca) includes two biographical tools, one discursive and the other data-driven. Essays on the life and work of a select group of magazine contributors are available in Y90s Biographies. Biographical data for all magazine contributors are available in the Y90s Personography (https://personography.1890s.ca). The data has been transformed into Linked Open Data via the LINCS conversion toolkit of the the Linked Infrastructure for Networked Cultural Scholarship (LINCS) project. The data is assembled as a single text file in text/turtle (.ttl) and contains descriptive metadata that has been reconciled into triples using established linked data vocabularies. The Yellow Nineties 2.0 has been supported by funding from SSHRC.

Facebook

Twitter

Click to copy link

Link copied

Cite

Remi Lebret; David Grangier; Michael Auli (2021). WikiBio Dataset [Dataset]. https://paperswithcode.com/dataset/wikibio

WikiBio Dataset

Wikipedia Biography Dataset

Explore at:

402 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Nov 16, 2021

Authors

Remi Lebret; David Grangier; Michael Auli

Description

This dataset gathers 728,321 biographies from English Wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).

Clear search

Close search

Google apps

Main menu

WikiBio Dataset

Data from: Pantheon 1.0, A Manually Verified Dataset of Globally Famous...

Data from: Member biographies

Database_biographies of the Sverdlovsk oblast officials.xlsx

Data from: Short fictional biography. Posibility of a reader's literary...

Biographies of literature writers written in English language

bio-mcp-data

Biography wear corp Import Company US

ZH-preview Dataset

Big Data V3 No Bio Dataset

Big Data V3 No Bio

APIS Dataset artists

Data from: Destined for Success? Educational Biographies of Academically...

Data from: A short biography of Hubert Ludwig and a note on the publication...

phbdataset

Topic Model for English Wikipedia's Biographies with list of all 1.8M...

Data from: Biographical Directory of the United States Congress

U.S. District Court Judges Merge File

BiasBios Dataset

Early Members of the Leopoldina (1652-1818): Biographical Data

Data from: Yellow Nineties 2.0

WikiBio Dataset

Wikipedia Biography Dataset