58 datasets found
  1. P

    WikiBio Dataset

    • paperswithcode.com
    Updated Nov 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Remi Lebret; David Grangier; Michael Auli (2021). WikiBio Dataset [Dataset]. https://paperswithcode.com/dataset/wikibio
    Explore at:
    Dataset updated
    Nov 16, 2021
    Authors
    Remi Lebret; David Grangier; Michael Auli
    Description

    This dataset gathers 728,321 biographies from English Wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).

  2. H

    Data from: Pantheon 1.0, A Manually Verified Dataset of Globally Famous...

    • dataverse.harvard.edu
    Updated Jan 4, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2016). Pantheon 1.0, A Manually Verified Dataset of Globally Famous Biographies [Dataset]. http://doi.org/10.7910/DVN/28201
    Explore at:
    tsv(2176393), text/plain; charset=utf-8(13938718), text/plain; charset=us-ascii(149252802)Available download formats
    Dataset updated
    Jan 4, 2016
    Dataset provided by
    Harvard Dataverse
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    We present the Pantheon 1.0 dataset: a manually verified dataset of individuals that have transcended linguistic, temporal, and geographic boundaries. The Pantheon 1.0 dataset includes the 11,341 biographies present in more than 25 languages in Wikipedia and is enriched with: (i) manually verified demographic information (place and date of birth, gender) (ii) a taxonomy of occupations classifying each biography at three levels of aggregation and (iii) two measures of global popularity including the number of languages in which a biography is present in Wikipedia (L), and the Historical Popularity Index (HPI) a metric that combines information on L, time since birth, and page-views (2008-2013). We compare the Pantheon 1.0 dataset to data from the 2003 book, Human Accomplishments, and also to external measures of accomplishment in individual games and sports: Tennis, Swimming, Car Racing, and Chess. In all of these cases we find that measures of popularity (L and HPI) correlate highly with individual accomplishment, suggesting that measures of global popularity proxy the historical impact of individuals.

  3. Data from: Member biographies

    • gov.uk
    • s3.amazonaws.com
    Updated Oct 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Biometrics and Forensics Ethics Group (2024). Member biographies [Dataset]. https://www.gov.uk/government/publications/member-biographies
    Explore at:
    Dataset updated
    Oct 11, 2024
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Biometrics and Forensics Ethics Group
    Description

    Full biographies of the members of the Biometrics and Forensics Ethics Group.

  4. f

    Database_biographies of the Sverdlovsk oblast officials.xlsx

    • figshare.com
    xlsx
    Updated Dec 1, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kirill Melnikov (2020). Database_biographies of the Sverdlovsk oblast officials.xlsx [Dataset]. http://doi.org/10.6084/m9.figshare.13313045.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Dec 1, 2020
    Dataset provided by
    figshare
    Authors
    Kirill Melnikov
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Sverdlovsk Oblast
    Description

    This dataset contains the biographies of the Sverdlovsk Oblast officials (2004-2005; 2019-2020)

  5. f

    Data from: Short fictional biography. Posibility of a reader's literary...

    • scielo.figshare.com
    • figshare.com
    jpeg
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rafael Andugar Sousa (2023). Short fictional biography. Posibility of a reader's literary genre [Dataset]. http://doi.org/10.6084/m9.figshare.7101119.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    SciELO journals
    Authors
    Rafael Andugar Sousa
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract: In this article, we begin from the theoretical implications vislumbrated by J. M. Schaeffer with the porpose to create a new literary genre based in the task of the reader to compare diverse literary works which maybe don't belong to the same tradition. The object of our interest is the existence of tales and narrations of biographies which are invented by an author interested in real historical characters (or even also invented). To explore the limits of the genre is necessary to know deeply the field of biografphy and the relations with literary writing and the relations with historiographical discourse too.

  6. f

    Biographies of literature writers written in English language

    • figshare.com
    application/gzip
    Updated Mar 17, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Javier Gomez; Cesar Alfaro; Felipe Ortega; Javier M. Moguerza; Maria Jesus Algar; Raul Moreno (2023). Biographies of literature writers written in English language [Dataset]. http://doi.org/10.6084/m9.figshare.13551467.v4
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Mar 17, 2023
    Dataset provided by
    figshare
    Authors
    Javier Gomez; Cesar Alfaro; Felipe Ortega; Javier M. Moguerza; Maria Jesus Algar; Raul Moreno
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains 1000 biographies of literature writers retrieved from the english version of Wikipedia. There is a total of 500 biographies of women writers extracted from the category entitled “19th-century_women_writers” (https://en.wikipedia.org/wiki/Category:19th-century_women_writers) and 500 male biographies extracted from the category “19th-century_male_writers” (https://en.wikipedia.org/wiki/Category:19th-century_male_writers)

  7. h

    bio-mcp-data

    • huggingface.co
    Updated Jun 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Longevity Genie (2025). bio-mcp-data [Dataset]. https://huggingface.co/datasets/longevity-genie/bio-mcp-data
    Explore at:
    Dataset updated
    Jun 16, 2025
    Dataset authored and provided by
    Longevity Genie
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Bio-MCP-Data

    A repository containing biological datasets that will be used by BIO-MCP MCP (Model Context Protocol) standard.

      About
    

    This repository hosts biological data assets formatted to be compatible with the Model Context Protocol, enabling AI models to efficiently access and process biological information. The data is managed using Git Large File Storage (LFS) to handle large biological datasets.

      Purpose
    

    Provide standardized biological datasets for AI… See the full description on the dataset page: https://huggingface.co/datasets/longevity-genie/bio-mcp-data.

  8. Biography wear corp Import Company US

    • seair.co.in
    Updated Nov 5, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim (2017). Biography wear corp Import Company US [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset updated
    Nov 5, 2017
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    United States
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  9. m

    ZH-preview Dataset

    • data.mendeley.com
    Updated Jun 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    曾 昊 (2024). ZH-preview Dataset [Dataset]. http://doi.org/10.17632/nx8hknrgfz.1
    Explore at:
    Dataset updated
    Jun 27, 2024
    Authors
    曾 昊
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Preview Dataset for editorial evaluation and review.

  10. R

    Big Data V3 No Bio Dataset

    • universe.roboflow.com
    zip
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    graduationproject (2023). Big Data V3 No Bio Dataset [Dataset]. https://universe.roboflow.com/graduationproject-aqm0w/big-data-v3-no-bio
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset authored and provided by
    graduationproject
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Trash Bounding Boxes
    Description

    Big Data V3 No Bio

    ## Overview
    
    Big Data V3 No Bio is a dataset for object detection tasks - it contains Trash annotations for 8,825 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  11. APIS Dataset artists

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated Jun 2, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maximilian Kaiser; Maximilian Kaiser (2020). APIS Dataset artists [Dataset]. http://doi.org/10.5281/zenodo.3865451
    Explore at:
    binAvailable download formats
    Dataset updated
    Jun 2, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Maximilian Kaiser; Maximilian Kaiser
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains biographical data produced in course of the digital humanities project “Mapping historical networks: Building the new Austrian Prosopographical/Biographical Information System (APIS)” at the Austrian Academy of Sciences. It was funded by the Austrian National Fonds for Research, Technology and Development. The biographies were manually annotated by the author via a web application (apis.acdh.oewa.ac.at) which was developed at the Austrian Centre for Digital Humanities and Cultural Heritage (ACDH-CH).

    The starting point of the dataset (cl Kuenstlerhaus) were 506 annotated artists’ biographies from the Austrian Biographical Encyclopaedia 1815–1950 (ÖBL). For these persons, the membership in the Association of Fine Artists Vienna (Genossenschaft der bildenden Künstler Wiens) was confirmed by the comparison of the yearly published membership lists with the lemmas of the ÖBL. The data were collected primarily to enable a) statistics b) historical network analyses and c) cartographic analyses.

    The data is provided as graphml files:

    • relations between persons (kinship, pupil/teacher)
      cl_kuenstlerhaus_person-person_v1-01
    • relations between persons and institutions (education, career, social networks)
      cl_kuenstlerhaus_person-institution_v1-01
    • relations between persons and places (mobility)
      cl_kuenstlerhaus_person-place_v1-01

    The datset was last reviewed in January 2020.

  12. c

    Data from: Destined for Success? Educational Biographies of Academically...

    • datacatalogue.cessda.eu
    • beta.ukdataservice.ac.uk
    Updated Nov 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Power, S., University of London, Institute of Education; Whitty, G., University of London, Institute of Education; Edwards, T., University of Newcastle upon Tyne (2024). Destined for Success? Educational Biographies of Academically Able Pupils, 1981-1997 [Dataset]. http://doi.org/10.5255/UKDA-SN-3827-1
    Explore at:
    Dataset updated
    Nov 28, 2024
    Dataset provided by
    Policy Studies
    School of Education
    Authors
    Power, S., University of London, Institute of Education; Whitty, G., University of London, Institute of Education; Edwards, T., University of Newcastle upon Tyne
    Time period covered
    Jan 1, 1995 - Jan 1, 1997
    Area covered
    England
    Variables measured
    Individuals, National, Young people
    Measurement technique
    Face-to-face interview, Telephone interview, Postal survey, Self-completion
    Description

    Abstract copyright UK Data Service and data collection copyright owner.


    This is a mixed methods data collection.

    This project made use of a sample drawn for an earlier research project to explore the different ways in which 'academically able' students attending different types of secondary school at age 11 in the mid 1980s realised and experienced their subsequent educational and career opportunities. It involved four groups of academically able pupils: assisted place holders in independent schools, full fee paying pupils in the same schools, pupils at maintained grammar schools and those attending comprehensive schools. The findings provide important insights into the experiences, qualifications, attitudes and values of new recruits to middle class occupations in the 1990s.

    The broad aim of Destined for Success? Educational Biographies of Academically Able Pupils, 1981-1997 was to explore the different ways in which academically able students realise and experience educational opportunities. The study had the following specific objectives:
    • to compare the dimensions and directions along which different forms of schooling and sponsorship had impacted upon the educational careers of 'academically able' students
    • to investigate the extent to which students had been able to translate their educational promise at age 11 into subsequent school achievements, further educational opportunities and occupational locations
    • to explore the ways in which their experiences have resulted in the continuity or transformation of social identities in terms of family, friendship or work
    The research was conducted by means of a postal survey and semi-structured interviews. A sample of questionnaire respondents was selected for interview to ensure that all sectors, schools and modes of sponsorship were represented.

    A follow-up to this study is available under SN 6501 - Success Sustained? A Follow-up Survey of the 'Destined for Success' Cohort, 2004. This quantitative study revisits the respondents in their early thirties.

    Further information is available from the Destined for Success? Educational Biographies of Academically Able Pupils ESRC Award web page.

    For the second edition (May 2011), transcripts of qualitative interviews conducted with 34 of the original respondents were added to the quantitative data, making the study a mixed methods data collection.

    Main Topics:

    The following topics are covered: education; school types; academic ability; school achievements; higher education; transition from school to work and subsequent careers; social identities; basic socio-economic indicators; cultural and political dispositions.

  13. Data from: A short biography of Hubert Ludwig and a note on the publication...

    • search.datacite.org
    • gbif.org
    Updated Apr 27, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Plazi (2016). A short biography of Hubert Ludwig and a note on the publication dates of his monograph Die Seewalzen (1889 – 1892) [Dataset]. http://doi.org/10.15468/qf39mc
    Explore at:
    Dataset updated
    Apr 27, 2016
    Dataset provided by
    DataCitehttps://www.datacite.org/
    Plazi.org taxonomic treatments database
    Authors
    Plazi
    Description

    This dataset contains the digitized treatments in Plazi based on the original journal article Reich, Mike (2015): A short biography of Hubert Ludwig and a note on the publication dates of his monograph Die Seewalzen (1889 – 1892). Zootaxa 4052 (2): 332-344, DOI: http://dx.doi.org/10.11646/zootaxa.4052.3.3

  14. m

    phbdataset

    • data.mendeley.com
    Updated Jul 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Phillip Dangaiso (2023). phbdataset [Dataset]. http://doi.org/10.17632/5pyf6bm36g.1
    Explore at:
    Dataset updated
    Jul 13, 2023
    Authors
    Phillip Dangaiso
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    data collected from rural communities in Zimbabwe to evaluate preventive health behavior based on the health belief model.

  15. Topic Model for English Wikipedia's Biographies with list of all 1.8M...

    • zenodo.org
    • data.niaid.nih.gov
    bin, csv, txt, zip
    Updated Jan 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Mandiberg; Michael Mandiberg; Danara Sarıoğlu; Danara Sarıoğlu (2023). Topic Model for English Wikipedia's Biographies with list of all 1.8M articles linked to Wikidata [Dataset]. http://doi.org/10.5281/zenodo.5747336
    Explore at:
    zip, bin, csv, txtAvailable download formats
    Dataset updated
    Jan 28, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Michael Mandiberg; Michael Mandiberg; Danara Sarıoğlu; Danara Sarıoğlu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A Genism LDA Topic Model of English Wikipedia biographical articles with list of all 1.8M articles, and some associated Wikidata information

    The model has 150 Topics.

    This model was developed in the process of isolating a set of visual arts biographical articles, as described in "Clowns in the Visual Artists: Topic Modeling Wikipedia and Wikidata" in the Spring 2022 issue of Art Documentation - https://doi.org/10.1086/719999

    Because names, nationalities, and birthdays are so prominent in biographies, the stopwords list removed 170,000 names, surnames, city names, place names, countries, days, months and other time related words (https://github.com/mandiberg/Names-Surnames-and-Countries-for-Stopwords). We also directly removed each article subject’s given and surname, which were almost always the most frequently occurring words in any given article. Otherwise, the model just produced topics based on nationality, and common names and surnames.

    Files:

    all_enwiki_bios_from_wikidata.csv
    The list of all Wikidata items for humans with an enwiki page (e.g biographical article) was extracted from Wikidata JSON dump; list includes gender, occupation, and nationality. This was joined with the converted plaintext from an English Wikipedia dump. This data was downloaded in March 2021.

    Wikipedia Biographies LDA Topic Model human readable summary.csv
    A human readable file with the 150 topics ranked by count of articles per topic from the 1.8M corpus. The most popular topics have categorical descriptions of the occupations of each cluster. Some are marked as not an occupation cluster.

    BoW_corpus.mm*
    model_lda_full_Sep2_150Tv2*
    These six files comprise the topic model. The code to load them is present in the python files.

    dict_full_Aug-28-2021
    processed_docs_full_Aug-28-2021.txt
    processed_docs_1000_Aug-18-2021.txt
    These are the dictionary and processed corpuses required to build and implement the model using this code. The corpus with the first 1000 items is meant to be used for testing, as the full one is quite large and takes a long time to complete.

    topic-model-wikipedia-sept2021.zip
    The code and settings used for creating and implementing this model are included in this zip and are also available here: https://github.com/mandiberg/topic-model-wikipedia

    All-Wikipedia-Biographies-with-topic1.csv
    All-Wikipedia-Biographies-with-topic1and2.csv
    These are the list of 1.8M biographies matched to topics. The "topic1" file just includes the first topic, this is a slightly larger list. The "topic1and2" file is slightly smaller because about 2% articles do not match to a second topic.

    Analysis-for-Clowns-Visual-Arts.zip
    These are the raw data and final data produced for the "Clowns in the Visual Artists." Please see the article for context.

  16. w

    Data from: Biographical Directory of the United States Congress

    • data.wu.ac.at
    api/sparql +2
    Updated Oct 10, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DataFAQs (2013). Biographical Directory of the United States Congress [Dataset]. https://data.wu.ac.at/schema/datahub_io/NDM5Y2EzMTYtZjJhMS00NzdkLTk5N2UtODg0MTBmZTM1MjE2
    Explore at:
    api/sparql, example/turtle, meta/void(60.0)Available download formats
    Dataset updated
    Oct 10, 2013
    Dataset provided by
    DataFAQs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Names, positions, state, party, and congress number of members of US Congress 1774-present.

    Scraped from http://bioguide.congress.gov/biosearch/biosearch.asp by https://scraperwiki.com/scrapers/biographical_directory_usc/#

  17. H

    U.S. District Court Judges Merge File

    • dataverse.harvard.edu
    Updated Aug 16, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maya Sen (2011). U.S. District Court Judges Merge File [Dataset]. http://doi.org/10.7910/DVN/J1A6RW
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 16, 2011
    Dataset provided by
    Harvard Dataverse
    Authors
    Maya Sen
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    United States
    Description

    This merge file can be used to combine detailed biographical data on U.S. District Court Judges from the Federal Judicial Center (http://www.fjc.gov/history/home.nsf/page/export.html) with data on cases form the U.S. Court of Appeals Database Project (http://www.wmich.edu/nsf-coa/). The file includes the unique identifiers used by each group to make it easy for researchers to combine the two data sources together. Note that this is a merge file for U.S. District Court Judges only.

  18. P

    BiasBios Dataset

    • paperswithcode.com
    Updated Jan 26, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maria De-Arteaga; Alexey Romanov; Hanna Wallach; Jennifer Chayes; Christian Borgs; Alexandra Chouldechova; Sahin Geyik; Krishnaram Kenthapadi; Adam Tauman Kalai (2019). BiasBios Dataset [Dataset]. https://paperswithcode.com/dataset/biasbios
    Explore at:
    Dataset updated
    Jan 26, 2019
    Authors
    Maria De-Arteaga; Alexey Romanov; Hanna Wallach; Jennifer Chayes; Christian Borgs; Alexandra Chouldechova; Sahin Geyik; Krishnaram Kenthapadi; Adam Tauman Kalai
    Description

    The purpose of this dataset was to study gender bias in occupations. Online biographies, written in English, were collected to find the names, pronouns, and occupations. Twenty-eight most frequent occupations were identified based on their appearances. The resulting dataset consists of 397,340 biographies spanning twenty-eight different occupations. Of these occupations, the professor is the most frequent, with 118,400 biographies, while the rapper is the least frequent, with 1,406 biographies. Important information about the biographies: 1. The longest biography is 194 tokens, while the shortest is eighteen; the median biography length is seventy-two tokens. 2. It should be noted that the demographics of online biographies’ subjects differ from those of the overall workforce and that this dataset does not contain all biographies on the Internet.

  19. Z

    Early Members of the Leopoldina (1652-1818): Biographical Data

    • data.niaid.nih.gov
    • zenodo.org
    Updated Oct 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Münnich, Fanny (2024). Early Members of the Leopoldina (1652-1818): Biographical Data [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13818617
    Explore at:
    Dataset updated
    Oct 1, 2024
    Dataset provided by
    Schilling, Jacob
    Splinter, Susan
    Münnich, Fanny
    Gassner, Sebastian
    Rehbein, Malte
    Doppler, Tobias
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This data set encompasses a collection of detailed biographical data about 850 of the first members of the German National Academy of Sciences Leopoldina (Deutsche Akademie der Naturforscher Leopoldina – Nationale Akademie der Wissenschaften) from 1652 to 1818. The data includes information about the members themselves, their family, their membership in the Leopoldina, academic and professional positions held, as well as works, portraits, and associated sources.

  20. B

    Data from: Yellow Nineties 2.0

    • borealisdata.ca
    • search.dataone.org
    • +1more
    Updated May 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lorraine Janzen Kooistra; MJ Suhonos; Alison F. Hedley; Reg Beatty; Marion Tempest Grant; Linked Infrastructure for Networked Cultural Scholarship (LINCS) (2025). Yellow Nineties 2.0 [Dataset]. http://doi.org/10.5683/SP3/2FTQXM
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 31, 2025
    Dataset provided by
    Borealis
    Authors
    Lorraine Janzen Kooistra; MJ Suhonos; Alison F. Hedley; Reg Beatty; Marion Tempest Grant; Linked Infrastructure for Networked Cultural Scholarship (LINCS)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    1889 - 1905
    Description

    Yellow Nineties 2.0 uses digital tools to advance knowledge of eight late-Victorian little magazines and the people who contributed to their production between 1889 and 1905: Pagan Review (1 volume, 1892) Yellow Book (13 volumes, 1894–1897) The Dial (5 volumes, 1889–1897) The Evergreen: A Northern Seasonal (4 volumes, 1895–1897) The Green Sheaf (13 issues, 1903–1904) The Pageant (2 volumes, 1896–1897) The Savoy (2 quarterly and 6 monthly issues, 1896) The Venture: An Annual of Art and Literature (2 volumes, 1903 and 1905) The data document the communities of production responsible for these little magazines, particularly by recovering the social networks of and biographical information about women and marginalized persons in those communities. The dataset enables users to query, visualize, and analyze the relationships, connections, and social networks of magazine contributors. The Yellow Nineties project site (https://1890s.ca) includes two biographical tools, one discursive and the other data-driven. Essays on the life and work of a select group of magazine contributors are available in Y90s Biographies. Biographical data for all magazine contributors are available in the Y90s Personography (https://personography.1890s.ca). The data has been transformed into Linked Open Data via the LINCS conversion toolkit of the the Linked Infrastructure for Networked Cultural Scholarship (LINCS) project. The data is assembled as a single text file in text/turtle (.ttl) and contains descriptive metadata that has been reconciled into triples using established linked data vocabularies. The Yellow Nineties 2.0 has been supported by funding from SSHRC.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Remi Lebret; David Grangier; Michael Auli (2021). WikiBio Dataset [Dataset]. https://paperswithcode.com/dataset/wikibio

WikiBio Dataset

Wikipedia Biography Dataset

Explore at:
402 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Nov 16, 2021
Authors
Remi Lebret; David Grangier; Michael Auli
Description

This dataset gathers 728,321 biographies from English Wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).

Search
Clear search
Close search
Google apps
Main menu