100+ datasets found
  1. Best Books Ever Dataset

    • zenodo.org
    csv
    Updated Nov 10, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lorena Casanova Lozano; Sergio Costa Planells; Lorena Casanova Lozano; Sergio Costa Planells (2020). Best Books Ever Dataset [Dataset]. http://doi.org/10.5281/zenodo.4265096
    Explore at:
    csvAvailable download formats
    Dataset updated
    Nov 10, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Lorena Casanova Lozano; Sergio Costa Planells; Lorena Casanova Lozano; Sergio Costa Planells
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The dataset has been collected in the frame of the Prac1 of the subject Tipology and Data Life Cycle of the Master's Degree in Data Science of the Universitat Oberta de Catalunya (UOC).

    The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).

    Original code used to retrieve the dataset can be found on github repository: github.com/scostap/goodreads_bbe_dataset

    The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.

    Book cover images can be optionally downloaded from the url in the 'coverImg' field. Python code for doing so and an example can be found on the github repo.

    The 25 fields of the dataset are:

    | Attributes | Definition | Completeness |
    | ------------- | ------------- | ------------- | 
    | bookId | Book Identifier as in goodreads.com | 100 |
    | title | Book title | 100 |
    | series | Series Name | 45 |
    | author | Book's Author | 100 |
    | rating | Global goodreads rating | 100 |
    | description | Book's description | 97 |
    | language | Book's language | 93 |
    | isbn | Book's ISBN | 92 |
    | genres | Book's genres | 91 |
    | characters | Main characters | 26 |
    | bookFormat | Type of binding | 97 |
    | edition | Type of edition (ex. Anniversary Edition) | 9 |
    | pages | Number of pages | 96 |
    | publisher | Editorial | 93 |
    | publishDate | publication date | 98 |
    | firstPublishDate | Publication date of first edition | 59 |
    | awards | List of awards | 20 |
    | numRatings | Number of total ratings | 100 |
    | ratingsByStars | Number of ratings by stars | 97 |
    | likedPercent | Derived field, percent of ratings over 2 starts (as in GoodReads) | 99 |
    | setting | Story setting | 22 |
    | coverImg | URL to cover image | 99 |
    | bbeScore | Score in Best Books Ever list | 100 |
    | bbeVotes | Number of votes in Best Books Ever list | 100 |
    | price | Book's price (extracted from Iberlibro) | 73 |

  2. Books Dataset

    • figshare.com
    txt
    Updated Jan 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Giuseppe Mendola (2016). Books Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.1441255.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jan 19, 2016
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Giuseppe Mendola
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This database contains information about books gathered with help of Google Books API. The database contains 7 different tables where 3 of them are only to relate the other tables together. Tables: Books contains 1062 records. Authors contains 1595 records. Categories 109 records. Metadata 37 records. MD5 (GBooks_2015-06-09.sql) = bfd09094d0e123e668b2e58332b1a98b

  3. Books data

    • kaggle.com
    Updated Aug 14, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carlos Heryhelder (2021). Books data [Dataset]. https://www.kaggle.com/datasets/heryhelder/books-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 14, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Carlos Heryhelder
    Description

    Dataset

    This dataset was created by Carlos Heryhelder

    Contents

  4. w

    Dataset of books called Advanced database techniques

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books called Advanced database techniques [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=Advanced+database+techniques
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 1 row and is filtered where the book is Advanced database techniques. It features 7 columns including author, publication date, language, and book publisher.

  5. p

    Books Wholesalers in Louisiana, United States - 5 Verified Listings Database...

    • poidata.io
    csv, excel, json
    Updated Jul 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Poidata.io (2025). Books Wholesalers in Louisiana, United States - 5 Verified Listings Database [Dataset]. https://www.poidata.io/report/books-wholesaler/united-states/louisiana
    Explore at:
    json, excel, csvAvailable download formats
    Dataset updated
    Jul 18, 2025
    Dataset provided by
    Poidata.io
    Area covered
    Louisiana, United States
    Description

    Comprehensive dataset of 5 Books wholesalers in Louisiana, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.

  6. n

    PiCoBoo database: Aunt Mavor's Picture Books for Little Readers [Second...

    • data.ncl.ac.uk
    pdf
    Updated May 31, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francesca Tancini (2023). PiCoBoo database: Aunt Mavor's Picture Books for Little Readers [Second Series] [Dataset]. http://doi.org/10.25405/data.ncl.15181071.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Newcastle University
    Authors
    Francesca Tancini
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Aunt Mavor's Picture Books for Little Readers [Second Series]

  7. Print book unit sales in the U.S. 2004-2024

    • statista.com
    Updated Jun 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Print book unit sales in the U.S. 2004-2024 [Dataset]. https://www.statista.com/statistics/422595/print-book-sales-usa/
    Explore at:
    Dataset updated
    Jun 24, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    Data showing how many books were sold in 2024 revealed that the printed book market remains healthy: a total of ***** million units were sold that year among outlets which reported to the source. Whilst this marked a small jump from the previous year, the figure peaked in 2021 and has not surpassed *** million since. Trade paperbacks remained the dominant format. Book sales statistics Looking at book sales by year, 2005 to 2010 were the most lucrative for the printed book market, with well over *** million units sold annually during that five-year period. After dropping below *** million in 2012, gradual and consistent increases can be seen each year, with the exception of between the years 2018 and 2019. For bookstores though, how many books are sold each year depends on the success of key months across a twelve-month period. Bookstore sales in the United States are at their highest in December, January, and August, but figures for December are consistently higher than other months. Books are popular holiday gifts, with around ** to ** percent of consumers responding to annual surveys in each year from 2012 to 2020 saying that they planned to purchase books as presents during the festive season.

  8. h

    Psy-Data-books

    • huggingface.co
    Updated Jun 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ammar (2025). Psy-Data-books [Dataset]. https://huggingface.co/datasets/Daemontatox/Psy-Data-books
    Explore at:
    Dataset updated
    Jun 19, 2025
    Authors
    Ammar
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    🧠 Psy-Data-Books: Synthetic Medical & Psychology Conversation Dataset

    Psy-Data-Books is one of the largest synthetic datasets of psychology and medical conversations, generated from verified medical and psychology literature. It is designed for building and training powerful conversational AI systems for healthcare, therapy, and mental health applications.

      📊 Dataset Summary
    

    Domain: Psychology, Psychiatry, Mental Health, General Medicine Data Type: Synthetic… See the full description on the dataset page: https://huggingface.co/datasets/Daemontatox/Psy-Data-books.

  9. w

    Dataset of books series that contain Logical database design principles

    • workwithdata.com
    Updated Nov 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2024). Dataset of books series that contain Logical database design principles [Dataset]. https://www.workwithdata.com/datasets/book-series?f=1&fcol0=j0-book&fop0=%3D&fval0=Logical+database+design+principles&j=1&j0=books
    Explore at:
    Dataset updated
    Nov 25, 2024
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about book series. It has 2 rows and is filtered where the books is Logical database design principles. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.

  10. o

    Amazon Bestselling Books & Customer Reviews

    • opendatabay.com
    .undefined
    Updated Jul 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datasimple (2025). Amazon Bestselling Books & Customer Reviews [Dataset]. https://www.opendatabay.com/data/ai-ml/1639fb85-1580-4646-8216-326b2fac3437
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jul 2, 2025
    Dataset authored and provided by
    Datasimple
    Area covered
    Reviews & Ratings
    Description

    This dataset provides an in-depth look into Amazon's top 100 bestselling books along with their customer reviews, ratings, and pricing information. It offers a window into the world of popular reading and customer sentiment. The dataset was collected in November 2023, making it suitable for analysing recent literary trends and consumer behaviour.

    Columns

    The dataset includes the following fields: * Book Rank: The ranking of the book among the top 100 bestselling books on Amazon. * Book Title: The title of the book. Examples include "The Ballad of Songbirds and Snakes" and "Iron Flame". * Price: The price of the book in USD. * Rating: The overall rating of the book, on a scale of 1 to 5. * Author: The author of the book. Notable authors include Sarah J. Maas and Adam Wallace. * Year of Publication: The year in which the book was published. * Genre: The category to which the book belongs. Popular genres include Nonfiction and Childrens, literature. * URL: The direct URL link to the book on Amazon's platform. * Review Title: The title of the customer review. * Reviewer: The name of the person who wrote the review. * Reviewer Rating: The rating given by the reviewer for the book, on a scale of 1 to 5. * Review Description: The textual content of the review. * Is_verified: Indicates whether the review is a verified customer purchase. * Date: The date when the review was posted. * Timestamp: The timestamp indicating when the review was posted. * ASIN: Amazon Standard Identification Number assigned to products on Amazon.

    Distribution

    The dataset focuses on the top 100 bestselling books. * Price: Book prices range from 1.00 USD to 100.00 USD. There are approximately 10 books within each 9.90 USD price band across this range. * Rating: Overall book ratings are generally high, ranging from 4.10 to 5.00. A notable number of books have ratings between 4.73 and 4.82. * Year of Publication: Books in the dataset were published between 1947 and 2024. A significant portion, 64 books, were published between 2016 and 2024, indicating a strong presence of recent titles. * Genre: While diverse, Nonfiction and Childrens, literature are among the more prominent genres. * Authors/Titles: "The Ballad of Songbirds and Snakes" and "Iron Flame" are among the top-ranked titles. Sarah J. Maas and Adam Wallace are featured authors. The dataset covers review data for each of the top 100 books, though the exact number of reviews per book is not specified.

    Usage

    This dataset is ideal for: * Market analysis: Identifying bestselling trends, pricing strategies, and popular authors. * Sentiment analysis: Analysing customer reviews to understand public perception and extract insights. * Recommender systems: Building or improving book recommendation engines. * Natural Language Processing (NLP): Training models for text classification, entity recognition, or summarisation based on review content. * Data visualisation: Creating visualisations of literary trends, rating distributions, or reviewer behaviour.

    Coverage

    • Geographic Scope: The data pertains to the global Amazon marketplace.
    • Time Range: Book publication years span from 1947 to 2024. Review data was collected up to November 2023.

    License

    CC-BY

    Who Can Use It

    • Data scientists and analysts: For machine learning projects, statistical analysis, and predictive modelling.
    • Book enthusiasts and literary researchers: To explore popular reading habits and genre trends.
    • Publishers and authors: To gain insights into market demand and reader feedback.
    • Students and educators: For academic projects related to data science, literature, or consumer studies.

    Dataset Name Suggestions

    • Amazon Bestselling Books & Customer Reviews
    • Top 100 Amazon Books Data 2023
    • Amazon Literary Trends Dataset
    • Bestselling Book Reviews on Amazon

    Attributes

    Original Data Source: Top 100 Bestselling Book Reviews on Amazon

  11. p

    Books Wholesalers in Nevada, United States - 1 Verified Listings Database

    • poidata.io
    csv, excel, json
    Updated Jul 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Poidata.io (2025). Books Wholesalers in Nevada, United States - 1 Verified Listings Database [Dataset]. https://www.poidata.io/report/books-wholesaler/united-states/nevada
    Explore at:
    json, csv, excelAvailable download formats
    Dataset updated
    Jul 17, 2025
    Dataset provided by
    Poidata.io
    Area covered
    Nevada, United States
    Description

    Comprehensive dataset of 1 Books wholesalers in Nevada, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.

  12. Database Index to Teacher Record Books

    • researchdata.edu.au
    Updated Dec 4, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public Record Office Victoria (2014). Database Index to Teacher Record Books [Dataset]. https://researchdata.edu.au/database-index-teacher-books/494334?source=suggested_datasets
    Explore at:
    Dataset updated
    Dec 4, 2014
    Dataset provided by
    Public Record Office Victoria
    Time period covered
    1863 - 1959
    Area covered
    Description

    This Microsoft Access database was created under the direction of Public Record Office Victoria and the Department of Eduction and Training's Education History Unit to provide access to VPRS 13579, Teacher Record Books. Consult the text for this series for further information on how to use a teacher record number to access a teacher's service history.

    The displayed fields are:

    Surname (SName) including for example, "nee Smith"
    Given name (GName1)
    Other given name (GnameOth)
    Extra (other text appearing on the card; this may include "PR", indicating a record in the VPRS 14440 Register of Professional Officers)
    DEETID (the teacher record number, for use with VPRS 13579 or VPRS 13718)

  13. w

    Dataset of book subjects that contain Database management on the Sinclair QL...

    • workwithdata.com
    Updated Nov 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2024). Dataset of book subjects that contain Database management on the Sinclair QL [Dataset]. https://www.workwithdata.com/datasets/book-subjects?f=1&fcol0=j0-book&fop0=%3D&fval0=Database+management+on+the+Sinclair+QL&j=1&j0=books
    Explore at:
    Dataset updated
    Nov 7, 2024
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about book subjects. It has 2 rows and is filtered where the books is Database management on the Sinclair QL. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.

  14. m

    Butte Stope Books Data

    • mbmg.mtech.edu
    Updated Mar 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Montana Bureau of Mines and Geology (2025). Butte Stope Books Data [Dataset]. https://mbmg.mtech.edu/Information/Collections/Butte-Stope-Books/data.html
    Explore at:
    Dataset updated
    Mar 9, 2025
    Dataset authored and provided by
    Montana Bureau of Mines and Geology
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Butte
    Description

    Metadata and data derived from Butte Stope Books. The underground mine openings are generally referred to as stopes; detailed maps of the stopes were recorded from field books to a master Stope Book for each mine.

  15. Orange Book

    • catalog.data.gov
    • data.virginia.gov
    • +3more
    Updated Jul 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Food and Drug Administration (2025). Orange Book [Dataset]. https://catalog.data.gov/dataset/orange-book
    Explore at:
    Dataset updated
    Jul 11, 2025
    Dataset provided by
    Food and Drug Administrationhttp://www.fda.gov/
    Description

    The Approved Drug Products with Therapeutic Equivalence (Orange Book or OB) is a list of drugs approved under Section 505 of the Federal Food, Drug and Cosmetic Act and provides consumers timely updates on these products. In addition to these products (fo

  16. T

    United States Imports from Norway of Printed books, newspapers, pictures

    • tradingeconomics.com
    csv, excel, json, xml
    Updated Feb 6, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2020). United States Imports from Norway of Printed books, newspapers, pictures [Dataset]. https://tradingeconomics.com/united-states/imports/norway/printed-books-newspapers-pictures
    Explore at:
    excel, csv, json, xmlAvailable download formats
    Dataset updated
    Feb 6, 2020
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1990 - Dec 31, 2025
    Area covered
    United States
    Description

    United States Imports from Norway of Printed books, newspapers, pictures was US$1.46 Million during 2024, according to the United Nations COMTRADE database on international trade. United States Imports from Norway of Printed books, newspapers, pictures - data, historical chart and statistics - was last updated on July of 2025.

  17. Representation in Children's Literature

    • kaggle.com
    Updated Nov 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BENJAMIN WRIGHT (2020). Representation in Children's Literature [Dataset]. https://www.kaggle.com/benjaminwright/representation/tasks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 24, 2020
    Dataset provided by
    Kaggle
    Authors
    BENJAMIN WRIGHT
    Description

    I wanted to find good data about representation and diversity in literature, which brought me to the following page of the Cooperative Children's Book Center (CCBC): https://ccbc.education.wisc.edu/literature-resources/ccbc-diversity-statistics/. The following is data on books by and about Black, Indigenous and People of Color published for children and teens compiled by the Cooperative Children’s Book Center, School of Education, University of Wisconsin-Madison.

    There are two .csv files in the data set. One shows books received by the CCBC from US publishers per year that are authored and/or illustrated by a Black/African/Indigenous/Asian/Pacific Islander/Latinx person, and the other shows books received by the CCBC from US publishers per year that feature a BIPOC character. Further explanation can be found at the CCBC FAQ page.

    Please note that for 2018 and 2019, the below .csv represent Asian/Pacific Islander people as one column, which is how the CCBC published the data between 2002-2017. Also note that the attached data are not the entire data collected by the CCBC. The CCBC also collects books from international publishers, and since 2018, the CCBC has been publishing data about books by/about Arabs.

    All data was collected by the CCBC. Please see the following page (with the complete data) about how to cite the data in your publications/blogs/notebooks: https://ccbc.education.wisc.edu/literature-resources/ccbc-diversity-statistics/books-by-about-poc-fnn/.

    I am curious to see what sorts of visualizations people can make in exploratory analysis of this data! Also, can you predict how many BIPOC books the CCBC will receive in 2020? What happens when you study against US population data?

  18. T

    United States Imports from Bermuda of Printed books, newspapers, pictures

    • tradingeconomics.com
    csv, excel, json, xml
    Updated Jul 15, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2017). United States Imports from Bermuda of Printed books, newspapers, pictures [Dataset]. https://tradingeconomics.com/united-states/imports/bermuda/printed-books-newspapers-pictures
    Explore at:
    excel, xml, json, csvAvailable download formats
    Dataset updated
    Jul 15, 2017
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1990 - Dec 31, 2025
    Area covered
    United States
    Description

    United States Imports from Bermuda of Printed books, newspapers, pictures was US$17.56 Thousand during 2023, according to the United Nations COMTRADE database on international trade. United States Imports from Bermuda of Printed books, newspapers, pictures - data, historical chart and statistics - was last updated on July of 2025.

  19. p

    Books in Spain - 2 Verified Listings Database

    • poidata.io
    csv, excel, json
    Updated Jul 8, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Poidata.io (2025). Books in Spain - 2 Verified Listings Database [Dataset]. https://www.poidata.io/report/books/spain
    Explore at:
    csv, excel, jsonAvailable download formats
    Dataset updated
    Jul 8, 2025
    Dataset provided by
    Poidata.io
    Area covered
    Spain
    Description

    Comprehensive dataset of 2 Books in Spain as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.

  20. w

    Dataset of books called UK Marine Pressures-Activities Database "PAD" :...

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books called UK Marine Pressures-Activities Database "PAD" : methods report [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=UK+Marine+Pressures-Activities+Database+%22PAD%22+%3A+methods+report
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United Kingdom
    Description

    This dataset is about books. It has 1 row and is filtered where the book is UK Marine Pressures-Activities Database "PAD" : methods report. It features 7 columns including author, publication date, language, and book publisher.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Lorena Casanova Lozano; Sergio Costa Planells; Lorena Casanova Lozano; Sergio Costa Planells (2020). Best Books Ever Dataset [Dataset]. http://doi.org/10.5281/zenodo.4265096
Organization logo

Best Books Ever Dataset

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
csvAvailable download formats
Dataset updated
Nov 10, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Lorena Casanova Lozano; Sergio Costa Planells; Lorena Casanova Lozano; Sergio Costa Planells
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

The dataset has been collected in the frame of the Prac1 of the subject Tipology and Data Life Cycle of the Master's Degree in Data Science of the Universitat Oberta de Catalunya (UOC).

The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).

Original code used to retrieve the dataset can be found on github repository: github.com/scostap/goodreads_bbe_dataset

The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.

Book cover images can be optionally downloaded from the url in the 'coverImg' field. Python code for doing so and an example can be found on the github repo.

The 25 fields of the dataset are:

| Attributes | Definition | Completeness |
| ------------- | ------------- | ------------- | 
| bookId | Book Identifier as in goodreads.com | 100 |
| title | Book title | 100 |
| series | Series Name | 45 |
| author | Book's Author | 100 |
| rating | Global goodreads rating | 100 |
| description | Book's description | 97 |
| language | Book's language | 93 |
| isbn | Book's ISBN | 92 |
| genres | Book's genres | 91 |
| characters | Main characters | 26 |
| bookFormat | Type of binding | 97 |
| edition | Type of edition (ex. Anniversary Edition) | 9 |
| pages | Number of pages | 96 |
| publisher | Editorial | 93 |
| publishDate | publication date | 98 |
| firstPublishDate | Publication date of first edition | 59 |
| awards | List of awards | 20 |
| numRatings | Number of total ratings | 100 |
| ratingsByStars | Number of ratings by stars | 97 |
| likedPercent | Derived field, percent of ratings over 2 starts (as in GoodReads) | 99 |
| setting | Story setting | 22 |
| coverImg | URL to cover image | 99 |
| bbeScore | Score in Best Books Ever list | 100 |
| bbeVotes | Number of votes in Best Books Ever list | 100 |
| price | Book's price (extracted from Iberlibro) | 73 |

Search
Clear search
Close search
Google apps
Main menu