Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The dataset has been collected in the frame of the Prac1 of the subject Tipology and Data Life Cycle of the Master's Degree in Data Science of the Universitat Oberta de Catalunya (UOC).
The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).
Original code used to retrieve the dataset can be found on github repository: github.com/scostap/goodreads_bbe_dataset
The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.
Book cover images can be optionally downloaded from the url in the 'coverImg' field. Python code for doing so and an example can be found on the github repo.
The 25 fields of the dataset are:
| Attributes | Definition | Completeness |
| ------------- | ------------- | ------------- |
| bookId | Book Identifier as in goodreads.com | 100 |
| title | Book title | 100 |
| series | Series Name | 45 |
| author | Book's Author | 100 |
| rating | Global goodreads rating | 100 |
| description | Book's description | 97 |
| language | Book's language | 93 |
| isbn | Book's ISBN | 92 |
| genres | Book's genres | 91 |
| characters | Main characters | 26 |
| bookFormat | Type of binding | 97 |
| edition | Type of edition (ex. Anniversary Edition) | 9 |
| pages | Number of pages | 96 |
| publisher | Editorial | 93 |
| publishDate | publication date | 98 |
| firstPublishDate | Publication date of first edition | 59 |
| awards | List of awards | 20 |
| numRatings | Number of total ratings | 100 |
| ratingsByStars | Number of ratings by stars | 97 |
| likedPercent | Derived field, percent of ratings over 2 starts (as in GoodReads) | 99 |
| setting | Story setting | 22 |
| coverImg | URL to cover image | 99 |
| bbeScore | Score in Best Books Ever list | 100 |
| bbeVotes | Number of votes in Best Books Ever list | 100 |
| price | Book's price (extracted from Iberlibro) | 73 |
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This database contains information about books gathered with help of Google Books API. The database contains 7 different tables where 3 of them are only to relate the other tables together. Tables: Books contains 1062 records. Authors contains 1595 records. Categories 109 records. Metadata 37 records. MD5 (GBooks_2015-06-09.sql) = bfd09094d0e123e668b2e58332b1a98b
This dataset was created by Carlos Heryhelder
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about books. It has 1 row and is filtered where the book is Advanced database techniques. It features 7 columns including author, publication date, language, and book publisher.
Comprehensive dataset of 5 Books wholesalers in Louisiana, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Aunt Mavor's Picture Books for Little Readers [Second Series]
Data showing how many books were sold in 2024 revealed that the printed book market remains healthy: a total of ***** million units were sold that year among outlets which reported to the source. Whilst this marked a small jump from the previous year, the figure peaked in 2021 and has not surpassed *** million since. Trade paperbacks remained the dominant format. Book sales statistics Looking at book sales by year, 2005 to 2010 were the most lucrative for the printed book market, with well over *** million units sold annually during that five-year period. After dropping below *** million in 2012, gradual and consistent increases can be seen each year, with the exception of between the years 2018 and 2019. For bookstores though, how many books are sold each year depends on the success of key months across a twelve-month period. Bookstore sales in the United States are at their highest in December, January, and August, but figures for December are consistently higher than other months. Books are popular holiday gifts, with around ** to ** percent of consumers responding to annual surveys in each year from 2012 to 2020 saying that they planned to purchase books as presents during the festive season.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
🧠Psy-Data-Books: Synthetic Medical & Psychology Conversation Dataset
Psy-Data-Books is one of the largest synthetic datasets of psychology and medical conversations, generated from verified medical and psychology literature. It is designed for building and training powerful conversational AI systems for healthcare, therapy, and mental health applications.
📊 Dataset Summary
Domain: Psychology, Psychiatry, Mental Health, General Medicine Data Type: Synthetic… See the full description on the dataset page: https://huggingface.co/datasets/Daemontatox/Psy-Data-books.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about book series. It has 2 rows and is filtered where the books is Logical database design principles. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.
This dataset provides an in-depth look into Amazon's top 100 bestselling books along with their customer reviews, ratings, and pricing information. It offers a window into the world of popular reading and customer sentiment. The dataset was collected in November 2023, making it suitable for analysing recent literary trends and consumer behaviour.
The dataset includes the following fields: * Book Rank: The ranking of the book among the top 100 bestselling books on Amazon. * Book Title: The title of the book. Examples include "The Ballad of Songbirds and Snakes" and "Iron Flame". * Price: The price of the book in USD. * Rating: The overall rating of the book, on a scale of 1 to 5. * Author: The author of the book. Notable authors include Sarah J. Maas and Adam Wallace. * Year of Publication: The year in which the book was published. * Genre: The category to which the book belongs. Popular genres include Nonfiction and Childrens, literature. * URL: The direct URL link to the book on Amazon's platform. * Review Title: The title of the customer review. * Reviewer: The name of the person who wrote the review. * Reviewer Rating: The rating given by the reviewer for the book, on a scale of 1 to 5. * Review Description: The textual content of the review. * Is_verified: Indicates whether the review is a verified customer purchase. * Date: The date when the review was posted. * Timestamp: The timestamp indicating when the review was posted. * ASIN: Amazon Standard Identification Number assigned to products on Amazon.
The dataset focuses on the top 100 bestselling books. * Price: Book prices range from 1.00 USD to 100.00 USD. There are approximately 10 books within each 9.90 USD price band across this range. * Rating: Overall book ratings are generally high, ranging from 4.10 to 5.00. A notable number of books have ratings between 4.73 and 4.82. * Year of Publication: Books in the dataset were published between 1947 and 2024. A significant portion, 64 books, were published between 2016 and 2024, indicating a strong presence of recent titles. * Genre: While diverse, Nonfiction and Childrens, literature are among the more prominent genres. * Authors/Titles: "The Ballad of Songbirds and Snakes" and "Iron Flame" are among the top-ranked titles. Sarah J. Maas and Adam Wallace are featured authors. The dataset covers review data for each of the top 100 books, though the exact number of reviews per book is not specified.
This dataset is ideal for: * Market analysis: Identifying bestselling trends, pricing strategies, and popular authors. * Sentiment analysis: Analysing customer reviews to understand public perception and extract insights. * Recommender systems: Building or improving book recommendation engines. * Natural Language Processing (NLP): Training models for text classification, entity recognition, or summarisation based on review content. * Data visualisation: Creating visualisations of literary trends, rating distributions, or reviewer behaviour.
CC-BY
Original Data Source: Top 100 Bestselling Book Reviews on Amazon
Comprehensive dataset of 1 Books wholesalers in Nevada, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
This Microsoft Access database was created under the direction of Public Record Office Victoria and the Department of Eduction and Training's Education History Unit to provide access to VPRS 13579, Teacher Record Books. Consult the text for this series for further information on how to use a teacher record number to access a teacher's service history.
The displayed fields are:
Surname (SName) including for example, "nee Smith"
Given name (GName1)
Other given name (GnameOth)
Extra (other text appearing on the card; this may include "PR", indicating a record in the VPRS 14440 Register of Professional Officers)
DEETID (the teacher record number, for use with VPRS 13579 or VPRS 13718)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about book subjects. It has 2 rows and is filtered where the books is Database management on the Sinclair QL. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Metadata and data derived from Butte Stope Books. The underground mine openings are generally referred to as stopes; detailed maps of the stopes were recorded from field books to a master Stope Book for each mine.
The Approved Drug Products with Therapeutic Equivalence (Orange Book or OB) is a list of drugs approved under Section 505 of the Federal Food, Drug and Cosmetic Act and provides consumers timely updates on these products. In addition to these products (fo
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States Imports from Norway of Printed books, newspapers, pictures was US$1.46 Million during 2024, according to the United Nations COMTRADE database on international trade. United States Imports from Norway of Printed books, newspapers, pictures - data, historical chart and statistics - was last updated on July of 2025.
I wanted to find good data about representation and diversity in literature, which brought me to the following page of the Cooperative Children's Book Center (CCBC): https://ccbc.education.wisc.edu/literature-resources/ccbc-diversity-statistics/. The following is data on books by and about Black, Indigenous and People of Color published for children and teens compiled by the Cooperative Children’s Book Center, School of Education, University of Wisconsin-Madison.
There are two .csv files in the data set. One shows books received by the CCBC from US publishers per year that are authored and/or illustrated by a Black/African/Indigenous/Asian/Pacific Islander/Latinx person, and the other shows books received by the CCBC from US publishers per year that feature a BIPOC character. Further explanation can be found at the CCBC FAQ page.
Please note that for 2018 and 2019, the below .csv represent Asian/Pacific Islander people as one column, which is how the CCBC published the data between 2002-2017. Also note that the attached data are not the entire data collected by the CCBC. The CCBC also collects books from international publishers, and since 2018, the CCBC has been publishing data about books by/about Arabs.
All data was collected by the CCBC. Please see the following page (with the complete data) about how to cite the data in your publications/blogs/notebooks: https://ccbc.education.wisc.edu/literature-resources/ccbc-diversity-statistics/books-by-about-poc-fnn/.
I am curious to see what sorts of visualizations people can make in exploratory analysis of this data! Also, can you predict how many BIPOC books the CCBC will receive in 2020? What happens when you study against US population data?
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States Imports from Bermuda of Printed books, newspapers, pictures was US$17.56 Thousand during 2023, according to the United Nations COMTRADE database on international trade. United States Imports from Bermuda of Printed books, newspapers, pictures - data, historical chart and statistics - was last updated on July of 2025.
Comprehensive dataset of 2 Books in Spain as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about books. It has 1 row and is filtered where the book is UK Marine Pressures-Activities Database "PAD" : methods report. It features 7 columns including author, publication date, language, and book publisher.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The dataset has been collected in the frame of the Prac1 of the subject Tipology and Data Life Cycle of the Master's Degree in Data Science of the Universitat Oberta de Catalunya (UOC).
The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).
Original code used to retrieve the dataset can be found on github repository: github.com/scostap/goodreads_bbe_dataset
The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.
Book cover images can be optionally downloaded from the url in the 'coverImg' field. Python code for doing so and an example can be found on the github repo.
The 25 fields of the dataset are:
| Attributes | Definition | Completeness |
| ------------- | ------------- | ------------- |
| bookId | Book Identifier as in goodreads.com | 100 |
| title | Book title | 100 |
| series | Series Name | 45 |
| author | Book's Author | 100 |
| rating | Global goodreads rating | 100 |
| description | Book's description | 97 |
| language | Book's language | 93 |
| isbn | Book's ISBN | 92 |
| genres | Book's genres | 91 |
| characters | Main characters | 26 |
| bookFormat | Type of binding | 97 |
| edition | Type of edition (ex. Anniversary Edition) | 9 |
| pages | Number of pages | 96 |
| publisher | Editorial | 93 |
| publishDate | publication date | 98 |
| firstPublishDate | Publication date of first edition | 59 |
| awards | List of awards | 20 |
| numRatings | Number of total ratings | 100 |
| ratingsByStars | Number of ratings by stars | 97 |
| likedPercent | Derived field, percent of ratings over 2 starts (as in GoodReads) | 99 |
| setting | Story setting | 22 |
| coverImg | URL to cover image | 99 |
| bbeScore | Score in Best Books Ever list | 100 |
| bbeVotes | Number of votes in Best Books Ever list | 100 |
| price | Book's price (extracted from Iberlibro) | 73 |