Facebook
TwitterBy Josh Murrey [source]
The Books Dataset: Sales, Ratings, and Publication provides comprehensive information on various aspects of books, including their publishing year, author details, ratings given by readers, sales performance data, and genre classification. The dataset consists of several key columns that capture important attributes related to each book.
The Publishing Year column indicates the year in which each book was published. This information helps in understanding the chronological distribution of books in the dataset.
The Book Name column contains the titles of the books. Each book has a unique name that distinguishes it from others in the dataset.
The Author column specifies the name(s) of the author(s) responsible for creating each book. This information is crucial for understanding different authors' contributions and analyzing their impact on sales and ratings.
The language_code column represents a specific code assigned to indicate the language in which each book is written. This code serves as a reference point for language-based analysis within the dataset.
Each author's rating is captured in the Author_Rating column. This rating is based on their previous works and serves as an indicator of their reputation or acclaim among readers.
The average rating given by readers for each book is recorded in the Book_average_rating column. This value reflects how well-received a particular book is by its audience.
The number of ratings given to each book by readers can be found in the Book_ratings_count column. This metric helps gauge reader engagement and provides insights into popular or widely-discussed books within this dataset.
Books are classified into different genres or categories which are mentioned under the genre column. Genre classification allows for analyzing trends across specific literary genres or identifying patterns related to certain types of books.
Sales-related data includes both gross sales revenue (gross sales) generated by each book and publisher revenue (publisher revenue) earned from these sales transactions. These numeric values provide insights into financial performance aspects associated with the book market.
The sale price column denotes the specific price at which each book is sold. This information helps evaluate pricing strategies and their potential impact on sales figures.
Sales performance is further quantified through the sales rank column, which assigns a numerical rank to each book based on its sales performance. This ranking system aids in identifying high-performing books within the dataset.
Lastly, the units sold column captures the number of units of each book that have been sold. This data highlights popular books based on reader demand and serves as a crucial measure of commercial success within the dataset.
Overall, this expansive and comprehensive Books Dataset
Introduction:
Getting Familiar with the Columns: The dataset contains multiple columns that provide different kinds of information:
Book Name: The title of each book.
Author: The name of the author who wrote the book.
language_code: The code representing the language in which the book is written.
Author_Rating: The rating assigned to the author based on their previous works.
Book_average_rating: The average rating given to the book by readers.
Book_ratings_count: The number of ratings given to the book by readers.
genre: The genre or category to which the book belongs.
gross sales: The total sales revenue generated by each book.
publisher revenue: The revenue earned by publishers from selling each book.
sale price: The price at which each copy of a book is sold.
sales rank: A numeric value indicating a book's rank based on its sales performance in comparison to other books within its category (genre).
units sold : Total number of copies sold for each specific title.
Understanding Numeric and Textual Data: Numeric columns in this dataset include Publishing Year, Author_Rating, Book_average_rating, Book_ratings_count,gross sales,publisher revenue,sale price,sales rank and units sold; these provide quantitative insights that can be used for statistical analysis and comparisons.
Additionally,the columns 'Author','Book Name',and 'genre' contain textual data that provides descriptive elements such as authors' names and categorization genres.
- Exploring Relationships Between Data Points: By combining different co...
Facebook
TwitterData showing how many books were sold in 2024 revealed that the printed book market remains healthy: a total of ***** million units were sold that year among outlets which reported to the source. Whilst this marked a small jump from the previous year, the figure peaked in 2021 and has not surpassed *** million since. Trade paperbacks remained the dominant format. Book sales statistics Looking at book sales by year, 2005 to 2010 were the most lucrative for the printed book market, with well over *** million units sold annually during that five-year period. After dropping below *** million in 2012, gradual and consistent increases can be seen each year, with the exception of between the years 2018 and 2019. For bookstores though, how many books are sold each year depends on the success of key months across a twelve-month period. Bookstore sales in the United States are at their highest in December, January, and August, but figures for December are consistently higher than other months. Books are popular holiday gifts, with around ** to ** percent of consumers responding to annual surveys in each year from 2012 to 2020 saying that they planned to purchase books as presents during the festive season.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
About Discription:Usefulness of Dataset:
Facebook
TwitterAdult fiction book sales increased by *** percent between H1 2024 and H1 2025, with 2025 unit sales hitting ***** million between January and June that year. Adult nonfiction sales saw the biggest decline year over year with a drop of *** percent.
Facebook
TwitterBook sales revenue in the United States in 2024 amounted to *** billion U.S. dollars, of which *** billion was derived from hardback books. Meanwhile, paperbacks reached *** billion U.S. dollars, in line with trends showing consistent growth in this segment since 2017. The source reported e-book revenue as *** billion U.S. dollars, a drop from the previous two years, whereas downloaded audiobook revenue continued to grow, and in 2024 surpassed *****illion U.S. dollars.
Facebook
TwitterNet value of book sales by customer category, includes all members under Net value of book sales by customer category, for Book publishers, for Canada and regions, for one year of data.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The Books Dataset contains 1070 rows of information relating to Sales, Ratings, and Publication of various types of books. It provides comprehensive information on various aspects of books, including their publishing year, author details, ratings given by readers, sales performance data, and genre classification. The dataset consists of several key columns that capture important attributes related to each book.
The dataset contains multiple columns that provide different kinds of information: - Index โ a sequential row index for the data - Publishing Year - the year in which each book was published - Book Name: The title of each book. - Author: The name of the author who wrote the book. - language_code: The code representing the language in which the book is written. - Author_Rating: The rating assigned to the author based on their previous works. - Book_average_rating: The average rating given to the book by readers. - Book_ratings_count: The number of ratings given to the book by readers. - genre: The genre or category to which the book belongs. - gross sales: The total sales revenue generated by each book. - publisher revenue: The revenue earned by publishers from selling each book. - sale price: The price at which each copy of a book is sold. - sales rank: A numeric value indicating a book's rank based on its sales performance in comparison to other books within its category (genre). - units sold : Total number of copies sold for each specific title.
Dataset created by Josh Murray: https://data.world/josh-nbu
Type: Linear Target variable: units sold, sales rank or publisher revenue
Facebook
TwitterIn 2022, data representing ** national associations of publishers in Europe found that **** percent of books were sold in bookstores and specialized stores. Online sales accounted for **** percent of the total, whilst **** percent of books were sold through supermarkets or other stores.
Facebook
TwitterThe sales of books by language of printing , by North American Industry Classification System (NAICS) 511130 book publishers, which include all members under sales, (dollars X 1,000,000), every 2 years, for five years of data.
Facebook
TwitterRetail sales data for bookstores in the United States showed that the sales of U.S. book retailers amounted to *** million U.S. dollars in April 2025, marking an increase from the *** million recorded in the same month of the previous year. Book formats in the United States Print books still generate the majority of book sales revenue in the United States, with hardback books alone pulling in *** billion U.S. dollars of the *** billion total in 2021. Meanwhile, e-book sales growth is sluggish. The number of e-books sold each year is notoriously difficult to track and results depend on the data provided by publishers, and official figures show that annual e-book revenue has yet to reach *** billion U.S. dollars. When it comes to audiobooks, physical audiobook revenue has fallen consistently since 2017, whereas downloaded audio revenue is increasing. Audiobook consumption Audiobook sales revenue is climbing in tandem with the share of U.S. adults listening to audiobooks in general. One of the advantages of audiobooks is the ability to enjoy them whilst on the move, but data shows that more and more consumers have begun listening to them at home. Whilst the COVID-19 pandemic led to increased in-home media consumption, the home was already a popular location for audiobook listening prior to the outbreak, with a survey from 2019 revealing that the same share of respondents listened to audiobooks at home as those who did so in the car. Given their unique appeal, audiobooks will continue to gather ground. Print, however, is still the preferred book format, and new print books outperform used books in terms of popularity which is good news for the book market.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States - Retail Sales: Book Stores was 610.00000 Mil. of $ in July of 2025, according to the United States Federal Reserve. Historically, United States - Retail Sales: Book Stores reached a record high of 2370.00000 in August of 2008 and a record low of 164.00000 in April of 2020. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - Retail Sales: Book Stores - last updated from the United States Federal Reserve on December of 2025.
Facebook
Twitterhttps://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Breakdown of Revenue by Media Type: Books - Print Books for Book Publishers, All Establishments, Employer Firms (RPCMPBEF51113ALLEST) from 2013 to 2022 about book, printing, employer firms, accounting, revenue, establishments, services, and USA.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains detailed information about a wide range of books available for purchase on an online retailer's website. It includes data such as book titles, authors, categories, prices, stock status, number of copies left, book length in pages, edition details, publication information, and customer engagement metrics like wished users counts and discount offers. This dataset is ideal for data analysis projects focusing on book sales trends, customer preferences, and market insights within the online retail book industry. Whether you're exploring pricing strategies, customer behavior, or genre popularity, this dataset provides a rich resource for data-driven exploration and analysis in the domain of online book retailing. Content:
Book Title: Title of the book.
Author: Author(s) of the book.
Category: Category or genre of the book.
Price (TK): Price of the book in TK (local currency).
Stock Status: Availability status of the book (In Stock/Out of Stock).
Copies Left: Number of copies currently available.
Book Length (Pages): Number of pages in the book.
Edition: Edition details of the book.
Publication: Publisher or publication details.
Wished Users: Number of users who have added this book to their wish list.
Discount Offer: Any available discount or promotional offer on the book.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F17551034%2F8d16bae9e2eb4046322d2423daaaad97%2F949302f2-fb09-4fbf-95d6-338388c4a753.png?generation=1719996201879469&alt=media" alt="">
Facebook
Twitterhttps://www.ibisworld.com/about/termsofuse/https://www.ibisworld.com/about/termsofuse/
Market Size statistics on the Online Book Sales industry in the US
Facebook
Twitterhttps://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Retail Sales: Book Stores (MRTSMPCSM451211USN) from Feb 1992 to Aug 2025 about book, retail trade, sales, retail, and USA.
Facebook
TwitterAfter reaching its highest number of the past decade with a total of more than *** million books sold in 2021 in France, that value dropped to reach about *** million euros in 2023. That corresponds to a decrease of about two percent in comparison to the previous year.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States Retail Sales: Book Stores data was reported at 1.320 USD bn in Aug 2018. This records an increase from the previous number of 661.000 USD mn for Jul 2018. United States Retail Sales: Book Stores data is updated monthly, averaging 1.022 USD bn from Jan 1992 (Median) to Aug 2018, with 320 observations. The data reached an all-time high of 2.425 USD bn in Aug 2008 and a record low of 523.000 USD mn in Apr 1992. United States Retail Sales: Book Stores data remains active status in CEIC and is reported by US Census Bureau. The data is categorized under Global Databaseโs USA โ Table US.H001: Retail Sales: By NAIC System.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about books. It has 2 rows and is filtered where the book is Statistics in market research. It features 7 columns including author, publication date, language, and book publisher.
Facebook
TwitterE-commerce sales by North American Industry Classification System (NAICS) include all members under distribution of sales, for book publishers, for Canada, for two years of data.
Facebook
TwitterTrade e-book revenue amounted to ****billion U.S. dollars in 2023. The source noted that e-book sales revenue grew only marginally by *** percent between 2022 and 2023, whereas digital audio revenue was up by almost ** percent.
Facebook
TwitterBy Josh Murrey [source]
The Books Dataset: Sales, Ratings, and Publication provides comprehensive information on various aspects of books, including their publishing year, author details, ratings given by readers, sales performance data, and genre classification. The dataset consists of several key columns that capture important attributes related to each book.
The Publishing Year column indicates the year in which each book was published. This information helps in understanding the chronological distribution of books in the dataset.
The Book Name column contains the titles of the books. Each book has a unique name that distinguishes it from others in the dataset.
The Author column specifies the name(s) of the author(s) responsible for creating each book. This information is crucial for understanding different authors' contributions and analyzing their impact on sales and ratings.
The language_code column represents a specific code assigned to indicate the language in which each book is written. This code serves as a reference point for language-based analysis within the dataset.
Each author's rating is captured in the Author_Rating column. This rating is based on their previous works and serves as an indicator of their reputation or acclaim among readers.
The average rating given by readers for each book is recorded in the Book_average_rating column. This value reflects how well-received a particular book is by its audience.
The number of ratings given to each book by readers can be found in the Book_ratings_count column. This metric helps gauge reader engagement and provides insights into popular or widely-discussed books within this dataset.
Books are classified into different genres or categories which are mentioned under the genre column. Genre classification allows for analyzing trends across specific literary genres or identifying patterns related to certain types of books.
Sales-related data includes both gross sales revenue (gross sales) generated by each book and publisher revenue (publisher revenue) earned from these sales transactions. These numeric values provide insights into financial performance aspects associated with the book market.
The sale price column denotes the specific price at which each book is sold. This information helps evaluate pricing strategies and their potential impact on sales figures.
Sales performance is further quantified through the sales rank column, which assigns a numerical rank to each book based on its sales performance. This ranking system aids in identifying high-performing books within the dataset.
Lastly, the units sold column captures the number of units of each book that have been sold. This data highlights popular books based on reader demand and serves as a crucial measure of commercial success within the dataset.
Overall, this expansive and comprehensive Books Dataset
Introduction:
Getting Familiar with the Columns: The dataset contains multiple columns that provide different kinds of information:
Book Name: The title of each book.
Author: The name of the author who wrote the book.
language_code: The code representing the language in which the book is written.
Author_Rating: The rating assigned to the author based on their previous works.
Book_average_rating: The average rating given to the book by readers.
Book_ratings_count: The number of ratings given to the book by readers.
genre: The genre or category to which the book belongs.
gross sales: The total sales revenue generated by each book.
publisher revenue: The revenue earned by publishers from selling each book.
sale price: The price at which each copy of a book is sold.
sales rank: A numeric value indicating a book's rank based on its sales performance in comparison to other books within its category (genre).
units sold : Total number of copies sold for each specific title.
Understanding Numeric and Textual Data: Numeric columns in this dataset include Publishing Year, Author_Rating, Book_average_rating, Book_ratings_count,gross sales,publisher revenue,sale price,sales rank and units sold; these provide quantitative insights that can be used for statistical analysis and comparisons.
Additionally,the columns 'Author','Book Name',and 'genre' contain textual data that provides descriptive elements such as authors' names and categorization genres.
- Exploring Relationships Between Data Points: By combining different co...