Facebook
TwitterBy Josh Murrey [source]
The Books Dataset: Sales, Ratings, and Publication provides comprehensive information on various aspects of books, including their publishing year, author details, ratings given by readers, sales performance data, and genre classification. The dataset consists of several key columns that capture important attributes related to each book.
The Publishing Year column indicates the year in which each book was published. This information helps in understanding the chronological distribution of books in the dataset.
The Book Name column contains the titles of the books. Each book has a unique name that distinguishes it from others in the dataset.
The Author column specifies the name(s) of the author(s) responsible for creating each book. This information is crucial for understanding different authors' contributions and analyzing their impact on sales and ratings.
The language_code column represents a specific code assigned to indicate the language in which each book is written. This code serves as a reference point for language-based analysis within the dataset.
Each author's rating is captured in the Author_Rating column. This rating is based on their previous works and serves as an indicator of their reputation or acclaim among readers.
The average rating given by readers for each book is recorded in the Book_average_rating column. This value reflects how well-received a particular book is by its audience.
The number of ratings given to each book by readers can be found in the Book_ratings_count column. This metric helps gauge reader engagement and provides insights into popular or widely-discussed books within this dataset.
Books are classified into different genres or categories which are mentioned under the genre column. Genre classification allows for analyzing trends across specific literary genres or identifying patterns related to certain types of books.
Sales-related data includes both gross sales revenue (gross sales) generated by each book and publisher revenue (publisher revenue) earned from these sales transactions. These numeric values provide insights into financial performance aspects associated with the book market.
The sale price column denotes the specific price at which each book is sold. This information helps evaluate pricing strategies and their potential impact on sales figures.
Sales performance is further quantified through the sales rank column, which assigns a numerical rank to each book based on its sales performance. This ranking system aids in identifying high-performing books within the dataset.
Lastly, the units sold column captures the number of units of each book that have been sold. This data highlights popular books based on reader demand and serves as a crucial measure of commercial success within the dataset.
Overall, this expansive and comprehensive Books Dataset
Introduction:
Getting Familiar with the Columns: The dataset contains multiple columns that provide different kinds of information:
Book Name: The title of each book.
Author: The name of the author who wrote the book.
language_code: The code representing the language in which the book is written.
Author_Rating: The rating assigned to the author based on their previous works.
Book_average_rating: The average rating given to the book by readers.
Book_ratings_count: The number of ratings given to the book by readers.
genre: The genre or category to which the book belongs.
gross sales: The total sales revenue generated by each book.
publisher revenue: The revenue earned by publishers from selling each book.
sale price: The price at which each copy of a book is sold.
sales rank: A numeric value indicating a book's rank based on its sales performance in comparison to other books within its category (genre).
units sold : Total number of copies sold for each specific title.
Understanding Numeric and Textual Data: Numeric columns in this dataset include Publishing Year, Author_Rating, Book_average_rating, Book_ratings_count,gross sales,publisher revenue,sale price,sales rank and units sold; these provide quantitative insights that can be used for statistical analysis and comparisons.
Additionally,the columns 'Author','Book Name',and 'genre' contain textual data that provides descriptive elements such as authors' names and categorization genres.
- Exploring Relationships Between Data Points: By combining different co...
Facebook
TwitterData showing how many books were sold in 2024 revealed that the printed book market remains healthy: a total of ***** million units were sold that year among outlets which reported to the source. Whilst this marked a small jump from the previous year, the figure peaked in 2021 and has not surpassed *** million since. Trade paperbacks remained the dominant format. Book sales statistics Looking at book sales by year, 2005 to 2010 were the most lucrative for the printed book market, with well over *** million units sold annually during that five-year period. After dropping below *** million in 2012, gradual and consistent increases can be seen each year, with the exception of between the years 2018 and 2019. For bookstores though, how many books are sold each year depends on the success of key months across a twelve-month period. Bookstore sales in the United States are at their highest in December, January, and August, but figures for December are consistently higher than other months. Books are popular holiday gifts, with around ** to ** percent of consumers responding to annual surveys in each year from 2012 to 2020 saying that they planned to purchase books as presents during the festive season.
Facebook
TwitterBook sales revenue in the United States in 2024 amounted to *** billion U.S. dollars, of which *** billion was derived from hardback books. Meanwhile, paperbacks reached *** billion U.S. dollars, in line with trends showing consistent growth in this segment since 2017. The source reported e-book revenue as *** billion U.S. dollars, a drop from the previous two years, whereas downloaded audiobook revenue continued to grow, and in 2024 surpassed *****illion U.S. dollars.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
About Discription:Usefulness of Dataset:
Facebook
TwitterNet value of book sales by customer category, includes all members under Net value of book sales by customer category, for Book publishers, for Canada and regions, for one year of data.
Facebook
TwitterAdult fiction book sales increased by *** percent between H1 2024 and H1 2025, with 2025 unit sales hitting ***** million between January and June that year. Adult nonfiction sales saw the biggest decline year over year with a drop of *** percent.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The Books Dataset contains 1070 rows of information relating to Sales, Ratings, and Publication of various types of books. It provides comprehensive information on various aspects of books, including their publishing year, author details, ratings given by readers, sales performance data, and genre classification. The dataset consists of several key columns that capture important attributes related to each book.
The dataset contains multiple columns that provide different kinds of information: - Index โ a sequential row index for the data - Publishing Year - the year in which each book was published - Book Name: The title of each book. - Author: The name of the author who wrote the book. - language_code: The code representing the language in which the book is written. - Author_Rating: The rating assigned to the author based on their previous works. - Book_average_rating: The average rating given to the book by readers. - Book_ratings_count: The number of ratings given to the book by readers. - genre: The genre or category to which the book belongs. - gross sales: The total sales revenue generated by each book. - publisher revenue: The revenue earned by publishers from selling each book. - sale price: The price at which each copy of a book is sold. - sales rank: A numeric value indicating a book's rank based on its sales performance in comparison to other books within its category (genre). - units sold : Total number of copies sold for each specific title.
Dataset created by Josh Murray: https://data.world/josh-nbu
Type: Linear Target variable: units sold, sales rank or publisher revenue
Facebook
Twitterhttps://www.ibisworld.com/about/termsofuse/https://www.ibisworld.com/about/termsofuse/
Online Book Sales are growing modestly in 2025, with revenue up 2.9% to reach $10.2 billion. Faster fulfillment, polished mobile checkout and app upgrades make buying books online feel immediate and effortless โ especially for younger, mobile-first readers. Convenience continues to win the cart, with streamlined delivery windows, transparent returns and personalized recommendations lifting conversion and repeat purchases. Meanwhile, direct-to-consumer and subscription models support profit through recurring engagement and lower acquisition costs. The industry has expanded at a CAGR of 1.2% over the past five years, as on-the-go browsing and one-tap payments shifted discovery and purchase to phones. AI-driven recommendations and dynamic promotions nudged shoppers from browsing to buying in a single session. Independent book stores narrowed the tech gap via IndieCommerce 2.0, mapping enterprise-grade personalization and modern storefront tools to local businesses, with reported average sales per store up 15.0% in 2024. Amazon's scale and algorithmic merchandising amplified review and price signals into front-page placement, compressing rivals' pricing power. However, e-book unit economics lifted profit by sidestepping print, freight and returns. Industry revenue is projected to climb at a 2.7% CAGR to $11.6 billion by 2030, as convenience, lower prices and rapid delivery keep nudging purchases to digital carts. Subscriptions, family plans and loyalty bundles will help stabilize spending across print, e-book and audiobook formats. AI-sharpened discovery, voice assistants, read-and-listen sync and wallet-native checkout will raise mobile conversion and basket size. Social media platforms and creator feeds funnel low-cost customer acquisition traffic, boosting preorders and backlist sales. At the same time, they also heighten week-to-week volatility around viral titles, necessitating dynamic pricing and nimble inventory management. Amazon will likely retain its dominant position, yet IndieCommerce-enabled independents can defend their share through curated editions, signed runs and coordinated two- to three-day fulfillment. They can balance price transparency with community and data-led personalization to sustain earnings as digital formats deepen engagement.
Facebook
TwitterTrade e-book revenue amounted to ****billion U.S. dollars in 2023. The source noted that e-book sales revenue grew only marginally by *** percent between 2022 and 2023, whereas digital audio revenue was up by almost ** percent.
Facebook
Twitterhttps://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Retail Sales: Book Stores (MRTSSM451211USN) from Jan 1992 to Jul 2025 about book, retail trade, sales, retail, and USA.
Facebook
Twitterhttps://www.ibisworld.com/about/termsofuse/https://www.ibisworld.com/about/termsofuse/
Market Size statistics on the Online Book Sales industry in the US
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States - Retail Sales: Book Stores was 610.00000 Mil. of $ in July of 2025, according to the United States Federal Reserve. Historically, United States - Retail Sales: Book Stores reached a record high of 2370.00000 in August of 2008 and a record low of 164.00000 in April of 2020. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - Retail Sales: Book Stores - last updated from the United States Federal Reserve on December of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains detailed information about a wide range of books available for purchase on an online retailer's website. It includes data such as book titles, authors, categories, prices, stock status, number of copies left, book length in pages, edition details, publication information, and customer engagement metrics like wished users counts and discount offers. This dataset is ideal for data analysis projects focusing on book sales trends, customer preferences, and market insights within the online retail book industry. Whether you're exploring pricing strategies, customer behavior, or genre popularity, this dataset provides a rich resource for data-driven exploration and analysis in the domain of online book retailing. Content:
Book Title: Title of the book.
Author: Author(s) of the book.
Category: Category or genre of the book.
Price (TK): Price of the book in TK (local currency).
Stock Status: Availability status of the book (In Stock/Out of Stock).
Copies Left: Number of copies currently available.
Book Length (Pages): Number of pages in the book.
Edition: Edition details of the book.
Publication: Publisher or publication details.
Wished Users: Number of users who have added this book to their wish list.
Discount Offer: Any available discount or promotional offer on the book.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F17551034%2F8d16bae9e2eb4046322d2423daaaad97%2F949302f2-fb09-4fbf-95d6-338388c4a753.png?generation=1719996201879469&alt=media" alt="">
Facebook
Twitterhttps://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Breakdown of Revenue by Media Type: Books - Print Books for Book Publishers, All Establishments, Employer Firms (RPCMPBEF51113ALLEST) from 2013 to 2022 about book, printing, employer firms, accounting, revenue, establishments, services, and USA.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Sales for comic books and graphic novels through the direct market and bookstore channels in 2016. Direct market sales derived from John Jackson Miller's Diamond sales estimates. Bookstore sales derived from Brian Hibbs's analysis of Nielsen BookScan data.
Facebook
TwitterRetail sales data for bookstores in the United States showed that the sales of U.S. book retailers amounted to *** million U.S. dollars in April 2025, marking an increase from the *** million recorded in the same month of the previous year. Book formats in the United States Print books still generate the majority of book sales revenue in the United States, with hardback books alone pulling in *** billion U.S. dollars of the *** billion total in 2021. Meanwhile, e-book sales growth is sluggish. The number of e-books sold each year is notoriously difficult to track and results depend on the data provided by publishers, and official figures show that annual e-book revenue has yet to reach *** billion U.S. dollars. When it comes to audiobooks, physical audiobook revenue has fallen consistently since 2017, whereas downloaded audio revenue is increasing. Audiobook consumption Audiobook sales revenue is climbing in tandem with the share of U.S. adults listening to audiobooks in general. One of the advantages of audiobooks is the ability to enjoy them whilst on the move, but data shows that more and more consumers have begun listening to them at home. Whilst the COVID-19 pandemic led to increased in-home media consumption, the home was already a popular location for audiobook listening prior to the outbreak, with a survey from 2019 revealing that the same share of respondents listened to audiobooks at home as those who did so in the car. Given their unique appeal, audiobooks will continue to gather ground. Print, however, is still the preferred book format, and new print books outperform used books in terms of popularity which is good news for the book market.
Facebook
TwitterIn 2022, data representing ** national associations of publishers in Europe found that **** percent of books were sold in bookstores and specialized stores. Online sales accounted for **** percent of the total, whilst **** percent of books were sold through supermarkets or other stores.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States Retail Sales: Book Stores data was reported at 1.320 USD bn in Aug 2018. This records an increase from the previous number of 661.000 USD mn for Jul 2018. United States Retail Sales: Book Stores data is updated monthly, averaging 1.022 USD bn from Jan 1992 (Median) to Aug 2018, with 320 observations. The data reached an all-time high of 2.425 USD bn in Aug 2008 and a record low of 523.000 USD mn in Apr 1992. United States Retail Sales: Book Stores data remains active status in CEIC and is reported by US Census Bureau. The data is categorized under Global Databaseโs USA โ Table US.H001: Retail Sales: By NAIC System.
Facebook
TwitterOpen Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
The net value of book sales for Canadian publishers by customer category by North American Industry Classification System (NAICS) 511130 book publishers, which include all members under net value of book sales by customer category, (dollars X 1,000,000), every 2 years, for five years of data.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about books. It has 4 rows and is filtered where the book publisher is Book Sales Ltd. It features 7 columns including author, publication date, language, and book publisher.
Facebook
TwitterBy Josh Murrey [source]
The Books Dataset: Sales, Ratings, and Publication provides comprehensive information on various aspects of books, including their publishing year, author details, ratings given by readers, sales performance data, and genre classification. The dataset consists of several key columns that capture important attributes related to each book.
The Publishing Year column indicates the year in which each book was published. This information helps in understanding the chronological distribution of books in the dataset.
The Book Name column contains the titles of the books. Each book has a unique name that distinguishes it from others in the dataset.
The Author column specifies the name(s) of the author(s) responsible for creating each book. This information is crucial for understanding different authors' contributions and analyzing their impact on sales and ratings.
The language_code column represents a specific code assigned to indicate the language in which each book is written. This code serves as a reference point for language-based analysis within the dataset.
Each author's rating is captured in the Author_Rating column. This rating is based on their previous works and serves as an indicator of their reputation or acclaim among readers.
The average rating given by readers for each book is recorded in the Book_average_rating column. This value reflects how well-received a particular book is by its audience.
The number of ratings given to each book by readers can be found in the Book_ratings_count column. This metric helps gauge reader engagement and provides insights into popular or widely-discussed books within this dataset.
Books are classified into different genres or categories which are mentioned under the genre column. Genre classification allows for analyzing trends across specific literary genres or identifying patterns related to certain types of books.
Sales-related data includes both gross sales revenue (gross sales) generated by each book and publisher revenue (publisher revenue) earned from these sales transactions. These numeric values provide insights into financial performance aspects associated with the book market.
The sale price column denotes the specific price at which each book is sold. This information helps evaluate pricing strategies and their potential impact on sales figures.
Sales performance is further quantified through the sales rank column, which assigns a numerical rank to each book based on its sales performance. This ranking system aids in identifying high-performing books within the dataset.
Lastly, the units sold column captures the number of units of each book that have been sold. This data highlights popular books based on reader demand and serves as a crucial measure of commercial success within the dataset.
Overall, this expansive and comprehensive Books Dataset
Introduction:
Getting Familiar with the Columns: The dataset contains multiple columns that provide different kinds of information:
Book Name: The title of each book.
Author: The name of the author who wrote the book.
language_code: The code representing the language in which the book is written.
Author_Rating: The rating assigned to the author based on their previous works.
Book_average_rating: The average rating given to the book by readers.
Book_ratings_count: The number of ratings given to the book by readers.
genre: The genre or category to which the book belongs.
gross sales: The total sales revenue generated by each book.
publisher revenue: The revenue earned by publishers from selling each book.
sale price: The price at which each copy of a book is sold.
sales rank: A numeric value indicating a book's rank based on its sales performance in comparison to other books within its category (genre).
units sold : Total number of copies sold for each specific title.
Understanding Numeric and Textual Data: Numeric columns in this dataset include Publishing Year, Author_Rating, Book_average_rating, Book_ratings_count,gross sales,publisher revenue,sale price,sales rank and units sold; these provide quantitative insights that can be used for statistical analysis and comparisons.
Additionally,the columns 'Author','Book Name',and 'genre' contain textual data that provides descriptive elements such as authors' names and categorization genres.
- Exploring Relationships Between Data Points: By combining different co...