Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The dataset has been collected in the frame of the Prac1 of the subject Tipology and Data Life Cycle of the Master's Degree in Data Science of the Universitat Oberta de Catalunya (UOC).
The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).
Original code used to retrieve the dataset can be found on github repository: github.com/scostap/goodreads_bbe_dataset
The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.
Book cover images can be optionally downloaded from the url in the 'coverImg' field. Python code for doing so and an example can be found on the github repo.
The 25 fields of the dataset are:
| Attributes | Definition | Completeness |
| ------------- | ------------- | ------------- |
| bookId | Book Identifier as in goodreads.com | 100 |
| title | Book title | 100 |
| series | Series Name | 45 |
| author | Book's Author | 100 |
| rating | Global goodreads rating | 100 |
| description | Book's description | 97 |
| language | Book's language | 93 |
| isbn | Book's ISBN | 92 |
| genres | Book's genres | 91 |
| characters | Main characters | 26 |
| bookFormat | Type of binding | 97 |
| edition | Type of edition (ex. Anniversary Edition) | 9 |
| pages | Number of pages | 96 |
| publisher | Editorial | 93 |
| publishDate | publication date | 98 |
| firstPublishDate | Publication date of first edition | 59 |
| awards | List of awards | 20 |
| numRatings | Number of total ratings | 100 |
| ratingsByStars | Number of ratings by stars | 97 |
| likedPercent | Derived field, percent of ratings over 2 starts (as in GoodReads) | 99 |
| setting | Story setting | 22 |
| coverImg | URL to cover image | 99 |
| bbeScore | Score in Best Books Ever list | 100 |
| bbeVotes | Number of votes in Best Books Ever list | 100 |
| price | Book's price (extracted from Iberlibro) | 73 |
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about book subjects. It has 1 row and is filtered where the books is The one percent edge : small changes that guarantee relevance and build sustainable success. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about book subjects. It has 3 rows and is filtered where the books is All done with mirrors : an exploration of measure, proportion, ratio and number : opus 2. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about book series. It has 1 row and is filtered where the books is Geometry of design : studies in proportion and composition. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.
The summary statistics by North American Industry Classification System (NAICS) which include: operating revenue (dollars x 1,000,000), operating expenses (dollars x 1,000,000), salaries wages and benefits (dollars x 1,000,000), and operating profit margin (by percent), of book publishers (NAICS 511130), annual, for five years of data.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In the excel file “all models” tab includes properties on all analyzed gene models: the book, the page, the epistemological features (named as in the article), context where the model was situated, the model and the hybridization percentage. “Proportion of historical models” counts the total number and proportion of historic models. “Structure of the books” tab shows the sequence of contents in all analyzed books. “Definitions of key concepts” shows the original passages which includes definitions of genes, alleles, dominance, recessive or environmental effects on phenotype and translations in English with book details and page numbers. “Examples” tab show all the examples used in the books and “Named scientist” lists all the scientists which have been named in the textbooks.The numbers refer to the page numbers in books.Article at the moment in press for NorDiNa - Nordic Studies in Science Education
My ArcGIS StoryMap is centered around The Green Book, an annual travel guide that allowed African Americans to travel safely during the height of the Jim Crow Era in the United States. More specifically, The Green Book listed establishments, such as hotels and restaurants, that would openly accept and welcome black customers into their businesses. As someone who is interested in the intersection between STEM and the humanities, I wanted to utilize The Science of Where to formulate a project that would reveal important historical implications to the public. Therefore, my overarching goal was to map each location in The Green Book in order to draw significant conclusions regarding racial segregation in one of the largest cities in the entire world.Although a more detailed methodology of my work can be found in the project itself, the following is a step by step walkthrough of my overall scientific process:Develop a question in relation to The Green Book to be solved through the completion of the project.Perform background research on The Green Book to gain a more comprehensive understanding of the subject matter.Formulate a hypothesis that answers the proposed question based on the background research.Transcribe names and addresses for each of the hotel listings in The Green Book into a comma separated values file.Transcribe names and addresses for each of the restaurants listings in The Green Book into a comma separated values file.Repeat Steps 4 and 5 for the 1940, 1950, 1960, and 1966 publications of The Green Book. In total, there should be eight unique database files (1940 New York City Hotels, 1940 New York City Restaurants, 1950 New York City Hotels, 1950 New York City Restaurants, 1960 New York City Hotels, 1960 New York City Restaurants, 1966 New York City Hotels, and 1966 New York City Restaurants.)Construct an address locator that references a New York City street base map to plot the information from the databases in Step 6 as points on a map.Manually plot locations that the address locator did not automatically match on the map.Repeat Steps 7 and 8 for all eight database files.Find and match the point locations for each listing in The Green Book with historical photographs.Generate a map tour using the geotagged images for each point from Step 10.Create a point density heat map for the locations in all eight database files.Research and obtain professional and historically accurate racial demographic data for New York City during the same time period as when The Green Book was published.Generate a hot spot map of the black population percentage using the demographic data.Analyze any geospatial trends between the point density heat maps for The Green Book and the black population percentage hot spot maps from the demographic data.Research and obtain professional and historically accurate redlining data for New York City during the same time period as when The Green Book was published.Overlay the points from The Green Book listings from Step 9 on top of the redlining shapefile.Count the number of point features completely located within each redlining zone ranking utilizing the spatial join tool.Plot the data recorded from Step 18 in the form of graphs.Analyze any geospatial trends between the listings for The Green Book and its location relative to the redlining ranking zones.Draw conclusions from the analyses in Steps 15 and 20 to present a justifiable rationale for the results._Student Generated Maps:New York City Pin Location Maphttps://arcg.is/15i4nj1940 New York City Hotels Maphttps://arcg.is/WuXeq1940 New York City Restaurants Maphttps://arcg.is/L4aqq1950 New York City Hotels Maphttps://arcg.is/1CvTGj1950 New York City Restaurants Maphttps://arcg.is/0iSG4r1960 New York City Hotels Maphttps://arcg.is/1DOzeT1960 New York City Restaurants Maphttps://arcg.is/1rWKTj1966 New York City Hotels Maphttps://arcg.is/4PjOK1966 New York City Restaurants Maphttps://arcg.is/1zyDTv11930s Manhattan Black Population Percentage Enumeration District Maphttps://arcg.is/1rKSzz1930s Manhattan Black Population Percentage Hot Spot Map (Same as Previous)https://arcg.is/1rKSzz1940 Hotels Point Density Heat Maphttps://arcg.is/jD1Ki1940 Restaurants Point Density Heat Maphttps://arcg.is/1aKbTS1940 Hotels Redlining Maphttps://arcg.is/8b10y1940 Restaurants Redlining Maphttps://arcg.is/9WrXv1950 Hotels Redlining Maphttps://arcg.is/ruGiP1950 Restaurants Redlining Maphttps://arcg.is/0qzfvC01960 Hotels Redlining Maphttps://arcg.is/1KTHLK01960 Restaurants Redlining Maphttps://arcg.is/0jiu9q1966 Hotels Redlining Maphttps://arcg.is/PXKn41966 Restaurants Redlining Maphttps://arcg.is/uCD05_Bibliography:Image Credits (In Order of Appearance)Header/Thumbnail Image:Student Generated Collage (Created Using Pictures from the Schomburg Center for Research in Black Culture, Manuscripts, Archives and Rare Books Division, The New York Public Library, https://digitalcollections.nypl.org/collections/the-green-book#/?tab=about.)Mob Violence Image:Kelley, Robert W. “A Mob Rocks an out of State Car Passing.” Life Magazine, www.life.com/history/school-integration-clinton-history, The Green Book Example Image:Schomburg Center for Research in Black Culture, Manuscripts, Archives and Rare Books Division, The New York Public Library Digital Collections, https://images.nypl.org/index.php?id=5207583&t=w. 1940s Borough of Manhattan Hotels and Restaurants Photographs:“Manhattan 1940s Tax Photos.” NYC Municipal Archives Collections, The New York City Department of Records & Information Services, https://nycma.lunaimaging.com/luna/servlet/NYCMA~5~5?cic=NYCMA~5~5.Figure 1:Student Generated GraphFigure 2:Student Generated GraphFigure 3:Student Generated GraphGIS DataThe Green Book Database:Student Generated (See Above)The Green Book Listings Maps:Student Generated (See Above)The Green Book Point Density Heat Maps:Student Generated (See Above)The Green Book Road Trip Map:Student GeneratedLION New York City Single Line Street Base Map:https://www1.nyc.gov/site/planning/data-maps/open-data/dwn-lion.page 1930s Manhattan Census Data:https://s4.ad.brown.edu/Projects/UTP2/ncities.htm Mapping Inequality Redlining Data:https://dsl.richmond.edu/panorama/redlining/#loc=12/40.794/-74.072&city=manhattan-ny&text=downloads 1940 The Green Book Document:Schomburg Center for Research in Black Culture, Manuscripts, Archives and Rare Books Division, The New York Public Library. "The Negro Motorist Green-Book: 1940" The New York Public Library Digital Collections, 1940, https://digitalcollections.nypl.org/items/dc858e50-83d3-0132-2266-58d385a7b928. 1950 The Green Book Document:Schomburg Center for Research in Black Culture, Manuscripts, Archives and Rare Books Division, The New York Public Library. "The Negro Motorist Green-Book: 1950" The New York Public Library Digital Collections, 1950, https://digitalcollections.nypl.org/items/283a7180-87c6-0132-13e6-58d385a7b928. 1960 The Green Book Document:Schomburg Center for Research in Black Culture, Manuscripts, Archives and Rare Books Division, The New York Public Library. "The Travelers' Green Book: 1960" The New York Public Library Digital Collections, 1960, https://digitalcollections.nypl.org/items/a7bf74e0-9427-0132-17bf-58d385a7b928. 1966 The Green Book Document:Schomburg Center for Research in Black Culture, Manuscripts, Archives and Rare Books Division, The New York Public Library. "Travelers' Green Book: 1966-67 International Edition" The New York Public Library Digital Collections, 1966, https://digitalcollections.nypl.org/items/27516920-8308-0132-5063-58d385a7bbd0. Hyperlink Credits (In Order of Appearance)Referenced Hyperlink #1: Coen, Ross. “Sundown Towns.” Black Past, 23 Aug. 2020, blackpast.org/african-american-history/sundown-towns.Referenced Hyperlink #2: Foster, Mark S. “In the Face of ‘Jim Crow’: Prosperous Blacks and Vacations, Travel and Outdoor Leisure, 1890-1945.” The Journal of Negro History, vol. 84, no. 2, 1999, pp. 130–149., doi:10.2307/2649043. Referenced Hyperlink #3:Driskell, Jay. “An Atlas of Self-Reliance: The Negro Motorist's Green Book (1937-1964).” National Museum of American History, Smithsonian Institution, 30 July 2015, americanhistory.si.edu/blog/negro-motorists-green-book. Referenced Hyperlink #4:Kahn, Eve M. “The 'Green Book' Legacy, a Beacon for Black Travelers.” The New York Times, The New York Times, 6 Aug. 2015, www.nytimes.com/2015/08/07/arts/design/the-green-book-legacy-a-beacon-for-black-travelers.html. Referenced Hyperlink #5:Giorgis, Hannah. “The Documentary Highlighting the Real 'Green Book'.” The Atlantic, Atlantic Media Company, 25 Feb. 2019, www.theatlantic.com/entertainment/archive/2019/02/real-green-book-preserving-stories-of-jim-crow-era-travel/583294/. Referenced Hyperlink #6:Staples, Brent. “Traveling While Black: The Green Book's Black History.” The New York Times, The New York Times, 25 Jan. 2019, www.nytimes.com/2019/01/25/opinion/green-book-black-travel.html. Referenced Hyperlink #7:Pollak, Michael. “How Official Is Official?” The New York Times, The New York Times, 15 Oct. 2010, www.nytimes.com/2010/10/17/nyregion/17fyi.html. Referenced Hyperlink #8:“New Name: Avenue Becomes a Boulevard.” The New York Times, The New York Times, 22 Oct. 1987, www.nytimes.com/1987/10/22/nyregion/new-name-avenue-becomes-a-boulevard.html. Referenced Hyperlink #9:Norris, Frank. “Racial Dynamism in Los Angeles, 1900–1964.” Southern California Quarterly, vol. 99, no. 3, 2017, pp. 251–289., doi:10.1525/scq.2017.99.3.251. Referenced Hyperlink #10:Shertzer, Allison, et al. Urban Transition Historical GIS Project, 2016, https://s4.ad.brown.edu/Projects/UTP2/ncities.htm. Referenced Hyperlink #11:Mitchell, Bruce. “HOLC ‘Redlining’ Maps: The Persistent Structure Of Segregation And Economic Inequality.” National Community Reinvestment Coalition, 20 Mar. 2018,
Data is cleaned. All inconsistencies and erroneous records have been removed. These two datasets are used to see how the composition of the contact-book of emergent users differ from those of traditional users in aspects like its size, prevalence use of special symbols, the proportion of dialed contacts through the phone-book, and percentage of unintelligible contact names, etc. Aggregated data for 30 emergent users and 30 traditional users is provided in the form of CSV files to replicate the data analysis results. To reproduce the graphs for usability analysis, R scripts are also provided in the same repository. These scripts contain the required data vectors. These graphs show the efficiency, effectiveness, and satisfaction of emergent users on conventional contact-book interfaces.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The dataset has been collected in the frame of the Prac1 of the subject Tipology and Data Life Cycle of the Master's Degree in Data Science of the Universitat Oberta de Catalunya (UOC).
The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).
Original code used to retrieve the dataset can be found on github repository: github.com/scostap/goodreads_bbe_dataset
The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.
Book cover images can be optionally downloaded from the url in the 'coverImg' field. Python code for doing so and an example can be found on the github repo.
The 25 fields of the dataset are:
| Attributes | Definition | Completeness |
| ------------- | ------------- | ------------- |
| bookId | Book Identifier as in goodreads.com | 100 |
| title | Book title | 100 |
| series | Series Name | 45 |
| author | Book's Author | 100 |
| rating | Global goodreads rating | 100 |
| description | Book's description | 97 |
| language | Book's language | 93 |
| isbn | Book's ISBN | 92 |
| genres | Book's genres | 91 |
| characters | Main characters | 26 |
| bookFormat | Type of binding | 97 |
| edition | Type of edition (ex. Anniversary Edition) | 9 |
| pages | Number of pages | 96 |
| publisher | Editorial | 93 |
| publishDate | publication date | 98 |
| firstPublishDate | Publication date of first edition | 59 |
| awards | List of awards | 20 |
| numRatings | Number of total ratings | 100 |
| ratingsByStars | Number of ratings by stars | 97 |
| likedPercent | Derived field, percent of ratings over 2 starts (as in GoodReads) | 99 |
| setting | Story setting | 22 |
| coverImg | URL to cover image | 99 |
| bbeScore | Score in Best Books Ever list | 100 |
| bbeVotes | Number of votes in Best Books Ever list | 100 |
| price | Book's price (extracted from Iberlibro) | 73 |