Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains the San Francisco Police Department’s (SFPD) incident reports from 2018 to present. The dataset will be updated daily.
+500.000 rows and 34 columns. Columns' description are listed below.
Data from DataSF. Image from Thales.
If you're reading this, please upvote.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
From: http://kavita-ganesan.com/entity-ranking-data
Downloads: - Dataset - Only reviews (~98MB) [ readme ]
OpinRank Dataset - Reviews from TripAdvisor and Edmunds Dataset Type: Text Format: Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews) Domain: hotels, cars How to cite dataset: [ bib ]
Citing Dataset [ bib ]
If you use this dataset for your own research please cite the following to mark the dataset:
Ganesan, K. A., and C. X. Zhai, "Opinion-Based Entity Ranking", Information Retrieval.
@article{ganesan2012opinion, title={Opinion-based entity ranking}, author={Ganesan, Kavita and Zhai, ChengXiang}, journal={Information retrieval}, volume={15}, number={2}, pages={116--150}, year={2012}, publisher={Springer} }
Dataset Overview This data set contains full reviews for cars and and hotels collected from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews).
Car Reviews Dataset Description
Full reviews of cars for model-years 2007, 2008, and 2009 There are about 140-250 cars for each model year Extracted fields include dates, author names, favorites and the full textual review Total number of reviews: ~42,230 Year 2007 -18,903 reviews Year 2008 -15,438 reviews Year 2009 - 7,947 reviews Format There are three different folders (2007,2008,2009) representing the three model years. Each file (within these 3 folders) would contain all reviews for a particular car. The filename represents the name of the car. Within each car file, you would see a set of reviews in the following format:
Note that each review is enclosed within a element as shown above and all the extracted items are within this element.
Hotel Reviews Dataset Description
Full reviews of hotels in 10 different cities (Dubai, Beijing, London, New York City, New Delhi, San Francisco, Shanghai, Montreal, Las Vegas, Chicago) There are about 80-700 hotels in each city Extracted fields include date, review title and the full review Total number of reviews: ~259,000 Format There should be 10 different folders representing the 10 cities mentioned earlier. Each file (within these 10 folders) would contain all reviews related to a particular hotel. The filename represents the name of the hotel. Within each file, you would see a set of reviews in the following format:
Date1
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains the San Francisco Police Department’s (SFPD) incident reports from 2018 to present. The dataset will be updated daily.
+500.000 rows and 34 columns. Columns' description are listed below.
Data from DataSF. Image from Thales.
If you're reading this, please upvote.