This dataset was created by Jeetendra Dhall
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by pooriaTaj
Released under Apache 2.0
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
This dataset was created by Yueming
Released under Database: Open Database, Contents: Database Contents
Large Movie Review Dataset. This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.
To use this dataset:
import tensorflow_datasets as tfds
ds = tfds.load('imdb_reviews', split='train')
for ex in ds.take(4):
print(ex)
See the guide for more informations on tensorflow_datasets.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by love-seeker
Released under Apache 2.0
This dataset was created by Naim Mhedhbi
This dataset was created by Jacob
This dataset was created by NIKHIL PRATAP SINGH
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
The 3 datasets used for fine-tuning is available on Kaggle. You can download it using below links:
IMDB dataset (Sentiment analysis) in CSV format link
Sentiment Analysis Dataset link
Stock News Sentiment Analysis(Massive Dataset) link
π Data Preprocessing & Visualizationlink
The dataset is cleaned, preprocessed, and visualized using Pandas, Matplotlib, and Seaborn. Open and run the notebook:
This dataset was created by Ktr123
This dataset was created by adityaghuse
Data Source: https://www.kaggle.com/datasets/gufukuro/movie-scripts-corpus Data Description : Movie Scripts Corpus This corpus was collected to use for screenplay analysis with machine learning methods. Corpus includes movie scripts, crawled from different sources, their annotations by script structural elements and movies metadata. Corpus description Screenplay data consists of: Movie scripts TXT-documents with raw full text (2858 docs) Movie scripts TXT-documents with full text lemmas (2858 docs) Manual annotation TXT-documents for some movie scripts (33 docs, more than 6000 annotated rows) Movie scripts annotations TXT-documents obtained by BERT Movie scripts annotations json-documents obtained by rule-based annotator ScreenPy Movies metadata consists of: Cut versions of movie reviews and scores from metacritic: Number of reviews: 21025 Number of movies with reviews: 2038 Metadata for movies, including: title, akas, launch year, score from metacritic, imdb user rating and number of votes from imdb.com, movie awards, opening weekend, producers, budget, script department, production companies, writers, directors, cast info, countries involved in production, age restrict, plot (with outline), keywords, genres, taglines, critics' synopsis Screenplay awards information: Academy Awards adapted screenplay, Academy Awards original screenplay, BAFTA, Golden Globe Award for Best Screenplay, Writers Guild Awards Winners & Nominees 2020-2013 nominations information for 462 movies in total. Movie characters data consists of: Script text fragments with dialogs and scene descriptions for characters, gathered with annotators: 2153 movies and text fragments for 32114 characters in total Gender labels for 4792 characters
This dataset was created by salil dethe
This dataset was created by Engr Wajid Rehman
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset was created by anishkd
Released under CC0: Public Domain
This dataset was created by klin059
This dataset was created by Samuel Givand
This dataset was created by Kalina_yan
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by Muhammad Hadi13
Released under Apache 2.0
This dataset was created by Jeetendra Dhall