Advanced models like CNNs , LSTMs, and Transformers are frequently tested on this dataset.
The raw data is hosted by Stanford University and is also available on Kaggle . IMDb Sentiment Analysis Using Naive Bayes - IJFMR
This paper introduced a dataset of , specifically balanced with 25,000 positive and 25,000 negative samples. It has since become the benchmark for testing various machine learning and deep learning models, including: jada-imdb
Often used as a baseline for binary classification performance.
The original paper that established the large-scale IMDb movie review dataset used widely in Natural Language Processing (NLP) is: Learning Word Vectors for Sentiment Analysis Advanced models like CNNs , LSTMs, and Transformers
International Conference on Machine Learning (ICML), 2011. Dataset Details
While there is no single academic paper titled exactly "jada-imdb," the query most likely refers to the foundational paper that introduced the for sentiment analysis, which is the most cited work associated with this data. Foundational IMDb Dataset Paper It has since become the benchmark for testing
Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts.