Sentiment Analysis and Text Generation using Many-to-One LSTMs on Airline Reviews

Overview

This project explores sentiment analysis and text generation utilizing Long Short-Term Memory (LSTM) neural networks.

Sentiment Analysis

We use LSTMs to analyze airline sentiments, aiming to predict sentiment labels (0 or 1) based on customer reviews. Converting text reviews into numerical data, we utilize many-to-one LSTMs for accurate predictions.

Text Generation

Text generation is also explored by training LSTMs on "Alice's Adventures in Wonderland." The model learns to predict the next word in a sequence, creating coherent and contextually relevant sentences. Challenges like language variability and context dependence are addressed through entropy scaling and softmax temperature techniques.

Aim

This project's objectives are twofold:

Develop a sentiment detection model using many-to-one LSTMs to predict sentiment labels (0 or 1) based on airline text reviews.
Utilize many-to-one LSTMs for text generation, training on "Alice's Adventures in Wonderland" and predicting the next word in a sequence.

Data Description

airline_sentiment.csv: This dataset includes information on airline sentiments in CSV format, containing "airline_sentiment" (sentiment labels: 0 or 1) and "text" (customer reviews).
alice.txt: Project Gutenberg's eBook of "Alice’s Adventures in Wonderland" by Lewis Carroll, provided as a text file. This dataset serves as training data for text generation using many-to-one LSTMs.

Tech Stack

Language: Python
Libraries: pandas, numpy, keras, tensorflow, collections, nltk

Approach