Researcher ORCID Identifier
Date of Submission
Campus Only Senior Thesis
Bachelor of Arts
2023 Olivia J Renfro
Sentiment analysis is widely used in various industries, and both lexicon labeling and machine learning approaches have been extensively compared. While machine learning models have shown higher accuracy, they require manual labeling of training data, which is time-consuming and costly. This study compares the performance of eight hybrid sentiment analysis models on Twitter data using SentiWordNet and VADER polarity lexicons. Different transformation techniques were used to create a numerical feature map and fed into Linear SVM and Random Forest models. SVM with TF-IDF outperformed RF for most hybrid models, while RF performed better than SVM in almost all word-embedding variations. RF performed exceptionally well for both lexicons, even though it is less cited in sentiment analysis literature, and only SVM with TF-IDF transformations were competitive with RF. SVM consistently performed the worst with word-embedding transformations for both polarity lexicons.
Renfro, Olivia, "SentiWordNet and VADER: Comparative Analysis of the Efficacy of Hybrid Sentiment Analysis Models" (2023). CMC Senior Theses. 3317.
This thesis is restricted to the Claremont Colleges current faculty, students, and staff.