As a college senior affected by the coronavirus pandemic, I, like over 75% of the students at the University of Texas at Austin, am attending zoom university this fall. Being away from campus during my penultimate semester has made me feel a little homesick. Fueled by curiosity and a little bit of FOMO, I decided to check up on my fellow longhorns with the help of a few Natural Language Processing techniques.

The code for this project is on my GitHub repo.

Table of Contents

  1. UT Data
  2. Bag of Words
  3. Topic Modeling
  4. Sentiment Analysis
  5. _1. _Overall Sentiment
  6. _2. _Sentiment by Topic
  7. Comparison with Texas A&M
  8. _1. _Topics in Common
  9. _2. _Sentiment Comparison
  10. _3. _School Reputation
  11. Conclusion

UT Austin Data

I collected 1,348 text entries from the most recent posts on the UT Austin subreddit using Joseph Lai’s Universal Reddit Scraper. These text entries included post title, body, and comments. I then cleaned the text using a VBA script on excel to remove non-ASCII characters.

#machine-learning #reddit #nlp #covid19 #college

How college students are handling COVID-19
2.25 GEEK