Workshop: Analyzing BarbieHeimer texts with R

Much of the data proliferating today is unstructured and text-heavy.

Author

Mauricio “Pachá” Vargas S.

Published

July 24, 2023

Book here: https://www.buymeacoffee.com/pacha/e/153247.

Price: 5 USD regular / 3 USD members.

Much of the data proliferating today is unstructured and text-heavy. We can use different data sources, such as scraped Barbie and Oppenheimer reviews to conduct natural language processing.

This workshop will introduce you to the basics of text analysis in R, using the tidytext package. We will use the BarbieHeimer corpus, a collection of texts I collected for this activity.

We will learn how to clean and prepare the data, and how to perform basic text analysis tasks such as term frequency, inverse document frecuency, sentiment analysis, and topic modeling.

When:

July 26, 2021, 05:30 AM - 06:30 PM
July 27, 2021, 05:30 AM - 06:30 PM
July 28, 2021, 05:30 AM - 06:30 PM

All hours are expressed in GMT-4 (New York time).

Where: Zoom

I have a YouTube channel, Data Science Pacha (https://www.youtube.com/@datasciencepacha), where you’ll find some of my explanations for a general audience.

Join me on this epic learning adventure 🚀🚀🚀