Workshop: Analyzing BarbieHeimer texts with R

Much of the data proliferating today is unstructured and text-heavy.
Author

Mauricio โ€œPachรกโ€ Vargas S.

Published

July 24, 2023

Book here: https://www.buymeacoffee.com/pacha/e/153247.

Price: 5 USD regular / 3 USD members.

Much of the data proliferating today is unstructured and text-heavy. We can use different data sources, such as scraped Barbie and Oppenheimer reviews to conduct natural language processing.

This workshop will introduce you to the basics of text analysis in R, using the tidytext package. We will use the BarbieHeimer corpus, a collection of texts I collected for this activity.

We will learn how to clean and prepare the data, and how to perform basic text analysis tasks such as term frequency, inverse document frecuency, sentiment analysis, and topic modeling.

When:

All hours are expressed in GMT-4 (New York time).

Where: Zoom

I have a YouTube channel, Data Science Pacha (https://www.youtube.com/@datasciencepacha), where youโ€™ll find some of my explanations for a general audience.

Join me on this epic learning adventure ๐Ÿš€๐Ÿš€๐Ÿš€