“Those who find beautiful meanings in beautiful things are the cultivated. For these there is hope. They are the elect to whom beautiful things mean only beauty.” Oscar Wilde.
This package is a fork of janeaustenr and provides access to the full texts of Oscar Wilde’s 7 completed, presented plays. The UTF-8 plain text for each novel was sourced from Project Gutenberg, processed a bit, and is ready for text analysis. Each text is in a character vector with elements of about 70 characters. The package contains:
vera_or_the_nihilists
: Vera; Or, The Nihilists, published in 1880salome
: Salome, published in 1881the_dutchess_of_padua
: The Dutchess of Padua, published in 1881lady_windermeres_fan
: Lady Windermere’s Fan, published in 1882a_woman_of_no_importance
: A Woman of No Importance, published in 1883an_ideal_husband
: An Ideal Husband, published in 1895the_importance_of_being_earnest
: The Importance of Being Earnest, published in 1895There is also a function wilde_plays()
that returns a tidy data frame of all 7 plays.
Users should be aware that there are some differences in usage between the novels as made available by Project Gutenberg. For example, “anything” vs. “any thing”, “Mr” vs. “Mr.”, and using underscores vs. all caps to indicate italics/emphasis.
The texts were obtained by using gutenbergr::
and some additional text cleaning was made by using stringr::
.
You can install the released version of lp from Github with:
source("https://install-github.me/pachamaltese/oscarwilder")
For some ideas on getting started with analyzing these texts, see Julia Silge’s excellent post blog post on sentiment analysis of Austen’s novels. For help within R, try ?salome
or similar for getting started with the data sets.
This project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.