Clio 2: Text analysis

Your assignment is to apply a text analysis method to a corpus of your choice. You may use the ATS corpus, the WPA former slave narratives, or the Tracts for the Times corpora provided, or you may find your own corpus. Note that it is not necessary for the purposes of this assignment to use a very large corpus.

You should start by applying a term-frequency and TF-IDF analysis to your corpus (see the quanteda package in particular). Then you should also analyze the corpus using either topic modeling, named entity recognition (the cleanNLP package), sentiment analysis, word vectors, or text reuse. The quanteda package also contains a number of sample analyses.

For this assignment, you can turn in an analysis notebook that explains your process and what you have found.

Files to known

install.R will help you install the key packages
ats/ contains 20 sample tracts printed by the American Tract Society
The files named *-example.R show examples of doing different kinds of analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
ats		ats
README.md		README.md
clio2-text.Rproj		clio2-text.Rproj
embeddings-example.R		embeddings-example.R
install.R		install.R
quanteda-example.R		quanteda-example.R
stm-example.R		stm-example.R
stringr-example.R		stringr-example.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Clio 2: Text analysis

Files to known

About

Uh oh!

Releases

Packages

Languages

dcjones1789/clio2-text

Folders and files

Latest commit

History

Repository files navigation

Clio 2: Text analysis

Files to known

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages