DHC Weekly 4/19: Corpora Works of Mercy

Last week on the blog, I wrote about Voyant, a text anaysis tool that can be used to discover all sorts of stats about a text or a corpus of texts — what words are used most frequently, in what combinations, in what contexts, and so on. I used The Adventures of Sherlock Holmes as my test corpus, because its consistent tone and easily accessible public domain status makes it an ideal example for the sorts of questions textual analysis tools like Voyant can prompt one to ask of a literary text. But what other sorts of corpora are out there, and what sorts of projects does analyzing them lead to? Today, I want to write about a publically accessible collection of English language corpora amassed by Mark Davies, a professor of linguistics at Brigham Young University. 

Continue reading “DHC Weekly 4/19: Corpora Works of Mercy”

DHC Weekly 4/12: Voyant and Text Analysis

Hello DH fans!

This week we’re leaving mapping behind us and turning to a category of DH tools oft-utilized in the classroom: text analysis! I’m going to be taking a look at one of the most oft-used text analysis tools, Voyant! Voyant is so popular because it’s quite out-of-the-box easy to use, with no coding necessary. In practice, I have found this to mean that Voyant is a little idiosyncratic and difficult  — but I’m going to try to break down its basics for you all this week!

Continue reading “DHC Weekly 4/12: Voyant and Text Analysis”