A corpus of novels written in the late 19th and early 20th century, built using the texts collected by the 100 English Novels Project on GitHub.
The corpus has been lemmatized and tagged with TreeTagger, using the BNC tagset.