This website is a common source for literary text mining corpora, since it stores around 50,000 books in plain text format. For 10 points each:
[10e] Name this large online archive of public-domain books named for a German printer.
ANSWER: Project Gutenberg [prompt on Gutenberg]
[10m] A paper by Reagan et al. used sentiment analysis on a Project Gutenberg corpus to support this author’s rejected MA thesis, which proposed that all stories conform to eight basic emotional shapes like “Rags to Riches,” “Man in a Hole,” and “Cinderella.”
ANSWER: Kurt Vonnegut [or Kurt Vonnegut Jr.] (The University of Chicago rejected his thesis.)
[10h] Many digital humanities scholars follow a 1997 book by Franco Moretti that performs this task for 19th-century novels. This task is a central tool for a field of criticism pioneered by Robert Tally and Bertrand Westphal.
ANSWER: mapping literature [or mapping books; or literary cartography; or creating maps of books or equivalents; accept creating a literary atlas; accept literary geography; accept Geocriticism; accept Atlas of the European Novel]
<American Literature>