Data Downloads

This page contains data files that you can use while working through Macroscope lessons. We have left the URL names below.

Also, as you move forward, it can be helpful to practice or explore other collections of material. An excellent place to get started is Alan Liu’s compilation of demonstration corpora for text analysis http://dhresourcesforprojectbuilding.pbworks.com/w/page/69244469/Data%20Collections%20and%20Datasets

‘Demo corpora are sample or toy collections of texts that are ready-to-go for demonstration purposes or hands-on tutorials–e.g., for teaching text analysis, topic modeling, etc.  Ideal collections for this purpose are public domain or open access, plain-text, relatively modest in number of files, organized neatly in a folder(s), and downloadable as a zip file.’