Computational analysis of historical texts

I’m interested in developing effective strategies to locate, access in suitable formats, preprocess, and apply computational tools to historical texts available at multiple repositories including archive.org, Library of Congress, Hathi Trust etc. There are OCRC, format, and analysis challenges galore to overcome but I think this approach can provide students with very useable skills and build historical understanding.