-
-
100,000+ Books in Plain Text Format — This is targeted at people with machine learning use cases but could be ideal as a dataset for all sorts of things. It’s a 37GB download though. There’s also a 100GB file containing source code for similar purposes. It’s all explained in more detail here.