Resources

This page contains links to resources, tools, courses, blogs, newsletters and other interesting things relating to text mining for research.

Tools and software

See full list of 80+ tools (web apps, software, packages in R and python) with an overview of type and classification of free vs charged.

Books and Textbooks

Course materials

Newsletters

  • Sebastian Ruder’s NLP News, probably the most comprehensive newsletter out there, covering deep dives in the technical and the economics of mining.

  • The Gradient are overviews, essays and perspectives on Artificial Intelligence, recent developments and long-term impacts. It is a publication ran by volunteers and open to submissions.

Other interesting resources

  • An overview of text mining in the social sciences and humanities by Dong Nguyen, arxiv preprint

  • A critique of computational text analysis in the humanities

  • An academic paper from the Workshop on Computational Humanities Research 2020, discussing the history of quantitative and computational research in the humanities, and especially the quantitative methods in history before computers; by Michael Piotrowski and Mateusz Fafinsky

  • Estimating the degree of similarity between two texts, a blog by Adrien Sieg 2018

  • Masakhane is a grassroots NLP community for Africans, by Africans. It brings people together to work on challenging research problems for African languages. Their recent EMNLP 2020 findings paper demonstrates the impact grassroots efforts can have.

  • A super long and comprehensive list of great NLP resources by Keon