Skip to main content

Digital Humanities

Data and Text Mining

   

What is text and data mining?

Text and data mining (TDM) are often used together although they are slightly different.  Both use computational analysis of vast quantities of digital information.  However text mining looks more at  natural language text  and data mining at structured data. 

Researchers use specialized tools to extract data, identify trends, look for patterns and better understand the relationships of terms within and between documents. Analysis might focus on word frequency, words that frequently appear near each other, contextual information for key words, common phrases and other patterns. 

Analyzes can happen from websites (such as publicly available Facebook posts),  to 16th C. manuscripts,  to DNA sequences,  as well as old newspapers

Databases that allow mining

These resources allow for some kind of mining. Please contact a Subject Librarian for more details and to investigate options if you do not see the database you want listed here.

University of Alberta Libraries - University of Alberta, Edmonton, AB, Canada T6G 2R3 - We are located on Treaty 6 / Métis Territory.