TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

etcML

etcML (Easy Text Classification with Machine Learning) is a free text analysis tool from Stanford University that uses machine learning to identify positive and negative sentiments in texts. Users can analyze their own dataset, use a dataset provided ...
etcML
etcML

Voyant ScatterPlot

ScatterPlot creates a scatter plot graph of terms, spaced by their variation from one another. Once you arrive to ScatterPlot, insert / upload your content and let the tool perform its analysis. You may hover over these dots and click on them for ...
Voyant ScatterPlot
Voyant ScatterPlot

WATCON

WATCON was a program used extensively at the University of Waterloo for humanities research in the 1970s. It was developed by Philip H. Smith Jr. and WATCHUM (Waterloo Computing in the Humanities) as a text-handling package with concordance applications, ...
WATCON
WATCON

Voyant Corpus Term Frequencies

Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.
Voyant Corpus Term Frequencies
Voyant Corpus Term Frequencies

Voyant Mandala

Mandala is a visualization tool that imports “textual” files to perform analysis on the frequency and linkage of words. For example, you may import a play and find the linkage and frequency between a word and its speaker.  
Voyant Mandala
Voyant Mandala

BookLamp Labs: Sentiment Viewer

BookLamp's Sentiment Viewer is a visualization tool that graphs out a book's sentiments according to intensity and location in the text. Books to graph can be searched by title or author. BookLamp and its tools have been retired as of April 2014 ...
BookLamp Labs: Sentiment Viewer
BookLamp Labs: Sentiment Viewer

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

SimpleTCT

SimpleTCT (Simple Text Comparison Tool) is a free Java-based text comparison tool offered by Open Digital Arts & Humanities Tools (OpenDAHT). It offers a simplified management environment that enables users to display .rtf files, in which they may ...
SimpleTCT
SimpleTCT

Laurence Anthony: AntWordProfiler

AntWordProfiler is a free tool for word profiling. For each word in a document, it will generate the base form and a list of possible related words, provide statistics and frequency data and list word types. It can also process files separately or as ...
Laurence Anthony: AntWordProfiler
Laurence Anthony: AntWordProfiler

Visual Browser

Visual Browser is a Java application for visualizing RDF data using the Jena framework. The resultant graphs are animated and permit users to expand and hide nodes and switch the view of edges, allowing them to focus on a small part of the network. The ...
Visual Browser
Visual Browser

Voyant Bubbles

Bubbles reads the words in a document (or corpus) and displays the highest frequency words within proportionately large bubbles. Once you arrive to Bubbles, insert / upload your content and let the tool perform its analysis.
Voyant Bubbles
Voyant Bubbles

ARRAS

ARRAS is a historically important tool for analyzing and concording text. It notably provided inspiration for the TACT system.
ARRAS
ARRAS

BookLamp

BookLamp, part of the Book Genome Project, is a tool and a resource for finding books. It offers an alternative to social recommendation engines reliant on author popularity by treating its books as equal regardless of number of copies sold. BookLamp's ...
BookLamp
BookLamp

Voyant Knots

Knots is a visualization tool that helps to understand patterns of word relevance in one or more documents. Each term is represented as a twisted line – when the lines overlap it means a relevance or linkage within the terms.
Voyant Knots
Voyant Knots

Community Contributed Collection (CoCoCo)

Community Contributed Collection (CoCoCo) is web software developed by the RunCoCo Project at the University of Oxford for collecting, cataloguing and managing web content such as text or uploaded files contributed by a community of users. It enables ...
Community Contributed Collection (CoCoCo)
Community Contributed Collection (CoCoCo)

Voyant Knots

Knots is a visualization tool that helps to understand patterns of word relevance in one or more documents. Each term is represented as a twisted line – when the lines overlap it means a relevance or linkage within the terms.
Voyant Knots
Voyant Knots

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

Textometrica

Textometrica is a free, web based text analysis tool offered by HUMlab at Umeå University. Users can upload a plain-text file and examine its word frequencies, see co-occurrences, and generate visualizations and graphs.
Textometrica
Textometrica

LIWC (Linguistic Inquiry and Word Count)

LIWC (Linguistic Inquiry and Word Count) is a text analysis program available for purchase. It calculates the degree to which various categories of words are used in a text, and can process texts ranging from e-mails to speeches, poems and transcribed ...
LIWC (Linguistic Inquiry and Word Count)
LIWC (Linguistic Inquiry and Word Count)

Gephi

Gephi is a free, open source interactive visualization and data exploration tool. Users can manipulate the display to uncover new facets of the data, enabling intutive exploration.
Gephi
Gephi

Typical

Typical was a tool for language-independent corpus exploration. It assessed the significance of co-occuring words in a line and evaluated the significance of the whole line, to help in disambiguating words and finding characteristic example lines.
Typical
Typical
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: