TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

etcML

etcML (Easy Text Classification with Machine Learning) is a free text analysis tool from Stanford University that uses machine learning to identify positive and negative sentiments in texts. Users can analyze their own dataset, use a dataset provided ...
etcML
etcML

Voyant ScatterPlot

ScatterPlot creates a scatter plot graph of terms, spaced by their variation from one another. Once you arrive to ScatterPlot, insert / upload your content and let the tool perform its analysis. You may hover over these dots and click on them for ...
Voyant ScatterPlot
Voyant ScatterPlot

Summarizer - XML (TAPoRware)

This tool creates a summary of statistical information on a given document, and enables the user to select what types of information to display in the summary. The options include high frequency words, sentences with high frequency words, high frequency ...
Summarizer - XML (TAPoRware)
Summarizer - XML (TAPoRware)

Voyant Corpus Term Frequencies

Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.
Voyant Corpus Term Frequencies
Voyant Corpus Term Frequencies

Voyant Mandala

Mandala is a visualization tool that imports “textual” files to perform analysis on the frequency and linkage of words. For example, you may import a play and find the linkage and frequency between a word and its speaker.  
Voyant Mandala
Voyant Mandala

CATPAC

CATPAC is a program available for purchase designed to summarize a text's main ideas. It is multilingual and primarily aimed at researchers in the sciences, and can handle large volumes of text. Text must be formatted in ASCII or RTF. The CATPAC ...
CATPAC
CATPAC

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

SimpleTCT

SimpleTCT (Simple Text Comparison Tool) is a free Java-based text comparison tool offered by Open Digital Arts & Humanities Tools (OpenDAHT). It offers a simplified management environment that enables users to display .rtf files, in which they may ...
SimpleTCT
SimpleTCT

OntoViz

OntoViz is a tab widget included in the Protégé software package for visualizing Protégé ontologies in conjunction with GraphViz. It allows developers to manage and configure visualizations according to their needs.              ...
OntoViz
OntoViz

Visual Browser

Visual Browser is a Java application for visualizing RDF data using the Jena framework. The resultant graphs are animated and permit users to expand and hide nodes and switch the view of edges, allowing them to focus on a small part of the network. The ...
Visual Browser
Visual Browser

Voyant Bubbles

Bubbles reads the words in a document (or corpus) and displays the highest frequency words within proportionately large bubbles. Once you arrive to Bubbles, insert / upload your content and let the tool perform its analysis.
Voyant Bubbles
Voyant Bubbles

SEASR: Date Entities to Simile Timeline

SEASR's Date Entities to Similie Timeline is a free tool for extracting data entities that may be displayed on a timeline. It uses OpenNLP to extract sentences containing dates, and SIMILE Timeline to display them.
SEASR: Date Entities to Simile Timeline
SEASR: Date Entities to Simile Timeline

BookLamp

BookLamp, part of the Book Genome Project, is a tool and a resource for finding books. It offers an alternative to social recommendation engines reliant on author popularity by treating its books as equal regardless of number of copies sold. BookLamp's ...
BookLamp
BookLamp

Voyant Knots

Knots is a visualization tool that helps to understand patterns of word relevance in one or more documents. Each term is represented as a twisted line – when the lines overlap it means a relevance or linkage within the terms.
Voyant Knots
Voyant Knots

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

Voyant Knots

Knots is a visualization tool that helps to understand patterns of word relevance in one or more documents. Each term is represented as a twisted line – when the lines overlap it means a relevance or linkage within the terms.
Voyant Knots
Voyant Knots

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

Profiler Plus

Profiler Plus is a commercial general-purpose text analysis program based on natural language processing. It supports multiple languages and can output in text, XML and CSV.
Profiler Plus
Profiler Plus

LIWC (Linguistic Inquiry and Word Count)

LIWC (Linguistic Inquiry and Word Count) is a text analysis program available for purchase. It calculates the degree to which various categories of words are used in a text, and can process texts ranging from e-mails to speeches, poems and transcribed ...
LIWC (Linguistic Inquiry and Word Count)
LIWC (Linguistic Inquiry and Word Count)

Gephi

Gephi is a free, open source interactive visualization and data exploration tool. Users can manipulate the display to uncover new facets of the data, enabling intutive exploration.
Gephi
Gephi

Quadrigram

Quadrigram is an environment for data gathering, query and visualization, based on a visual programming language and intended for users with no experience programming or creating visualizations. At present, it contains 50 different visualizers.
Quadrigram
Quadrigram
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: