TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

CheckText

CheckText is a free, web-based text analysis tool. Users can paste in text, upload it from their files, or import content from a web page. For each text, CheckText generates statistics such as word count, syllable count or number of complex words, provides ...
CheckText
CheckText

Voyant Corpus Term Frequencies

Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.
Voyant Corpus Term Frequencies
Voyant Corpus Term Frequencies

SEASR: Date Entities to Simile Timeline

SEASR's Date Entities to Similie Timeline is a free tool for extracting data entities that may be displayed on a timeline. It uses OpenNLP to extract sentences containing dates, and SIMILE Timeline to display them.
SEASR: Date Entities to Simile Timeline
SEASR: Date Entities to Simile Timeline

TextGrid

TextGrid is a virtual research environment for text-based humanities scholarship. It offers a variety of tools and services for collaboratively creating, analyzing, editing and publishing texts. The TextGrid environment is split into two components, ...
TextGrid
TextGrid

Voyant Links

Links finds collocates for words and displays links between them using a force directed graph. It shows term frequencies in proximity to keyword. It is a visualization and shows a web of terms.
Voyant Links
Voyant Links

Voyant Corpus Term Frequencies

Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.
Voyant Corpus Term Frequencies
Voyant Corpus Term Frequencies

Umigon

Umigon is a free, web-based and open-source tool for sentiment analysis of tweets. From a person's Twitter handle, Umigon retrieves that account's tweets and processes it for sentiment with accounting for factual statements (ex: "I hate war" will be ...
Umigon
Umigon

Voyant Bubblelines

Bubblelines is a visualization tool that helps to understand patterns of word repetition in one or more documents. Each document is represented as a horizontal line and each seach term is represented as a bubble – the bubble represents the frequency ...
Voyant Bubblelines
Voyant Bubblelines

Lippmannian Device to Gephi

The Lippmannian Device to Gephi is a free, web-based tool for converting the output of the Digital Methods Association's Googlescraper (Lippmannian Device) tool to a format suitable for Gephi visualization. From a Googlescraper file, it generates a ...
Lippmannian Device to Gephi
Lippmannian Device to Gephi

Discursis

Discursis is a tool for analyzing text-based natural language with a focus on sequential analysis. It is particularly desgined to work with texts that have an internal temporal structure, such as a transcribed conversation. For each text, it generates ...
Discursis
Discursis

Old Bailey Data Warehousing Interface

This prototype was produced as a proof-of-concept for the Criminal Intent project.  It allows the records of the Old Bailey Online project to be searched such that the returned results can be viewed as either a timeline or a concordance of terms.  ...
Old Bailey Data Warehousing Interface
Old Bailey Data Warehousing Interface

Google Charts

Google Charts (formerly the Google Visualization API) is a web-based interface for interactive data visualization via HTML5 and SVG. It includes a number of preset chart types ranging from line charts to word trees and all are customizable. A separate ...
Google Charts
Google Charts

Alt.Text

Alt.Text is a free, working prototype application for exploring a text on both an outline and content level via a graphical user interface. It breaks down texts into components such as sections, passages or documents, and permits users to leverage these ...
Alt.Text
Alt.Text

Voyant Document KWICs

Document KWICs shows a table of keywords in their context. In other words, it provides a list of certain keywords and their occurrence within a corpus or document.
Voyant Document KWICs
Voyant Document KWICs

brat rapid annotation tool

The brat rapid annotation tool is an online environment for collaborative structured annotation of texts. It can be run in a browser (optimized for Chrome and Safari) or downloaded for local installation. It can be used for a variety of annotation tasks, ...
brat rapid annotation tool
brat rapid annotation tool

CATMA (Computer Aided Textual Markup and Analysis)

CATMA (Computer Aided Textual Markup and Analysis) is a free, open source markup and analysis tool from the University of Hamburg's Department of Languages, Literature and Media. It incorporates three interactive modules, a tagger enabling textual markup ...
CATMA (Computer Aided Textual Markup and Analysis)
CATMA (Computer Aided Textual Markup and Analysis)

Voyant FeatureClusters (beta)

FeatureClusters is a visualization tool for viewing associations between words, based on commonalities in the feature sets of the words. This is illustrated using a force-directed graph, wherein each word is a node in the graph. Nodes are connected ...
Voyant FeatureClusters (beta)
Voyant FeatureClusters (beta)

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter

Stanford Sentiment Analysis

The Stanford NLP Group's Sentiment Analysis tool is a free, web based live demo based on a sentiment analysis treebank, trained to predict the sentiment in movie reviews. Users can test out the sentiment analysis model with their own text in the box ...
Stanford Sentiment Analysis
Stanford Sentiment Analysis

Voyant Cirrus

Cirrus is a visualization tool that displays a word cloud relating to the frequency of words appearing in one or more documents. One can click on any word appearing in the cloud to obtain detailed information about its relativity.
Voyant Cirrus
Voyant Cirrus

Kaleidoscope

Kaleidoscope is a tool designed to assist users in spotting differences between versions of a text, source files or images. It allows direct comparison of images, texts to be merged into a single edition, and files to be differentiated.
Kaleidoscope
Kaleidoscope
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: