TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

CheckText

CheckText is a free, web-based text analysis tool. Users can paste in text, upload it from their files, or import content from a web page. For each text, CheckText generates statistics such as word count, syllable count or number of complex words, provides ...
CheckText
CheckText

Voyant Corpus Term Frequencies

Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.
Voyant Corpus Term Frequencies
Voyant Corpus Term Frequencies

LEMMA

LEMMA was a program package for lemmatizing German word-forms released in 1978. It was designed to work with computer-readable corpora. The algorithm was implemented in PL/I on an IBM 370/168 computer.
LEMMA
LEMMA

TextGrid

TextGrid is a virtual research environment for text-based humanities scholarship. It offers a variety of tools and services for collaboratively creating, analyzing, editing and publishing texts. The TextGrid environment is split into two components, ...
TextGrid
TextGrid

Voyant Links

Links finds collocates for words and displays links between them using a force directed graph. It shows term frequencies in proximity to keyword. It is a visualization and shows a web of terms.
Voyant Links
Voyant Links

RDF2SVG (Rhizomik)

RDF2SVG is a web-based tool from the Rhizomik initiative for transforming RDF/XML data into an SVG representation. It can also accept N-triples and N3 input, and includes a feature for filtering the user's defined preferred language from the source ...
RDF2SVG (Rhizomik)
RDF2SVG (Rhizomik)

Umigon

Umigon is a free, web-based and open-source tool for sentiment analysis of tweets. From a person's Twitter handle, Umigon retrieves that account's tweets and processes it for sentiment with accounting for factual statements (ex: "I hate war" will be ...
Umigon
Umigon

Voyant Bubblelines

Bubblelines is a visualization tool that helps to understand patterns of word repetition in one or more documents. Each document is represented as a horizontal line and each seach term is represented as a bubble – the bubble represents the frequency ...
Voyant Bubblelines
Voyant Bubblelines

TokenX

TokenX is a free text visualization and analysis tool for XML documents. It offeres a web-based environment and can generate word clouds, highlight parts of text such as words, non-words and punctuation, KWIC (keyword in context) and more. TokenX also ...
TokenX
TokenX

Discursis

Discursis is a tool for analyzing text-based natural language with a focus on sequential analysis. It is particularly desgined to work with texts that have an internal temporal structure, such as a transcribed conversation. For each text, it generates ...
Discursis
Discursis

Old Bailey Data Warehousing Interface

This prototype was produced as a proof-of-concept for the Criminal Intent project.  It allows the records of the Old Bailey Online project to be searched such that the returned results can be viewed as either a timeline or a concordance of terms.  ...
Old Bailey Data Warehousing Interface
Old Bailey Data Warehousing Interface

Compare Networks Over Time

Compare Networks Over Time is a free, web-based tool for comparing Issuecrawler networks, or data from a regularly scheduled crawl. The tool displays ranked actor lists for each comparison, in a format suitable for graphing via spreadsheet software. ...
Compare Networks Over Time
Compare Networks Over Time

Alt.Text

Alt.Text is a free, working prototype application for exploring a text on both an outline and content level via a graphical user interface. It breaks down texts into components such as sections, passages or documents, and permits users to leverage these ...
Alt.Text
Alt.Text

Voyant Document KWICs

Document KWICs shows a table of keywords in their context. In other words, it provides a list of certain keywords and their occurrence within a corpus or document.
Voyant Document KWICs
Voyant Document KWICs

VARD 2

VARD 2 is a free, creative commons tool for preprocessing historical corpora. Built in Java, it enables researchers to easily match up historic variant spellings with modern conventions. Though optimized for Early Modern English, other languages can ...
VARD 2
VARD 2

CATMA (Computer Aided Textual Markup and Analysis)

CATMA (Computer Aided Textual Markup and Analysis) is a free, open source markup and analysis tool from the University of Hamburg's Department of Languages, Literature and Media. It incorporates three interactive modules, a tagger enabling textual markup ...
CATMA (Computer Aided Textual Markup and Analysis)
CATMA (Computer Aided Textual Markup and Analysis)

Voyant FeatureClusters (beta)

FeatureClusters is a visualization tool for viewing associations between words, based on commonalities in the feature sets of the words. This is illustrated using a force-directed graph, wherein each word is a node in the graph. Nodes are connected ...
Voyant FeatureClusters (beta)
Voyant FeatureClusters (beta)

GINGER II

GINGER II is a word-sense disambiguator for English, offering a new approach to the algorithm developed for GINGER I. It directly extracts semantic disambiguation rules from dictionary example phrases, and semantically tags, syntactically parses and ...
GINGER II
GINGER II

Stanford Sentiment Analysis

The Stanford NLP Group's Sentiment Analysis tool is a free, web based live demo based on a sentiment analysis treebank, trained to predict the sentiment in movie reviews. Users can test out the sentiment analysis model with their own text in the box ...
Stanford Sentiment Analysis
Stanford Sentiment Analysis

Voyant Cirrus

Cirrus is a visualization tool that displays a word cloud relating to the frequency of words appearing in one or more documents. One can click on any word appearing in the cloud to obtain detailed information about its relativity.
Voyant Cirrus
Voyant Cirrus

Meandre

Meandre is a graphical programming language for creating text analysis flows, built on top of the Seasr infrastructure. Meandre uploads its flows to a Seasr server where they can be accessed and used by anyone who can access the server.
Meandre
Meandre
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: