TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

DEREDEC

DEREDEC was a programming system and workbench for linguistics and text analysis written in LISP in the 1980s. It enabled syntactic and texual parsing, and could link phrases by their contenxual dependency relations.
DEREDEC
DEREDEC

WordSeer

WordSeer is a free, simple to use text analysis tool offered by the University of California Berkeley. It is entirely web based, with tools for search, reading and interrogating, heat maps and frequency. At this time, the only corpus available for exploration ...
WordSeer
WordSeer

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

etcML

etcML (Easy Text Classification with Machine Learning) is a free text analysis tool from Stanford University that uses machine learning to identify positive and negative sentiments in texts. Users can analyze their own dataset, use a dataset provided ...
etcML
etcML

Voyant Corpus Grid

Corpus Grid shows an overview of the corpus, including each document's title, number of word tokens (total words), number or word types (unique words), and lexical density (the ratio of tokens to types).
Voyant Corpus Grid
Voyant Corpus Grid

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

Collocation - HTML (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - HTML (TAPoRware)
Collocation - HTML (TAPoRware)

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

Transana

Transana is a free, open source program offered by the University of Wisconsin-Madison Center for Education Research for transcribing and analyzing large collections of video and audio data aimed at academic researchers. It enables users to manually ...
Transana
Transana

SEASR: Tag Cloud Viewer With Stemming

SEASR's Tag Cloud Viewer With Stemming is a free tool for creating a tag cloud from a text hosted at a URL, and can process PDF documents. While this tool is similar to SEASR's NGram Tag Cloud Viewer, it applies a stemmer to the text during processing. ...
SEASR: Tag Cloud Viewer With Stemming
SEASR: Tag Cloud Viewer With Stemming

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

Pressbooks

Pressbooks is a platform for collaboratively creating ebooks built over WordPress available as both a free service and on a subscription basis. Subscriptions are only required for users wanting to manage more than 5 books. eBooks can be generated in ...
Pressbooks
Pressbooks

CHNM: Scribe

Scribe, now in version 3.5, is a free note-taking program available for both PC and Mac. Aimed particularly at historians, this program allows researchers to create digital note cards for managing sources, research notes, contacts, images, glossaries, ...
CHNM: Scribe
CHNM: Scribe

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

MITRE Syntactic Analysis Procedure

The MITRE Syntactic Analysis Procedure was a program developed in the 1960s for analyzing English phrase structures.
MITRE Syntactic Analysis Procedure
MITRE Syntactic Analysis Procedure

Google Visualization API with RDF

The Google Visualization API provides a platform to create, share and reuse visualizations written by the developer community. Another option of the platform is to create reports, dashboards as well as analyze and display the data through the visualization ...
Google Visualization API with RDF
Google Visualization API with RDF

CLAS (Computerized Language Analysis System)

CLAS (Computerized Language Analysis System) was an important historic text analysis system available in the 1970s. It was written in PL/I for IBM 360/370 punch card machines and performed standard statistical tests and concordances on natural language ...
CLAS (Computerized Language Analysis System)
CLAS (Computerized Language Analysis System)

LATtice

LATtice is a free visualization tool for exploring and comparing texts across corpora. It also has features for 'drilling down' to determine what rhetorical categories make texts similar or different, and shows multiple visualizations in the same view. ...
LATtice
LATtice
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: