TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

BookLamp Labs: Suggestion Viewer

BookLamp's Suggestion Viewer is a faceted search and browser tool for finding new books based on how similar they are to another book. Users can search by either title or author and select the book to center the search on. The Suggestion Viewer then ...
BookLamp Labs: Suggestion Viewer
BookLamp Labs: Suggestion Viewer

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

Stanford Mobisocial Lab: Muse

Muse is a free, open source JavaScript tool for reflecting on and searching for patterns in the past by examining one's personal e-mail archive. It analyses e-mail to generate several views including a sentiment graph from messages that may reflect ...
Stanford Mobisocial Lab: Muse
Stanford Mobisocial Lab: Muse

WordSeer

WordSeer is a free, simple to use text analysis tool offered by the University of California Berkeley. It is entirely web based, with tools for search, reading and interrogating, heat maps and frequency. At this time, the only corpus available for exploration ...
WordSeer
WordSeer

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

Voyant Cirrus

Cirrus is a visualization tool that displays a word cloud relating to the frequency of words appearing in one or more documents. One can click on any word appearing in the cloud to obtain detailed information about its relativity.
Voyant Cirrus
Voyant Cirrus

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

Voyant Skin Builder

Voyant Skin Builder is a function within the larger Voyant toolset that enables users to create a customized selection and arrangement of tools with which to analyze a text. These custom skins can be saved or exported for future use.
Voyant Skin Builder
Voyant Skin Builder

List Words - Plain Text (TAPoRware)

This tool lists words in an plain text document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and HTML versions ...
List Words - Plain Text (TAPoRware)
List Words - Plain Text (TAPoRware)

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

Tagger - Other (TAPoRware)

This tool was designed by the Globalization & Compendium project to tag glossary terms in an XML file, though it can also be used to tag other text formats. Only the first appearance of a term will be tagged.
Tagger - Other (TAPoRware)
Tagger - Other (TAPoRware)

LIWC (Linguistic Inquiry and Word Count)

LIWC (Linguistic Inquiry and Word Count) is a text analysis program available for purchase. It calculates the degree to which various categories of words are used in a text, and can process texts ranging from e-mails to speeches, poems and transcribed ...
LIWC (Linguistic Inquiry and Word Count)
LIWC (Linguistic Inquiry and Word Count)

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

SEASR: Tag Cloud Viewer With Stemming

SEASR's Tag Cloud Viewer With Stemming is a free tool for creating a tag cloud from a text hosted at a URL, and can process PDF documents. While this tool is similar to SEASR's NGram Tag Cloud Viewer, it applies a stemmer to the text during processing. ...
SEASR: Tag Cloud Viewer With Stemming
SEASR: Tag Cloud Viewer With Stemming

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

Visual Browser

Visual Browser is a Java application for visualizing RDF data using the Jena framework. The resultant graphs are animated and permit users to expand and hide nodes and switch the view of edges, allowing them to focus on a small part of the network. The ...
Visual Browser
Visual Browser

Weighted Centroid - Other (TAPoRware)

The Weighted Centroid is a java applet designed to display a circular graph based on word distribution data. The text is divided up into an arbitrary number of units, which are positioned around the circumference of the circle in a clockwise sequence. ...
Weighted Centroid - Other (TAPoRware)
Weighted Centroid - Other (TAPoRware)

CLAS (Computerized Language Analysis System)

CLAS (Computerized Language Analysis System) was an important historic text analysis system available in the 1970s. It was written in PL/I for IBM 360/370 punch card machines and performed standard statistical tests and concordances on natural language ...
CLAS (Computerized Language Analysis System)
CLAS (Computerized Language Analysis System)

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: