TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

WebLicht

WebLicht is an architecture for creating annotated text corpora. It offers a fully-functional virtual research environment with chains of RESTful web services, each providing a linguistic tool such as format conversion, tokenizing, tagging or parsing. ...
WebLicht
WebLicht

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

Raw Grep - Other (TAPoRware)

This tool can find a user-specified string of characters anywhere in a text document. If the string is part of a word or another non-space string, the word or the non-space string will be displayed as a unit. The search can also be used to view a concordance ...
Raw Grep - Other (TAPoRware)
Raw Grep - Other (TAPoRware)

Textalyser

Textalyser is a free web-based text analysis tool offered by the Bernhard Huber Internet Engineering Company. Users can paste text into the provided entry field, upload a file or provide a URL for analysis. Textalyser provides detailed statistics on ...
Textalyser
Textalyser

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

SIMILE Widgets: JsTeX

JsTeX, a part of the SIMILE Widgets family of tools, is an open source JavaScript library capable of interpreting basic TeX encodings and transforming them into HTML definitions directly in a web page. This tool has been retired by SIMILE Widgets, ...
SIMILE Widgets: JsTeX
SIMILE Widgets: JsTeX

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

CorpusSearch 2

CorpusSearch 2 is a free Java-based program for constructing syntactically annotated (parsed) corpora and searching them. Its functions include finding and counting lexical and syntactic patterns, correcting systemic errors and coding linguistic features. ...
CorpusSearch 2
CorpusSearch 2

Text Variation Explorer (TVE)

Text Variation Explorer (TVE) is a Java tool for text visualization. TVE enables users to explore the effect of window size on a text's type-token ratio, proportion of hapax legomena and average word length; it can also cluster text fragments based ...
Text Variation Explorer (TVE)
Text Variation Explorer (TVE)

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

Lexa

Lexa is a free legacy set of programs for corpus processing. It is designed to tag and lemmatize texts or series of texts. The website for Lexa is no longer maintained or active, but may still be viewed via the Internet Archive.
Lexa
Lexa

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

CETA Parser

The CETA (Centre pour l'Etude de la Traduction Automatique) Parser was a program developed in the 1960s that could automatically reduce Russian sentences to a string of words and provide equivalents for each in French. CETA developed its parser in conjunction ...
CETA Parser
CETA Parser

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

CLAS (Computerized Language Analysis System)

CLAS (Computerized Language Analysis System) was an important historic text analysis system available in the 1970s. It was written in PL/I for IBM 360/370 punch card machines and performed standard statistical tests and concordances on natural language ...
CLAS (Computerized Language Analysis System)
CLAS (Computerized Language Analysis System)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: