TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

General Inquirer

The General Inquirer is a historically important program for content analysis of textual data originally developed in the 1960s by Philip Stone and his colleagues at the Harvard Laboratory of Social Relations. Though the original release used punched ...
General Inquirer
General Inquirer

Voyant Skin Builder

Voyant Skin Builder is a function within the larger Voyant toolset that enables users to create a customized selection and arrangement of tools with which to analyze a text. These custom skins can be saved or exported for future use.
Voyant Skin Builder
Voyant Skin Builder

CATPAC

CATPAC is a program available for purchase designed to summarize a text's main ideas. It is multilingual and primarily aimed at researchers in the sciences, and can handle large volumes of text. Text must be formatted in ASCII or RTF. The CATPAC ...
CATPAC
CATPAC

Timeline JS

Timeline JS is a free timeline tool that can be poplulated from either a Google spreadsheet or a JSON file, and can draw media material from a variety of sources, such as Twitter, Flickr, Wikipedia, YouTube and more. It has been designed to be both ...
Timeline JS
Timeline JS

Neatline

Neatline is a free, open-source geotemporal exhibit-builder for creating complex maps and narrative sequences from collections of archives and artifacts. It is first and foremost a suite of plugins for the Omeka framework, but can also be accessed as ...
Neatline
Neatline

Tesseract OCR

Tesseract is a free raw OCR engine originally developed by HP Labs and now maintained by Google. It works with the Leptonica Image Processing Library, and is capable of reading a variety of image formats. It can convert images to text in over 40 languages. ...
Tesseract OCR
Tesseract OCR

Juxta

Juxta is a free, open source tool for comparing and collating texts, originally intended for comparing multiple versions of the same text. It offers several views and visualization options, including histograms and side by side comparison. Juxta is ...
Juxta
Juxta

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

Tom Sawyer Perspectives

Tom Sawyer Perspectives is a commercial software package for data visualization and analysis. It provides a graphical software development kit and preview environment, including layout and Tom Sawyer Software's proprietary data visualization reference ...
Tom Sawyer Perspectives
Tom Sawyer Perspectives

TagCrowd

TagCrowd is a tool for generating a frequency-based word cloud from a source text, with a free browser version available through the TagCrowd website. A commercial version may also be purchased, subject to a creative commons license.
TagCrowd
TagCrowd

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

Timeline JS

Timeline JS is a free timeline tool that can be poplulated from either a Google spreadsheet or a JSON file, and can draw media material from a variety of sources, such as Twitter, Flickr, Wikipedia, YouTube and more. It has been designed to be both ...
Timeline JS
Timeline JS

Stanford NLP Group: CoreNLP

Stanford CoreNLP is a free Natural Language Processing tool. It processes English language text and provides the base forms of words, parts of speech, indicates whether they are proper names, normalizes dates, times and numeric quantities, and marks ...
Stanford NLP Group: CoreNLP
Stanford NLP Group: CoreNLP

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

TextSTAT

TextSTAT is a free text analysis tool offered by Niederländische Philologie, FU Berlin. It is a simple program designed to accept plain text, HTML, Word and OpenOffice files to produce word frequency lists and concordances, and versions are available ...
TextSTAT
TextSTAT

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

SNOBOL (String Oriented Symbolic Language)

SNOBOL is a programming language developed in the 1960s capable of symbol and free-form string manipulation that was of particular value to early computer-based Humanities research. SNOBOL4 was the final version released by AT&T Bell Laboratories. ...
SNOBOL (String Oriented Symbolic Language)
SNOBOL (String Oriented Symbolic Language)

Stanford Vis Group: d3.js - Data Driven Documents

D3.js is a free, open source JavaScript library for manipulating documents with data utilizing HTML5, SVG and CSS3. It is designed to create visualizations that work with current web standards to make the best possible use of the most recent browsers ...
Stanford Vis Group: d3.js - Data Driven Documents
Stanford Vis Group: d3.js - Data Driven Documents

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

DV-COLL (Donne Variorum Textual Collation Program)

DV-COLL (Donne Variorum Textual Collation Program) is a historically important textual collation program first introduced in the 1980s and maintained into the present. It was originally designed to assist in creating a digitized corpus of the works ...
DV-COLL (Donne Variorum Textual Collation Program)
DV-COLL (Donne Variorum Textual Collation Program)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: