TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

Extract Text - HTML (TAPoRware)

This tool extracts text found within specific tags in an HTML document. It is part of the TAPoRware toolset; an XML version is also available.
Extract Text - HTML (TAPoRware)
Extract Text - HTML (TAPoRware)

Voyant Knots

Knots is a visualization tool that helps to understand patterns of word relevance in one or more documents. Each term is represented as a twisted line – when the lines overlap it means a relevance or linkage within the terms.
Voyant Knots
Voyant Knots

Wmatrix

Wmatrix is a free tool for corpus analysis and comparison. It provides a web interface for USAS and CLAWS, in addition to enabling standard corpus linguistic functions such as frequency lists and concordances.
Wmatrix
Wmatrix

List Words - Plain Text (TAPoRware)

This tool lists words in an plain text document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and HTML versions ...
List Words - Plain Text (TAPoRware)
List Words - Plain Text (TAPoRware)

PRORA

PRORA was a historically important set of six interrelated programs for creating computerized concordances, first released in the 1960s. It output on punched cards, could be used for any language represented by the Latin alphabet, and also had functions ...
PRORA
PRORA

SplitsTree4

SplitsTree4 is a free Java tool for generating phylogenic (similarity) networks from Universitat Tubingen. While designed for molecular sequence data, it can also visualize humanities data such as document sequence alignments.
SplitsTree4
SplitsTree4

Hypergraph - XML (TAPoRware)

Tool uses the Hypergraph Package (http://hypergraph.sourceforge.net/) to display the structure of a user-selected XML file. This tool requires jsdk1.4.2 or later; otherwise only the default hypergraph is displayed.
Hypergraph - XML (TAPoRware)
Hypergraph - XML (TAPoRware)

English Metrics

English Metrics is a free tutorial on English metrics, first written in Hypercard and maintained in a StackRunner version for both OS X and Windows environments. It traces the process of scanning iambic pentameter, and includes reference and exercise ...
English Metrics
English Metrics

Stanford NLP Group: CoreNLP

Stanford CoreNLP is a free Natural Language Processing tool. It processes English language text and provides the base forms of words, parts of speech, indicates whether they are proper names, normalizes dates, times and numeric quantities, and marks ...
Stanford NLP Group: CoreNLP
Stanford NLP Group: CoreNLP

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

CulturalAnalytics

CulturalAnalytics, also known as Cultural Analytics for the Digital Humanities in R, is a free R package of functions for statistical analysis and plotting image properties, developed by Rob Myers specifically for the Digital Humanities, and of value ...
CulturalAnalytics
CulturalAnalytics

General Inquirer

The General Inquirer is a historically important program for content analysis of textual data originally developed in the 1960s by Philip Stone and his colleagues at the Harvard Laboratory of Social Relations. Though the original release used punched ...
General Inquirer
General Inquirer

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

Virtual Lightbox

Virtual Lightbox is a Java-base tool for comparing images online developed for the academic humanities, the library and museum community, and any users who use images as an important data type. It is available as both an application, which functions ...
Virtual Lightbox
Virtual Lightbox

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Word Cloud - Beta (TAPoRware)

This tool generates a word cloud of the top frequency words from a text document, with word size determined by its frequency. The user can specify how many words are to be included from the document, whether to apply a modified Glasgow Stop Words list, ...
Word Cloud - Beta (TAPoRware)
Word Cloud - Beta (TAPoRware)

CollateX

CollateX is a free Java library for collating texts offered by Interedition, designed to be the successor to Peter Robinson's Collate. It uses a component-oriented architecture, to enable users to mix and match components according to their needs. Though ...
CollateX
CollateX

RDF Gravity

RDF Gravity (RDF Graph Visualization Tool) is a free, open-source visualization tool from Salzburg Research's Knowledge Information Systems Group. It supports RDF graph structures and OWL ontologies, including multiple RDF files, and offers a variety ...
RDF Gravity
RDF Gravity

Collocation - Plain Text (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - Plain Text (TAPoRware)
Collocation - Plain Text (TAPoRware)

AUTOSTYL

AUTOSTYL was a set of linked programs for stylistic analysis developed by Louis T. Milic in the 1980s. Its functions included frequency distributions for letters in words, syllable countering, affix analysis, indexing, and word classification.
AUTOSTYL
AUTOSTYL
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: