TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

Extract Text - HTML (TAPoRware)

This tool extracts text found within specific tags in an HTML document. It is part of the TAPoRware toolset; an XML version is also available.
Extract Text - HTML (TAPoRware)
Extract Text - HTML (TAPoRware)

DEREDEC

DEREDEC was a programming system and workbench for linguistics and text analysis written in LISP in the 1980s. It enabled syntactic and texual parsing, and could link phrases by their contenxual dependency relations.
DEREDEC
DEREDEC

Wmatrix

Wmatrix is a free tool for corpus analysis and comparison. It provides a web interface for USAS and CLAWS, in addition to enabling standard corpus linguistic functions such as frequency lists and concordances.
Wmatrix
Wmatrix

List Words - Plain Text (TAPoRware)

This tool lists words in an plain text document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and HTML versions ...
List Words - Plain Text (TAPoRware)
List Words - Plain Text (TAPoRware)

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

SplitsTree4

SplitsTree4 is a free Java tool for generating phylogenic (similarity) networks from Universitat Tubingen. While designed for molecular sequence data, it can also visualize humanities data such as document sequence alignments.
SplitsTree4
SplitsTree4

Hypergraph - XML (TAPoRware)

Tool uses the Hypergraph Package (http://hypergraph.sourceforge.net/) to display the structure of a user-selected XML file. This tool requires jsdk1.4.2 or later; otherwise only the default hypergraph is displayed.
Hypergraph - XML (TAPoRware)
Hypergraph - XML (TAPoRware)

TagCrowd

TagCrowd is a tool for generating a frequency-based word cloud from a source text, with a free browser version available through the TagCrowd website. A commercial version may also be purchased, subject to a creative commons license.
TagCrowd
TagCrowd

Stanford NLP Group: CoreNLP

Stanford CoreNLP is a free Natural Language Processing tool. It processes English language text and provides the base forms of words, parts of speech, indicates whether they are proper names, normalizes dates, times and numeric quantities, and marks ...
Stanford NLP Group: CoreNLP
Stanford NLP Group: CoreNLP

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

SARIT (Search and Retrieval of Indic Texts)

SARIT (Search and Retrieval of Indic Texts) is a library of Indic texts with tools for search, retrieval and analysis built into the website. This includes options to apply a KWIC or concordance encompassing all matching texts for a site search. SARIT ...
SARIT (Search and Retrieval of Indic Texts)
SARIT (Search and Retrieval of Indic Texts)

General Inquirer

The General Inquirer is a historically important program for content analysis of textual data originally developed in the 1960s by Philip Stone and his colleagues at the Harvard Laboratory of Social Relations. Though the original release used punched ...
General Inquirer
General Inquirer

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Word Cloud - Beta (TAPoRware)

This tool generates a word cloud of the top frequency words from a text document, with word size determined by its frequency. The user can specify how many words are to be included from the document, whether to apply a modified Glasgow Stop Words list, ...
Word Cloud - Beta (TAPoRware)
Word Cloud - Beta (TAPoRware)

List Tags - HTML (TAPoRware)

This tool lists all tags found in an HTML document, either uploaded by the user or from a web address. It is part of the TAPoRware collection of tools; see List XML Elements for an XML tool with similar functionality.
List Tags - HTML (TAPoRware)
List Tags - HTML (TAPoRware)

RDF Gravity

RDF Gravity (RDF Graph Visualization Tool) is a free, open-source visualization tool from Salzburg Research's Knowledge Information Systems Group. It supports RDF graph structures and OWL ontologies, including multiple RDF files, and offers a variety ...
RDF Gravity
RDF Gravity

Collocation - Plain Text (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - Plain Text (TAPoRware)
Collocation - Plain Text (TAPoRware)

BIBCON

BIBCON was a key-word-out-of-context system for concordances developed in FORTRAN and available in the 1960s.  It was a modified version of another system, KWIC.
BIBCON
BIBCON
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: