TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

Extract Text - HTML (TAPoRware)

This tool extracts text found within specific tags in an HTML document. It is part of the TAPoRware toolset; an XML version is also available.
Extract Text - HTML (TAPoRware)
Extract Text - HTML (TAPoRware)

PhiloGL

PhiloGL is an open source WebGL Framework for advanced data visualization, creative coding and game development. It includes a module system encompassing Program and Shader management, IO, XHR, JSONP, Web Worker management, Effects and Tweening, among ...
PhiloGL
PhiloGL

Wmatrix

Wmatrix is a free tool for corpus analysis and comparison. It provides a web interface for USAS and CLAWS, in addition to enabling standard corpus linguistic functions such as frequency lists and concordances.
Wmatrix
Wmatrix

List Words - Plain Text (TAPoRware)

This tool lists words in an plain text document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and HTML versions ...
List Words - Plain Text (TAPoRware)
List Words - Plain Text (TAPoRware)

INL BlackLab

From the official BlackLab site: "BlackLab is a corpus retrieval engine built on top of Apache Lucene. It allows fast, complex searches with accurate hit highlighting on large, tagged and annotated, bodies of text. It was developed at the Institute ...
INL BlackLab
INL BlackLab

SplitsTree4

SplitsTree4 is a free Java tool for generating phylogenic (similarity) networks from Universitat Tubingen. While designed for molecular sequence data, it can also visualize humanities data such as document sequence alignments.
SplitsTree4
SplitsTree4

Hypergraph - XML (TAPoRware)

Tool uses the Hypergraph Package (http://hypergraph.sourceforge.net/) to display the structure of a user-selected XML file. This tool requires jsdk1.4.2 or later; otherwise only the default hypergraph is displayed.
Hypergraph - XML (TAPoRware)
Hypergraph - XML (TAPoRware)

W3C RDF Validation Service

The W3C RDF Validation Service is a free, web-based tool for checking RDF documents for errors and displaying the results. It can display in triples, a graph, or a combination of the two, and can format the graph in a variety of file formats including ...
W3C RDF Validation Service
W3C RDF Validation Service

Stanford NLP Group: CoreNLP

Stanford CoreNLP is a free Natural Language Processing tool. It processes English language text and provides the base forms of words, parts of speech, indicates whether they are proper names, normalizes dates, times and numeric quantities, and marks ...
Stanford NLP Group: CoreNLP
Stanford NLP Group: CoreNLP

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

GINGER II

GINGER II is a word-sense disambiguator for English, offering a new approach to the algorithm developed for GINGER I. It directly extracts semantic disambiguation rules from dictionary example phrases, and semantically tags, syntactically parses and ...
GINGER II
GINGER II

General Inquirer

The General Inquirer is a historically important program for content analysis of textual data originally developed in the 1960s by Philip Stone and his colleagues at the Harvard Laboratory of Social Relations. Though the original release used punched ...
General Inquirer
General Inquirer

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

CAPs Finder - Beta (TAPoRware)

This tool finds groups of capital letters (CAPs) in a user-specified plain text file. If the user submits an XML or HTML document, the tool will strip all tags and then process it as plain text. XML and HTML versions are not currently available.
CAPs Finder - Beta (TAPoRware)
CAPs Finder - Beta (TAPoRware)

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Word Cloud - Beta (TAPoRware)

This tool generates a word cloud of the top frequency words from a text document, with word size determined by its frequency. The user can specify how many words are to be included from the document, whether to apply a modified Glasgow Stop Words list, ...
Word Cloud - Beta (TAPoRware)
Word Cloud - Beta (TAPoRware)

Co-Occurrence - XML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an XML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. HTML and plain text ...
Co-Occurrence - XML (TAPoRware)
Co-Occurrence - XML (TAPoRware)

RDF Gravity

RDF Gravity (RDF Graph Visualization Tool) is a free, open-source visualization tool from Salzburg Research's Knowledge Information Systems Group. It supports RDF graph structures and OWL ontologies, including multiple RDF files, and offers a variety ...
RDF Gravity
RDF Gravity

Collocation - Plain Text (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - Plain Text (TAPoRware)
Collocation - Plain Text (TAPoRware)

AUTOSTYL

AUTOSTYL was a set of linked programs for stylistic analysis developed by Louis T. Milic in the 1980s. Its functions included frequency distributions for letters in words, syllable countering, affix analysis, indexing, and word classification.
AUTOSTYL
AUTOSTYL
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: