TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Visual Browser

Visual Browser is a Java application for visualizing RDF data using the Jena framework. The resultant graphs are animated and permit users to expand and hide nodes and switch the view of edges, allowing them to focus on a small part of the network. The ...
Visual Browser
Visual Browser

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

Voyant Knots

Knots is a visualization tool that helps to understand patterns of word relevance in one or more documents. Each term is represented as a twisted line – when the lines overlap it means a relevance or linkage within the terms.
Voyant Knots
Voyant Knots

TextSTAT

TextSTAT is a free text analysis tool offered by Niederländische Philologie, FU Berlin. It is a simple program designed to accept plain text, HTML, Word and OpenOffice files to produce word frequency lists and concordances, and versions are available ...
TextSTAT
TextSTAT

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

I Write Like

I Write Like is a free, web-based text analysis tool that compares a user-provided text to the prose of well-known writers using statistics, word choice and writing style analysis. It then reports on which writer the text most resembles. The tool requires ...
I Write Like
I Write Like

WORDij

WORDij is a free semantic network tool for capturing relationships between words and assigning word-pair link strengths. This information is used as the basis for more sophisticated analysis, such as word network structure mapping. WORDij requires registration ...
WORDij
WORDij

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

cue.language

cue.language is a free library of Java code and resources for basic natural language processing emerging from the development of the Wordle word cloud tool and maintained by Jonathan Feinman of IBM's CUE Research Group. Its functions include tokenization, ...
cue.language
cue.language

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

INRAC Language Compiler

INRAC was an early language compiler useful for prototyping natural language processing interfaces, designing conversational systems, and experimental prose and poetry.
INRAC Language Compiler
INRAC Language Compiler

Textalyser

Textalyser is a free web-based text analysis tool offered by the Bernhard Huber Internet Engineering Company. Users can paste text into the provided entry field, upload a file or provide a URL for analysis. Textalyser provides detailed statistics on ...
Textalyser
Textalyser

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

SIMILE Widgets: Gadget

Gadget, a part of the SIMILE Widgets family of tools, is an open source command line XML inspector for generating useful summaries from large amounts of well-formed XML data. It is particularly valuable when exploring, migrating or cleaning XML, in ...
SIMILE Widgets: Gadget
SIMILE Widgets: Gadget

Stanford NLP Group: Stanford Topic Modelling Toolbox

Stanford Topic Modelling Toolbox is a free collection of topic modelling tools, and a part of the Stanford Natural Language Processing toolset. It is designed for social scientists and other users who need to analyze datasets with a substantial textual ...
Stanford NLP Group: Stanford Topic Modelling Toolbox
Stanford NLP Group: Stanford Topic Modelling Toolbox

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

Compare Networks Over Time

Compare Networks Over Time is a free, web-based tool for comparing Issuecrawler networks, or data from a regularly scheduled crawl. The tool displays ranked actor lists for each comparison, in a format suitable for graphing via spreadsheet software. ...
Compare Networks Over Time
Compare Networks Over Time

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

Wmatrix

Wmatrix is a free tool for corpus analysis and comparison. It provides a web interface for USAS and CLAWS, in addition to enabling standard corpus linguistic functions such as frequency lists and concordances.
Wmatrix
Wmatrix
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: