TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

PRORA

PRORA was a historically important set of six interrelated programs for creating computerized concordances, first released in the 1960s. It output on punched cards, could be used for any language represented by the Latin alphabet, and also had functions ...
PRORA
PRORA

Quadrigram

Quadrigram is an environment for data gathering, query and visualization, based on a visual programming language and intended for users with no experience programming or creating visualizations. At present, it contains 50 different visualizers.
Quadrigram
Quadrigram

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

TEXTCORD

TEXTCORD is a concordance program available in the 1980s. It was developed in SPITBOL at the University of Toronto's Divinity School.
TEXTCORD
TEXTCORD

BookLamp

BookLamp, part of the Book Genome Project, is a tool and a resource for finding books. It offers an alternative to social recommendation engines reliant on author popularity by treating its books as equal regardless of number of copies sold. BookLamp's ...
BookLamp
BookLamp

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

TACT (Text Analysis Computing Tools)

TACT (Text Analysis Computing Tools) is a historically important text analysis and retrieval system that was developed from 1986 to 1989 at the University of Toronto in cooperation with IBM and remained in use into the 1990s. It was designed to run ...
TACT (Text Analysis Computing Tools)
TACT (Text Analysis Computing Tools)

Jigsaw

Jigsaw is a free visual analytics application for exploring collections of documents such as text or spreadsheets. It is aimed at analysists and researchers, particularly to "help analysts reach more timely and accurate understandings of the larger ...
Jigsaw
Jigsaw

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Mandala Browser

Mandala Browser is a rich-prospect browsing interface for exploring a data set in .txt, .rft, .pdf, .csv or .xml format. Searches can be constrained by columns or fields. A version of this tool is also available in the Voyant toolset.
Mandala Browser
Mandala Browser

Hypergraph - XML (TAPoRware)

Tool uses the Hypergraph Package (http://hypergraph.sourceforge.net/) to display the structure of a user-selected XML file. This tool requires jsdk1.4.2 or later; otherwise only the default hypergraph is displayed.
Hypergraph - XML (TAPoRware)
Hypergraph - XML (TAPoRware)

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

TEXTPACK V

TEXTPACK V is a historic collection of interrelated text analysis utilities first released for mainframe computers in the 1970s. With the fifth edition, released in the 1980s, it was ported from FORTRAN to run on PC.
TEXTPACK V
TEXTPACK V

WordSeer

WordSeer is a free, simple to use text analysis tool offered by the University of California Berkeley. It is entirely web based, with tools for search, reading and interrogating, heat maps and frequency. At this time, the only corpus available for exploration ...
WordSeer
WordSeer

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

ARRAS

ARRAS is a historically important tool for analyzing and concording text. It notably provided inspiration for the TACT system.
ARRAS
ARRAS

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

Co-Occurrence - XML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an XML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. HTML and plain text ...
Co-Occurrence - XML (TAPoRware)
Co-Occurrence - XML (TAPoRware)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: