TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

jsLDA

jsLDA is a free, open source tool for corpus-based in browser topic modelling. Users can test the tool via the provided demo, or download the source code to run on their own system. Users can load any corpus accessible from a URL, and can train the ...
jsLDA
jsLDA

Meld

Meld is a free, open source tool for comparing files, directories and version-controlled projects developed in Python by Kai Willadsen. It features two- and three-way comparison of files and directories, supports numerous version control systems, and ...
Meld
Meld

Typical

Typical was a tool for language-independent corpus exploration. It assessed the significance of co-occuring words in a line and evaluated the significance of the whole line, to help in disambiguating words and finding characteristic example lines.
Typical
Typical

Voyant RezoViz

Voyant RezoViz is a free, web-based tool in the Voyant toolset for visualizing the relationships between people, locations and organizations in a text or collection of texts.
Voyant RezoViz
Voyant RezoViz

PRORA

PRORA was a historically important set of six interrelated programs for creating computerized concordances, first released in the 1960s. It output on punched cards, could be used for any language represented by the Latin alphabet, and also had functions ...
PRORA
PRORA

Visual Browser

Visual Browser is a Java application for visualizing RDF data using the Jena framework. The resultant graphs are animated and permit users to expand and hide nodes and switch the view of edges, allowing them to focus on a small part of the network. The ...
Visual Browser
Visual Browser

DiscoverText

DiscoverText is a cloud-based textual analytics tool available for purchase. It is designed to collect data from sources such as social media, blogs and a variety of document formats, and generate tag clouds and reports from the resulting corpus.
DiscoverText
DiscoverText

Stanford HCI Group: PaperToolkit

PaperToolkit is a free, open source toolkit for designing pen and paper applications. Applications generated with this tool are intended for printing on a paper surface that allows some digital interaction, such as the related ButterflyNet or Interactive ...
Stanford HCI Group: PaperToolkit
Stanford HCI Group: PaperToolkit

R

R is an open source programing language designed for statistical analysis and parallel computing. R began its life as a research project at the University of Aukland, but has since expanded to become a collaborativly run open source project run by the ...
R
R

Coggle

Coggle is a web-based tool for visualizing and applying non-linear structuring to information. Though Coggle has been used for mind mapping, it is primarily a method of documenting branching and non-linear information, and encompassing multiple perspectives. ...
Coggle
Coggle

CommentSpace

CommentSpace is a free, collaborative tool for analysis of comments and visualizations on a website. It uses tags and links to organize the findings and identify contributions from particular users. This tool can assist analysts in identifying evidence ...
CommentSpace
CommentSpace

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter

SentiStrength

SentiStrength is a tool for sentiment analysis available for free to academic users (with registration), in Java and Windows-optimized versions. It estimates the strength of sentiment in short texts, and can handle informal language. Strength is expressed ...
SentiStrength
SentiStrength

Tropes

Tropes is a legacy commercial text analysis tool now available for free. It is designed for natural language processing and semantic classification, including chronological analysis of sequential pieces of text and summarization. It includes a graphical ...
Tropes
Tropes

Wisdom

Wisdom was a word sense disambiguation system that participated in the 1998 SENSEVAL competition. It used a simple supervised learning algorithm, drawing on co-occurrence statistics.
Wisdom
Wisdom

List Words - XML (TAPoRware)

This tool lists words in an XML document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are HTML and plain text versions ...
List Words - XML (TAPoRware)
List Words - XML (TAPoRware)

Raining Words - Other (TAPoR)

'Raining Words' is a tool designed to display word frequency information, minus the words in a chosen stop list, from a user-specified document in a Java applet. High frequency words appear larger and move more slowly than lower frequency words. This ...
Raining Words - Other (TAPoR)
Raining Words - Other (TAPoR)

GapVis

GapVis is a free tool developed in conjunction with Google Ancient Places to visualize the locations within and allow the exploration of books referencing ancient places within the Google Books collection. Users can view text summaries, use an enhanced ...
GapVis
GapVis

TACT (Text Analysis Computing Tools)

TACT (Text Analysis Computing Tools) is a historically important text analysis and retrieval system that was developed from 1986 to 1989 at the University of Toronto in cooperation with IBM and remained in use into the 1990s. It was designed to run ...
TACT (Text Analysis Computing Tools)
TACT (Text Analysis Computing Tools)

CATMA (Computer Aided Textual Markup and Analysis)

CATMA (Computer Aided Textual Markup and Analysis) is a free, open source markup and analysis tool from the University of Hamburg's Department of Languages, Literature and Media. It incorporates three interactive modules, a tagger enabling textual markup ...
CATMA (Computer Aided Textual Markup and Analysis)
CATMA (Computer Aided Textual Markup and Analysis)

INL BlackLab

From the official BlackLab site: "BlackLab is a corpus retrieval engine built on top of Apache Lucene. It allows fast, complex searches with accurate hit highlighting on large, tagged and annotated, bodies of text. It was developed at the Institute ...
INL BlackLab
INL BlackLab
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: