TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

BookLamp Labs: Suggestion Viewer

BookLamp's Suggestion Viewer is a faceted search and browser tool for finding new books based on how similar they are to another book. Users can search by either title or author and select the book to center the search on. The Suggestion Viewer then ...
BookLamp Labs: Suggestion Viewer
BookLamp Labs: Suggestion Viewer

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

SIMILE Widgets: Timeplot

TimePlot, a part of the SIMILE Widgets family of tools, is a free, open source DHTML-based AJAXy widget for plotting time series and overlaying time-based events over them. The overlays use the same data formats supported by the related SIMILE Widgets ...
SIMILE Widgets: Timeplot
SIMILE Widgets: Timeplot

Stanford Vis Group: d3.js - Data Driven Documents

D3.js is a free, open source JavaScript library for manipulating documents with data utilizing HTML5, SVG and CSS3. It is designed to create visualizations that work with current web standards to make the best possible use of the most recent browsers ...
Stanford Vis Group: d3.js - Data Driven Documents
Stanford Vis Group: d3.js - Data Driven Documents

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter

TACT (Text Analysis Computing Tools)

TACT (Text Analysis Computing Tools) is a historically important text analysis and retrieval system that was developed from 1986 to 1989 at the University of Toronto in cooperation with IBM and remained in use into the 1990s. It was designed to run ...
TACT (Text Analysis Computing Tools)
TACT (Text Analysis Computing Tools)

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

Concordance - Plain Text (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in a plain text document. Users may specify the context length, and whether the tool returns the context length in words, sentences, lines or paragraphs. It is also available for XML ...
Concordance - Plain Text (TAPoRware)
Concordance - Plain Text (TAPoRware)

Textable

Orange Textable is a an add-on tool developed for the Orange data mining and visualization package. Its functions include the ability to build data tables from textual data, to import data from other sources, and to build concordances and collocations. ...
Textable
Textable

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

Meld

Meld is a free, open source tool for comparing files, directories and version-controlled projects developed in Python by Kai Willadsen. It features two- and three-way comparison of files and directories, supports numerous version control systems, and ...
Meld
Meld

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

ATLAS.ti

ATLAS.ti is a tool for systematic qualitative data analysis released under a for-pay license. It offers multi-window frames, overviews generated through interactive network views utilizing visual interconnections, and cloud views and reports. A free ...
ATLAS.ti
ATLAS.ti

Collocation - HTML (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - HTML (TAPoRware)
Collocation - HTML (TAPoRware)

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

List Words - HTML (TAPoRware)

This tool lists words in an HTML document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and plain text versions ...
List Words - HTML (TAPoRware)
List Words - HTML (TAPoRware)

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

CLAS (Computerized Language Analysis System)

CLAS (Computerized Language Analysis System) was an important historic text analysis system available in the 1970s. It was written in PL/I for IBM 360/370 punch card machines and performed standard statistical tests and concordances on natural language ...
CLAS (Computerized Language Analysis System)
CLAS (Computerized Language Analysis System)

RiTa

RiTa is a free, open-source natural language library for work with generative literature, offered as both a 'core' package of jar files and documentation, and a text-to-speech package. It is designed to be simple and intuitive while still offering flexibility ...
RiTa
RiTa
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: