TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

Lexos

Lexos is a free, web-based workflow for text analysis offered by Wheaton College's Lexomics Project. Lexos is the most recent and distinct iteration of the Lexomics system of tools, enabling users to upload a text and use the Lexos interface to apply ...
Lexos
Lexos

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

SEASR: Date Entities to Simile Timeline

SEASR's Date Entities to Similie Timeline is a free tool for extracting data entities that may be displayed on a timeline. It uses OpenNLP to extract sentences containing dates, and SIMILE Timeline to display them.
SEASR: Date Entities to Simile Timeline
SEASR: Date Entities to Simile Timeline

Visual Understanding Environment (VUE)

Visual Understanding Environment (VUE) is a free, open source application for mapping concepts, ideas and digital content. It enables users to generate nodes and links, and apply a simple set of tools to explore the relationships. VUE is, in the words ...
Visual Understanding Environment (VUE)
Visual Understanding Environment (VUE)

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

Fixed Phrase - HTML (TAPoRware)

This tool locates fixed phrases of a user-chosen context length containing a specified word and displays all matching phrases in several different ways. Versions are also available for XML and plain text through the TAPoR toolset.
Fixed Phrase - HTML (TAPoRware)
Fixed Phrase - HTML (TAPoRware)

WordWanderer

WordWanderer is a free, web-based tool for visualizing and exploring text. It combines search, concordance and word cloud attributes to enable users to explore their texts. Users can paste their own texts into the box provided, or choose from a list ...
WordWanderer
WordWanderer

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

OpenRefine

OpenRefine (formerly Google Refine) is a free, open-source tool for working with messy data. It enables users to clean data, transform it between a variety of formats, extend it with web services, and link it to databases. This tool is available for ...
OpenRefine
OpenRefine

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

Typical

Typical was a tool for language-independent corpus exploration. It assessed the significance of co-occuring words in a line and evaluated the significance of the whole line, to help in disambiguating words and finding characteristic example lines.
Typical
Typical

Voyant ScatterPlot

ScatterPlot creates a scatter plot graph of terms, spaced by their variation from one another. Once you arrive to ScatterPlot, insert / upload your content and let the tool perform its analysis. You may hover over these dots and click on them for ...
Voyant ScatterPlot
Voyant ScatterPlot

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

GERTWOL

GERTWOL is a tool for the morphological analysis of German texts. A limited version for analyzing single words is now available for free through the developer's website.
GERTWOL
GERTWOL

word tree

word tree is a free, web-based tool for generating dynamic word trees from user-supplied texts. Users can paste their text directly in the box provided, enter a URL or Twitter handle in the search bar, or install the bookmarklet into their browser's ...
word tree
word tree

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

Project Quincy

Project Quincy is a free, open source Django application and MySQL database for tracing the development of social networks and instititions across time and space. Aimed at historians, it enables users to map networks in historical documentation such ...
Project Quincy
Project Quincy

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

Tokenize - Plain Text (TAPoR)

This tool splits an HTML document at specified points into 'tokens' - words, lines, sentences, paragraphs or characters. The user can specify characters, patterns, or tags upon which to separate tokens, and choose to have the results listed separator ...
Tokenize - Plain Text (TAPoR)
Tokenize - Plain Text (TAPoR)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: