TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Visualizing Literature: Sentiment

Sentiment, from the Visualizing Literature toolset, allows users to select a text from Visualizing Literature's list of grade school curriculum exemplar creative commons texts and graph it for keywords suggestive of sentiment. All texts are searchable, ...
Visualizing Literature: Sentiment
Visualizing Literature: Sentiment

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

Summarizer - HTML (TAPoRware)

This tool creates a summary of statistical information on a given document, and enables the user to select what types of information to display in the summary. The options include high frequency words, sentences with high frequency words, high frequency ...
Summarizer - HTML (TAPoRware)
Summarizer - HTML (TAPoRware)

Stanford NLP Group: CoreNLP

Stanford CoreNLP is a free Natural Language Processing tool. It processes English language text and provides the base forms of words, parts of speech, indicates whether they are proper names, normalizes dates, times and numeric quantities, and marks ...
Stanford NLP Group: CoreNLP
Stanford NLP Group: CoreNLP

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

AWK

AWK is a programming language well suited to text processing first developed at Bell Labs in the late 1970s. It was particularly useful to humanists in the late 1980s and early 1990s, and was notably used to develop parts of the ARTFL project.
AWK
AWK

Stanford Vis Group: d3.js - Data Driven Documents

D3.js is a free, open source JavaScript library for manipulating documents with data utilizing HTML5, SVG and CSS3. It is designed to create visualizations that work with current web standards to make the best possible use of the most recent browsers ...
Stanford Vis Group: d3.js - Data Driven Documents
Stanford Vis Group: d3.js - Data Driven Documents

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

Web Page Cleaner - Beta (TAPoRware)

This tool removes all HTML formatting from a web page or an uploaded HTML file, leaving the text for further processing. It is particularly good for preparing text-intensive web pages for analysis as plain text.
Web Page Cleaner - Beta (TAPoRware)
Web Page Cleaner - Beta (TAPoRware)

SEASR: NGram Tag Cloud Viewer

SEASR's NGram Tag Cloud Viewer is a free tool for generating a tag cloud from a text hosted at a URL. It can also process PDF documents. Although the page on SEASR's website is no longer active, it can still be viewed via the Internet Archive.
SEASR: NGram Tag Cloud Viewer
SEASR: NGram Tag Cloud Viewer

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

TimeRime

TimeRime is a free, web-based timeline application for creating, viewing and comparing interactive timelines. This application requires registration, and timelines created within the application are public.
TimeRime
TimeRime

TagCrowd

TagCrowd is a tool for generating a frequency-based word cloud from a source text, with a free browser version available through the TagCrowd website. A commercial version may also be purchased, subject to a creative commons license.
TagCrowd
TagCrowd

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

List Words - XML (TAPoRware)

This tool lists words in an XML document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are HTML and plain text versions ...
List Words - XML (TAPoRware)
List Words - XML (TAPoRware)

Overview

Overview is a free, web-based tool for document mining. Developed by and for journalists, Overview is also broadly applicable to other corpus-based research applications. Aside from providing advanced search features such as regular expressions and ...
Overview
Overview

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

Textometrica

Textometrica is a free, web based text analysis tool offered by HUMlab at Umeå University. Users can upload a plain-text file and examine its word frequencies, see co-occurrences, and generate visualizations and graphs.
Textometrica
Textometrica

SplitsTree4

SplitsTree4 is a free Java tool for generating phylogenic (similarity) networks from Universitat Tubingen. While designed for molecular sequence data, it can also visualize humanities data such as document sequence alignments.
SplitsTree4
SplitsTree4

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

Flemm v3.1: Analyseur Flexionnel du français pour des corpus étiquetés

Flemm v3.1 is a free, open source tool for lemmatizing and generating inflections for a French text.
Flemm v3.1: Analyseur Flexionnel du français pour des corpus étiquetés
Flemm v3.1: Analyseur Flexionnel du français pour des corpus étiquetés
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: