TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

T-PEN

T-PEN is a free toolset from the Center for Digital Theology at Saint Louis University for working with images of manuscripts and attaching transcription data. It has features for collaborative transcription in XML, multiple data viewing, data manipulation ...
T-PEN
T-PEN

TAGS (Twitter Archiving Google Spreadsheet) v5.1

TAGS (Twitter Archiving Google Spreadsheet) v5.1 is a tool for automatically pulling Twitter search results into a Google spreadsheet for further analysis. Users can set TAGS 5.1 to update the resulting archive hourly or at a frequency of their specification, ...
TAGS (Twitter Archiving Google Spreadsheet) v5.1
TAGS (Twitter Archiving Google Spreadsheet) v5.1

Aggregator - Other (TAPoRware)

This tool aggregates texts/subtexts from different locations into a single text. The original texts can be from a user-specified web page or files located on one's computer. Aggregating subtexts requires all documents to share a common subtext tag, ...
Aggregator - Other (TAPoRware)
Aggregator - Other (TAPoRware)

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

University of Maryland HCI Group: FeatureLens

FeatureLens is a free tool for visualizing and exploring patterns in text collections. This tool integrates the results of text-mining algorithms, and can assist in finding frequent words or ngrams, enabling the discovery of fuzzy repetition patterns. ...
University of Maryland HCI Group: FeatureLens
University of Maryland HCI Group: FeatureLens

Extract Text - HTML (TAPoRware)

This tool extracts text found within specific tags in an HTML document. It is part of the TAPoRware toolset; an XML version is also available.
Extract Text - HTML (TAPoRware)
Extract Text - HTML (TAPoRware)

BookLamp Labs: Suggestion Viewer

BookLamp's Suggestion Viewer is a faceted search and browser tool for finding new books based on how similar they are to another book. Users can search by either title or author and select the book to center the search on. The Suggestion Viewer then ...
BookLamp Labs: Suggestion Viewer
BookLamp Labs: Suggestion Viewer

TagCrowd

TagCrowd is a tool for generating a frequency-based word cloud from a source text, with a free browser version available through the TagCrowd website. A commercial version may also be purchased, subject to a creative commons license.
TagCrowd
TagCrowd

List Words - Plain Text (TAPoRware)

This tool lists words in an plain text document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and HTML versions ...
List Words - Plain Text (TAPoRware)
List Words - Plain Text (TAPoRware)

Edit Flow

Edit Flow is a free, open source WordPress plugin designed for editorial collaboration. It includes a private discussion environment, metadata management and workflow tools.
Edit Flow
Edit Flow

SEASR: OpenNLP Entities To Protovis Network Graph

SEASR's OpenNLP Entities to Protovis Network Graph is a free tool for extracting entities within a specified sentence distance within a text. The OpenNLP system is used to extract entities, and their relationships are represented in a link node network ...
SEASR: OpenNLP Entities To Protovis Network Graph
SEASR: OpenNLP Entities To Protovis Network Graph

Hypergraph - XML (TAPoRware)

Tool uses the Hypergraph Package (http://hypergraph.sourceforge.net/) to display the structure of a user-selected XML file. This tool requires jsdk1.4.2 or later; otherwise only the default hypergraph is displayed.
Hypergraph - XML (TAPoRware)
Hypergraph - XML (TAPoRware)

Stanford NLP Group: Stanford Tokenizer

Stanford Tokenizer is a free Java implementation for diving an English text into tokens such as words, and a part of the Stanford Natural Language Processing toolset. This tool is not available on its own, but is bundled with other tools in the same ...
Stanford NLP Group: Stanford Tokenizer
Stanford NLP Group: Stanford Tokenizer

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

Stanford Vis Group: Data Wrangler

Data Wrangler is free web-based tool for interactive data cleaning and transformation. It takes raw data and transforms it into data tables for further analysis and cleaning. The results can be exported in a variety of data table formats, including ...
Stanford Vis Group: Data Wrangler
Stanford Vis Group: Data Wrangler

General Inquirer

The General Inquirer is a historically important program for content analysis of textual data originally developed in the 1960s by Philip Stone and his colleagues at the Harvard Laboratory of Social Relations. Though the original release used punched ...
General Inquirer
General Inquirer

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

Umigon

Umigon is a free, web-based and open-source tool for sentiment analysis of tweets. From a person's Twitter handle, Umigon retrieves that account's tweets and processes it for sentiment with accounting for factual statements (ex: "I hate war" will be ...
Umigon
Umigon
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: