TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Google News Scraper

Google News Scraper is a free, web-based tool for batch querying news.google.com. Users can specify search terms, news sources, date range, location, language, version of Google (ex. .com, .ca, .co.uk, etc.), and what facets to include in the output. ...
Google News Scraper
Google News Scraper

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

IBM: Many Eyes

Many Eyes is a free collection of data visualization tools enabling exploration and discussion of the data. Users who post comments on a visualization may also save their view for others to see in conjunction with their comment. Visualizations can be ...
IBM: Many Eyes
IBM: Many Eyes

BookLamp

BookLamp, part of the Book Genome Project, is a tool and a resource for finding books. It offers an alternative to social recommendation engines reliant on author popularity by treating its books as equal regardless of number of copies sold. BookLamp's ...
BookLamp
BookLamp

Word Cloud - Beta (TAPoRware)

This tool generates a word cloud of the top frequency words from a text document, with word size determined by its frequency. The user can specify how many words are to be included from the document, whether to apply a modified Glasgow Stop Words list, ...
Word Cloud - Beta (TAPoRware)
Word Cloud - Beta (TAPoRware)

Ngram Statistics Package (NSP)

The Ngram Statistics Package (NSP) is a free suite for identifying word and character ngrams in large corpora developed by Ted Pederson and his team. It also generates frequency data and co-occurrences, and can generate correlations between two files. ...
Ngram Statistics Package (NSP)
Ngram Statistics Package (NSP)

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

Collocation - Plain Text (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - Plain Text (TAPoRware)
Collocation - Plain Text (TAPoRware)

minezy

minezy is a prototype application for exploring and mining large email archives. Its interface is designed to make it easy to find the answers to common questions: who the message originator was, when messages were sent, how frequently, who was included ...
minezy
minezy

TagCrowd

TagCrowd is a tool for generating a frequency-based word cloud from a source text, with a free browser version available through the TagCrowd website. A commercial version may also be purchased, subject to a creative commons license.
TagCrowd
TagCrowd

Fixed Phrase - Plain Text (TAPoRware)

This tool locates fixed phrases of a user-chosen context length containing a specified word and displays all matching phrases in several different ways. Versions are also available for HTML and XML through the TAPoR toolset.
Fixed Phrase - Plain Text (TAPoRware)
Fixed Phrase - Plain Text (TAPoRware)

Sophie

Sophie is a free, creative commons authoring tool from the Institute for the Future of the Book and the University of Southern California's School of Cinematic Arts that enables collaboration, reading and publication. Its highlights include an authoring ...
Sophie
Sophie

Comparator - HTML (TAPoRware)

This tool compares two documents by comparing the words in each according to user specifications. XML and plain text versions are also available in the TAPoRware toolsets.
Comparator - HTML (TAPoRware)
Comparator - HTML (TAPoRware)

Co-Occurrence - Plain Text (TAPoRware)

This tool looks for two words a certain distance apart from one another in a plain text document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. HTML and XML ...
Co-Occurrence - Plain Text (TAPoRware)
Co-Occurrence - Plain Text (TAPoRware)

SALT (Statistic Analysis of Language Transcripts)

SALT (Systemic Analysis of Language Transcripts) is a commercial software package for working with large datasets derived from transcripts. It includes transcription tools, text analysis based on natural language processing, content analysis, classification, ...
SALT (Statistic Analysis of Language Transcripts)
SALT (Statistic Analysis of Language Transcripts)

Macro-Etymological Analyzer

The Macro-Etymological Analyzer is a free, open-source tool for etymological analysis of plain-text documents. Users can upload their own file, or choose from a list of pre-loaded texts. The tool conducts a frequency analysis and then identifies the ...
Macro-Etymological Analyzer
Macro-Etymological Analyzer

I Write Like

I Write Like is a free, web-based text analysis tool that compares a user-provided text to the prose of well-known writers using statistics, word choice and writing style analysis. It then reports on which writer the text most resembles. The tool requires ...
I Write Like
I Write Like

SELECT (Sum and Evaluate the Largest Exponentiated Correlation Terms)

SELECT (Sum and Evaluate the Largest Exponentiated Correlation Terms) was a computer program for identifying associationally rich words for content analysis emerging from the WORDS research program developed in the 1970s. Although historically important, ...
SELECT (Sum and Evaluate the Largest Exponentiated Correlation Terms)
SELECT (Sum and Evaluate the Largest Exponentiated Correlation Terms)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: