TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

General Inquirer

The General Inquirer is a historically important program for content analysis of textual data originally developed in the 1960s by Philip Stone and his colleagues at the Harvard Laboratory of Social Relations. Though the original release used punched ...
General Inquirer
General Inquirer

Word Cloud - Beta (TAPoRware)

This tool generates a word cloud of the top frequency words from a text document, with word size determined by its frequency. The user can specify how many words are to be included from the document, whether to apply a modified Glasgow Stop Words list, ...
Word Cloud - Beta (TAPoRware)
Word Cloud - Beta (TAPoRware)

Pundit

Pundit is a free, creative commons tool developed by the SemLib Project for creating structured annotations of web pages. These annotations can be collected in virtual notebooks and shared to create collaborative structured data. Annotations may be ...
Pundit
Pundit

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

Collocation - Plain Text (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - Plain Text (TAPoRware)
Collocation - Plain Text (TAPoRware)

Tokenize - XML (TAPoR)

This tool splits an XML document at specified points into 'tokens' - words, lines, sentences, paragraphs or characters. The user can specify characters, patterns, or tags upon which to separate tokens, and choose to have the results listed separator ...
Tokenize - XML (TAPoR)
Tokenize - XML (TAPoR)

Voyant Document KWICs

Document KWICs shows a table of keywords in their context. In other words, it provides a list of certain keywords and their occurrence within a corpus or document.
Voyant Document KWICs
Voyant Document KWICs

Fixed Phrase - Plain Text (TAPoRware)

This tool locates fixed phrases of a user-chosen context length containing a specified word and displays all matching phrases in several different ways. Versions are also available for HTML and XML through the TAPoR toolset.
Fixed Phrase - Plain Text (TAPoRware)
Fixed Phrase - Plain Text (TAPoRware)

Tom Sawyer Perspectives

Tom Sawyer Perspectives is designed by Tom Sawyer Software to facilitate creation of complex visualization and analysis applications. The solution represents a complete Software Development Kit (SDK) with a graphics-based design and preview environment. ...
Tom Sawyer Perspectives
Tom Sawyer Perspectives

BookLamp

BookLamp, part of the Book Genome Project, is a tool and a resource for finding books. It offers an alternative to social recommendation engines reliant on author popularity by treating its books as equal regardless of number of copies sold. BookLamp's ...
BookLamp
BookLamp

Co-Occurrence - Plain Text (TAPoRware)

This tool looks for two words a certain distance apart from one another in a plain text document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. HTML and XML ...
Co-Occurrence - Plain Text (TAPoRware)
Co-Occurrence - Plain Text (TAPoRware)

Pattern (CLiPS)

Pattern (CLiPS) is a web mining module for Python designed for computational linguistic and psycholinguistic research. It integrates tools for data retrieval from search engines, social media, web spiders and individual websites, and can accomplish ...
Pattern (CLiPS)
Pattern (CLiPS)

word tree

word tree is a free, web-based tool for generating dynamic word trees from user-supplied texts. Users can paste their text directly in the box provided, enter a URL or Twitter handle in the search bar, or install the bookmarklet into their browser's ...
word tree
word tree

I Write Like

I Write Like is a free, web-based text analysis tool that compares a user-provided text to the prose of well-known writers using statistics, word choice and writing style analysis. It then reports on which writer the text most resembles. The tool requires ...
I Write Like
I Write Like

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

Leximancer

Leximancer is an application for identifying key concepts in a text, and exploring the results through interactive visualizations and data exports. It includes concept and network cloud visualizations, a sentiment lens, a query system, and multiple ...
Leximancer
Leximancer

Voyant Skin Builder

Voyant Skin Builder is a function within the larger Voyant toolset that enables users to create a customized selection and arrangement of tools with which to analyze a text. These custom skins can be saved or exported for future use.
Voyant Skin Builder
Voyant Skin Builder

UMDHMM: Hidden Markov Model Toolkit

The UMDHMM: Hidden Markov Model Toolkit is a free UNIX implementation of hidden Markov models useful for speech recognition from Tapas Kanungo. It includes implementations of the Forward-Backward, Viterbi and Baum-Welch algorithms.
UMDHMM: Hidden Markov Model Toolkit
UMDHMM: Hidden Markov Model Toolkit

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

Neatline

Neatline is a free, open-source geotemporal exhibit-builder for creating complex maps and narrative sequences from collections of archives and artifacts. It is first and foremost a suite of plugins for the Omeka framework, but can also be accessed as ...
Neatline
Neatline

GRIPHOS (General Retrieval and Information Processor for Humanities Oriented Studies)

GRIPHOS (General Retrieval and Information Processor for Humanities Oriented Studies) is a historically important suite of programs used extensively within the Museum Computer Network, a shared information management system developed jointly by fifteen ...
GRIPHOS (General Retrieval and Information Processor for Humanities Oriented Studies)
GRIPHOS (General Retrieval and Information Processor for Humanities Oriented Studies)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: