TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

TextSTAT

TextSTAT is a free text analysis tool offered by Niederländische Philologie, FU Berlin. It is a simple program designed to accept plain text, HTML, Word and OpenOffice files to produce word frequency lists and concordances, and versions are available ...
TextSTAT
TextSTAT

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

ALGOL

ALGOL (Algorithmic Language) was a programming language developed in the mid 1950s and often discussed in relation to humanities research applications, such as concordances or computational linguistics.
ALGOL
ALGOL

Textalyser

Textalyser is a free web-based text analysis tool offered by the Bernhard Huber Internet Engineering Company. Users can paste text into the provided entry field, upload a file or provide a URL for analysis. Textalyser provides detailed statistics on ...
Textalyser
Textalyser

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

Digitate

Digitate is a free, open source application for making notes and annotations directly on an image of a cultural artifact such as a manuscript or a painting. Images can also be grouped into projects, saved for later or exported. This application is only ...
Digitate
Digitate

Leximancer

Leximancer is an application for identifying key concepts in a text, and exploring the results through interactive visualizations and data exports. It includes concept and network cloud visualizations, a sentiment lens, a query system, and multiple ...
Leximancer
Leximancer

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

Versioning Machine

The Versioning Machine, now in version 4.0, is a framework and an interface for displaying multiple versions of text, and encodes the text according to TEI guidelines. It incorporates features found in both critical editions and electronic publication, ...
Versioning Machine
Versioning Machine

University of Maryland HCI Group: FeatureLens

FeatureLens is a free tool for visualizing and exploring patterns in text collections. This tool integrates the results of text-mining algorithms, and can assist in finding frequent words or ngrams, enabling the discovery of fuzzy repetition patterns. ...
University of Maryland HCI Group: FeatureLens
University of Maryland HCI Group: FeatureLens

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

Google Blogsearch Scraper

The Google Blogsearch Scraper is a free, web-based tool for batch querying Google Blog Search. Users can submit a list of URLs to search, and a set of keywords to apply to them. For each query, the tool supplies a tag cloud and an HTML table, plus a ...
Google Blogsearch Scraper
Google Blogsearch Scraper

Stanford Vis Group: d3.js - Data Driven Documents

D3.js is a free, open source JavaScript library for manipulating documents with data utilizing HTML5, SVG and CSS3. It is designed to create visualizations that work with current web standards to make the best possible use of the most recent browsers ...
Stanford Vis Group: d3.js - Data Driven Documents
Stanford Vis Group: d3.js - Data Driven Documents

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

Open Calais Issue Discovery

Open Calais Issue Discovery is a free, open source tool for textual analysis. From a text file, URLs or an Issuecrawler XML file, it generates a ranked table of terms and an overall count corresponding to the most relevant words and phrases in the source ...
Open Calais Issue Discovery
Open Calais Issue Discovery

NodeXL

NodeXL is a free, open source tool for generating and exploring network graphs from Microsoft Excel files. It is particularly suited to data from social media sources, and includes the ability to directly import data from Twitter, YouTube, Flickr and ...
NodeXL
NodeXL

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

BookLamp Labs: StoryDNA Viewer

BookLamp's StoryDNA Viewer is a corpus search tool designed to find all books sharing a user-selected set of characteristics ('StoryDNA'). From the resultant list, users can then view a book's entry in BookLamp to get more information about what characteristics ...
BookLamp Labs: StoryDNA Viewer
BookLamp Labs: StoryDNA Viewer

EURAC: Comparison Arcs

Comparison Arcs is a free proof of concept tool demonstrating a method for comparing the linguistic properties of multiple documents in graphical displays. Both texts may be searched in parallel for words, lemmas and parts of speech. This project is ...
EURAC: Comparison Arcs
EURAC: Comparison Arcs

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

Neatline

Neatline is a free, open-source geotemporal exhibit-builder for creating complex maps and narrative sequences from collections of archives and artifacts. It is first and foremost a suite of plugins for the Omeka framework, but can also be accessed as ...
Neatline
Neatline
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: