TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Stanford Vis Group: d3.js - Data Driven Documents

D3.js is a free, open source JavaScript library for manipulating documents with data utilizing HTML5, SVG and CSS3. It is designed to create visualizations that work with current web standards to make the best possible use of the most recent browsers ...
Stanford Vis Group: d3.js - Data Driven Documents
Stanford Vis Group: d3.js - Data Driven Documents

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

ProcessingJS

ProcessingJS is an open source visual programming language for the work with data visualization, electronic arts and visual design communities. ProcessingJS operates under the web standards and doesn’t require any plug-ins. The tool promotes visual ...
ProcessingJS
ProcessingJS

Textometrica

Textometrica is a free, web based text analysis tool offered by HUMlab at Umeå University. Users can upload a plain-text file and examine its word frequencies, see co-occurrences, and generate visualizations and graphs.
Textometrica
Textometrica

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

FISHER

FISHER was a string-handling package for the FORTRAN programming language. It enabled researchers to manipluate characters and character strings with greater ease than FORTRAN permitted by itself.
FISHER
FISHER

Stanford NLP Group: Stanford Topic Modelling Toolbox

Stanford Topic Modelling Toolbox is a free collection of topic modelling tools, and a part of the Stanford Natural Language Processing toolset. It is designed for social scientists and other users who need to analyze datasets with a substantial textual ...
Stanford NLP Group: Stanford Topic Modelling Toolbox
Stanford NLP Group: Stanford Topic Modelling Toolbox

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

The Networked Corpus

The Networked Corpus is a free, open source tool for navigating a corpus of plain text files via topic modelling using MALLET. The tool consistes of a Python script, and generates a collection of HTML files that can be explored further via the user's ...
The Networked Corpus
The Networked Corpus

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

Stanford NLP Group: Stanford Parser

Stanford Parser is a free Java implementation for the statistical parsing of text, and a part of the Stanford Natural Language Processing toolset. This tool can be used with English, Chinese, German and Arabic. A demo web browser version is also available.  ...
Stanford NLP Group: Stanford Parser
Stanford NLP Group: Stanford Parser

Overview

Overview is a free, web-based tool for document mining. Developed by and for journalists, Overview is also broadly applicable to other corpus-based research applications. Aside from providing advanced search features such as regular expressions and ...
Overview
Overview

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

Improvise

Improvise is a free Java software architecture and user interface designed to enable the construction and browsing of interactive visualizations, integrating a declarative visual query language.
Improvise
Improvise

TextDNA

TextDNA is a free tool for large-scale overview analysis of linguistic data offered by the University of Wisconsin, Madison. It identifies patterns within a text, and enables users to compare ordered sets of data with its sequence visualization. It ...
TextDNA
TextDNA

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

GeoTime

GeoTime is a tool for visualizing complex event-related data available for purchase. It can simultaneously visualize geospatial, temporal and link data to display activities and events. Data can be imported from ArcGIS or Microsoft Excel.
GeoTime
GeoTime

Juxta

Juxta is a free, open source tool for comparing and collating texts, originally intended for comparing multiple versions of the same text. It offers several views and visualization options, including histograms and side by side comparison. Juxta is ...
Juxta
Juxta

CLAS (Computerized Language Analysis System)

CLAS (Computerized Language Analysis System) was an important historic text analysis system available in the 1970s. It was written in PL/I for IBM 360/370 punch card machines and performed standard statistical tests and concordances on natural language ...
CLAS (Computerized Language Analysis System)
CLAS (Computerized Language Analysis System)

WordCruncher

WordCruncher is long-standing text indexing, retrieval and analysis program offered by Brigham Young University. Its functions include tagging, contextual searcing, collocation and analytical reporting, and its development has been active since the ...
WordCruncher
WordCruncher
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: