TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Juxta

Juxta is a free, open source tool for comparing and collating texts, originally intended for comparing multiple versions of the same text. It offers several views and visualization options, including histograms and side by side comparison. Juxta is ...
Juxta
Juxta

Voyant Skin Builder

Voyant Skin Builder is a function within the larger Voyant toolset that enables users to create a customized selection and arrangement of tools with which to analyze a text. These custom skins can be saved or exported for future use.
Voyant Skin Builder
Voyant Skin Builder

SIMILE Widgets: Timeline

Timeline, a part of the SIMILE Widgets family of tools, is a free, open source tool for creating interactive timelines from temporal data utilizing JavaScript and web markup in HTML and XML.
SIMILE Widgets: Timeline
SIMILE Widgets: Timeline

Concordle

Concordle is a free, web based word cloud and concordance tool built in Javascript. It describes itself as the "not so pretty cousin of Wordle" and first debuted in 2006. Users can paste text into the provided box and generate a word cloud, concordance ...
Concordle
Concordle

Neatline

Neatline is a free, open-source geotemporal exhibit-builder for creating complex maps and narrative sequences from collections of archives and artifacts. It is first and foremost a suite of plugins for the Omeka framework, but can also be accessed as ...
Neatline
Neatline

HAWKEYE

HAWKEYE was a program for text analysis for the IBM 370 and developed in the 1970s. It could accept text from an IBM typewriter and had functions to describe text componants such as words and clause lengths, conduct frequency analyses, and generate ...
HAWKEYE
HAWKEYE

Text Variation Explorer (TVE)

Text Variation Explorer (TVE) is a Java tool for text visualization. TVE enables users to explore the effect of window size on a text's type-token ratio, proportion of hapax legomena and average word length; it can also cluster text fragments based ...
Text Variation Explorer (TVE)
Text Variation Explorer (TVE)

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

jsLDA

jsLDA is a free, open source tool for corpus-based in browser topic modelling. Users can test the tool via the provided demo, or download the source code to run on their own system. Users can load any corpus accessible from a URL, and can train the ...
jsLDA
jsLDA

Stanford Vis Group: d3.js - Data Driven Documents

D3.js is a free, open source JavaScript library for manipulating documents with data utilizing HTML5, SVG and CSS3. It is designed to create visualizations that work with current web standards to make the best possible use of the most recent browsers ...
Stanford Vis Group: d3.js - Data Driven Documents
Stanford Vis Group: d3.js - Data Driven Documents

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Prospéro (PROgramme de Sociologie Pragmatique, Expérimentale et Réflexive sur Ordinateur)

Prospéro (PROgramme de Sociologie Pragmatique, Expérimentale et Réflexive sur Ordinateur) is a free text analysis tool aimed at scholars in the Humanities capable of complex analysis on natural language. It permits users to classify and track records ...
Prospéro (PROgramme de Sociologie Pragmatique, Expérimentale et Réflexive sur Ordinateur)
Prospéro (PROgramme de Sociologie Pragmatique, Expérimentale et Réflexive sur Ordinateur)

Timeline JS

Timeline JS is a free timeline tool that can be poplulated from either a Google spreadsheet or a JSON file, and can draw media material from a variety of sources, such as Twitter, Flickr, Wikipedia, YouTube and more. It has been designed to be both ...
Timeline JS
Timeline JS

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

CONCORD

CONCORD was a concordance program developed in 1968 to identify and sort collocating words or phrases. Notably, it allowed users to sort words ending in one of several suffixes (such as -er or -est) to be grouped together.
CONCORD
CONCORD

DocuScope

DocuScope is a text analysis environment first developed in 1998. It contains a suite of interactive visualization tools for corpus-based rhetorical analysis. At present, DocuScope is not available outside the originating research group. However, the ...
DocuScope
DocuScope

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

EXPLEX

EXPLEX (EXtracting information about Proper nouns to provide LEXical information) was a natural language processing system available in the 1990s. It was developed to create lexical entries for nouns appearing in the Wall Street Journal.
EXPLEX
EXPLEX
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: