TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

CulturalAnalytics

CulturalAnalytics, also known as Cultural Analytics for the Digital Humanities in R, is a free R package of functions for statistical analysis and plotting image properties, developed by Rob Myers specifically for the Digital Humanities, and of value ...
CulturalAnalytics
CulturalAnalytics

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

REDUX

REDUX was a natural language program for making generalizations from cases such as case studies, legal cases, case histories and so forth. It searched for repeated patterns, and was presented as a metalanguage for the analysis of the behavior of types. ...
REDUX
REDUX

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

Voyant Bubblelines

Bubblelines is a visualization tool that helps to understand patterns of word repetition in one or more documents. Each document is represented as a horizontal line and each seach term is represented as a bubble – the bubble represents the frequency ...
Voyant Bubblelines
Voyant Bubblelines

Word Cloud - Beta (TAPoRware)

This tool generates a word cloud of the top frequency words from a text document, with word size determined by its frequency. The user can specify how many words are to be included from the document, whether to apply a modified Glasgow Stop Words list, ...
Word Cloud - Beta (TAPoRware)
Word Cloud - Beta (TAPoRware)

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

ARRAS

ARRAS is a historically important tool for analyzing and concording text. It notably provided inspiration for the TACT system.
ARRAS
ARRAS

Voyant Corpus Grid

Corpus Grid shows an overview of the corpus, including each document's title, number of word tokens (total words), number or word types (unique words), and lexical density (the ratio of tokens to types).
Voyant Corpus Grid
Voyant Corpus Grid

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

LISP

LISP is a longstanding high-level programming language first developed in the late 1950s. Within the humanities, it was applied to computational linguistics and natural language processing research.
LISP
LISP

Google Visualization API with RDF

The Google Visualization API provides a platform to create, share and reuse visualizations written by the developer community. Another option of the platform is to create reports, dashboards as well as analyze and display the data through the visualization ...
Google Visualization API with RDF
Google Visualization API with RDF

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

SEASR: Date Entities to Simile Timeline

SEASR's Date Entities to Similie Timeline is a free tool for extracting data entities that may be displayed on a timeline. It uses OpenNLP to extract sentences containing dates, and SIMILE Timeline to display them.
SEASR: Date Entities to Simile Timeline
SEASR: Date Entities to Simile Timeline

Twitter Capture and Analysis Toolset (DMI-TCAT)

Twitter Capture and Analysis Toolset (DMI-TCAT) is a free, open source tool for capturing and analyzing tweets. The tool's web interface is currently closed to researchers outside the University of Amsterdam's Media Studies department, however, the ...
Twitter Capture and Analysis Toolset (DMI-TCAT)
Twitter Capture and Analysis Toolset (DMI-TCAT)

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

TEXTCORD

TEXTCORD is a concordance program available in the 1980s. It was developed in SPITBOL at the University of Toronto's Divinity School.
TEXTCORD
TEXTCORD

BookLamp Labs: Stream Graph Viewer

BookLamp's Stream Graph Viewer is a tool for viewing where and how much of a story element ('StoryDNA') appears in a book. Books can be searched by title or author, and then graphed for one or several pieces of StoryDNA.
BookLamp Labs: Stream Graph Viewer
BookLamp Labs: Stream Graph Viewer

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

ORBIS: Stanford Geospatial Network Model of the Roman World

ORBIS: Stanford Geospatial Network Model of the Roman World is a tool and academic resource for reconstructing the time and financial costs of travel in the ancient world. Its model is based on a simplified network of Roman cities, roads, rivers and ...
ORBIS: Stanford Geospatial Network Model of the Roman World
ORBIS: Stanford Geospatial Network Model of the Roman World
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: