TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

Voyant Bubbles

Bubbles reads the words in a document (or corpus) and displays the highest frequency words within proportionately large bubbles. Once you arrive to Bubbles, insert / upload your content and let the tool perform its analysis.
Voyant Bubbles
Voyant Bubbles

LodLive

LodLive is a web-based tool developed to demonstrate Linked Data standards applied to browsing RDF resources with a simple interface. Users can search preset datasets, including DBpedia and Freebase, or search a resource available at a web address of ...
LodLive
LodLive

Alt.Text

Alt.Text is a free, working prototype application for exploring a text on both an outline and content level via a graphical user interface. It breaks down texts into components such as sections, passages or documents, and permits users to leverage these ...
Alt.Text
Alt.Text

Voyant Mandala

Mandala is a visualization tool that imports “textual” files to perform analysis on the frequency and linkage of words. For example, you may import a play and find the linkage and frequency between a word and its speaker.  
Voyant Mandala
Voyant Mandala

Voyant Corpus Grid

Corpus Grid shows an overview of the corpus, including each document's title, number of word tokens (total words), number or word types (unique words), and lexical density (the ratio of tokens to types).
Voyant Corpus Grid
Voyant Corpus Grid

EULER

EULER was a general programming language proposed as a successor to and with many of the characteristics of ALGOL 60. Extended EULER, which included string-handling facilities based on ALGOL W, was useful to humanities research for search, frequency ...
EULER
EULER

WordHoard

WordHoard is a tool for the study of large texts or transcribed speech. It annotates or tags texts by applying morphological, lexical, prosodic, and narratological criteria. Users may apply WordHoard to their own texts, or to the corpora included with ...
WordHoard
WordHoard

GATE (General Architecture for Text Engineering)

GATE (General Architecture for Text Engineering) is free, open source software offered by the University of Sheffield since 1995. It provides a framework for users to gather a corpus, apply an ontology to it, develop markup, automate the application ...
GATE (General Architecture for Text Engineering)
GATE (General Architecture for Text Engineering)

Tableau Public

Tableau Public is a free data visualization tool aimed at online publishers and academics. It can be used to create an interactive visualization, and also enables users to publish it to the web as an embed or share it via a link. Users must download ...
Tableau Public
Tableau Public

Textexture

Textexture is a free, web-based tool for visualizing texts as a network. The visualization gives a quick visual summary of the text. Clicking on nodes brings up the excerpts the tool has identified as most relivant, and permits users to locate similar ...
Textexture
Textexture

Distribution Graph - XML (TAPoRware)

This tool creates a graphical distribution list of words found within specific XML elements. HTML and plain text versions are also available within the TAPoRware toolsets.
Distribution Graph - XML (TAPoRware)
Distribution Graph - XML (TAPoRware)

PAIR (Pairwise Alignment for Intertextual Relations)

PAIR (Pairwise Alignment for Intertextual Relations) is a free, open source sequence alignment algorithm for humanities text analysis. It identifies similar passages in a large corpus. Corpuses are indexed for future use, and incoming texts can be compared ...
PAIR (Pairwise Alignment for Intertextual Relations)
PAIR (Pairwise Alignment for Intertextual Relations)

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OS X and Linux. ...
NLTK 2.0 (Natural Language Toolkit)
NLTK 2.0 (Natural Language Toolkit)

OrlandoVision (OVis)

An application for visualizing a specific collection of authors, and the links or associations between them.  Links are determined by co-occurrence in the Orlando dataset.  The current dataset consists of authors, other people associated with them, ...
OrlandoVision (OVis)
OrlandoVision (OVis)

ATLAS.ti

ATLAS.ti is a tool for systematic qualitative data analysis released under a for-pay license. It offers multi-window frames, overviews generated through interactive network views utilizing visual interconnections, and cloud views and reports. A free ...
ATLAS.ti
ATLAS.ti

Voyant Bubblelines

Bubblelines is a visualization tool that helps to understand patterns of word repetition in one or more documents. Each document is represented as a horizontal line and each seach term is represented as a bubble – the bubble represents the frequency ...
Voyant Bubblelines
Voyant Bubblelines

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

etcML

etcML (Easy Text Classification with Machine Learning) is a free text analysis tool from Stanford University that uses machine learning to identify positive and negative sentiments in texts. Users can analyze their own dataset, use a dataset provided ...
etcML
etcML

Textalyser

Textalyser is a free web-based text analysis tool offered by the Bernhard Huber Internet Engineering Company. Users can paste text into the provided entry field, upload a file or provide a URL for analysis. Textalyser provides detailed statistics on ...
Textalyser
Textalyser

Aggregator - Other (TAPoRware)

This tool aggregates texts/subtexts from different locations into a single text. The original texts can be from a user-specified web page or files located on one's computer. Aggregating subtexts requires all documents to share a common subtext tag, ...
Aggregator - Other (TAPoRware)
Aggregator - Other (TAPoRware)

VARD 2

VARD 2 is a free, creative commons tool for preprocessing historical corpora. Built in Java, it enables researchers to easily match up historic variant spellings with modern conventions. Though optimized for Early Modern English, other languages can ...
VARD 2
VARD 2
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: