TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

Voyant Bubbles

Bubbles reads the words in a document (or corpus) and displays the highest frequency words within proportionately large bubbles. Once you arrive to Bubbles, insert / upload your content and let the tool perform its analysis.
Voyant Bubbles
Voyant Bubbles

LodLive

LodLive is a web-based tool developed to demonstrate Linked Data standards applied to browsing RDF resources with a simple interface. Users can search preset datasets, including DBpedia and Freebase, or search a resource available at a web address of ...
LodLive
LodLive

List Words - XML (TAPoRware)

This tool lists words in an XML document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are HTML and plain text versions ...
List Words - XML (TAPoRware)
List Words - XML (TAPoRware)

Voyant Mandala

Mandala is a visualization tool that imports “textual” files to perform analysis on the frequency and linkage of words. For example, you may import a play and find the linkage and frequency between a word and its speaker.  
Voyant Mandala
Voyant Mandala

Voyant Corpus Grid

Corpus Grid shows an overview of the corpus, including each document's title, number of word tokens (total words), number or word types (unique words), and lexical density (the ratio of tokens to types).
Voyant Corpus Grid
Voyant Corpus Grid

Coggle

Coggle is a web-based tool for visualizing and applying non-linear structuring to information. Though Coggle has been used for mind mapping, it is primarily a method of documenting branching and non-linear information, and encompassing multiple perspectives. ...
Coggle
Coggle

WordHoard

WordHoard is a tool for the study of large texts or transcribed speech. It annotates or tags texts by applying morphological, lexical, prosodic, and narratological criteria. Users may apply WordHoard to their own texts, or to the corpora included with ...
WordHoard
WordHoard

GATE (General Architecture for Text Engineering)

GATE (General Architecture for Text Engineering) is free, open source software offered by the University of Sheffield since 1995. It provides a framework for users to gather a corpus, apply an ontology to it, develop markup, automate the application ...
GATE (General Architecture for Text Engineering)
GATE (General Architecture for Text Engineering)

Open Calais Issue Discovery

Open Calais Issue Discovery is a free, open source tool for textual analysis. From a text file, URLs or an Issuecrawler XML file, it generates a ranked table of terms and an overall count corresponding to the most relevant words and phrases in the source ...
Open Calais Issue Discovery
Open Calais Issue Discovery

Textexture

Textexture is a free, web-based tool for visualizing texts as a network. The visualization gives a quick visual summary of the text. Clicking on nodes brings up the excerpts the tool has identified as most relivant, and permits users to locate similar ...
Textexture
Textexture

Distribution Graph - XML (TAPoRware)

This tool creates a graphical distribution list of words found within specific XML elements. HTML and plain text versions are also available within the TAPoRware toolsets.
Distribution Graph - XML (TAPoRware)
Distribution Graph - XML (TAPoRware)

Principal Components Analysis on Plain Text - Beta (TAPoRware)

This tool applies Principal Components Analysis rules to a text to generate relationships between words and text units. It works best with large texts where users can specify units of over 500 words. HTML and XML versions are not currently avaiable. ...
Principal Components Analysis on Plain Text - Beta (TAPoRware)
Principal Components Analysis on Plain Text - Beta (TAPoRware)

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OS X and Linux. ...
NLTK 2.0 (Natural Language Toolkit)
NLTK 2.0 (Natural Language Toolkit)

OrlandoVision (OVis)

An application for visualizing a specific collection of authors, and the links or associations between them.  Links are determined by co-occurrence in the Orlando dataset.  The current dataset consists of authors, other people associated with them, ...
OrlandoVision (OVis)
OrlandoVision (OVis)

Voyant Corpus Term Frequencies

Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.
Voyant Corpus Term Frequencies
Voyant Corpus Term Frequencies

Voyant Bubblelines

Bubblelines is a visualization tool that helps to understand patterns of word repetition in one or more documents. Each document is represented as a horizontal line and each seach term is represented as a bubble – the bubble represents the frequency ...
Voyant Bubblelines
Voyant Bubblelines

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

PL/C

PL/C (Programming Language Cornell) was a programming language based on PL/I developed at Cornell University in the early 1970s as an entry-level language to facilitate programming instruction. It was occasionally used to introduce humanists to programming ...
PL/C
PL/C

Textalyser

Textalyser is a free web-based text analysis tool offered by the Bernhard Huber Internet Engineering Company. Users can paste text into the provided entry field, upload a file or provide a URL for analysis. Textalyser provides detailed statistics on ...
Textalyser
Textalyser

Aggregator - Other (TAPoRware)

This tool aggregates texts/subtexts from different locations into a single text. The original texts can be from a user-specified web page or files located on one's computer. Aggregating subtexts requires all documents to share a common subtext tag, ...
Aggregator - Other (TAPoRware)
Aggregator - Other (TAPoRware)

Leximancer

Leximancer is an application for identifying key concepts in a text, and exploring the results through interactive visualizations and data exports. It includes concept and network cloud visualizations, a sentiment lens, a query system, and multiple ...
Leximancer
Leximancer
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: