TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

University of Maryland HCI Group: FeatureLens

FeatureLens is a free tool for visualizing and exploring patterns in text collections. This tool integrates the results of text-mining algorithms, and can assist in finding frequent words or ngrams, enabling the discovery of fuzzy repetition patterns. ...
University of Maryland HCI Group: FeatureLens
University of Maryland HCI Group: FeatureLens

Word Cloud - Beta (TAPoRware)

This tool generates a word cloud of the top frequency words from a text document, with word size determined by its frequency. The user can specify how many words are to be included from the document, whether to apply a modified Glasgow Stop Words list, ...
Word Cloud - Beta (TAPoRware)
Word Cloud - Beta (TAPoRware)

Tesseract OCR

Tesseract is a free raw OCR engine originally developed by HP Labs and now maintained by Google. It works with the Leptonica Image Processing Library, and is capable of reading a variety of image formats. It can convert images to text in over 40 languages. ...
Tesseract OCR
Tesseract OCR

BookLamp Labs: Sentiment Viewer

BookLamp's Sentiment Viewer is a visualization tool that graphs out a book's sentiments according to intensity and location in the text. Books to graph can be searched by title or author.
BookLamp Labs: Sentiment Viewer
BookLamp Labs: Sentiment Viewer

Collocation - Plain Text (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - Plain Text (TAPoRware)
Collocation - Plain Text (TAPoRware)

SUSS (Sunderland University SENSEVAL System)

SUSS (Sunderland University SENSEVAL System) was an algorithm for word sense disambiguation developed for the inaugural SENSEVAL event in 1998.
SUSS (Sunderland University SENSEVAL System)
SUSS (Sunderland University SENSEVAL System)

Concordance - Plain Text (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in a plain text document. Users may specify the context length, and whether the tool returns the context length in words, sentences, lines or paragraphs. It is also available for XML ...
Concordance - Plain Text (TAPoRware)
Concordance - Plain Text (TAPoRware)

Fixed Phrase - Plain Text (TAPoRware)

This tool locates fixed phrases of a user-chosen context length containing a specified word and displays all matching phrases in several different ways. Versions are also available for HTML and XML through the TAPoR toolset.
Fixed Phrase - Plain Text (TAPoRware)
Fixed Phrase - Plain Text (TAPoRware)

PL/C

PL/C (Programming Language Cornell) was a programming language based on PL/I developed at Cornell University in the early 1970s as an entry-level language to facilitate programming instruction. It was occasionally used to introduce humanists to programming ...
PL/C
PL/C

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

Co-Occurrence - Plain Text (TAPoRware)

This tool looks for two words a certain distance apart from one another in a plain text document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. HTML and XML ...
Co-Occurrence - Plain Text (TAPoRware)
Co-Occurrence - Plain Text (TAPoRware)

Issue Discovery

Issue Discovery is a tool for extracting the most prominent words and phrases from user-supplied URLs, texts or Issuecrawler XML result files. It is designed as a heuristic for data exploration rather than as an empirical tool.
Issue Discovery
Issue Discovery

Quadrigram

Quadrigram is an environment for data gathering, query and visualization, based on a visual programming language and intended for users with no experience programming or creating visualizations. At present, it contains 50 different visualizers.
Quadrigram
Quadrigram

I Write Like

I Write Like is a free, web-based text analysis tool that compares a user-provided text to the prose of well-known writers using statistics, word choice and writing style analysis. It then reports on which writer the text most resembles. The tool requires ...
I Write Like
I Write Like

NITE XML Toolkit (NXT)

The NITE XML Toolkit (NXT) is an open source toolkit for working with language corpora, particularly useful for multimodal and cross-annotated data sets.
NITE XML Toolkit (NXT)
NITE XML Toolkit (NXT)

SIMILE Widgets: Timeline

Timeline, a part of the SIMILE Widgets family of tools, is a free, open source tool for creating interactive timelines from temporal data utilizing JavaScript and web markup in HTML and XML.
SIMILE Widgets: Timeline
SIMILE Widgets: Timeline

Voyant Skin Builder

Voyant Skin Builder is a function within the larger Voyant toolset that enables users to create a customized selection and arrangement of tools with which to analyze a text. These custom skins can be saved or exported for future use.
Voyant Skin Builder
Voyant Skin Builder

List Words - Plain Text (TAPoRware)

This tool lists words in an plain text document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and HTML versions ...
List Words - Plain Text (TAPoRware)
List Words - Plain Text (TAPoRware)

Voyant ScatterPlot

ScatterPlot creates a scatter plot graph of terms, spaced by their variation from one another. Once you arrive to ScatterPlot, insert / upload your content and let the tool perform its analysis. You may hover over these dots and click on them for ...
Voyant ScatterPlot
Voyant ScatterPlot

Neatline

Neatline is a free, open-source geotemporal exhibit-builder for creating complex maps and narrative sequences from collections of archives and artifacts. It is first and foremost a suite of plugins for the Omeka framework, but can also be accessed as ...
Neatline
Neatline

WebLicht

WebLicht is an architecture for creating annotated text corpora. It offers a fully-functional virtual research environment with chains of RESTful web services, each providing a linguistic tool such as format conversion, tokenizing, tagging or parsing. ...
WebLicht
WebLicht
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: