TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

KH Coder

KH Coder is a tool for quantitative content analysis and text mining that has been under continuous development since 2001. It was originally developed for Japanese text and now supports numerous other languages, including English, Italian and French. ...
KH Coder
KH Coder

OrlandoVision (OVis)

An application for visualizing a specific collection of authors, and the links or associations between them.  Links are determined by co-occurrence in the Orlando dataset.  The current dataset consists of authors, other people associated with them, ...
OrlandoVision (OVis)
OrlandoVision (OVis)

KLIC (Key Letter in Context)

KLIC (Key Letter in Context) was a program for graphological anaylsis particularly useful to scholars analyzing the language of medieval texts, created by Grace B. Logan of the Waterloo Arts Computing Office. It assisted in determining where different ...
KLIC (Key Letter in Context)
KLIC (Key Letter in Context)

Voyant ScatterPlot

ScatterPlot creates a scatter plot graph of terms, spaced by their variation from one another. Once you arrive to ScatterPlot, insert / upload your content and let the tool perform its analysis. You may hover over these dots and click on them for ...
Voyant ScatterPlot
Voyant ScatterPlot

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

Issue Discovery

Issue Discovery is a tool for extracting the most prominent words and phrases from user-supplied URLs, texts or Issuecrawler XML result files. It is designed as a heuristic for data exploration rather than as an empirical tool.
Issue Discovery
Issue Discovery

SEASR: NGram Tag Cloud Viewer

SEASR's NGram Tag Cloud Viewer is a free tool for generating a tag cloud from a text hosted at a URL. It can also process PDF documents. Although the page on SEASR's website is no longer active, it can still be viewed via the Internet Archive.
SEASR: NGram Tag Cloud Viewer
SEASR: NGram Tag Cloud Viewer

Aggregator - Other (TAPoRware)

This tool aggregates texts/subtexts from different locations into a single text. The original texts can be from a user-specified web page or files located on one's computer. Aggregating subtexts requires all documents to share a common subtext tag, ...
Aggregator - Other (TAPoRware)
Aggregator - Other (TAPoRware)

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

Extract Text - HTML (TAPoRware)

This tool extracts text found within specific tags in an HTML document. It is part of the TAPoRware toolset; an XML version is also available.
Extract Text - HTML (TAPoRware)
Extract Text - HTML (TAPoRware)

INRAC Language Compiler

INRAC was an early language compiler useful for prototyping natural language processing interfaces, designing conversational systems, and experimental prose and poetry.
INRAC Language Compiler
INRAC Language Compiler

RDF Gravity

RDF Gravity (RDF Graph Visualization Tool) is a free, open-source visualization tool from Salzburg Research's Knowledge Information Systems Group. It supports RDF graph structures and OWL ontologies, including multiple RDF files, and offers a variety ...
RDF Gravity
RDF Gravity

List Words - Plain Text (TAPoRware)

This tool lists words in an plain text document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and HTML versions ...
List Words - Plain Text (TAPoRware)
List Words - Plain Text (TAPoRware)

Link Extractor - HTML (TAPoRware)

This tool extracts all the URLs, except image sources, in an HTML text. It also converts relative links to clickable absolute links.
Link Extractor - HTML (TAPoRware)
Link Extractor - HTML (TAPoRware)

Keywords Finder - Beta (TAPoRware)

This tool identifies keywords or key phrases within a user-specified text, using the assumption that they will appear with the greatest frequency. It applies a stemmer to every word. Plain text input is recommended. All tags will be stripped from an ...
Keywords Finder - Beta (TAPoRware)
Keywords Finder - Beta (TAPoRware)

Hypergraph - XML (TAPoRware)

Tool uses the Hypergraph Package (http://hypergraph.sourceforge.net/) to display the structure of a user-selected XML file. This tool requires jsdk1.4.2 or later; otherwise only the default hypergraph is displayed.
Hypergraph - XML (TAPoRware)
Hypergraph - XML (TAPoRware)

Raw Grep - Other (TAPoRware)

This tool can find a user-specified string of characters anywhere in a text document. If the string is part of a word or another non-space string, the word or the non-space string will be displayed as a unit. The search can also be used to view a concordance ...
Raw Grep - Other (TAPoRware)
Raw Grep - Other (TAPoRware)

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

VARD 2

VARD 2 is a free, creative commons tool for preprocessing historical corpora. Built in Java, it enables researchers to easily match up historic variant spellings with modern conventions. Though optimized for Early Modern English, other languages can ...
VARD 2
VARD 2
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator Dutch English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: