TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

TextSTAT

TextSTAT is a free text analysis tool offered by Niederländische Philologie, FU Berlin. It is a simple program designed to accept plain text, HTML, Word and OpenOffice files to produce word frequency lists and concordances, and versions are available ...
TextSTAT
TextSTAT

SATO (Systeme d'analyse des textes par ordinateur)

SATO (Systeme d'analyse des textes par ordinateur) is a longstanding historic text analysis system, now available as a free, web-based tool. Users can either draw off SATO's corpus or upload their own for analysis, and the texts for analysis may be ...
SATO (Systeme d'analyse des textes par ordinateur)
SATO (Systeme d'analyse des textes par ordinateur)

List Words - XML (TAPoRware)

This tool lists words in an XML document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are HTML and plain text versions ...
List Words - XML (TAPoRware)
List Words - XML (TAPoRware)

CLAWS Part-of-Speech Tagger

CLAWS (Constituent Likelihood Automatic Word-tagging System) is a free parts-of-speech tagging tool from Lancaster University. It has been continuously developed since the 1980s, has consistently achieved 96-97% accuracy, and has been applied to the ...
CLAWS Part-of-Speech Tagger
CLAWS Part-of-Speech Tagger

Domeo Annotation Toolkit

The Domeo Annotation Toolkit is an extensible web application for creating and sharing ontology-based stand-off annotations on HTML or XML documents. Users can add annotations manually, or via the tool's full or partial automation options. It also includes ...
Domeo Annotation Toolkit
Domeo Annotation Toolkit

RapidMiner

RapidMiner is an open source data mining tool with a GUI interface, available in both a free, unsupported 'community version' and a for-pay, fully supported 'enterprise edition'. It utilizes a standardized XML interchange format for processing, and ...
RapidMiner
RapidMiner

Version Variation Visualization

Version Variation Visualization is an interrelated set of free, usable prototypes for visualizing and exploring culturally important works in relation to their world-wide translations. It allows users to explore either the inlcuded corpora or their ...
Version Variation Visualization
Version Variation Visualization

Orlando Degrees of Separation

Orlando contains a relatively large corpus, currently consisting of details about the life and writing careers of roughly 1000 British women writers, amounting to 6.8 million words with 2.2 million semantic tags for everything from paragraphs to politics, ...
Orlando Degrees of Separation
Orlando Degrees of Separation

FANGORN

FANGORN was a historically important programming language designed for linguistic analysis in the humanities.
FANGORN
FANGORN

TextDNA

TextDNA is a free tool for large-scale overview analysis of linguistic data offered by the University of Wisconsin, Madison. It identifies patterns within a text, and enables users to compare ordered sets of data with its sequence visualization. It ...
TextDNA
TextDNA

Laurence Anthony: AntWordProfiler

AntWordProfiler is a free tool for word profiling. For each word in a document, it will generate the base form and a list of possible related words, provide statistics and frequency data and list word types. It can also process files separately or as ...
Laurence Anthony: AntWordProfiler
Laurence Anthony: AntWordProfiler

Concordance

Concordance is a commerical text analysis and concordance tool originally developed for the Humanities (formerly available via the University of Dundee School of Humanities). It offers features for text analysis, word lists, indexes, full concordances, ...
Concordance
Concordance

Extract Text - HTML (TAPoRware)

This tool extracts text found within specific tags in an HTML document. It is part of the TAPoRware toolset; an XML version is also available.
Extract Text - HTML (TAPoRware)
Extract Text - HTML (TAPoRware)

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

Voyant Links

Links finds collocates for words and displays links between them using a force directed graph. It shows term frequencies in proximity to keyword. It is a visualization and shows a web of terms.
Voyant Links
Voyant Links

WordCruncher

WordCruncher is long-standing text indexing, retrieval and analysis program offered by Brigham Young University. Its functions include tagging, contextual searcing, collocation and analytical reporting, and its development has been active since the ...
WordCruncher
WordCruncher

TextSTAT

TextSTAT is a free text analysis tool offered by Niederländische Philologie, FU Berlin. It is a simple program designed to accept plain text, HTML, Word and OpenOffice files to produce word frequency lists and concordances, and versions are available ...
TextSTAT
TextSTAT

RelFinder

RelFinder is part of Visual DataWeb, a set of four different tools (RelFinder, SemLens, gFacet, and tFacet) designed less for presentation and more for exploration/mining of RDF data. All the tools are web based and require Flash to run. Key principles ...
RelFinder
RelFinder

IBM: Many Eyes

Many Eyes is a free collection of data visualization tools enabling exploration and discussion of the data. Users who post comments on a visualization may also save their view for others to see in conjunction with their comment. Visualizations can be ...
IBM: Many Eyes
IBM: Many Eyes

LEXICO

LEXICO was an interactive system designed to assist lexographers in text analysis. It assisted in storing, editing and concording texts, lemmatizing word lists and generating 'slips' containing a single word with its lemma, context and source.
LEXICO
LEXICO

INRAC Language Compiler

INRAC was an early language compiler useful for prototyping natural language processing interfaces, designing conversational systems, and experimental prose and poetry.
INRAC Language Compiler
INRAC Language Compiler
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: