TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

CLAWS Part-of-Speech Tagger

CLAWS (Constituent Likelihood Automatic Word-tagging System) is a free parts-of-speech tagging tool from Lancaster University. It has been continuously developed since the 1980s, has consistently achieved 96-97% accuracy, and has been applied to the ...
CLAWS Part-of-Speech Tagger
CLAWS Part-of-Speech Tagger

CLAS (Computerized Language Analysis System)

CLAS (Computerized Language Analysis System) was an important historic text analysis system available in the 1970s. It was written in PL/I for IBM 360/370 punch card machines and performed standard statistical tests and concordances on natural language ...
CLAS (Computerized Language Analysis System)
CLAS (Computerized Language Analysis System)

SNOBOL (String Oriented Symbolic Language)

SNOBOL is a programming language developed in the 1960s capable of symbol and free-form string manipulation that was of particular value to early computer-based Humanities research. SNOBOL4 was the final version released by AT&T Bell Laboratories. ...
SNOBOL (String Oriented Symbolic Language)
SNOBOL (String Oriented Symbolic Language)

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

SATO (Systeme d'analyse des textes par ordinateur)

SATO (Systeme d'analyse des textes par ordinateur) is a longstanding historic text analysis system, now available as a free, web-based tool. Users can either draw off SATO's corpus or upload their own for analysis, and the texts for analysis may be ...
SATO (Systeme d'analyse des textes par ordinateur)
SATO (Systeme d'analyse des textes par ordinateur)

REDUX

REDUX was a natural language processing program for making generalizations from cases such as case studies, legal cases, case histories and so forth. It searched for repeated patterns, and was presented as a metalanguage for the analysis of the behavior ...
REDUX
REDUX

Speech Tagger - Plain Text (TAPoRware)

The speech tagger allows you to highlight different parts of a text where each part is identified by a different colour. The tool uses TreeTagger to find the different parts of speech. Note that using the TreeTagger tool is very taxing on the server ...
Speech Tagger - Plain Text (TAPoRware)
Speech Tagger - Plain Text (TAPoRware)

Domeo Annotation Toolkit

The Domeo Annotation Toolkit is an extensible web application for creating and sharing ontology-based stand-off annotations on HTML or XML documents. Users can add annotations manually, or via the tool's full or partial automation options. It also includes ...
Domeo Annotation Toolkit
Domeo Annotation Toolkit

Quadrigram

Quadrigram is an environment for data gathering, query and visualization, based on a visual programming language and intended for users with no experience programming or creating visualizations. At present, it contains 50 different visualizers.
Quadrigram
Quadrigram

CHNM: Scribe

Scribe, now in version 3.5, is a free note-taking program available for both PC and Mac. Aimed particularly at historians, this program allows researchers to create digital note cards for managing sources, research notes, contacts, images, glossaries, ...
CHNM: Scribe
CHNM: Scribe

Orlando Degrees of Separation

Orlando contains a relatively large corpus, currently consisting of details about the life and writing careers of roughly 1000 British women writers, amounting to 6.8 million words with 2.2 million semantic tags for everything from paragraphs to politics, ...
Orlando Degrees of Separation
Orlando Degrees of Separation

VisualEyes

VisualEyes is web-based authoring tool for creating dynamic visualizations offered by the University of Virginia. It can be used to combine images, maps, charts, video and research data into a single visualization. It is comprehensively documented, ...
VisualEyes
VisualEyes

Visual Understanding Environment (VUE)

Visual Understanding Environment (VUE) is a free, open source application for mapping concepts, ideas and digital content. It enables users to generate nodes and links, and apply a simple set of tools to explore the relationships. VUE is, in the words ...
Visual Understanding Environment (VUE)
Visual Understanding Environment (VUE)

Laurence Anthony: AntWordProfiler

AntWordProfiler is a free tool for word profiling. For each word in a document, it will generate the base form and a list of possible related words, provide statistics and frequency data and list word types. It can also process files separately or as ...
Laurence Anthony: AntWordProfiler
Laurence Anthony: AntWordProfiler

NITE XML Toolkit (NXT)

The NITE XML Toolkit (NXT) is an open source toolkit for working with language corpora, particularly useful for multimodal and cross-annotated data sets.
NITE XML Toolkit (NXT)
NITE XML Toolkit (NXT)

Extract Text - HTML (TAPoRware)

This tool extracts text found within specific tags in an HTML document. It is part of the TAPoRware toolset; an XML version is also available.
Extract Text - HTML (TAPoRware)
Extract Text - HTML (TAPoRware)

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

BookLamp Labs: Stream Graph Viewer

BookLamp's Stream Graph Viewer is a tool for viewing where and how much of a story element ('StoryDNA') appears in a book. Books can be searched by title or author, and then graphed for one or several pieces of StoryDNA. BookLamp and its tools have ...
BookLamp Labs: Stream Graph Viewer
BookLamp Labs: Stream Graph Viewer

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

TextSTAT

TextSTAT is a free text analysis tool offered by Niederländische Philologie, FU Berlin. It is a simple program designed to accept plain text, HTML, Word and OpenOffice files to produce word frequency lists and concordances, and versions are available ...
TextSTAT
TextSTAT

RapidMiner

RapidMiner is an open source data mining tool with a GUI interface, available in both a free, unsupported 'community version' and a for-pay, fully supported 'enterprise edition'. It utilizes a standardized XML interchange format for processing, and ...
RapidMiner
RapidMiner
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: