TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Word Cloud - Beta (TAPoRware)

This tool generates a word cloud of the top frequency words from a text document, with word size determined by its frequency. The user can specify how many words are to be included from the document, whether to apply a modified Glasgow Stop Words list, ...
Word Cloud - Beta (TAPoRware)
Word Cloud - Beta (TAPoRware)

SATO (Systeme d'analyse des textes par ordinateur)

SATO (Systeme d'analyse des textes par ordinateur) is a longstanding historic text analysis system, now available as a free, web-based tool. Users can either draw off SATO's corpus or upload their own for analysis, and the texts for analysis may be ...
SATO (Systeme d'analyse des textes par ordinateur)
SATO (Systeme d'analyse des textes par ordinateur)

WordCruncher

WordCruncher is long-standing text indexing, retrieval and analysis program offered by Brigham Young University. Its functions include tagging, contextual searcing, collocation and analytical reporting, and its development has been active since the ...
WordCruncher
WordCruncher

CLAWS Part-of-Speech Tagger

CLAWS (Constituent Likelihood Automatic Word-tagging System) is a free parts-of-speech tagging tool from Lancaster University. It has been continuously developed since the 1980s, has consistently achieved 96-97% accuracy, and has been applied to the ...
CLAWS Part-of-Speech Tagger
CLAWS Part-of-Speech Tagger

Domeo Annotation Toolkit

The Domeo Annotation Toolkit is an extensible web application for creating and sharing ontology-based stand-off annotations on HTML or XML documents. Users can add annotations manually, or via the tool's full or partial automation options. It also includes ...
Domeo Annotation Toolkit
Domeo Annotation Toolkit

BookLamp

BookLamp, part of the Book Genome Project, is a tool and a resource for finding books. It offers an alternative to social recommendation engines reliant on author popularity by treating its books as equal regardless of number of copies sold. BookLamp's ...
BookLamp
BookLamp

Visual Browser

Visual Browser is a Java application for visualizing RDF data using the Jena framework. The resultant graphs are animated and permit users to expand and hide nodes and switch the view of edges, allowing them to focus on a small part of the network. The ...
Visual Browser
Visual Browser

Orlando Degrees of Separation

Orlando contains a relatively large corpus, currently consisting of details about the life and writing careers of roughly 1000 British women writers, amounting to 6.8 million words with 2.2 million semantic tags for everything from paragraphs to politics, ...
Orlando Degrees of Separation
Orlando Degrees of Separation

STRAP 2.0

STRAP 2.0 (Structural Analysis Programs) was a package of interrelated programs for analyzing tagged texts. It was intended to index, conduct contextual searches and generate distribution diagrams for literary texts. It could also recognize punctuation ...
STRAP 2.0
STRAP 2.0

TACT (Text Analysis Computing Tools)

TACT (Text Analysis Computing Tools) is a historically important text analysis and retrieval system that was developed from 1986 to 1989 at the University of Toronto in cooperation with IBM and remained in use into the 1990s. It was designed to run ...
TACT (Text Analysis Computing Tools)
TACT (Text Analysis Computing Tools)

Laurence Anthony: AntWordProfiler

AntWordProfiler is a free tool for word profiling. For each word in a document, it will generate the base form and a list of possible related words, provide statistics and frequency data and list word types. It can also process files separately or as ...
Laurence Anthony: AntWordProfiler
Laurence Anthony: AntWordProfiler

Google News Scraper

Google News Scraper is a free, web-based tool for batch querying news.google.com. Users can specify search terms, news sources, date range, location, language, version of Google (ex. .com, .ca, .co.uk, etc.), and what facets to include in the output. ...
Google News Scraper
Google News Scraper

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

TokenX

TokenX is a free text visualization and analysis tool for XML documents. It offeres a web-based environment and can generate word clouds, highlight parts of text such as words, non-words and punctuation, KWIC (keyword in context) and more. TokenX also ...
TokenX
TokenX

TextSTAT

TextSTAT is a free text analysis tool offered by Niederländische Philologie, FU Berlin. It is a simple program designed to accept plain text, HTML, Word and OpenOffice files to produce word frequency lists and concordances, and versions are available ...
TextSTAT
TextSTAT

Orange

Orange is a free, open source data visualization and analysis tool. It allows users to conduct data mining either through its visual programming language, or via Python scripts, and includes components for machine learning. Orange includes both add-ons ...
Orange
Orange

Timeline JS

Timeline JS is a free timeline tool that can be poplulated from either a Google spreadsheet or a JSON file, and can draw media material from a variety of sources, such as Twitter, Flickr, Wikipedia, YouTube and more. It has been designed to be both ...
Timeline JS
Timeline JS

LEXICO

LEXICO was an interactive system designed to assist lexographers in text analysis. It assisted in storing, editing and concording texts, lemmatizing word lists and generating 'slips' containing a single word with its lemma, context and source.
LEXICO
LEXICO

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: