TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

RapidMiner

RapidMiner is an open source data mining tool with a GUI interface, available in both a free, unsupported 'community version' and a for-pay, fully supported 'enterprise edition'. It utilizes a standardized XML interchange format for processing, and ...
RapidMiner
RapidMiner

Stanford Mobisocial Lab: Muse

Muse is a free, open source JavaScript tool for reflecting on and searching for patterns in the past by examining one's personal e-mail archive. It analyses e-mail to generate several views including a sentiment graph from messages that may reflect ...
Stanford Mobisocial Lab: Muse
Stanford Mobisocial Lab: Muse

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

Stanford NLP Group: Stanford Phrasal

Stanford Phrasal is a a free Java implementation for phrase-based machine translation. It provides an easy to use API for implementating new decoding model features and supports unique capabilities such as translating using phrases that include gaps ...
Stanford NLP Group: Stanford Phrasal
Stanford NLP Group: Stanford Phrasal

Weighted Centroid - Other (TAPoRware)

The Weighted Centroid is a java applet designed to display a circular graph based on word distribution data. The text is divided up into an arbitrary number of units, which are positioned around the circumference of the circle in a clockwise sequence. ...
Weighted Centroid - Other (TAPoRware)
Weighted Centroid - Other (TAPoRware)

CLAS (Computerized Language Analysis System)

CLAS (Computerized Language Analysis System) was an important historic text analysis system available in the 1970s. It was written in PL/I for IBM 360/370 punch card machines and performed standard statistical tests and concordances on natural language ...
CLAS (Computerized Language Analysis System)
CLAS (Computerized Language Analysis System)

CHNM: Timeline Builder (Beta)

Timeline Builder is a free tool for creating and maintaining interactive, Flash-based timelines for viewing on the web. Basic text formatting and links to other resources can be added through basic HTML. This tool requires registration through the web ...
CHNM: Timeline Builder (Beta)
CHNM: Timeline Builder (Beta)

University of Maryland HCI Group: FeatureLens

FeatureLens is a free tool for visualizing and exploring patterns in text collections. This tool integrates the results of text-mining algorithms, and can assist in finding frequent words or ngrams, enabling the discovery of fuzzy repetition patterns. ...
University of Maryland HCI Group: FeatureLens
University of Maryland HCI Group: FeatureLens

SATO (Systeme d'analyse des textes par ordinateur)

SATO (Systeme d'analyse des textes par ordinateur) is a longstanding historic text analysis system, now available as a free, web-based tool. Users can either draw off SATO's corpus or upload their own for analysis, and the texts for analysis may be ...
SATO (Systeme d'analyse des textes par ordinateur)
SATO (Systeme d'analyse des textes par ordinateur)

Stanford NLP Group: Stanford Topic Modelling Toolbox

Stanford Topic Modelling Toolbox is a free collection of topic modelling tools, and a part of the Stanford Natural Language Processing toolset. It is designed for social scientists and other users who need to analyze datasets with a substantial textual ...
Stanford NLP Group: Stanford Topic Modelling Toolbox
Stanford NLP Group: Stanford Topic Modelling Toolbox

Concordle

Concordle is a free, web based word cloud and concordance tool built in Javascript. It describes itself as the "not so pretty cousin of Wordle" and first debuted in 2006. Users can paste text into the provided box and generate a word cloud, concordance ...
Concordle
Concordle

Domeo Annotation Toolkit

The Domeo Annotation Toolkit is an extensible web application for creating and sharing ontology-based stand-off annotations on HTML or XML documents. Users can add annotations manually, or via the tool's full or partial automation options. It also includes ...
Domeo Annotation Toolkit
Domeo Annotation Toolkit

Pajek

Pajek is a free tool for large network analysis and visualization in continuous development since 1996. It can handle large datasets from diverse sources, such as collaboration networks, citiation networks, data mining, and Internet-derived networks. ...
Pajek
Pajek

Crawdad Text Analysis Software

Crawdad is a commercial software package for qualitative data analysis based on natural language processing. It generates a network model of a text and calculates word influence based on its position within the network. It also includes visualization, ...
Crawdad Text Analysis Software
Crawdad Text Analysis Software

Orlando Degrees of Separation

Orlando contains a relatively large corpus, currently consisting of details about the life and writing careers of roughly 1000 British women writers, amounting to 6.8 million words with 2.2 million semantic tags for everything from paragraphs to politics, ...
Orlando Degrees of Separation
Orlando Degrees of Separation

Twitter Capture and Analysis Toolset (DMI-TCAT)

Twitter Capture and Analysis Toolset (DMI-TCAT) is a free, open source tool for capturing and analyzing tweets. The tool's web interface is currently closed to researchers outside the University of Amsterdam's Media Studies department, however, the ...
Twitter Capture and Analysis Toolset (DMI-TCAT)
Twitter Capture and Analysis Toolset (DMI-TCAT)

TokenX

TokenX is a free text visualization and analysis tool for XML documents. It offeres a web-based environment and can generate word clouds, highlight parts of text such as words, non-words and punctuation, KWIC (keyword in context) and more. TokenX also ...
TokenX
TokenX

Laurence Anthony: AntWordProfiler

AntWordProfiler is a free tool for word profiling. For each word in a document, it will generate the base form and a list of possible related words, provide statistics and frequency data and list word types. It can also process files separately or as ...
Laurence Anthony: AntWordProfiler
Laurence Anthony: AntWordProfiler

DocuBurst

DocuBurst is a free web-based visualization tool for exploring the contents of a text.  Visitors can upload their own text or view those provided by others. DocuBurst presents an interactive chart called a ‘radial sunburst’ diagram which organizes ...
DocuBurst
DocuBurst
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: