TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

List Words - Plain Text (TAPoRware)

This tool lists words in an plain text document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and HTML versions ...
List Words - Plain Text (TAPoRware)
List Words - Plain Text (TAPoRware)

Stanford Mobisocial Lab: Muse

Muse is a free, open source JavaScript tool for reflecting on and searching for patterns in the past by examining one's personal e-mail archive. It analyses e-mail to generate several views including a sentiment graph from messages that may reflect ...
Stanford Mobisocial Lab: Muse
Stanford Mobisocial Lab: Muse

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

Topic Modelling Tool

Topic Modelling Tool is a free, open source tool for Latent Dirichlet Allocation topic modelling with a graphical user interface. The tool learns topics in a user-supplied corpus of plain text files, and outputs results as a CSV file for further analysis. ...
Topic Modelling Tool
Topic Modelling Tool

Weighted Centroid - Other (TAPoRware)

The Weighted Centroid is a java applet designed to display a circular graph based on word distribution data. The text is divided up into an arbitrary number of units, which are positioned around the circumference of the circle in a clockwise sequence. ...
Weighted Centroid - Other (TAPoRware)
Weighted Centroid - Other (TAPoRware)

CLAS (Computerized Language Analysis System)

CLAS (Computerized Language Analysis System) was an important historic text analysis system available in the 1970s. It was written in PL/I for IBM 360/370 punch card machines and performed standard statistical tests and concordances on natural language ...
CLAS (Computerized Language Analysis System)
CLAS (Computerized Language Analysis System)

Concordle

Concordle is a free, web based word cloud and concordance tool built in Javascript. It describes itself as the "not so pretty cousin of Wordle" and first debuted in 2006. Users can paste text into the provided box and generate a word cloud, concordance ...
Concordle
Concordle

University of Maryland HCI Group: FeatureLens

FeatureLens is a free tool for visualizing and exploring patterns in text collections. This tool integrates the results of text-mining algorithms, and can assist in finding frequent words or ngrams, enabling the discovery of fuzzy repetition patterns. ...
University of Maryland HCI Group: FeatureLens
University of Maryland HCI Group: FeatureLens

SATO (Systeme d'analyse des textes par ordinateur)

SATO (Systeme d'analyse des textes par ordinateur) is a longstanding historic text analysis system, now available as a free, web-based tool. Users can either draw off SATO's corpus or upload their own for analysis, and the texts for analysis may be ...
SATO (Systeme d'analyse des textes par ordinateur)
SATO (Systeme d'analyse des textes par ordinateur)

INTEX

INTEX is a linguistic development environment active until 2005. It includes large-coverage dictionaries and grammers, can parse texts of several million words in real-time, and tools to create and maintain large-coverage lexical resources, morphological ...
INTEX
INTEX

Concordle

Concordle is a free, web based word cloud and concordance tool built in Javascript. It describes itself as the "not so pretty cousin of Wordle" and first debuted in 2006. Users can paste text into the provided box and generate a word cloud, concordance ...
Concordle
Concordle

Domeo Annotation Toolkit

The Domeo Annotation Toolkit is an extensible web application for creating and sharing ontology-based stand-off annotations on HTML or XML documents. Users can add annotations manually, or via the tool's full or partial automation options. It also includes ...
Domeo Annotation Toolkit
Domeo Annotation Toolkit

AWK

AWK is a programming language well suited to text processing first developed at Bell Labs in the late 1970s. It was particularly useful to humanists in the late 1980s and early 1990s, and was notably used to develop parts of the ARTFL project.
AWK
AWK

Crawdad Text Analysis Software

Crawdad is a commercial software package for qualitative data analysis based on natural language processing. It generates a network model of a text and calculates word influence based on its position within the network. It also includes visualization, ...
Crawdad Text Analysis Software
Crawdad Text Analysis Software

Orlando Degrees of Separation

Orlando contains a relatively large corpus, currently consisting of details about the life and writing careers of roughly 1000 British women writers, amounting to 6.8 million words with 2.2 million semantic tags for everything from paragraphs to politics, ...
Orlando Degrees of Separation
Orlando Degrees of Separation

Tesserae

Tesserae is a web-based interface for exploring intertextual parallels. At this stage of the project, it only permits comparison of texts from three curated corpuses, representing selections from canonical Latin, Greek and English literature.
Tesserae
Tesserae

TokenX

TokenX is a free text visualization and analysis tool for XML documents. It offeres a web-based environment and can generate word clouds, highlight parts of text such as words, non-words and punctuation, KWIC (keyword in context) and more. TokenX also ...
TokenX
TokenX

Laurence Anthony: AntWordProfiler

AntWordProfiler is a free tool for word profiling. For each word in a document, it will generate the base form and a list of possible related words, provide statistics and frequency data and list word types. It can also process files separately or as ...
Laurence Anthony: AntWordProfiler
Laurence Anthony: AntWordProfiler

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: