TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Comparator - Plain Text (TAPoRware)

This tool compares two documents by comparing the words in each according to user specifications. HTML and XML versions are also available in the TAPoRware toolsets.
Comparator - Plain Text (TAPoRware)
Comparator - Plain Text (TAPoRware)

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

COGS-3

COGS-3 was a general-purpose concordance program for IBM and DEC mainframe computers at the University of Toronto. It was developed in the 1980s in PL/I and also included a lemmatization feature.
COGS-3
COGS-3

LIWC (Linguistic Inquiry and Word Count)

LIWC (Linguistic Inquiry and Word Count) is a text analysis program available for purchase. It calculates the degree to which various categories of words are used in a text, and can process texts ranging from e-mails to speeches, poems and transcribed ...
LIWC (Linguistic Inquiry and Word Count)
LIWC (Linguistic Inquiry and Word Count)

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

Keywords Finder - Beta (TAPoRware)

This tool identifies keywords or key phrases within a user-specified text, using the assumption that they will appear with the greatest frequency. It applies a stemmer to every word. Plain text input is recommended. All tags will be stripped from an ...
Keywords Finder - Beta (TAPoRware)
Keywords Finder - Beta (TAPoRware)

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

Neatline

Neatline is a free, open-source geotemporal exhibit-builder for creating complex maps and narrative sequences from collections of archives and artifacts. It is first and foremost a suite of plugins for the Omeka framework, but can also be accessed as ...
Neatline
Neatline

Concord, the Interactive Concordance Generator (Virtual Muse)

Concord is a free concordance generator written in Python for OS X and Windows XP (a new Windows version is forthcoming). It generates a KWIC concordance entry from a plain-text file for any word or phrase and permits sorting in a variety of ways. Concord ...
Concord, the Interactive Concordance Generator (Virtual Muse)
Concord, the Interactive Concordance Generator (Virtual Muse)

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

Acronym Finder - Beta (TAPoRware)

This tool locates acronyms and matches them with the corresponding full name from a user-specified input text.
Acronym Finder - Beta (TAPoRware)
Acronym Finder - Beta (TAPoRware)

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

TimeRime

TimeRime is a free, web-based timeline application for creating, viewing and comparing interactive timelines. This application requires registration, and timelines created within the application are public.
TimeRime
TimeRime

Quadrigram

Quadrigram is an environment for data gathering, query and visualization, based on a visual programming language and intended for users with no experience programming or creating visualizations. At present, it contains 50 different visualizers.
Quadrigram
Quadrigram

DM (Digital MappaeMundi)

The DM (Digital MappaeMundi) is an environment for studying and annotating images and texts. It enables users to link together images, texts, or fragments of images or texts, such as a textual annotation of an image or text. It is aimed at scholars ...
DM (Digital MappaeMundi)
DM (Digital MappaeMundi)

Pattern (CLiPS)

Pattern (CLiPS) is a web mining module for Python designed for computational linguistic and psycholinguistic research. It integrates tools for data retrieval from search engines, social media, web spiders and individual websites, and can accomplish ...
Pattern (CLiPS)
Pattern (CLiPS)

SIMILE Widgets: Timeline

Timeline, a part of the SIMILE Widgets family of tools, is a free, open source tool for creating interactive timelines from temporal data utilizing JavaScript and web markup in HTML and XML.
SIMILE Widgets: Timeline
SIMILE Widgets: Timeline

CLAS (Computerized Language Analysis System)

CLAS (Computerized Language Analysis System) was an important historic text analysis system available in the 1970s. It was written in PL/I for IBM 360/370 punch card machines and performed standard statistical tests and concordances on natural language ...
CLAS (Computerized Language Analysis System)
CLAS (Computerized Language Analysis System)

SEASR: Flesch-Kincaid Readability Test

SEASR's Flesch-Kinkaid Readability Test is a free tool that analyses text from a URL, calculates the Flesch-Kincaid readibility measure of the text, and displays the results.
SEASR: Flesch-Kincaid Readability Test
SEASR: Flesch-Kincaid Readability Test
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: