TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

TextDNA

TextDNA is a free tool for large-scale overview analysis of linguistic data offered by the University of Wisconsin, Madison. It identifies patterns within a text, and enables users to compare ordered sets of data with its sequence visualization. It ...
TextDNA
TextDNA

SATO (Systeme d'analyse des textes par ordinateur)

SATO (Systeme d'analyse des textes par ordinateur) is a longstanding historic text analysis system, now available as a free, web-based tool. Users can either draw off SATO's corpus or upload their own for analysis, and the texts for analysis may be ...
SATO (Systeme d'analyse des textes par ordinateur)
SATO (Systeme d'analyse des textes par ordinateur)

WordCruncher

WordCruncher is long-standing text indexing, retrieval and analysis program offered by Brigham Young University. Its functions include tagging, contextual searcing, collocation and analytical reporting, and its development has been active since the ...
WordCruncher
WordCruncher

Stanford NLP Group: CoreNLP

Stanford CoreNLP is a free Natural Language Processing tool. It processes English language text and provides the base forms of words, parts of speech, indicates whether they are proper names, normalizes dates, times and numeric quantities, and marks ...
Stanford NLP Group: CoreNLP
Stanford NLP Group: CoreNLP

Domeo Annotation Toolkit

The Domeo Annotation Toolkit is an extensible web application for creating and sharing ontology-based stand-off annotations on HTML or XML documents. Users can add annotations manually, or via the tool's full or partial automation options. It also includes ...
Domeo Annotation Toolkit
Domeo Annotation Toolkit

Apache Xalan

Apache Xalan is a free XSLT processor for transforming XML documents into HTML, text or other XML document types, offered by the Apache Software Foundation. This tool uses the Xerces XML parser to converty XML into internal nodesets, and is offered ...
Apache Xalan
Apache Xalan

SIMILE Widgets: Timeline

Timeline, a part of the SIMILE Widgets family of tools, is a free, open source tool for creating interactive timelines from temporal data utilizing JavaScript and web markup in HTML and XML.
SIMILE Widgets: Timeline
SIMILE Widgets: Timeline

Orlando Degrees of Separation

Orlando contains a relatively large corpus, currently consisting of details about the life and writing careers of roughly 1000 British women writers, amounting to 6.8 million words with 2.2 million semantic tags for everything from paragraphs to politics, ...
Orlando Degrees of Separation
Orlando Degrees of Separation

SEASR: OpenNLP Entities To Protovis Network Graph

SEASR's OpenNLP Entities to Protovis Network Graph is a free tool for extracting entities within a specified sentence distance within a text. The OpenNLP system is used to extract entities, and their relationships are represented in a link node network ...
SEASR: OpenNLP Entities To Protovis Network Graph
SEASR: OpenNLP Entities To Protovis Network Graph

TABARI (Text Analysis By Augmented Replacement Instructions)

TABARI (Text Analysis By Augmented Replacement Instructions) is a legacy open-source successor to the KEDS program, written in C++ and maintained online to the present in an OS X edition. It runs in the Terminal command prompt, and is designed for analyzing ...
TABARI (Text Analysis By Augmented Replacement Instructions)
TABARI (Text Analysis By Augmented Replacement Instructions)

Laurence Anthony: AntWordProfiler

AntWordProfiler is a free tool for word profiling. For each word in a document, it will generate the base form and a list of possible related words, provide statistics and frequency data and list word types. It can also process files separately or as ...
Laurence Anthony: AntWordProfiler
Laurence Anthony: AntWordProfiler

PAIR (Pairwise Alignment for Intertextual Relations)

PAIR (Pairwise Alignment for Intertextual Relations) is a free, open source sequence alignment algorithm for humanities text analysis. It identifies similar passages in a large corpus. Corpuses are indexed for future use, and incoming texts can be compared ...
PAIR (Pairwise Alignment for Intertextual Relations)
PAIR (Pairwise Alignment for Intertextual Relations)

TokenX

TokenX is a free text visualization and analysis tool for XML documents. It offeres a web-based environment and can generate word clouds, highlight parts of text such as words, non-words and punctuation, KWIC (keyword in context) and more. TokenX also ...
TokenX
TokenX

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

TimeRime

TimeRime is a free, web-based timeline application for creating, viewing and comparing interactive timelines. This application requires registration, and timelines created within the application are public.
TimeRime
TimeRime

CulturalAnalytics

CulturalAnalytics, also known as Cultural Analytics for the Digital Humanities in R, is a free R package of functions for statistical analysis and plotting image properties, developed by Rob Myers specifically for the Digital Humanities, and of value ...
CulturalAnalytics
CulturalAnalytics

TextSTAT

TextSTAT is a free text analysis tool offered by Niederländische Philologie, FU Berlin. It is a simple program designed to accept plain text, HTML, Word and OpenOffice files to produce word frequency lists and concordances, and versions are available ...
TextSTAT
TextSTAT

CorpusSearch 2

CorpusSearch 2 is a free Java-based program for constructing syntactically annotated (parsed) corpora and searching them. Its functions include finding and counting lexical and syntactic patterns, correcting systemic errors and coding linguistic features. ...
CorpusSearch 2
CorpusSearch 2

Concord, the Interactive Concordance Generator (Virtual Muse)

Concord is a free concordance generator written in Python for OS X and Windows XP (a new Windows version is forthcoming). It generates a KWIC concordance entry from a plain-text file for any word or phrase and permits sorting in a variety of ways. Concord ...
Concord, the Interactive Concordance Generator (Virtual Muse)
Concord, the Interactive Concordance Generator (Virtual Muse)

LEXICO

LEXICO was an interactive system designed to assist lexographers in text analysis. It assisted in storing, editing and concording texts, lemmatizing word lists and generating 'slips' containing a single word with its lemma, context and source.
LEXICO
LEXICO

Version Variation Visualization

Version Variation Visualization is an interrelated set of free, usable prototypes for visualizing and exploring culturally important works in relation to their world-wide translations. It allows users to explore either the inlcuded corpora or their ...
Version Variation Visualization
Version Variation Visualization
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: