TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Text Variation Explorer (TVE)

Text Variation Explorer (TVE) is a Java tool for text visualization. TVE enables users to explore the effect of window size on a text's type-token ratio, proportion of hapax legomena and average word length; it can also cluster text fragments based ...
Text Variation Explorer (TVE)
Text Variation Explorer (TVE)

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OSX and Linux. ...
NLTK 2.0 (Natural Language Toolkit)
NLTK 2.0 (Natural Language Toolkit)

Prefuse

Prefuse is a Java framework for creating interactive information visualization applications from the University of California at Berkeley's Visualization Lab. It can be used to design standalone applications, visual components within other applications ...
Prefuse
Prefuse

Gephi

Gephi is a free, open source interactive visualization and data exploration tool. Users can manipulate the display to uncover new facets of the data, enabling intutive exploration.
Gephi
Gephi

ATLAS.ti

ATLAS.ti is a tool for systematic qualitative data analysis released under a for-pay license. It offers multi-window frames, overviews generated through interactive network views utilizing visual interconnections, and cloud views and reports. A free ...
ATLAS.ti
ATLAS.ti

Wisdom

Wisdom was a word sense disambiguation system that participated in the 1998 SENSEVAL competition. It used a simple supervised learning algorithm, drawing on co-occurrence statistics.
Wisdom
Wisdom

Scripto

Scripto is a lightweight, open source tool for crowdsourcing transcriptions for Humanities projects. Projects utilizing Scripto can manage contributions via full editorial controls and a versioning history.
Scripto
Scripto

ORBIS: Stanford Geospatial Network Model of the Roman World

ORBIS: Stanford Geospatial Network Model of the Roman World is a tool and academic resource for reconstructing the time and financial costs of travel in the ancient world. Its model is based on a simplified network of Roman cities, roads, rivers and ...
ORBIS: Stanford Geospatial Network Model of the Roman World
ORBIS: Stanford Geospatial Network Model of the Roman World

W3C RDF Validation Service

The W3C RDF Validation Service is a free, web-based tool for checking RDF documents for errors and displaying the results. It can display in triples, a graph, or a combination of the two, and can format the graph in a variety of file formats including ...
W3C RDF Validation Service
W3C RDF Validation Service

Pliny

Pliny is a tool for working with annotations when conducting personal scholarly research, and can be used with both digital and print materials. It also has data manangement and organizational features, and is designed to facilitate reading, reacting ...
Pliny
Pliny

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter

Stanford NLP Group: Stanford Tregex and Tsurgeon

Stanford Tregex and Tsurgeon is a bundled pair of tools from the Stanford Natural Language Processing toolset. Tregex is a utility for using tree relationships and regular expressions to match patterns in trees, while Tsurgeon is a tree transformation ...
Stanford NLP Group: Stanford Tregex and Tsurgeon
Stanford NLP Group: Stanford Tregex and Tsurgeon

Umigon

Umigon is a free, web-based and open-source tool for sentiment analysis of tweets. From a person's Twitter handle, Umigon retrieves that account's tweets and processes it for sentiment with accounting for factual statements (ex: "I hate war" will be ...
Umigon
Umigon

JConcorder

JConcorder is Java software for building and managing word catalogues, originally released for the Macintosh as Concorder / Le Concordeur. Amongst its features are functions for listing and cataloguing words, generating concordances, exporting concordances ...
JConcorder
JConcorder

Stanford NLP Group: Stanford Tokenizer

Stanford Tokenizer is a free Java implementation for diving an English text into tokens such as words, and a part of the Stanford Natural Language Processing toolset. This tool is not available on its own, but is bundled with other tools in the same ...
Stanford NLP Group: Stanford Tokenizer
Stanford NLP Group: Stanford Tokenizer

Stanford NLP Group: Part-of-Speech Tagger

Stanford Part-of-Speech Tagger is a free Java implementation for the recognition of parts of speech, and a part of the Stanford Natural Language Processing toolset. It reads text and assigns parts of speech to each word such as noun, verb or adjective. ...
Stanford NLP Group: Part-of-Speech Tagger
Stanford NLP Group: Part-of-Speech Tagger

Meld

Meld is a free, open source tool for comparing files, directories and version-controlled projects developed in Python by Kai Willadsen. It features two- and three-way comparison of files and directories, supports numerous version control systems, and ...
Meld
Meld

cue.language

cue.language is a free library of Java code and resources for basic natural language processing emerging from the development of the Wordle word cloud tool and maintained by Jonathan Feinman of IBM's CUE Research Group. Its functions include tokenization, ...
cue.language
cue.language

Voyant Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further.
Voyant Lava
Voyant Lava

PRORA

PRORA was a historically important set of six interrelated programs for creating computerized concordances, first released in the 1960s. It output on punched cards, could be used for any language represented by the Latin alphabet, and also had functions ...
PRORA
PRORA

SEASR: Google Search To Entities To Protovis Network Graph

SEASR's Google Search To Entities To Protovis Network Graph is a free tool for submitting a search to Google and returning a specified number of documents for further processing. It then analyses the documents for connections within a specified number ...
SEASR: Google Search To Entities To Protovis Network Graph
SEASR: Google Search To Entities To Protovis Network Graph
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: