TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

RoSE

RoSE is a web-based system blending social computing with humanities bibliographical resources, enabling these resources to be explored as a social network. It incorporates data mined from YAGO and Project Gutenberg, offers profile pages for both persons ...
RoSE
RoSE

Cytoscape

Cytoscape is an open source software platform for visualizing data networks and pathways. Though designed for bioinformatic systems, it has been generalized to complex network analysis and has applications extending to the semantic web. Its core distribution ...
Cytoscape
Cytoscape

Lexa

Lexa is a free legacy set of programs for corpus processing. It is designed to tag and lemmatize texts or series of texts. The website for Lexa is no longer maintained or active, but may still be viewed via the Internet Archive.
Lexa
Lexa

WordSmith

WordSmith Tools is a commercial integrated suite of programs designed to analyze word behaviour in a text. It can be used to generate a list of all words or word clusters, concord, find keywords and more. This tool is recommended for publishers, language ...
WordSmith
WordSmith

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OS X and Linux. ...
NLTK 2.0 (Natural Language Toolkit)
NLTK 2.0 (Natural Language Toolkit)

SCAN

SCAN was a conversational programming language available in the 1970s for text analysis. It was specific to text processing and could be used divide a text into sentences or words or split on separators. It was capable of running counts on a text, printing ...
SCAN
SCAN

EURAC: End to End

End to End is a visualization application for exploratory corpus analysis focused on collocations. This tool starts with two words and constructs a visual network of all collocations of those words within the corpus, while its interface enables interactive ...
EURAC: End to End
EURAC: End to End

ATLAS.ti

ATLAS.ti is a tool for systematic qualitative data analysis released under a for-pay license. It offers multi-window frames, overviews generated through interactive network views utilizing visual interconnections, and cloud views and reports. A free ...
ATLAS.ti
ATLAS.ti

KWIC

KWIC was an indexing and concordance tool available in the 1960s. Contrary to its name, KWIC functioned as a 'key-word-out-of-context' tool.
KWIC
KWIC

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

ORBIS: Stanford Geospatial Network Model of the Roman World

ORBIS: Stanford Geospatial Network Model of the Roman World is a tool and academic resource for reconstructing the time and financial costs of travel in the ancient world. Its model is based on a simplified network of Roman cities, roads, rivers and ...
ORBIS: Stanford Geospatial Network Model of the Roman World
ORBIS: Stanford Geospatial Network Model of the Roman World

EXPLEX

EXPLEX (EXtracting information about Proper nouns to provide LEXical information) was a natural language processing system available in the 1990s. It was developed to create lexical entries for nouns appearing in the Wall Street Journal.
EXPLEX
EXPLEX

SIMILE Widgets: Timeline

Timeline, a part of the SIMILE Widgets family of tools, is a free, open source tool for creating interactive timelines from temporal data utilizing JavaScript and web markup in HTML and XML.
SIMILE Widgets: Timeline
SIMILE Widgets: Timeline

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter

Textexture

Textexture is a free, web-based tool for visualizing texts as a network. The visualization gives a quick visual summary of the text. Clicking on nodes brings up the excerpts the tool has identified as most relivant, and permits users to locate similar ...
Textexture
Textexture

Collocation - HTML (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - HTML (TAPoRware)
Collocation - HTML (TAPoRware)

JConcorder

JConcorder is Java software for building and managing word catalogues, originally released for the Macintosh as Concorder / Le Concordeur. Amongst its features are functions for listing and cataloguing words, generating concordances, exporting concordances ...
JConcorder
JConcorder

Weka (Waikato Environment for Knowledge Analysis)

Weka (Waikato Environment for Knowledge Analysis) is a free Java-based data mining workbench of machine learning algorithms, offered by the Machine Learning Group of the University of Waikato. It includes tools for data pre-processing, classification, ...
Weka (Waikato Environment for Knowledge Analysis)
Weka (Waikato Environment for Knowledge Analysis)

Casual

Casual is a free, open source tool for generating a visualization of concepts from a key word search. It uses Wikipedia for conceptual information, and can also include Google Image search results where applicable. Users can click on a term to generate ...
Casual
Casual

Meld

Meld is a free, open source tool for comparing files, directories and version-controlled projects developed in Python by Kai Willadsen. It features two- and three-way comparison of files and directories, supports numerous version control systems, and ...
Meld
Meld

ANTHROPAC

ANTHROPAC, now in version 4.98, is a long-standing menu-driven DOS program developed for working with anthropological and other cultural data. It assists both the collection and analysis of datasets, and can accommodate both qualitiative and quantitative ...
ANTHROPAC
ANTHROPAC
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: