TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Stanford NLP Group: Stanford Phrasal

Stanford Phrasal is a a free Java implementation for phrase-based machine translation. It provides an easy to use API for implementating new decoding model features and supports unique capabilities such as translating using phrases that include gaps ...
Stanford NLP Group: Stanford Phrasal
Stanford NLP Group: Stanford Phrasal

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OSX and Linux. ...
NLTK 2.0 (Natural Language Toolkit)
NLTK 2.0 (Natural Language Toolkit)

FromThePage

FromThePage is a free software for manuscript transcription, allowing volunteers to transcribe document pages online. Transcriptions can then be marked up and annotated in a wiki-like enviroment, with the resultant text displayed on the public web. ...
FromThePage
FromThePage

Micro-OCP

Micro-OCP (Oxford Concordance Program) was a major and historically important textual analysis tool for microprocessor computers. It enabled users to generate concordances, word lists and indexes, in addition to facilitating text markup in COCOA or ...
Micro-OCP
Micro-OCP

ATLAS.ti

ATLAS.ti is a tool for systematic qualitative data analysis released under a for-pay license. It offers multi-window frames, overviews generated through interactive network views utilizing visual interconnections, and cloud views and reports. A free ...
ATLAS.ti
ATLAS.ti

Tokenize - XML (TAPoR)

This tool splits an XML document at specified points into 'tokens' - words, lines, sentences, paragraphs or characters. The user can specify characters, patterns, or tags upon which to separate tokens, and choose to have the results listed separator ...
Tokenize - XML (TAPoR)
Tokenize - XML (TAPoR)

List Word Pairs - Beta (TAPoRware)

This tool lists all word pairs found in a document alphabetically, by frequency, by order of appearance, or in reversed alphabetical order. Users may specify restrictions to narrow the results. This tool is currently only available for plain text documents. ...
List Word Pairs - Beta (TAPoRware)
List Word Pairs - Beta (TAPoRware)

ORBIS: Stanford Geospatial Network Model of the Roman World

ORBIS: Stanford Geospatial Network Model of the Roman World is a tool and academic resource for reconstructing the time and financial costs of travel in the ancient world. Its model is based on a simplified network of Roman cities, roads, rivers and ...
ORBIS: Stanford Geospatial Network Model of the Roman World
ORBIS: Stanford Geospatial Network Model of the Roman World

WordFreak

WordFreak is a free, open-source tool for linguistic annotation. It is designed to support both human and automated annotation of linguistic data, and learns human corrections of automatically annotated data.
WordFreak
WordFreak

SIMILE Widgets: Timeplot

TimePlot, a part of the SIMILE Widgets family of tools, is a free, open source DHTML-based AJAXy widget for plotting time series and overlaying time-based events over them. The overlays use the same data formats supported by the related SIMILE Widgets ...
SIMILE Widgets: Timeplot
SIMILE Widgets: Timeplot

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter

Google News Scraper

Google News Scraper is a free, web-based tool for batch querying news.google.com. Users can specify search terms, news sources, date range, location, language, version of Google (ex. .com, .ca, .co.uk, etc.), and what facets to include in the output. ...
Google News Scraper
Google News Scraper

Word and Phrase

Word and Phrase is a free, web-based text analysis tool created by Dr. Mark Davies of Brigham Young University. Users can paste texts directly into the box provided. The tool provides a range of detailed information on a text's words and phrases, on ...
Word and Phrase
Word and Phrase

JConcorder

JConcorder is Java software for building and managing word catalogues, originally released for the Macintosh as Concorder / Le Concordeur. Amongst its features are functions for listing and cataloguing words, generating concordances, exporting concordances ...
JConcorder
JConcorder

DfR Browser

The DfR Browser is a free, open source visualization interface for exploring aggragates of articles from the JSTOR database. It uses topic modelling, co-occurrances and document metadata to provide multiple views on the corpus of interest based on topic ...
DfR Browser
DfR Browser

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

Meld

Meld is a free, open source tool for comparing files, directories and version-controlled projects developed in Python by Kai Willadsen. It features two- and three-way comparison of files and directories, supports numerous version control systems, and ...
Meld
Meld

PC-KIMMO

PC-KIMMO is a tool for morphological parsing available since 1985. It is designed to generate and/or parse words, for use by computational linguists, descriptive linguists and others interested in natural language processing. Though this tool is no ...
PC-KIMMO
PC-KIMMO

Collocation - Plain Text (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - Plain Text (TAPoRware)
Collocation - Plain Text (TAPoRware)

PRORA

PRORA was a historically important set of six interrelated programs for creating computerized concordances, first released in the 1960s. It output on punched cards, could be used for any language represented by the Latin alphabet, and also had functions ...
PRORA
PRORA

TOSCA

TOSCA was a syntactic analysis system available in the 1990s. It required users to apply word class tags and syntactic markers, which it then used for syntactic parsing.
TOSCA
TOSCA
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: