TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Voyant Corpus Grid

Corpus Grid shows an overview of the corpus, including each document's title, number of word tokens (total words), number or word types (unique words), and lexical density (the ratio of tokens to types).
Voyant Corpus Grid
Voyant Corpus Grid

Cytoscape

Cytoscape is an open source software platform for visualizing data networks and pathways. Though designed for bioinformatic systems, it has been generalized to complex network analysis and has applications extending to the semantic web. Its core distribution ...
Cytoscape
Cytoscape

Citeline

Citeline is a tool for publishing bibliographies and citation collections on the web. it takes a BibTex file exported from Endnotes, Refworks and similar tools and turns the collection into an interactive, shareable exhibit. Citeline citation collections ...
Citeline
Citeline

Compare With Control - Beta (TAPoRware)

This tool compare the text submitted by the user with a predefined control corpus. The tool lists the words common in both texts, in an order set by the user. At present, the tool only offers the Brown Corpus, with more predefined control corpus forthcoming. ...
Compare With Control - Beta (TAPoRware)
Compare With Control - Beta (TAPoRware)

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OSX and Linux. ...
NLTK 2.0 (Natural Language Toolkit)
NLTK 2.0 (Natural Language Toolkit)

word2vec

word2vec is a tool for computing vector representations of words for natural language processign and other related research. It generates a vocabulary from the user's text and learns a vector representation of its words.
word2vec
word2vec

Meandre

Meandre is a graphical programming language for creating text analysis flows, built on top of the Seasr infrastructure. Meandre uploads its flows to a Seasr server where they can be accessed and used by anyone who can access the server.
Meandre
Meandre

ATLAS.ti

ATLAS.ti is a tool for systematic qualitative data analysis released under a for-pay license. It offers multi-window frames, overviews generated through interactive network views utilizing visual interconnections, and cloud views and reports. A free ...
ATLAS.ti
ATLAS.ti

Oxford Concordance Program (OCP)

The Oxford Concordance Program (OCP) was a major and influential early concordance program developed in the late 1970s and early 1980s at the Oxford University Computing Service. It was influnced by CLOC and COCOA, and aimed to be more user-friendly ...
Oxford Concordance Program (OCP)
Oxford Concordance Program (OCP)

Concordance - XML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an XML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - XML (TAPoRware)
Concordance - XML (TAPoRware)

ORBIS: Stanford Geospatial Network Model of the Roman World

ORBIS: Stanford Geospatial Network Model of the Roman World is a tool and academic resource for reconstructing the time and financial costs of travel in the ancient world. Its model is based on a simplified network of Roman cities, roads, rivers and ...
ORBIS: Stanford Geospatial Network Model of the Roman World
ORBIS: Stanford Geospatial Network Model of the Roman World

CMU-Cambridge Statistical Language Modelling Toolkit

The CMU-Cambridge Statistical Language Modelling Toolkit is a longstanding set of Unix tools offered by Carnegie Mellon University and Cambridge University. It includes functions for word frequency, n-grams, vocabulary and words marked as 'context cues'. ...
CMU-Cambridge Statistical Language Modelling Toolkit
CMU-Cambridge Statistical Language Modelling Toolkit

ContaWords

With ContaWords you can quickly and easily get a frequency analysis of your texts (pdf, html, txt). The results can be downloaded and easily handled. When you give ContaWords a task, it first reads the words of a text file and decides what part ...
ContaWords
ContaWords

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter

Aggregator - Other (TAPoRware)

This tool aggregates texts/subtexts from different locations into a single text. The original texts can be from a user-specified web page or files located on one's computer. Aggregating subtexts requires all documents to share a common subtext tag, ...
Aggregator - Other (TAPoRware)
Aggregator - Other (TAPoRware)

TextDNA

TextDNA is a free tool for large-scale overview analysis of linguistic data offered by the University of Wisconsin, Madison. It identifies patterns within a text, and enables users to compare ordered sets of data with its sequence visualization. It ...
TextDNA
TextDNA

JConcorder

JConcorder is Java software for building and managing word catalogues, originally released for the Macintosh as Concorder / Le Concordeur. Amongst its features are functions for listing and cataloguing words, generating concordances, exporting concordances ...
JConcorder
JConcorder

InTEXT

InTEXT is a legacy, commercial suite of programs designed to supplement web search, intranet management and accomplish variety of document generation, search and manipulation functions. It includes natural language query, full-text search and retrieval ...
InTEXT
InTEXT

Collocation - HTML (TAPoRware)

This tool provides all words directly before and after a user-specified word, based on the desired context of words, lines, sentences or paragraphs. The results can be sorted alphabetically, by frequency, or by Z-score (an indication of how far and ...
Collocation - HTML (TAPoRware)
Collocation - HTML (TAPoRware)

Meld

Meld is a free, open source tool for comparing files, directories and version-controlled projects developed in Python by Kai Willadsen. It features two- and three-way comparison of files and directories, supports numerous version control systems, and ...
Meld
Meld

TEXTAN

TEXTAN was a program developed by Suzanne Hanon of the University of Aarhus in the 1970s for locating English loan words in French texts. It was written in ALGOL 4 and consisted of two programs. TEXTAN 1 learned anglicisms developed manually from French ...
TEXTAN
TEXTAN
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator Dutch English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: