TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Google Visualization API with RDF

The Google Visualization API provides a platform to create, share and reuse visualizations written by the developer community. Another option of the platform is to create reports, dashboards as well as analyze and display the data through the visualization ...
Google Visualization API with RDF
Google Visualization API with RDF

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

CHNM: Scribe

Scribe, now in version 3.5, is a free note-taking program available for both PC and Mac. Aimed particularly at historians, this program allows researchers to create digital note cards for managing sources, research notes, contacts, images, glossaries, ...
CHNM: Scribe
CHNM: Scribe

LIWC (Linguistic Inquiry and Word Count)

LIWC (Linguistic Inquiry and Word Count) is a text analysis program available for purchase. It calculates the degree to which various categories of words are used in a text, and can process texts ranging from e-mails to speeches, poems and transcribed ...
LIWC (Linguistic Inquiry and Word Count)
LIWC (Linguistic Inquiry and Word Count)

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

CLAWS Part-of-Speech Tagger

CLAWS (Constituent Likelihood Automatic Word-tagging System) is a free parts-of-speech tagging tool from Lancaster University. It has been continuously developed since the 1980s, has consistently achieved 96-97% accuracy, and has been applied to the ...
CLAWS Part-of-Speech Tagger
CLAWS Part-of-Speech Tagger

SEASR: Tag Cloud Viewer With Stemming

SEASR's Tag Cloud Viewer With Stemming is a free tool for creating a tag cloud from a text hosted at a URL, and can process PDF documents. While this tool is similar to SEASR's NGram Tag Cloud Viewer, it applies a stemmer to the text during processing. ...
SEASR: Tag Cloud Viewer With Stemming
SEASR: Tag Cloud Viewer With Stemming

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

Compare Lists

Compare Lists is a free, web-based tool for comparing text from two lists of URLs supplied by the user. This tool was developed and is maintained by the Digital Methods Initiative.
Compare Lists
Compare Lists

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Voyant Corpus Grid

Corpus Grid shows an overview of the corpus, including each document's title, number of word tokens (total words), number or word types (unique words), and lexical density (the ratio of tokens to types).
Voyant Corpus Grid
Voyant Corpus Grid

Textable

Orange Textable is a an add-on tool developed for the Orange data mining and visualization package. Its functions include the ability to build data tables from textual data, to import data from other sources, and to build concordances and collocations. ...
Textable
Textable

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

LIWC (Linguistic Inquiry and Word Count)

LIWC (Linguistic Inquiry and Word Count) is a text analysis program available for purchase. It calculates the degree to which various categories of words are used in a text, and can process texts ranging from e-mails to speeches, poems and transcribed ...
LIWC (Linguistic Inquiry and Word Count)
LIWC (Linguistic Inquiry and Word Count)

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

Collate: Interactive Collation of Large Textual Traditions

Collate was a program designed for scholars concerned with the difficulties of medieval vernacular traditions. It aimed to help scholars with the preparation of critical editions, and could collate up to a hundred texts. Collate was also capable of ...
Collate: Interactive Collation of Large Textual Traditions
Collate: Interactive Collation of Large Textual Traditions

MONK (Metadata Offer New Knowledge)

MONK (Metadata Offer New Knowledge) is a digital environment for humanities scholars. It is desgined to assist with the discovery and analysis of patterns within texts, incorporating full text content from corpora such as ECCO, EEBO and Early American ...
MONK (Metadata Offer New Knowledge)
MONK (Metadata Offer New Knowledge)

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

NodeXL

NodeXL is a free, open source tool for generating and exploring network graphs from Microsoft Excel files. It is particularly suited to data from social media sources, and includes the ability to directly import data from Twitter, YouTube, Flickr and ...
NodeXL
NodeXL
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: