TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

SATO (Systeme d'analyse des textes par ordinateur)

SATO (Systeme d'analyse des textes par ordinateur) is a longstanding historic text analysis system, now available as a free, web-based tool. Users can either draw off SATO's corpus or upload their own for analysis, and the texts for analysis may be ...
SATO (Systeme d'analyse des textes par ordinateur)
SATO (Systeme d'analyse des textes par ordinateur)

SIMILE Widgets: Timeline

Timeline, a part of the SIMILE Widgets family of tools, is a free, open source tool for creating interactive timelines from temporal data utilizing JavaScript and web markup in HTML and XML.
SIMILE Widgets: Timeline
SIMILE Widgets: Timeline

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

Scharfenberg-Smith Concordance Program

The Scharfenberg-Smith Concordance Program, briefly known as the ICRH (Institute for Computer Research in the Humanities) Concordance Generator, was developed at the Thomas J. Watson Reserach Centre as part of research in machine translation from Russian ...
Scharfenberg-Smith Concordance Program
Scharfenberg-Smith Concordance Program

Textometrica

Textometrica is a free, web based text analysis tool offered by HUMlab at Umeå University. Users can upload a plain-text file and examine its word frequencies, see co-occurrences, and generate visualizations and graphs.
Textometrica
Textometrica

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

YAGO2

YAGO2 is a knowledge base fueled by data from Wikipedia, WordNet and GeoNames. Being part of the linked data cloud YAGO2 is linked to DBpedia as well. YAGO ontology works within temporal and special dimensions and considered one of the largest public ...
YAGO2
YAGO2

Concordle

Concordle is a free, web based word cloud and concordance tool built in Javascript. It describes itself as the "not so pretty cousin of Wordle" and first debuted in 2006. Users can paste text into the provided box and generate a word cloud, concordance ...
Concordle
Concordle

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

BIBCON

BIBCON was a key-word-out-of-context system for concordances developed in FORTRAN and available in the 1960s.  It was a modified version of another system, KWIC.
BIBCON
BIBCON

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

DEREDEC

DEREDEC was a programming system and workbench for linguistics and text analysis written in LISP in the 1980s. It enabled syntactic and texual parsing, and could link phrases by their contenxual dependency relations.
DEREDEC
DEREDEC

Overview

Overview is a free, web-based tool for document mining. Developed by and for journalists, Overview is also broadly applicable to other corpus-based research applications. Aside from providing advanced search features such as regular expressions and ...
Overview
Overview

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

Co-Occurrence - XML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an XML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. HTML and plain text ...
Co-Occurrence - XML (TAPoRware)
Co-Occurrence - XML (TAPoRware)

LIWC (Linguistic Inquiry and Word Count)

LIWC (Linguistic Inquiry and Word Count) is a text analysis program available for purchase. It calculates the degree to which various categories of words are used in a text, and can process texts ranging from e-mails to speeches, poems and transcribed ...
LIWC (Linguistic Inquiry and Word Count)
LIWC (Linguistic Inquiry and Word Count)

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

Micro-OCP

Micro-OCP (Oxford Concordance Program) was a major and historically important textual analysis tool for microprocessor computers. It enabled users to generate concordances, word lists and indexes, in addition to facilitating text markup in COCOA or ...
Micro-OCP
Micro-OCP
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: