TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

CONCORD

CONCORD was a concordance program developed in 1968 to identify and sort collocating words or phrases. Notably, it allowed users to sort words ending in one of several suffixes (such as -er or -est) to be grouped together.
CONCORD
CONCORD

Voyant Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, ...
Voyant Corpus Summary
Voyant Corpus Summary

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

Transformer - XML (TAPoRware)

This tool performs XML to HTML transformation using an XSL stylesheet. The XSL stylesheet can be from another website or as defined by the user.
Transformer - XML (TAPoRware)
Transformer - XML (TAPoRware)

Trend Miner

Trend Miner is a tool designed to enable portable open-source real-time methods for cross-lingual mining and summarizing of large-scale stream media. It combines elements from natural language processing, knowledge-based reasoning, machine learning, ...
Trend Miner
Trend Miner

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

CNRTL Extension for Firefox

CNRTL is a free XUI and Javascript extension for Firefox designed to enable users to access the French lexical CNRTL portal directly within their web browser. With its toolbar, users can also double-click on a word within a webpage to view lexical information ...
CNRTL Extension for Firefox
CNRTL Extension for Firefox

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

BookLamp Labs: StoryDNA Viewer

BookLamp's StoryDNA Viewer is a corpus search tool designed to find all books sharing a user-selected set of characteristics ('StoryDNA'). From the resultant list, users can then view a book's entry in BookLamp to get more information about what characteristics ...
BookLamp Labs: StoryDNA Viewer
BookLamp Labs: StoryDNA Viewer

CulturalAnalytics

CulturalAnalytics, also known as Cultural Analytics for the Digital Humanities in R, is a free R package of functions for statistical analysis and plotting image properties, developed by Rob Myers specifically for the Digital Humanities, and of value ...
CulturalAnalytics
CulturalAnalytics

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

Meandre

Meandre is a graphical programming language for creating text analysis flows, built on top of the Seasr infrastructure. Meandre uploads its flows to a Seasr server where they can be accessed and used by anyone who can access the server.
Meandre
Meandre

Keywords Finder - Beta (TAPoRware)

This tool identifies keywords or key phrases within a user-specified text, using the assumption that they will appear with the greatest frequency. It applies a stemmer to every word. Plain text input is recommended. All tags will be stripped from an ...
Keywords Finder - Beta (TAPoRware)
Keywords Finder - Beta (TAPoRware)

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

Pajek

Pajek is a free tool for large network analysis and visualization in continuous development since 1996. It can handle large datasets from diverse sources, such as collaboration networks, citiation networks, data mining, and Internet-derived networks. ...
Pajek
Pajek
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: