TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Wmatrix

Wmatrix is a free tool for corpus analysis and comparison. It provides a web interface for USAS and CLAWS, in addition to enabling standard corpus linguistic functions such as frequency lists and concordances.
Wmatrix
Wmatrix

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

jsLDA

jsLDA is a free, open source tool for corpus-based in browser topic modelling. Users can test the tool via the provided demo, or download the source code to run on their own system. Users can load any corpus accessible from a URL, and can train the ...
jsLDA
jsLDA

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

CNRTL Extension for Firefox

CNRTL is a free XUI and Javascript extension for Firefox designed to enable users to access the French lexical CNRTL portal directly within their web browser. With its toolbar, users can also double-click on a word within a webpage to view lexical information ...
CNRTL Extension for Firefox
CNRTL Extension for Firefox

Word Cloud - Beta (TAPoRware)

This tool generates a word cloud of the top frequency words from a text document, with word size determined by its frequency. The user can specify how many words are to be included from the document, whether to apply a modified Glasgow Stop Words list, ...
Word Cloud - Beta (TAPoRware)
Word Cloud - Beta (TAPoRware)

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

Annotation Studio

Annotation Studio is an open source, web-based annotation application that integrates a powerful set of textual interpretation tools behind an intuitive and easy-to-use interface. Users can upload their own texts, and annotate with styled text, video, ...
Annotation Studio
Annotation Studio

Twitter Capture and Analysis Toolset (DMI-TCAT)

Twitter Capture and Analysis Toolset (DMI-TCAT) is a free, open source tool for capturing and analyzing tweets. The tool's web interface is currently closed to researchers outside the University of Amsterdam's Media Studies department, however, the ...
Twitter Capture and Analysis Toolset (DMI-TCAT)
Twitter Capture and Analysis Toolset (DMI-TCAT)

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Summarizer - Plain Text (TAPoRware)

This tool creates a summary of statistical information on a given document, and enables the user to select what types of information to display in the summary. The options include high frequency words, sentences with high frequency words, high frequency ...
Summarizer - Plain Text (TAPoRware)
Summarizer - Plain Text (TAPoRware)

Google Visualization API with RDF

The Google Visualization API provides a platform to create, share and reuse visualizations written by the developer community. Another option of the platform is to create reports, dashboards as well as analyze and display the data through the visualization ...
Google Visualization API with RDF
Google Visualization API with RDF

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

SimpleTCT

SimpleTCT (Simple Text Comparison Tool) is a free Java-based text comparison tool offered by Open Digital Arts & Humanities Tools (OpenDAHT). It offers a simplified management environment that enables users to display .rtf files, in which they may ...
SimpleTCT
SimpleTCT

CulturalAnalytics

CulturalAnalytics, also known as Cultural Analytics for the Digital Humanities in R, is a free R package of functions for statistical analysis and plotting image properties, developed by Rob Myers specifically for the Digital Humanities, and of value ...
CulturalAnalytics
CulturalAnalytics

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

Berkeley Parser

The Berkeley Parser was a program for parsing English sentences developed at the University of California at Berkeley and available in the 1960s
Berkeley Parser
Berkeley Parser

TXM

TXM is a free, open source text corpus analysis environment. Its features include concordance, collocate search, frequencies based on the CQP full text search engine and statistical functions based on R packages. It can export results in CSV, XML or ...
TXM
TXM

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

WinBrill

WinBrill is a part-of-speech tagger developed by Eric Brill in the 1990s for the French language laboratory ATILF. It was originally available for UNIX, with WinBrill optimized for Windows 95/98/NT as of 1999. The ATILP provided a vocabulary and rules ...
WinBrill
WinBrill
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: