TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

TAGS (Twitter Archiving Google Spreadsheet) v5.1

TAGS (Twitter Archiving Google Spreadsheet) v5.1 is a tool for automatically pulling Twitter search results into a Google spreadsheet for further analysis. Users can set TAGS 5.1 to update the resulting archive hourly or at a frequency of their specification, ...
TAGS (Twitter Archiving Google Spreadsheet) v5.1
TAGS (Twitter Archiving Google Spreadsheet) v5.1

VINCI

VINCI is a natural language generation environment, first introduced in 1986 and with a sustained web presence. It provides linguists with a collection of linguist-friendly metalanguages for modelling natural language. It can generate sentences and ...
VINCI
VINCI

Mallet

Mallet is a Java based library and command line framework that provides statistical and machine learning tools for use with natural language processing.
Mallet
Mallet

WordSeer

WordSeer is a free, simple to use text analysis tool offered by the University of California Berkeley. It is entirely web based, with tools for search, reading and interrogating, heat maps and frequency. At this time, the only corpus available for exploration ...
WordSeer
WordSeer

EURAC: Double Tree

Double Tree is a free, open source Java application providing a visualization component for supporting exploratory corpus analysis. It focuses particularly on analyzing concordances, and can also represent a KWIC for a single word by collapsing the ...
EURAC: Double Tree
EURAC: Double Tree

NETMET

NETMET is a tool for generating and interpreting metaphors designed to run in DOS. Though this tool is no longer under development, it is still available for download. It includes a number of sample input metaphor files and is designed to be modified ...
NETMET
NETMET

Hypergraph - XML (TAPoRware)

Tool uses the Hypergraph Package (http://hypergraph.sourceforge.net/) to display the structure of a user-selected XML file. This tool requires jsdk1.4.2 or later; otherwise only the default hypergraph is displayed.
Hypergraph - XML (TAPoRware)
Hypergraph - XML (TAPoRware)

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

Compare With Control - Beta (TAPoRware)

This tool compare the text submitted by the user with a predefined control corpus. The tool lists the words common in both texts, in an order set by the user. At present, the tool only offers the Brown Corpus, with more predefined control corpus forthcoming. ...
Compare With Control - Beta (TAPoRware)
Compare With Control - Beta (TAPoRware)

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

Voyant Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.
Voyant Reader
Voyant Reader

Twitter Capture and Analysis Toolset (DMI-TCAT)

Twitter Capture and Analysis Toolset (DMI-TCAT) is a free, open source tool for capturing and analyzing tweets. The tool's web interface is currently closed to researchers outside the University of Amsterdam's Media Studies department, however, the ...
Twitter Capture and Analysis Toolset (DMI-TCAT)
Twitter Capture and Analysis Toolset (DMI-TCAT)

Quadrigram

Quadrigram is an environment for data gathering, query and visualization, based on a visual programming language and intended for users with no experience programming or creating visualizations. At present, it contains 50 different visualizers.
Quadrigram
Quadrigram

Ethnograph

Ethnograph is a long-standing legacy software package, maintained into the present, for quantitative analysis and data management. It contains features for search, applying metadata and conducting corpus analysis.
Ethnograph
Ethnograph

UAM CorpusTool

UAM CorpusTool is an annotation environment for text corpora in linguistic studies. It includes a graphical schema editor and saves annotations in XML format.
UAM CorpusTool
UAM CorpusTool

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

Co-Occurrence - HTML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an HTML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. XML and plain text ...
Co-Occurrence - HTML (TAPoRware)
Co-Occurrence - HTML (TAPoRware)

Scripto

Scripto is a lightweight, open source tool for crowdsourcing transcriptions for Humanities projects. Projects utilizing Scripto can manage contributions via full editorial controls and a versioning history.
Scripto
Scripto

BookLamp Labs: Suggestion Viewer

BookLamp's Suggestion Viewer is a faceted search and browser tool for finding new books based on how similar they are to another book. Users can search by either title or author and select the book to center the search on. The Suggestion Viewer then ...
BookLamp Labs: Suggestion Viewer
BookLamp Labs: Suggestion Viewer

Voyant Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below.
Voyant Document Term Frequencies
Voyant Document Term Frequencies

Stanford NLP Group: CoreNLP

Stanford CoreNLP is a free Natural Language Processing tool. It processes English language text and provides the base forms of words, parts of speech, indicates whether they are proper names, normalizes dates, times and numeric quantities, and marks ...
Stanford NLP Group: CoreNLP
Stanford NLP Group: CoreNLP
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: