TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

TagCrowd

TagCrowd is a tool for generating a frequency-based word cloud from a source text, with a free browser version available through the TagCrowd website. A commercial version may also be purchased, subject to a creative commons license.
TagCrowd
TagCrowd

Concordance - HTML (TAPoRware)

This tool finds the context for a specified word or pattern anywhere in an HTML document, and can be narrowed to only the text within specified tags. Users may specify the context length, and whether the tool returns the context length in words, sentences, ...
Concordance - HTML (TAPoRware)
Concordance - HTML (TAPoRware)

COMIT

COMIT was an early string processing programming language developed for IBM 700/7000 computers at MIT from the late 1950s to mid 1960s. It was particularly designed for linguistics and natural language processing, and was taught to humanists to aid ...
COMIT
COMIT

HyperPo

HyperPo is an important legacy tool, developed as the first web-based text analysis tool aimed at humanities scholars available from 1996 through 2006. Users could input a web address, upload a file or directly enter text for analysis. HyperPo's interface ...
HyperPo
HyperPo

Aggregator - Other (TAPoRware)

This tool aggregates texts/subtexts from different locations into a single text. The original texts can be from a user-specified web page or files located on one's computer. Aggregating subtexts requires all documents to share a common subtext tag, ...
Aggregator - Other (TAPoRware)
Aggregator - Other (TAPoRware)

JConcorder

JConcorder is Java software for building and managing word catalogues, originally released for the Macintosh as Concorder / Le Concordeur. Amongst its features are functions for listing and cataloguing words, generating concordances, exporting concordances ...
JConcorder
JConcorder

BookLamp

BookLamp, part of the Book Genome Project, is a tool and a resource for finding books. It offers an alternative to social recommendation engines reliant on author popularity by treating its books as equal regardless of number of copies sold. BookLamp's ...
BookLamp
BookLamp

Extract Text - HTML (TAPoRware)

This tool extracts text found within specific tags in an HTML document. It is part of the TAPoRware toolset; an XML version is also available.
Extract Text - HTML (TAPoRware)
Extract Text - HTML (TAPoRware)

Cytoscape

Cytoscape is an open source software platform for visualizing data networks and pathways. Though designed for bioinformatic systems, it has been generalized to complex network analysis and has applications extending to the semantic web. Its core distribution ...
Cytoscape
Cytoscape

TAGS (Twitter Archiving Google Spreadsheet) v5.1

TAGS (Twitter Archiving Google Spreadsheet) v5.1 is a tool for automatically pulling Twitter search results into a Google spreadsheet for further analysis. Users can set TAGS 5.1 to update the resulting archive hourly or at a frequency of their specification, ...
TAGS (Twitter Archiving Google Spreadsheet) v5.1
TAGS (Twitter Archiving Google Spreadsheet) v5.1

List Words - Plain Text (TAPoRware)

This tool lists words in an plain text document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and HTML versions ...
List Words - Plain Text (TAPoRware)
List Words - Plain Text (TAPoRware)

brat rapid annotation tool

The brat rapid annotation tool is an online environment for collaborative structured annotation of texts. It can be run in a browser (optimized for Chrome and Safari) or downloaded for local installation. It can be used for a variety of annotation tasks, ...
brat rapid annotation tool
brat rapid annotation tool

MONK (Metadata Offer New Knowledge)

MONK (Metadata Offer New Knowledge) is a digital environment for humanities scholars. It is desgined to assist with the discovery and analysis of patterns within texts, incorporating full text content from corpora such as ECCO, EEBO and Early American ...
MONK (Metadata Offer New Knowledge)
MONK (Metadata Offer New Knowledge)

Hypergraph - XML (TAPoRware)

Tool uses the Hypergraph Package (http://hypergraph.sourceforge.net/) to display the structure of a user-selected XML file. This tool requires jsdk1.4.2 or later; otherwise only the default hypergraph is displayed.
Hypergraph - XML (TAPoRware)
Hypergraph - XML (TAPoRware)

IITagger

IITagger was a part of speech tagging system developed in the 1990s to assign part of speech information to words from the Wall Street Journal.
IITagger
IITagger

WordHoard

WordHoard is a tool for the study of large texts or transcribed speech. It annotates or tags texts by applying morphological, lexical, prosodic, and narratological criteria. Users may apply WordHoard to their own texts, or to the corpora included with ...
WordHoard
WordHoard

Voyant Term Fountain (beta)

Term Fountain is a tool for visualizing word frequencies, and a part of the Voyant suite of tools. It takes the most common words in a corpus and represents them as a fountain. This tool is still in beta.
Voyant Term Fountain (beta)
Voyant Term Fountain (beta)

UVic Image Markup Tool

The UVic Image Markup Tool is a free tool for annotating images and storing the annotations in XML files from the University of Victoria's Humanities Computing and Media Centre. Its interface is designed for users without prior XML experience with additional ...
UVic Image Markup Tool
UVic Image Markup Tool

General Inquirer

The General Inquirer is a historically important program for content analysis of textual data originally developed in the 1960s by Philip Stone and his colleagues at the Harvard Laboratory of Social Relations. Though the original release used punched ...
General Inquirer
General Inquirer

Voyant Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).
Voyant Term Frequencies Chart
Voyant Term Frequencies Chart

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OSX and Linux. ...
NLTK 2.0 (Natural Language Toolkit)
NLTK 2.0 (Natural Language Toolkit)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: