TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Netlytic

Netlytic is a free, web-based tool for analyzing and visualizing text and social network data. It enables users to capture data from sources such as social media, blogs or online forums; identify themes and actors; and generate both chain and personal ...
Netlytic
Netlytic

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OSX and Linux. ...
NLTK 2.0 (Natural Language Toolkit)
NLTK 2.0 (Natural Language Toolkit)

Pattern (CLiPS)

Pattern (CLiPS) is a web mining module for Python designed for computational linguistic and psycholinguistic research. It integrates tools for data retrieval from search engines, social media, web spiders and individual websites, and can accomplish ...
Pattern (CLiPS)
Pattern (CLiPS)

Stanford NLP Group: Stanford Phrasal

Stanford Phrasal is a a free Java implementation for phrase-based machine translation. It provides an easy to use API for implementating new decoding model features and supports unique capabilities such as translating using phrases that include gaps ...
Stanford NLP Group: Stanford Phrasal
Stanford NLP Group: Stanford Phrasal

ATLAS.ti

ATLAS.ti is a tool for systematic qualitative data analysis released under a for-pay license. It offers multi-window frames, overviews generated through interactive network views utilizing visual interconnections, and cloud views and reports. A free ...
ATLAS.ti
ATLAS.ti

SCBD (Sentence and Chunk Boundaries Detector)

SCBD (Sentence and Chunk Boundaries Detector) was an NLP tool suited to Modern Greek text. It could detect sentence and chunk boundaries within an unrestricted text, could assign style markers, and was effective even on a large corpus of 200,000 words ...
SCBD (Sentence and Chunk Boundaries Detector)
SCBD (Sentence and Chunk Boundaries Detector)

Annotation Studio

Annotation Studio is an open source, web-based annotation application that integrates a powerful set of textual interpretation tools behind an intuitive and easy-to-use interface. Users can upload their own texts, and annotate with styled text, video, ...
Annotation Studio
Annotation Studio

ORBIS: Stanford Geospatial Network Model of the Roman World

ORBIS: Stanford Geospatial Network Model of the Roman World is a tool and academic resource for reconstructing the time and financial costs of travel in the ancient world. Its model is based on a simplified network of Roman cities, roads, rivers and ...
ORBIS: Stanford Geospatial Network Model of the Roman World
ORBIS: Stanford Geospatial Network Model of the Roman World

Keywords Finder - Beta (TAPoRware)

This tool identifies keywords or key phrases within a user-specified text, using the assumption that they will appear with the greatest frequency. It applies a stemmer to every word. Plain text input is recommended. All tags will be stripped from an ...
Keywords Finder - Beta (TAPoRware)
Keywords Finder - Beta (TAPoRware)

DocuScope

DocuScope is a text analysis environment first developed in 1998. It contains a suite of interactive visualization tools for corpus-based rhetorical analysis. At present, DocuScope is not available outside the originating research group. However, the ...
DocuScope
DocuScope

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter

Tokenize - XML (TAPoR)

This tool splits an XML document at specified points into 'tokens' - words, lines, sentences, paragraphs or characters. The user can specify characters, patterns, or tags upon which to separate tokens, and choose to have the results listed separator ...
Tokenize - XML (TAPoR)
Tokenize - XML (TAPoR)

Principal Components Analysis on Plain Text - Beta (TAPoRware)

This tool applies Principal Components Analysis rules to a text to generate relationships between words and text units. It works best with large texts where users can specify units of over 500 words. HTML and XML versions are not currently avaiable. ...
Principal Components Analysis on Plain Text - Beta (TAPoRware)
Principal Components Analysis on Plain Text - Beta (TAPoRware)

JConcorder

JConcorder is Java software for building and managing word catalogues, originally released for the Macintosh as Concorder / Le Concordeur. Amongst its features are functions for listing and cataloguing words, generating concordances, exporting concordances ...
JConcorder
JConcorder

Tokenize - HTML (TAPoR)

This tool splits an HTML document at specified points into 'tokens' - words, lines, sentences, paragraphs or characters. The user can specify characters, patterns, or tags upon which to separate tokens, and choose to have the results listed separator ...
Tokenize - HTML (TAPoR)
Tokenize - HTML (TAPoR)

W3C RDF Validation Service

The W3C RDF Validation Service is a free, web-based tool for checking RDF documents for errors and displaying the results. It can display in triples, a graph, or a combination of the two, and can format the graph in a variety of file formats including ...
W3C RDF Validation Service
W3C RDF Validation Service

Meld

Meld is a free, open source tool for comparing files, directories and version-controlled projects developed in Python by Kai Willadsen. It features two- and three-way comparison of files and directories, supports numerous version control systems, and ...
Meld
Meld

Get TEI Meta Data - Beta (TAPoRware)

This tool extracts metadata from TEI-compatible XML documents and displays it in name/value format. It is only available for XML.
Get TEI Meta Data - Beta (TAPoRware)
Get TEI Meta Data - Beta (TAPoRware)

Letter-Pairs Analysis

Letter-Pairs Analysis is a free, web-based tool for calculating and visualizing the number of times a pair of letters appears in a given text. Each pair is represented as a 'bubble', sized according to how frequently that pair appears in the text. An ...
Letter-Pairs Analysis
Letter-Pairs Analysis

PRORA

PRORA was a historically important set of six interrelated programs for creating computerized concordances, first released in the 1960s. It output on punched cards, could be used for any language represented by the Latin alphabet, and also had functions ...
PRORA
PRORA

SATO (Systeme d'analyse des textes par ordinateur)

SATO (Systeme d'analyse des textes par ordinateur) is a longstanding historic text analysis system, now available as a free, web-based tool. Users can either draw off SATO's corpus or upload their own for analysis, and the texts for analysis may be ...
SATO (Systeme d'analyse des textes par ordinateur)
SATO (Systeme d'analyse des textes par ordinateur)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: