TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

SIMILE Widgets: Welkin

Welkin is a free tool for visualizing complex RDF models offered for download by the SIMILE Project. Rather than permitting users to focus on specific nodes, it provides analytical views such as an overview of a dataset's connectivity or potential mappings ...
SIMILE Widgets: Welkin
SIMILE Widgets: Welkin

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OSX and Linux. ...
NLTK 2.0 (Natural Language Toolkit)
NLTK 2.0 (Natural Language Toolkit)

PRORA

PRORA was a historically important set of six interrelated programs for creating computerized concordances, first released in the 1960s. It output on punched cards, could be used for any language represented by the Latin alphabet, and also had functions ...
PRORA
PRORA

Topic Modelling Tool

Topic Modelling Tool is a free, open source tool for Latent Dirichlet Allocation topic modelling with a graphical user interface. The tool learns topics in a user-supplied corpus of plain text files, and outputs results as a CSV file for further analysis. ...
Topic Modelling Tool
Topic Modelling Tool

ATLAS.ti

ATLAS.ti is a tool for systematic qualitative data analysis released under a for-pay license. It offers multi-window frames, overviews generated through interactive network views utilizing visual interconnections, and cloud views and reports. A free ...
ATLAS.ti
ATLAS.ti

Co-Occurrence - XML (TAPoRware)

This tool looks for two words a certain distance apart from one another in an XML document, within the user-specified limits of words, sentences or lines. The results can be narrowed to only include words found within certain tags. HTML and plain text ...
Co-Occurrence - XML (TAPoRware)
Co-Occurrence - XML (TAPoRware)

TextDNA

TextDNA is a free tool for large-scale overview analysis of linguistic data offered by the University of Wisconsin, Madison. It identifies patterns within a text, and enables users to compare ordered sets of data with its sequence visualization. It ...
TextDNA
TextDNA

ORBIS: Stanford Geospatial Network Model of the Roman World

ORBIS: Stanford Geospatial Network Model of the Roman World is a tool and academic resource for reconstructing the time and financial costs of travel in the ancient world. Its model is based on a simplified network of Roman cities, roads, rivers and ...
ORBIS: Stanford Geospatial Network Model of the Roman World
ORBIS: Stanford Geospatial Network Model of the Roman World

Apache UIMA

Apache UIMA (Unstructured Information Management application) is a software system for analyzing large amounts of unstructured data, such as a plain text document, and identifying entities, such as persons, places, organizations, or relations between ...
Apache UIMA
Apache UIMA

Orange

Orange is a free, open source data visualization and analysis tool. It allows users to conduct data mining either through its visual programming language, or via Python scripts, and includes components for machine learning. Orange includes both add-ons ...
Orange
Orange

Stanford NLP Group: Stanford Word Segmenter

Stanford Word Segmenter is a free, open source Java-based tokenization tool for Chinese and Arabic text that integrates token pre-processing, or segmentation. For Arabic, the tool processes text according to the Penn Arabic Treebank 3 standard. For ...
Stanford NLP Group: Stanford Word Segmenter
Stanford NLP Group: Stanford Word Segmenter

List Tags - HTML (TAPoRware)

This tool lists all tags found in an HTML document, either uploaded by the user or from a web address. It is part of the TAPoRware collection of tools; see List XML Elements for an XML tool with similar functionality.
List Tags - HTML (TAPoRware)
List Tags - HTML (TAPoRware)

Comparator - Plain Text (TAPoRware)

This tool compares two documents by comparing the words in each according to user specifications. HTML and XML versions are also available in the TAPoRware toolsets.
Comparator - Plain Text (TAPoRware)
Comparator - Plain Text (TAPoRware)

JConcorder

JConcorder is Java software for building and managing word catalogues, originally released for the Macintosh as Concorder / Le Concordeur. Amongst its features are functions for listing and cataloguing words, generating concordances, exporting concordances ...
JConcorder
JConcorder

FORTRAN

FORTRAN (from 'formula translating system') was a major programming language first developed by IBM in the late 1950s. While originally intended for science and engineering, with substantial drawbacks for users seeking to conduct text processing, its ...
FORTRAN
FORTRAN

Netlytic

Netlytic is a free, web-based tool for analyzing and visualizing text and social network data. It enables users to capture data from sources such as social media, blogs or online forums; identify themes and actors; and generate both chain and personal ...
Netlytic
Netlytic

Meld

Meld is a free, open source tool for comparing files, directories and version-controlled projects developed in Python by Kai Willadsen. It features two- and three-way comparison of files and directories, supports numerous version control systems, and ...
Meld
Meld

XTRACT

XTRACT was a tool for lexical collocation developed by Frank Smadja, then of Columbia University. It was designed to use statistical techniques to identify collocations of aribitrary length, and to generate syntactic relationships between words. This ...
XTRACT
XTRACT

Stanford NLP Group: Part-of-Speech Tagger

Stanford Part-of-Speech Tagger is a free Java implementation for the recognition of parts of speech, and a part of the Stanford Natural Language Processing toolset. It reads text and assigns parts of speech to each word such as noun, verb or adjective. ...
Stanford NLP Group: Part-of-Speech Tagger
Stanford NLP Group: Part-of-Speech Tagger

PRORA

PRORA was a historically important set of six interrelated programs for creating computerized concordances, first released in the 1960s. It output on punched cards, could be used for any language represented by the Latin alphabet, and also had functions ...
PRORA
PRORA

Weighted Centroid - Other (TAPoRware)

The Weighted Centroid is a java applet designed to display a circular graph based on word distribution data. The text is divided up into an arbitrary number of units, which are positioned around the circumference of the circle in a clockwise sequence. ...
Weighted Centroid - Other (TAPoRware)
Weighted Centroid - Other (TAPoRware)
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator Dutch English English (language) French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: