TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Compare With Control - Beta (TAPoRware)

This tool compare the text submitted by the user with a predefined control corpus. The tool lists the words common in both texts, in an order set by the user. At present, the tool only offers the Brown Corpus, with more predefined control corpus forthcoming. ...
Compare With Control - Beta (TAPoRware)

Web Page Cleaner - Beta (TAPoRware)

This tool removes all HTML formatting from a web page or an uploaded HTML file, leaving the text for further processing. It is particularly good for preparing text-intensive web pages for analysis as plain text.
Web Page Cleaner - Beta (TAPoRware)

CHNM: Scribe

Scribe, now in version 3.5, is a free note-taking program available for both PC and Mac. Aimed particularly at historians, this program allows researchers to create digital note cards for managing sources, research notes, contacts, images, glossaries, ...
CHNM: Scribe

Anthologize

Anthologize is a free, open-source plugin for WordPress 3.0. It enables WordPress to be used as a platform for publishing electronic texts in PDF, ePUB or TEI format. The plugin enables users to grab their existing blog posts, export content from external ...
Anthologize

Flamenco Search

Flamenco Search is a downloadable web-based interface from the UC Berkley School of Information based in Python, designed to run on a server. It streamlines the browsing of large collections. It is particularly valuable for items such as documents or ...
Flamenco Search

Commentpress (Future of the Book)

Commentpress is an open source theme and plugin for Wordpress 3.3.1 or later designed to enable readers to contribute comments to each paragraph of a text and display those comments in the margins. Commentpress turns a text into a conversation, whether ...
Commentpress (Future of the Book)

TILE (Text Image Linking Environment)

The Text-Image Linking Environment (TILE) is a web-based tool that enables users to select regions of an image and link them to text. TILE is a system for creating and editing image-based electronic editions and connecting them to digital archives of ...
TILE (Text Image Linking Environment)

MONK (Metadata Offer New Knowledge)

MONK (Metadata Offer New Knowledge) is a digital environment for humanities scholars. It is desgined to assist with the discovery and analysis of patterns within texts, incorporating full text content from corpora such as ECCO, EEBO and Early American ...
MONK (Metadata Offer New Knowledge)

MorphAdorner

MorphAdorner is a Java command-line program for the adornment of words in a text. At present, available adornments include standard spellings, parts of speech and lemmata, in addition to tokanization, the recognition of sentence boundaries and extracting ...
MorphAdorner

NLTK 2.0 (Natural Language Toolkit)

NLTK 2.0 (Natural Language Tooklt) is a free, open source collection of Python modules, linguistic data and documentation for research and development in natural language processing and text analytics, with distributions for Windows, Mac OSX and Linux. ...
NLTK 2.0 (Natural Language Toolkit)

WordHoard

WordHoard is a tool for the study of large texts or transcribed speech. It annotates or tags texts by applying morphological, lexical, prosodic, and narratological criteria. Users may apply WordHoard to their own texts, or to the corpora included with ...
WordHoard

PhiloGL

PhiloGL is an open source WebGL Framework for advanced data visualization, creative coding and game development. It includes a module system encompassing Program and Shader management, IO, XHR, JSONP, Web Worker management, Effects and Tweening, among ...
PhiloGL

RiTa

RiTa is a free, open-source natural language library for work with generative literature, offered as both a 'core' package of jar files and documentation, and a text-to-speech package. It is designed to be simple and intuitive while still offering flexibility ...
RiTa

RoSE

RoSE is a web-based system blending social computing with humanities bibliographical resources, enabling these resources to be explored as a social network. It incorporates data mined from YAGO and Project Gutenberg, offers profile pages for both persons ...
RoSE

Scripto

Scripto is a lightweight, open source tool for crowdsourcing transcriptions for Humanities projects. Projects utilizing Scripto can manage contributions via full editorial controls and a versioning history.
Scripto
Sort
User supplied tags
2000s American English (language) 2010s Legacy Java 1990s Natural language processing Metadata Canadian French (language) 1980s German English Comparator 1970s Multilingual Word cloud Summarizer Social media Annotation French Collocation Timeline Content analysis Wordpress Collation European Transformer Transcription 1960s German (language) Statistical Disambiguation Lexography Publishing Multinational Word classification Sentiment analysis Distribution Data mining Co-occurence Poetry Computational linguistics Lemmatization Qualitative analysis Collaboration Arabic Concordance Norwegian Frequency Translation Voyant Transformation Tokenizer Svg Finnish (language) Irish Versioning Bibliographic management Danish Classification British Network analysis Command line Collaborative Word list Australian Quantitative analysis Environment Visualization Dictionary generation Chinese Stemmer Morphological analysis Word pair analysis Spanish (language) Flash Topic modelling Email analysis Italian (language) Corpus linguistics Estonian Readability Galacian Web interface Estonian (language) Multimedia management Relative frequency Hypergraph Scottish Ngram Hypercard Principal components analysis Word frequency Linguistics Polish (language) Danish (language) Text mining Document management Czech (language) Web mining Italian Sentence generation Analytics Rich-prospect browser Belgian Crowdsourcing Composition Writing analysis Multimedia Finnish Spanish Network management Curation Workbench Indexing Mixed methods Semantic web Welsh Austrian Tokenization Latin (language) Argentinian Optical character recognition Framework Ancient greek (language) Programming language Faceted browser Browser extension Chinese (language) Development library Russian (language) Compiler Japanese (language) Video analysis Mapping Dutch (language) Early modern english Stylistic analysis Portugese (language) Audio analysis Portuguese (language) Spelling variation Dutch Word clusters Tokenizing Toolkit