TextArc is a free visualization tool that represents an entire text on a single page. It has elements of an index, concordance and summary all in one place, encouraging the viewer to use its juxtapositions to uncover meaning. The web-based applet is preloaded with thousands of text collected from Project Gutenberg. Note: TextArc was designed for browsers circa 2002 and requires Java SE 6 to run.
LitStats is a tool for statistical analysis of natural language texts developed in the 1980s by Dr. Stephen Reimer of the University of Alberta. From an ASCII text, it can generate word frequency counts, word lengths, initial letter frequencies, sentence length frequencies and verbal segment frequencies. It was originally developed for an IBM 3033, and can still be run on systems using Windows XP.
DocuBurst is a free web-based visualization tool for exploring the contents of a text. Visitors can upload their own text or view those provided by others. DocuBurst presents an interactive chart called a ‘radial sunburst’ diagram which organizes the nouns extracted from the user-supplied text based on their meaning, and colours them based on frequency, revealing common themes in the text. The visualization also shows the proper nouns (e.g. character names) in a linked word cloud. The visualization may be zoomed, filtered, or refocused to target types of words of interest (e.g. “animal” words or “feeling” words). The visualization also provides a comparison tool to contrast word use across two documents. DocuBurst views can be bookmarked, annotated, shared, and embedded in your own website.
Wordle is an online toy for generating word clouds using the text you provide. Text can be submitted by providing an URL or by pasting raw text into an input. The most frequent words from the text are then used as the source for the resulting visualization, where frequency is used to determine the size of each word.
You may customize the output by choosing from a variety of display options (layout, font, colours) or use the randomize feature to get a random visualization. Additionally, there is built-in stop word support for many common languages, and the option to change the word case.
FeatureClusters is a visualization tool for viewing associations between words, based on commonalities in the feature sets of the words. This is illustrated using a force-directed graph, wherein each word is a node in the graph. Nodes are connected to one another if they share the same feature. Features are represented by the smaller circles (or petals) which surround each word. Moving your mouse over a word will show the feature details associated with that word. Clicking on a word will highlight its connections to other words.
Flowerbed is a simple visualization tool for viewing up to two documents. Each document is presented as a “flowerbed”, wherein each word is a flower. The height of the flower is determined by the relative frequency of that word within the document. The petals on each flower represent features associated with the word. When two documents are present in chosen the corpus, the second is mirrored below the first. In this way it is possible to do a cursory comparison of the documents.