Roget’s Thesaurus

node type: category
edge type: references or is related to
number of nodes: 994 (largest connected component of 1010)
number of edges: 5058 (largest connected component of 5074)
source: The Stanford GraphBase, http://www-cs-faculty.stanford.edu/~knuth/sgb.html
data: Roget.txt

The edge-repulsion LinLog drawing of Roget’s thesaurus provides a nice map of (parts of) the English language, because semantically related categories are grouped together (see the VRML). This is exemplified by the two zoomed areas. 

The node-repulsion LinLog model and the Fruchterman-Reingold model mainly cluster together categories with many references in the center of the drawing.

 

Complete thesaurus, edge-repulsion LinLog model

 

Zoom into the left part Zoom into the bottom part

 

Fruchterman-Reingold model (for comparision) Node-repulsion LinLog model (for comparison)

 

back