Indexing Tab

This tab displays, in tree form, a list of the words that occur in all uncategorized e-mails.

The index tree consists of folder icons, each labeled with a word, with the number of occurrences (number of e-mails it occurs in) in square brackets. These words can be called head words.

Each head word folder expands to a list of the words that co-occur with the head word. Each co-occurring word is followed by square brackets containing two numbers: the number of e-mails this word occurs in, and a ratio. This ratio is the rate of occurrence with this head word divided by rate of occurrence in whole corpus. For example:

This part of the index tree includes the following information:

This indicates that the words articles and newsstand are highly likely to occur together, which means e-mails that contain both words are good candidates for grouping together in a category.

At the bottom of the tab are the following boxes and buttons:

Find WordsEnter a word in this box to restrict the index tree to displaying only that word as a head word.

Min. Texts with words—Enter the minimum number of e-mails that a word must occur in if it is to be displayed in the list.

Rebuild Index Tree—Rebuild the tree to apply the filters that you set in the Find Words and Min. Texts with words boxes.

Select Texts—Select a word in the index tree, then click this button to put all e-mails containing this word in the Candidate messages list on the Main tab.

For more details on the Indexing tab, see the "Indexing Tab" section in the "Genesys Knowledge Management: Content Analyzer" chapter of the Multimedia 7.6 User's Guide.