Input Section

The data file "book.dat" is a text file that has each data item of N dimensions on a single row separated by white-space, spaces or tabs. So three dimensional data would have a row like:

2.0    3.4    128.0

When you load the data, the computer will try to figure out the number of data items, as well as the dimensionality of the data.

Clustering Section

Indicate the number of clusters desired and that maximum iterations.

The default is a random seeding into the Clusters. It is possible to have a crisp initial partition file. Just uncheck "Random Initial Assignment" and you will be asked to select a partition file.

The partition file "book.par" is a text file that contains the placement of each data point for each cluster. If there are five data points then the first row of "book.par" would be the first clusters data points:

1 0 0 0 1

indicates Cluster One contains the first and last data point.

Output Section

Type in the name of the output file. For large data sets it is impractical to output all the intermediate values and  the user should check "Last Only". The "View" button will open the text file in notepad.