Reading Data 

Home
About Clustan
Cluster Analysis
Applications
ClustanGraphics
User Support
Clustan/PC
Orders
What's New
White Papers
Contact Us
Reading Data
Proximities
Hierarchical
Constraints
k-Means
FocalPoint
Cluster Keys
Profiles
Exemplars
Classify Cases
ClustanPCA
Scatterplots
Auto Script
Wizard
This section describes various aspects of reading your data.

The simplest method of data input is from a text file, with the values separated by spaces or tab characters.  The data should be a square matrix, with the rows corresponding to the cases and the columns corresponding to the variables.  The cluster analysis will find groups of cases which are similar - if you are interested in clustering your variables, click here.

Another way of reading data is from an Excel spreadsheet.  We recently improved our Excel interface so that, for example, a spreadsheet of 5000 rows and 10 columns is read in 2 seconds.

Your data can contain missing values, which should be coded with a unique missing value code.

You may have different types of variables, in which case it will be necessary to specify their types.  You can do this in ClustanGraphics after you have read your data.

Another decision is whether or not to transform your variables prior to clustering.