The raw.data is the simulated dataset which consists of 3,000 independent SNPs and 1,004 individuals belonging to one of 5 populations (200 individuals each) and 4 outlying individuals. The matrix raw.data contains the number 0, 1, and 2 representing SNP in additive coding. The pairwise genetic distance between populations are listed below (see Balding, 1995):

pop1pop2pop3pop4pop5
pop10.00400.00590.00850.0101
pop20.00400.00550.00820.0099
pop30.00590.00550.01040.0119
pop40.00850.00820.01040.0139
pop50.01010.00990.01190.0139
data(ipcaps_example)

Format

A matrix with 3,000 columns and 1,004 rows

References

Balding, D.J., and Nichols, R.A. (1995). A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica 96, 3-12.

See also

label and PC