PMID- 21170921 OWN - NLM STAT- MEDLINE DCOM- 20110531 LR - 20211203 IS - 1097-0258 (Electronic) IS - 0277-6715 (Linking) VI - 29 IP - 30 DP - 2010 Dec 30 TI - Multiple correspondence discriminant analysis: an application to detect stratification in copy number variation. PG - 3284-93 LID - 10.1002/sim.3890 [doi] AB - We illustrate the use of multiple correspondence analysis (MCA) to correct for population stratification of copy number alteration data. In addition, we propose the use of multiple correspondence discriminant analysis (MCDA) to identify an optimal set of copy number variants (CNVs) that correctly infers the population stratification of a CNV map. Within MCDA, we highlight the novel use of correlation with class directions for variable ranking. We found a set of 20 CNVs with 98 per cent predictability in a CNV map of the HapMap populations. On this sample, the selection of variables based on centroid ranking outperformed the most common practice of ranking variables with their correlation to the principal axes. CI - Copyright (c) 2010 John Wiley & Sons, Ltd. FAU - Caceres, Alejandro AU - Caceres A AD - Center for Research in Environmental Epidemiology (CREAL), Parc de Recerca Biomedica de Barcelona, 88 Doctor Aiguader, Barcelona, Spain. FAU - Basagana, Xavier AU - Basagana X FAU - Gonzalez, Juan R AU - Gonzalez JR LA - eng PT - Journal Article PT - Research Support, Non-U.S. Gov't PL - England TA - Stat Med JT - Statistics in medicine JID - 8215016 SB - IM MH - Computer Simulation MH - *DNA Copy Number Variations MH - *Data Interpretation, Statistical MH - *Discriminant Analysis MH - Genetic Association Studies/*methods MH - Humans MH - Polymorphism, Single Nucleotide MH - Racial Groups/genetics EDAT- 2010/12/21 06:00 MHDA- 2011/06/01 06:00 CRDT- 2010/12/21 06:00 PHST- 2010/12/21 06:00 [entrez] PHST- 2010/12/21 06:00 [pubmed] PHST- 2011/06/01 06:00 [medline] AID - 10.1002/sim.3890 [doi] PST - ppublish SO - Stat Med. 2010 Dec 30;29(30):3284-93. doi: 10.1002/sim.3890.