![]() |
Non-parametric Analysis of Population StructureI recently had a chance to look over this paper which n/a had pointed out a while back on his blog. Unlike popular Bayesian clustering approaches such as structure and BAPS, the authors’ new method (Maximization of Genetic Distance, MGD) does not make assumptions about intra-population or inter-locus allele distributions, namely that they are in Hardy-Weinberg and linkage equilibrium. In this sense, the method is analogous to statistical methods like Wilcoxon rank-sum (Mann-Whitney) and Kruskal-Wallis, which are non-parametric versions of the t-test and one-way ANOVA, respectively (these methods are non-parametric in that they do not assume a specific underlying distribution for the data, in these cases Normal). MGD uses a simulated annealing algorithm to find the partition that maximizes the average genetic distance between populations. In this respect, it is very similar to traditional clustering methods like k-means. One interesting feature of the method is that it permits use of different distance measures. The authors use Nei’s D, which, while superior to Fst/Gst in that it is less sensitive to the level of heterozygosity, has the disadvantage of approaching infinity under certain conditions. Jost’s D, a more recently developed measure, may prove superior here. In simulation studies, the method performs better than structure or BAPS when the number of clusters is large or population structure is more complex. However, only structure gives the correct clustering solution when Fst is as low as 0.01. Perhaps the most interesting result from the paper is the following, which compares the clustering solutions of structure and MGD over different numbers of desired clusters (2-7). The data set used is the HGDP microsatellite markers from Rosenberg’s 2002 paper. [IMG]http://i760.photobucket.com/albums/xx249/Aletheia14/Structure_vs_MGD.jpg[/IMG]
Based on the results of this paper, in particular the nonexistence of the Kalash as a separate cluster using MGD, it is questionable as to whether structure always provides the ‘most correct’ answer. Posted by Dasein on Monday, June 7, 2010 at 08:00 AM in Genetics & Human Bio-Diversity Comments:2
Posted by Guessedworker on June 07, 2010, 05:27 PM | # Based on the results of this paper, in particular the nonexistence of the Kalash as a separate cluster using MGD, it is questionable as to whether structure always provides the ‘most correct’ answer. So if structure is not the whole solution but must be augmented, on what consistent basis can you eliminate the negative results of possible augmentary methods and not be accused of simply picking the answer that is correct for you? 3
Posted by Dasein on June 08, 2010, 07:54 AM | #
The ideal consistent basis would be a stranglehold control over media and academia that forces acceptance of a method politically favourable to Europeans Kidding aside, I think when parametric and non-parametric methods give different results (however different something needs to be to be considered noteworthy), then at the very least the assumptions of the parametric method need to be questioned. I’m not aware that Kalash have been evaluated for levels of Hardy-Weinberg and linkage disequilibrium that would make them inappropriate for inclusion in structure analysis. Perhaps they have been. To make what could be a long and convoluted answer short, I don’t think we can really speak of a ‘correct’ answer. There are trade-offs, and correct is dependent on the context. Parametric methods are more powerful than non-parametric ones (and we see above that the parametric method, structure, works better when differentiation is very low), but one has to be careful about when to use them. As for context dependence, if you’re looking to model historic migration patterns, then Fst/Gst might be the best solution. If you’re looking to quantify population diversity using very heterozygous markers, like microsatellites, Fst/Gst is a bad choice. A distance measure that accounted for allelic correlation structure would find lots of practical application to questions of interest to us, like how to quantify genetic interests. 4
Posted by James Bowery on June 08, 2010, 10:31 AM | # Dasein writes: “The data set used is the HGDP microsatellite markers from Rosenberg’s 2002 paper.” GW asks: “...not be accused of simply picking the answer that is correct for you?” To put my answer into more concrete terms: You take the taxonomy produced by each other the methods and use them to compress the data set. The most compressed form of the data set (fewest total bits—including the decompression algorithm) is that based on the most generally “correct” taxonomy given what is known (the data set). That is to say, it will more often produce correct predictions. 5
Posted by Fred Scrooby on June 22, 2010, 07:59 AM | # A little off topic (though Salter is mentioned in the entry): I consider the following new piece by Frank Salter a must-read — I cite it here because it’s just gone off the front page over at Richard Spencer’s, so guys who wander over there to browse may miss it: http://www.alternativeright.com/main/blogs/hbd-human-biodiversity/open-borders-closed-minds/ . It consists of a series of rejoinders to academic race-replacement advocacy. Be careful, it’s often deceptively concise and compact, so that Prof. Salter may occasionally do stuff like condense three major show-stopping arguments in a few words casually hidden in half-a-sentence or so: although it’s simple you have to read it carefully to be sure of getting all the meat. No one on our side ought to be unfamiliar with any of the counter-arguments to race-replacement which Prof. Salter touches on in this extremely important piece, so simply-worded yet so jam-packed. 6
Posted by Fred Scrooby on June 22, 2010, 08:10 AM | # (I’m sure I had seen this new Salter piece discussed briefly somewhere before I read it — if it was somewhere here at MR.com that I saw it forgive me for not referring back to that; I can’t recall where it was and don’t have time at the moment to hunt for it.) 7
Posted by Armor on June 22, 2010, 08:35 AM | # K.MacDonald, in his blog, called attention to Salter’s article. Next entry: You my Heidegger: Dasein vs. The World of They Previous entry: They Are Kind to Animals |
|
Existential IssuesWhite Genocide ProjectOf note
Recent CommentsAlso see trash folder. Marie commented in entry 'The Cubans of Miami' on 05/24/12, 08:07 AM. (go) (view) daniel commented in entry 'Golden Dawn - Greece' on 05/24/12, 07:41 AM. (go) (view) uh commented in entry 'Golden Dawn - Greece' on 05/24/12, 07:18 AM. (go) (view) daniel commented in entry 'Golden Dawn - Greece' on 05/24/12, 06:53 AM. (go) (view) Swan commented in entry 'The facial proportions of beautiful people' on 05/24/12, 06:48 AM. (go) (view) Swan commented in entry 'The facial proportions of beautiful people' on 05/24/12, 06:47 AM. (go) (view) daniel commented in entry 'Golden Dawn - Greece' on 05/24/12, 06:32 AM. (go) (view) Guest commented in entry 'The Torment of the Mulattoes' on 05/24/12, 06:17 AM. (go) (view) daniel commented in entry 'Beyond the 14 words' on 05/24/12, 03:05 AM. (go) (view) Lee John Barnes commented in entry 'Golden Dawn - Greece' on 05/24/12, 02:31 AM. (go) (view) daniel commented in entry 'Golden Dawn - Greece' on 05/24/12, 02:03 AM. (go) (view) Captainchaos commented in entry 'Beyond the 14 words' on 05/23/12, 11:08 PM. (go) (view) Captainchaos commented in entry 'Golden Dawn - Greece' on 05/23/12, 09:13 PM. (go) (view) Leon Haller commented in entry 'Golden Dawn - Greece' on 05/23/12, 07:47 PM. (go) (view) Swan commented in entry 'Indian beauty' on 05/23/12, 12:52 PM. (go) (view) Lee John Barnes commented in entry 'Golden Dawn - Greece' on 05/23/12, 12:45 PM. (go) (view) Swan commented in entry 'More on the Indian beauty question' on 05/23/12, 12:31 PM. (go) (view) Leon Haller commented in entry 'Beyond the 14 words' on 05/23/12, 11:43 AM. (go) (view) Leon Haller commented in entry 'Golden Dawn - Greece' on 05/23/12, 11:32 AM. (go) (view) Mellaba Pechios commented in entry 'Golden Dawn - Greece' on 05/23/12, 07:55 AM. (go) (view) daniel commented in entry 'Beyond the 14 words' on 05/23/12, 03:51 AM. (go) (view) Leon Haller commented in entry 'Golden Dawn - Greece' on 05/22/12, 10:40 PM. (go) (view) Leon Haller commented in entry 'Golden Dawn - Greece' on 05/22/12, 10:40 PM. (go) (view) Leon Haller commented in entry 'Golden Dawn - Greece' on 05/22/12, 10:26 PM. (go) (view) Leon Haller commented in entry 'Golden Dawn - Greece' on 05/22/12, 10:23 PM. (go) (view) 7 Year BA commented in entry 'Golden Dawn - Greece' on 05/22/12, 09:19 PM. (go) (view) DARYL commented in entry 'A repeatable comment for mass-pasting on American public message boards' on 05/22/12, 08:57 PM. (go) (view) Thorn commented in entry 'Golden Dawn - Greece' on 05/22/12, 08:31 PM. (go) (view) Church of Jed commented in entry 'Golden Dawn - Greece' on 05/22/12, 07:40 PM. (go) (view) Selous Scout commented in entry 'Golden Dawn - Greece' on 05/22/12, 05:56 PM. (go) (view) Silver commented in entry 'Beyond the 14 words' on 05/22/12, 11:37 AM. (go) (view) AnalogMan commented in entry 'Golden Dawn - Greece' on 05/22/12, 11:29 AM. (go) (view) Wandrin commented in entry 'Beyond the 14 words' on 05/22/12, 07:42 AM. (go) (view) Srakotraqu commented in entry 'A repeatable comment for mass-pasting on American public message boards' on 05/22/12, 07:30 AM. (go) (view) Wandrin commented in entry 'Beyond the 14 words' on 05/22/12, 07:19 AM. (go) (view) Recent Posts
General NewsScience NewsScience CategoriesAll CategoriesThe WritersEach author's name links to a list of all articles posted by the writer; the hashes link to authors' homepages. LinksEndorsement not implied. Controlled Opposition Crime
General
Immigration
Islam Jews
Nationalist Political Parties
Science Whites in Africa |
Posted by James Bowery on June 07, 2010, 11:01 AM | #
The most general metric is minimum description length. I suspect that in the absence of quantum “computers” (necessary for optimally choosing the MDL “code”), the closest practical approximation will be Kolmogorov complexity. In that event I further suspect a fruitful line of research will involve SA/GP hybrid search strategies—some of which have already shown promise in other areas.