Kidd Lab 

Department of Genetics, Yale University

Interpretation of "phylogenetic" trees of populations based on gene frequency data.

In the abstract a tree is simply one of many graphical ways to represent similarities. An additive tree is simply a representation of a set of linear equations that gives graphically the segment lengths that best fit the pairwise distances. However, it is unavoidable that a tree be thought of in terms of historical divisions and diversifications of populations. Without major qualifications that would be an incorrect interpretation for most human populations. Few extant populations have had complete endogamy for many generations and it is unlikely that ancient populations were any different. Moreover, at most loci that have been studied on many populations, the allele frequencies tend to show a clinal pattern.

In spite of these and other problems with an evolutionary interpretation of trees, there are plausible historical inferences that can be made. For example, in the interim analyses depicted the Native American branch separates from the East Asian branch considerably before the majority of the East Asian populations diverge from one another. The exception is the Siberian Yakut, which seems intermediate between the two clusters. Once North America was colonized there was probably little gene flow between there and the bulk of East Asia. This pattern in the tree argues that the ancestors of Native Americans were a separate group of populations prior to the diversification of modern East Asians. The Yakut may be descended from a less diversified ancestral lineage or may be more recently the result of amalgamation of modern East Asians and populations related to the ancestors of American Indians, giving the impression in a tree diagram of a more intermediate origin.

To the degree various simplifying assumptions are met the branch lengths are in units of t/2Ne, time in generations divided by twice the effective population size. These additive trees are unrooted – from the data analyzed there is no information on where the population ancestral to all modern populations connects to the tree shown. However, many sets of data indicate that our species arose in Africa, placing the root among the African populations. Assuming the root of this tree is among the African populations, the distance along the tree to each extant population has the same elapsed time – the differences are in the effective population sizes over that time period. Populations that have recently been very small and isolated are on long terminal branches, something particularly noticeable for the Samaritans. But, if the population is not isolated and/or its history does not fit a model of successive bifurcations of ancestry, its position in the tree may be misleading if one assumes an always-branching ancestry. Thus, a population arising from amalgamation of already diversified lineages would in effect represent a population with higher effective population size giving a shorter terminal branch. It would also have more intermediate gene frequencies and tend to be placed in the tree roughly intermediate between the lineages that founded it. The Yakut are an excellent example of this possibility.
 


Back to Contents



© 1999 Kenneth K. Kidd, Yale University. All rights reserved.
These figures may be reproduced for classroom use only.