Biology:Genetics and archaeogenetics of South Asia

Short description: Biological field of study

Genetics and archaeogenetics of South Asia is the study of the genetics and archaeogenetics of the ethnic groups of South Asia. It aims at uncovering these groups' genetic histories. The geographic position of the Indian subcontinent makes its biodiversity important for the study of the early dispersal of anatomically modern humans across Asia.

Based on mitochondrial DNA (mtDNA) variations, genetic unity across various South Asian subpopulations have shown that most of the ancestral nodes of the phylogenetic tree of all the mtDNA types originated in the subcontinent.^[2]^[3]^[4]^[5] Conclusions of studies based on Y chromosome variation and autosomal DNA variation have been varied.

The genetic makeup of modern South Asians can be described at the deepest level as a combination of West Eurasian (related to ancient and modern people in Europe and West Asia) ancestries with divergent East Eurasian ancestries. The latter primarily include a proposed indigenous South Asian component (termed Ancient Ancestral South Indians, short "AASI") that is distantly related to the Andamanese peoples, as well as to East/Southeast Asians and Australasians, and further include additional, regionally variable East/Southeast Asians components.^[6]^[7]^[8]^[9]

The proposed AASI type ancestry is closest to the non-West Eurasian part, termed S-component, extracted from South Asian samples, especially those from the Irula tribe, and is generally found throughout all South Asian ethnic groups in varying degrees.^[7] The West Eurasian ancestry, which is closely related to Mesolithic hunter-gatherers and Neolithic farmers who lived on the Iranian Plateau (who are also closely related to Caucasus hunter-gatherers), forms the major source of the South Asian genetic makeup, and combined with varying degrees of AASI ancestry, formed the Indus Periphery Cline around ~5400–3700 BCE, which constitutes the main ancestral heritage of most modern South Asian groups. The Indus Periphery ancestry, around the 2nd millennium BCE, mixed with another West Eurasian wave, the incoming mostly male-mediated Yamnaya-Steppe component (archaeogenetically dubbed the Western Steppe Herders) to form the Ancestral North Indians (ANI), while at the same time it contributed to the formation of Ancestral South Indians (ASI) by admixture with hunter-gatherers having higher proportions of AASI-related ancestry. The ANI–ASI gradient, as demonstrated by the higher proportion of ANI in traditionally upper caste and Indo-European speakers, that resulted because of the admixture between the ANI and the ASI after 2000 BCE at various proportions is termed as the Indian Cline.^[6]^[7]^[10]^[11] The East Asian ancestry component forms the major ancestry among Tibeto-Burmese and Khasian speakers, and is generally restricted to the Himalayan foothills and Northeast India, with substantial presence also in Munda-speaking groups, as well as in some populations of northern, central and eastern South Asia.^[12]^[13]^[14]^[15]^[16]^[17]^[18]

Overview

Graph model showing various admixture proportions in ancient and modern populations of South Asia (Narasimhan 2019)^[6]

Ancestral components modelled at K2 to K5 for modern South Asian ("Indian-like") populations based on improved "masks" with 456330 SNPs (Yelmen 2019)^[7]

Results of ADMIXTURE analysis at K8 ancestral components with global populations. The populations are ordered geographically in a bar plot (Pathak 2018).^[9]

Modern South Asians are descendants of a combination of Western Eurasian ancestries (notably "Iranian Neolithic Farmers" and "Western Steppe Herders" components) with an indigenous East Eurasian component (termed Ancient Ancestral South Indians, short "AASI") closest to the non–West Eurasian part extracted from South Asian samples; distantly related to the Andamanese peoples, East/Southeast Asians and Australasians, as well as regional variable additional East/Southeast Asian components respectively.^[6]^[7]^[8]^[9]^[19]

Phylogenetic position of the Indigenous South Asian (AASI) lineage among other East Eurasians

Modern South Asians display high genetic affinities to Ancient Iranian and Caucasus hunter-gatherer lineages.

The proposed AASI lineage, which is hypothesized to represent the ancestry of the very first hunter-gatherers and peoples of the Indian subcontinent, formed around ~40,000 years BCE. It was found that the AASI are distinct from Western Eurasian groups and have a closer genetic affinity with Ancient East Eurasians (such as Andamanese Onge or East Asian peoples) which is suggested to have diverged from Ancient West Eurasians around 48,000 years ago likely on the Persian plateau. Based on this, it has been inferred that the AASI lineage diverged from other Eastern Eurasian lineages, such as 'East and Southeast Asians' and 'Australasians', during their dispersal using a Southern route.^[8]^[20]^[21] The Andamanese people are among the relatively most closely related modern populations to the AASI component and henceforth used as an (imperfect) proxy for it,^[6]^[10] but others (Yelmen et al. 2019) note that both are deeply diverged from each other, and propose that the AASI type ancestry is closest to the non-West Eurasian part, termed S-component, extracted from the South Asian samples, especially those from the Irula tribe.^[7]^[22]^[23]^[24] Shinde et al. 2019 noted that both Andamanese Onge or East Siberian groups can be used as proxy for the non-West Eurasian-related component in the "qpAdm" admixture-modelling of an IVC-related individual (labelled "I6113") because both populations "have the same phylogenetic relationship to the non-West Eurasian-related of I6113 likely due to shared ancestry deeply in time".^[25]

Bennett et al. 2024 summarised:^[19]

AASI: – Ancient Ancestral South Indian, one of three deeply branching East Asian lineages (with AA and ESEA). This South Asian hunter-gatherer ancestry is found primarily in present-day southern India and South Asia.

Genetic data shows that the main West Eurasian geneflow event happened during the Neolithic period,^[26] or already during the Holocene (pre-Neolithic period) by Iranian hunter-gatherers.^[25]^[6]^[11]^[15]^[16]^[18]^[27]^[28]^[29]^[30] There is also evidence that some West Eurasian like ancestry reached South Asia earlier, during the Upper Paleolithic (around 40,000–30,000 years BCE)."^{[web 1]} Yet, the pre-Neolithic ancestry of South Asia belonged to "an indisputably EEC genetic component ... that made up the majority of the pre-Neolithic genetic landscape".^[31]^[30]

The Neolithic or Pre-Neolithic Iranian geneflow, in tandem with variable amounts of AASI admixture, gave rise to the Indus Periphery Cline, which is characteristic for modern South Asians and forms the major source of the gene pool. The introduction of this ancestry might be associated with the spread of Dravidian languages. Genetic data suggests that the specific Ancient Iranian-related lineage diverged from other lineages from the Neolithic Iranian plateau about 10,000 years ago.^[25]^[6]^[11]^[15]^[16]^[29]^[27]^[32]^[33]^[34] According to an international research team led by palaeogeneticists of the Johannes Gutenberg University Mainz (JGU), the main ancestry component of South Asians is derived from a population related to Neolithic farmers from the eastern Fertile Crescent and Iran.^[11] The Iranian-related ancestry found in >95% of individuals on the Indian cline can also be modelled as deriving from Early Neolithic Sarazm populations from Tajikistan.^[35]

In the 2nd millennium BCE, the Indus Periphery-related ancestry mixed with the arriving Yamnaya-Steppe component forming the Ancestral North Indians (ANI), while at the same time it contributed to the formation of Ancestral South Indians (ASI) by admixture with hunter-gatherers further South having higher proportions of AASI-related ancestry. The proximity to West Eurasian populations is based on the ANI-ASI gradient, also termed the Indian Cline, with the groups harbouring higher ANI-ancestry being closer to West Eurasians as compared to populations harbouring higher ASI-ancestry. Tribal groups from southern India harbour mostly ASI ancestry and sits farthest from West Eurasian groups on the PCA compared to other South Asians. The Yamnaya or Western Steppe pastoralist component is found in higher frequency among Indo-Aryan speakers, and is distributed throughout the Indian subcontinent at lower frequency.^[6]^[10] Certain communities and caste groups from the northern Indian subcontinent display a peak of Western Steppe Herder ancestry at similar amounts as Northern Europeans.^[16]^[9]

An East Asian-related ancestry component forms the major ancestry among Tibeto-Burmese and Khasi an speakers in the Himalayan foothills and Northeast India, and is also found in substantial presence in Mundari-speaking groups.^[12]^[13]^[36]^[28] According to Zhang et al., Austroasiatic migrations from Southeast Asia into India took place after the last Glacial maximum, circa 10,000 years ago.^[37] Arunkumar et al. suggest Austroasiatic migrations from Southeast Asia occurred into Northeast India 5.2 ± 0.6 kya and into East India 4.3 ± 0.2 kya.^[38] Tätte et al. 2019 estimated that the Austroasiatic language speaking people admixed with Indian population about 2000–3800 years ago, which may suggest arrival of Southeast Asian genetic component in the area.^[39]

It has been found that the ancestral node of the phylogenetic tree of all the mtDNA types (mitochondrial DNA haplogroups) typically found in Central Asia, the West Asia and Europe are also to be found in South Asia at relatively high frequencies. The inferred divergence of this common ancestral node is estimated to have occurred slightly less than 50,000 years ago.^[40] In India, the major maternal lineages are various M subclades, followed by R and U sublineages. These mitochondrial haplogroups' coalescence times have been approximated to date to 50,000 BP.^[40]

The major paternal lineages of South Asians are represented by the West Eurasian-affiliated haplogroups R1a1, R2, H, L and J2. A minority belongs to the East Eurasian-affiliated Haplogroup O-M175. O-M175 is mainly restricted to Austroasiatic and Tibeto-Burmese speakers, and also common among East and Southeast Asians, while H is largely restricted to South Asians and R1a1, J2 and L as well as a subclade of H (H2) are commonly found among European and Middle Eastern populations.^[41]^[42] Some researchers have argued that Y-DNA Haplogroup R1a1 (M17) is of autochthonous South Asian origin.^[43]^[44] However, proposals for a Central Asian/Eurasian steppe origin for R1a1 are also quite common and supported by several more recent studies.^[43]^[45]^[46]^{[web 2]}^[47] Other minor haplogroups include subclades of Q-M242, G-M201, R1b, as well as Haplogroup C-M130.^[43]^[48]^[49]

Genetic studies comparing eight X chromosome based STR markers using a multidimensional scaling plot (MDS plot), revealed that modern-day South Asians like Indians, Pakistanis, Bangladeshis and Sinhalese people cluster close to each other, but also closer to Europeans. In contrast Southeast Asians, East Asians and Africans were placed at a distant positions, outside the main cluster.^[50]

mtDNA

The spatial distribution of M, R and U mtDNA haplogroups and their sub-haplogroups in South Asia

The most frequent mtDNA haplogroups in South Asia are M, R and U (where U is a descendant of R).^[41] Arguing for the longer term "rival Y-Chromosome model",^[43] Stephen Oppenheimer believes that it is highly suggestive that India is the origin of the Eurasian mtDNA haplogroups which he calls the "Eurasian Eves". According to Oppenheimer it is highly probable that nearly all human maternal lineages in Central Asia, the Middle East and Europe descended from only four mtDNA lines that originated in South Asia 50,000–100,000 years ago.^[51]

Macrohaplogroup M

The macrohaplogroup M, which is considered as a cluster of the proto-Asian maternal lineages,^[40] represents more than 60% of South Asian MtDNA.^[52]

The M macrohaplotype in India includes many subgroups that differ profoundly from other sublineages in East Asia especially Mongoloid populations.^[40] The deep roots of M phylogeny clearly ascertain the relic of South Asian lineages as compared to other M sublineages (in East Asia and elsewhere) suggesting 'in-situ' origin of these sub-haplogroups in South Asia, most likely in India. These deep-rooting lineages are not language specific and spread over all the language groups in India.^[52]

Virtually all modern Central Asian MtDNA M lineages seem to belong to the Eastern Eurasian (Mongolian) rather than the South Asian subtypes of haplogroup M, which indicates that no large-scale migration from the present Turkic-speaking populations of Central Asia occurred to India. The absence of haplogroup M in Europeans, compared to its equally high frequency among South Asians, East Asians and in some Central Asian populations contrasts with the Western Eurasian leanings of South Asian paternal lineages.^[40]

Most of the extant mtDNA boundaries in South and Southwest Asia were likely shaped during the initial settlement of Eurasia by anatomically modern humans.^[53]

Haplogroup	Important Sub clades	Populations
M2	M2a, M2b	Throughout the continent with low presence in Northwest Peaking in Bangladesh, Andhra Pradesh, coastal Tamil Nadu and Sri Lanka
M3	M3a	Concentrated into northwestern India Highest amongst the Parsees of Mumbai
M4	M4a	Peaks in Pakistan, Kashmir and Andhra Pradesh
M6	M6a, M6b	Kashmir and near the coasts of the Bay of Bengal, Sri Lanka
M18		Throughout South Asia Peaking at Rajasthan and Andhra Pradesh
M25		Moderately frequent in Kerala and Maharashtra but rather infrequent elsewhere in India

Macrohaplogroup R

The macrohaplogroup R (a very large and old subdivision of macrohaplogroup N) is also widely represented and accounts for the other 40% of South Asian MtDNA. A very old and most important subdivision of it is haplogroup U that, while also present in West Eurasia, has several subclades specific to South Asia.

Most important South Asian haplogroups within R:^[53]

Haplogroup	Populations
R2	Distributed widely across the sub continent
R5	widely distributed in most of India. Peaks in coastal SW India
R6	widespread at low rates across India. Peaks among Tamils and Kashmiris
W	Found in northwestern states. Peaks in Gujarat, Punjab and Kashmir, frequency is low elsewhere.

Haplogroup U

Haplogroup U is a sub-haplogroup of macrohaplogroup R.^[53] The distribution of haplogroup U is a mirror image of that for haplogroup M: the former has not been described so far among eastern Asians but is frequent in European populations as well as among South Asians.^[54] South Asian U lineages differ substantially from those in Europe and their coalescence to a common ancestor also dates back to about 50,000 years.^[2]

Haplogroup	Populations
U2*	(a parahaplogroup) is sparsely distributed specially in the northern half of the South Asia. It is also found in SW Arabia.
U2a	shows relatively high density in Pakistan and NW India but also in Karnataka, where it reaches its higher density.
U2b	has highest concentration in Uttar Pradesh but is also found in many other places, specially in Kerala and Sri Lanka. It is also found in Oman.
U2c	is specially important in Bangladesh and West Bengal.
U2i	is maybe the most important numerically among U subclades in South Asia, reaching specially high concentrations (over 10%) in Uttar Pradesh, Sri Lanka, Sindh and parts of Karnataka. It also has some importance in Oman. mtDNA haplogroup U2i is dubbed "Western Eurasian" in Bamshad et al. study but "Eastern Eurasian (mostly India specific)" in Kivisild et al. study.
U7	this haplogroup has a significant presence in Gujarat, Punjab and Pakistan. The possible homeland of this haplogroup spans Gujarat (highest frequency, 12%) and Iran because from there its frequency declines steeply both to the east and to the west.

Y chromosome

The major South Asian Y-chromosome DNA haplogroups are H, J2, L, R1a1, R2, which are commonly found among other West Eurasian populations, such as Middle Easterners or Europeans.^[41] Their geographical origins are listed as follows, according to the latest scholarship:

Major South Asian Y-chromosomal lineages:	H H-L901	J2 J-M172	L L-M20	R1a1a1 R-M417	R2 R-M479
Basu et al. (2003)	no comment	no comment	no comment	Central Asia	no comment
Kivisild et al. (2003)	India	Western Asia	India	Southern and Western Asia	South-Central Asia
Cordaux et al. (2004)	India	West or Central Asia	Middle Eastern	Central Asia	South-Central Asia
Sengupta et al. (2006)	India	The Middle East and Central Asia	South India	North India	North India
Thanseem et al. (2006)	India	The Levant	The Middle East	Southern and Central Asia	Southern and Central Asia
Sahoo et al. (2006)	South Asia	The Near East	South Asia	South or West Asia	South Asia
Mirabal et al. (2009)	no comment	no comment	no comment	Northwestern India or Central Asia	no comment
Zhao et al. (2009)	India	The Middle East	The Middle East	Central Asia or West Eurasia	Central Asia or West Eurasia
Sharma et al. (2009)	no comment	no comment	no comment	South Asia	no comment
Thangaraj et al. (2010)	South Asia	The Near East	The Near East	South Asia	South Asia

Haplogroup H

Haplogroup H (Y-DNA) is found at a high frequency in South Asia and is considered to represent the major paternal lineage. H is today rarely found outside of South Asia, but is common among South Asian-descended populations, such as the Romanis, particularly the H-M82 subgroup. H was also found in some ancient samples of Europe and is still found today at a low frequency in certain southeastern Europeans and Arabs of the Levant. Haplogroup H is frequently found among populations of India, Sri Lanka, Nepal, Bangladesh, Pakistan and the Maldives. All three branches of Haplogroup H (Y-DNA) are found in South Asia.

Probable site of introduction; South Asia or West Asia^[55] or Southern Central Asia.^[56] It seems to represent the main Y-Chromosome haplogroup of the Paleolithic inhabitants of South Asia and Europe respectively. Some individuals in South Asia have also been shown to belong to the much rarer subclade H3 (Z5857).^[57] Haplogroup H is by no means restricted to specific populations. For example, H is possessed by about 28.8% of Indo-Aryan castes.^[43]^[58] and in tribals about 25–35%.^[45]^[58]

Haplogroup J2

Haplogroup J2 has been present in South Asia mostly as J2a-M410 and J2b-M102, since Neolithic times (9500 YBP).^[59]^[60] J2 clades attain peak frequencies in the North-West and South India^[59] and is found at 19% within South Indian castes, 11% in North Indian castes and 12% in Pakistan.^[43] In South India, the presence of J2 is higher among middle castes at 21%, followed by upper castes at 18.6% and lower castes at 14%.^[43] Among caste groups, the highest frequency of J2-M172 is observed among Tamil Vellalars of South India, at 38.7%.^[43] J2 is present in tribals too^[59] and has a frequency of 11% in Austro-Asiatic tribals. Among the Austro-Asiatic tribals, the predominant J2 occurs in the Lodha (35%).^[43] J2 is also present in the South Indian hill tribe Toda at 38.46%,^[61] in the Andh tribe of Telangana at 35.19%^[45] and in the Kol tribe of Uttar Pradesh at a frequency of 33.34%.^[62] Haplogroup J-P209 was found to be more common in India's Shia Muslims, of which 28.7% belong to haplogroup J, with 13.7% in J-M410, 10.6% in J-M267 and 4.4% in J2b.^[63]

In Pakistan, the highest frequencies of J2-M172 were observed among the Parsis at 38.89%, the Dravidian-speaking Brahuis at 28.18% and the Makrani Balochs at 24%.^[64] It also occurs at 18.18% in Makrani Siddis and at 3% in Karnataka Siddis.^[64]^[65]

J2-M172 is found at an overall frequency of 10.3% among the Sinhalese people of Sri Lanka.^[48] In Maldives, 20.6% of Maldivian population were found to be haplogroup J2 positive.^[66]

Haplogroup L

According to Dr. Spencer Wells, L-M20 originated either in India or the Middle East, among the K-M9 descendants that migrated eastwards from the Middle East and later southwards from the Pamir Knot, before reaching India c. 30,000 years ago.^[67]^[68] Other studies have proposed either a West Asian or South Asian origin for L-M20 and associated its expansion in the Indus valley (~7,000 YBP) to Neolithic farmers.^[45]^[58]^[64]^[69]^[70]^[71] Genetic studies suggest that L-M20 may be one of the haplogroups of the original creators of the Indus Valley Civilisation.^[72]^[73] There are three subbranches of haplogroup L: L1-M76 (L1a1), L2-M317 (L1b) and L3-M357 (L1a2), found at varying levels in South Asia.^[43]

India

Haplogroup L shows time of Neolithic expansion.^[74] The clade is present in the Indian population at an overall frequency of c. 7–15%.^[43]^[45]^[58]^[75] Haplogroup L has a higher frequency among south Indian castes (c. 17–19%) and reaches 68% in some castes in Karnataka but is somewhat rarer in northern Indian castes (c. 5–6%).^[43] The presence of haplogroup L is quite rare among tribal groups (c. 5.6–7%);^[43]^[45]^[58] however, 14.6% has been observed among the Chenchus.^[48]

Among regional and social groups, moderate to high frequencies have been observed in Jats (36.8%),^[73] Konkanastha Brahmins (18.6%), Lambadis (17.1%), Punjabis (12.1%) and Gujaratis (10.4%).^[48]

Pakistan

In Pakistan, L1-M76 and L3-M357 subclades of L-M20 reach overall frequencies of 5.1% and 6.8%, respectively.^[43] Haplogroup L3 (M357) is found frequently among Burusho (approx. 12%^[76]) and Pashtuns (approx. 7%^[76]). Its highest frequency can be found in south western Balochistan province along the Makran coast (28%) to Indus River delta. L3a (PK3) is found in approximately 23% of Nuristani in northwest Pakistan.^[76]

The clade is present in moderate distribution among the general Pakistani population (14% approx).^[64]^[69]

Sri Lanka

In one study, 16% of the Sinhalese were found to be Haplogroup L-M20 positive.^[77] In another study 18% were found to belong to L1.^[48]

Haplogroup R1a1

In South Asia, R1a1 has been observed often with high frequency in a number of demographic groups,^[44]^[43]^[78] as well as with highest STR diversity which lead some to see it as the locus of origin.^[62]^[48]^[79]

While R1a originated c. 22,000^[62] to 25,000^[80] years ago, its subclade M417 (R1a1a1) diversified c. 5,800 years ago.^[80] The distribution of M417-subclades R1-Z282 (including R1-Z280)^[81] in Central and Eastern Europe and R1-Z93 in Asia^[80]^[81] suggests that R1a1a diversified within the Eurasian Steppes or the Middle East and Caucasus region.^[81] The place of origin of these subclades plays a role in the debate about the origins of Indo-Europeans.

India

In India, a high percentage of this haplogroup is observed in West Bengal Brahmins (72%)^[43] to the east, Gujarat Lohanas (60%)^[78] to the west, Khatris (67%)^[78] in the north, and Karnataka Medars (39%) in the south.^[65] It has also been found in several South Indian Dravidian-speaking tribals including the Kotas (41%) of Tamil Nadu,^[61] Chenchu (26%) and Valmikis of Andhra Pradesh^[48] as well as the Yadav and Kallar of Tamil Nadu suggesting that M17 is widespread in these southern Indians tribes.^[48] Besides these, studies show high percentages in regionally diverse groups such as Manipuris (50%)^[78] to the extreme northeast and in among Punjabis (47%)^[48] to the extreme northwest.

Pakistan

In Pakistan, it is found at 71% among the Mohanna of Sindh Province to the south and 46% among the Baltis of Gilgit-Baltistan to the north.^[78]

Sri Lanka

23% of the Sinhalese people out of a sample of 87 subjects were found to be R1a1a (R-SRY1532) positive according to a 2003 research,^[77] while another research in the same year found 12.8% of 38 samples belonged to this specific haplogroup.^[48]

Maldives

In the Maldives, 23.8% of the Maldivian people were found to be R1a1a (M17) positive.^[66]

Nepal

People in Terai region, Nepal show R1a1a at 69%.^[82]

Haplogroup R2

In South Asia, the frequency of R2 and R2a lineage is around 10–15% in India and Sri Lanka and 7–8% in Pakistan. At least 90% of R-M124 individuals are located in South Asia.^[83] It is also reported in Caucasus and Central Asia at a lower frequency. A genetic study by Mondal et al. in 2017 concluded that Haplogroup R2 originated in northern India and was already present before the Steppe migration.^[84] Though, some of the oldest samples were detected among Mesolithic and Neolithic individuals from Iranian Plateau and Turan.^[6]^[85]

India

Among regional groups, it is found among West Bengalis (23%), New Delhi Hindus (20%), Punjabis (5%) and Gujaratis (3%).^[48] Among tribal groups, the Karmali tribe of West Bengal showed highest at 100%^[44] followed by Lodhas (43%)^[86] to the east, while Bhil of Gujarat in the west were at 18%,^[62] Tharus of the north showed it at 17%,^[5] the Chenchu and Pallan of the south were at 20% and 14% respectively.^[43]^[44] Among caste groups, high percentages are shown by Jaunpur Kshatriyas (87%), Kamma (73%), Bihar Yadav (50%), Khandayat (46%)and Kallar (44%).^[44]

It is also significantly high in many Brahmin groups including Punjabi Brahmins (25%), Bengali Brahmins (22%), Konkanastha Brahmins (20%), Chaturvedis (32%), Bhargavas (32%), Kashmiri Pandits (14%) and Lingayat Brahmins (30%).^[5]^[44]^[46]^[62]

North Indian Muslims have a frequency of 19% (Sunni) and 13% (Shia),^[46] while Dawoodi Bohra Muslim in the western state of Gujarat have a frequency of 16% and Mappila Muslims of southern India have a frequency of 5%.^[87]

Pakistan

Sri Lanka

38% of the Sinhalese of Sri Lanka were found to have R2 according to a 2003 research.^[48]

Maldives

12% of the Maldivians are found to have R2.^[66]

Nepal

In Nepal, R2 percentages range from 2% to 26% within different groups under various studies. Newars show a significantly high frequency of 26% while people of Kathmandu show it at 10%.

Haplogroup O

Haplogroup O1 (O-F265) and O2 (O-M122), the primary branches of Haplogroup O-M175 are very common among the Austroasiatic and Tibeto-Burmese speaking populations of South Asia respectively.^[88]

Haplogroup O-M95, a subclade of O1-F265, is mainly restricted in Austroasiatic-speaking groups in South Asia.^[86]^[89] According to Kumar et al 2007, M95 averages at 55% in Munda and 41% of Khasi-Khmuic speakers of from Northeast India, while Reddy et al. 2007 found an average frequency 53% among Mundari and 31% among Khasi speakers.^[86]^[88] Zhang et al. 2015, found a higher average of 67.53% and 74,00% among Munda and Khasi-speaking groups respectively.^[89] Abundant in the Andaman and Nicobar Islands (averaging ~45%), it is fixed (100%) in some populations like Shompen, Onge and Nicobarese.^[86]^[89] A migration of O-M95 from Southeast Asia into India has been suggested with an expansion time of 5.2 ± 0.6 KYA in Northeast India.^[42]

Haplogroup O2-M122 is primarily found among the males of Tibeto-Burmese ancestry in the Himalayas and Northeast India.^[90] Haplogroup O-M122, believed to have originated in Southern China shows very high percentages.^[91] It is found at 86.6% among Tamangs of Nepal, with similarly high frequencies, 75% to 85%, among the northeastern Indian Tibeto-Burman groups, including Adi, Naga, Apatani, Nyishi, Kachari and Rabha.^[90]^[92] In Northeast India, Baric speakers display a high frequency and homogeneity of O-M134, indicating a population bottleneck effect that occurred during a westward and then southward migration of the founding population of Tibeto-Burmans during its branching from the parental population.^[90] It has a significant presence among the Khasis (29%), despite being generally absent in other Austroasiatics of India, and it shows up at 55% among neighbouring Garos, a Tibeto-Burman group.^[86]

Reconstructing South Asian population history

The Indian Genome Variation Consortium, divides the population of South Asia into four ethnolinguistic (not genetic) groups: Indo-European, Dravidian, Tibeto-Burman and Austro-Asiatic.^[93]^[94]^[95]^[96]^[97] The molecular anthropology studies use three different type of markers: Mitochondrial DNA (mtDNA) variation which is maternally inherited and highly polymorphic, Y Chromosome variation which involves uniparental transmission along the male lines, and Autosomal DNA variation.^[5]^: 04

mtDNA variation

Most of the studies based on mtDNA variation have reported genetic unity of South Asian populations across language, caste and tribal groups.^[2]^[3]^[4] It is likely that haplogroup M was brought to Asia from East Africa along the southern route by earliest migration wave 78,000 years ago.^[2]

According to Kivisild et al. (1999), "Minor overlaps with lineages described in other Eurasian populations clearly demonstrate that recent immigrations have had very little impact on the innate structure of the maternal gene pool of South Asians. Despite the variations found within India, these populations stem from a limited number of founder lineages. These lineages were most likely introduced to South Asia during the Middle Palaeolithic, before the peopling of Europe 48,000 years ago and perhaps the Old World in general."^[2] Basu et al. (2003) also emphasises underlying unity of female lineages in India.^[75]

Y Chromosome variation

Conclusions based on Y Chromosome variation have been more varied than those based on mtDNA variation. While Kivisild et al.^[48] proposes an ancient and shared genetic heritage of male lineages in South Asia, Bamshad et al. (2001) suggests an affinity between South Asian male lineages and modern west Eurasians proportionate to upper-caste rank and places upper-caste populations of southern Indian states closer to East Europeans.^[98]

Basu et al. (2003) concludes that Austro–Asiatic tribal populations entered India first from the Northwest corridor and much later some of them through Northeastern corridor.^[75] Whereas, Kumar et al. (2007) analysed 25 South Asian Austro-Asiatic tribes and found a strong paternal genetic link among the sub-linguistic groups of the South Asian Austro-Asiatic populations.^[86] Mukherjee et al. (2001) places Pakistanis and North Indians between west Asian and Central Asian populations,^[99] whereas Cordaux et al. (2004) argues that the Indian caste populations are closer to Central Asian populations.^[58] Earlier studies like Sahoo et al. (2006) and Sengupta et al. (2006) suggested that Indian caste populations have not been subject to any recent admixtures.^[43]^[44]

Closest-neighbor analysis done by Mondal et al. in 2017 concluded that Indian Y-lineages are close to southern European populations and the time of divergence between the two at least in part predated Bronze-Age Steppe migration into India:^[84]

These results suggest that the European-related ancestry in Indian populations might be much older and more complex than anticipated, and might originate from the first wave of agriculturists or even earlier

— Mondal et al. 2017

Autosomal DNA variation

AASI-ANI-ASI

Results of studies based upon autosomal DNA variation have also been varied. In a major study (2009) using over 500,000 biallelic autosomal markers, Reich hypothesized that the modern South Asian population was the result of admixture between two genetically divergent ancestral populations dating from the post-Holocene era. These two "reconstructed" ancient populations he termed "Ancestral South Indians" (ASI) and "Ancestral North Indians" (ANI). According to Reich: "ANI ancestry is significantly higher in Indo-European than Dravidian speakers, suggesting that the ancestral ASI may have spoken a Dravidian language before mixing with the ANI." While the ANI is genetically close to Middle Easterners, Central Asians and Europeans, the ASI is not closely related to groups outside of the subcontinent. As no "ASI" ancient DNA is available, the indigenous Andamanese Onge are used as an (imperfect) proxy of ASI (according to Reich et al., the Andamanese, though distinct from them, are the closest living population to the ASI). According to Reich et al., both ANI and ASI ancestry are found all over the subcontinent (in both northern and southern India) in varying proportions, and that "ANI ancestry ranges from 39–71% in India, and is higher in traditionally upper caste and Indo-European speakers."^[10]

According to Gallego Romero et al. (2011), their research on lactose tolerance in India suggests that "the west Eurasian genetic contribution identified by Reich et al. (2009) principally reflects gene flow from Iran and the Middle East".^[100] Gallego Romero notes that Indians who are lactose-tolerant show a genetic pattern regarding this tolerance which is "characteristic of the common European mutation".^[101] According to Romero, this suggests that "the most common lactose tolerance mutation made a two-way migration out of the Middle East less than 10,000 years ago. While the mutation spread across Europe, another explorer must have brought the mutation eastward to India – likely traveling along the coast of the Persian Gulf where other pockets of the same mutation have been found."^[101]

Moorjani et al. 2013 state that the ASI, though not closely related to any living group, are "related (distantly) to indigenous Andaman Islanders." Moorjani et al. however suggest possible gene flow into the Andamanese from a population related to the ASI, causing the modelled relationship. The study concluded that "almost all groups speaking Indo-European or Dravidian languages lie along a gradient of varying relatedness to West-Eurasians in PCA (referred to as "Indian cline")".^[102]

A 2013 study by Chaubey using the single-nucleotide polymorphism (SNP), shows that the genome of Andamanese people (Onge) is closer to those of other Oceanic Negrito groups than to that of South Asians.^[103]

According to Basu et al. 2016, further analysis revealed that the genomic structure of mainland Indian populations is best explained by contributions from four ancestral components. In addition to the ANI and ASI, Basu et al. (2016) identified two East Asian ancestral components in mainland India that are major for the Austro-Asiatic-speaking tribals and the Tibeto-Burman speakers, which they denoted as AAA (for "Ancestral Austro-Asiatic") and ATB (for "Ancestral Tibeto-Burman") respectively. The study also infers that the populations of the Andaman Islands archipelago form a distinct ancestry, which "was found to be coancestral to Oceanic populations" but more distant from South Asians.^[36]

The cline of admixture between the ANI and ASI lineages is dated to the period of c. 4.2–1.9 kya by Moorjani et al. (2013), corresponding to the Indian Bronze Age, and associated by the authors with the process of deurbanisation of the Indus Valley civilisation and the population shift to the Gangetic system in the incipient Indian Iron Age.^[33] Basu et al. (2003) suggests that "Dravidian speakers were possibly widespread throughout India before the arrival of the Indo-European-speaking nomads" and that "formation of populations by fission that resulted in founder and drift effects have left their imprints on the genetic structures of contemporary populations".^[75] The geneticist PP Majumder (2010) has recently argued that the findings of Reich et al. (2009) are in remarkable concordance with previous research using mtDNA and Y-DNA:^[104]

Central Asian populations are supposed to have been major contributors to the Indian gene pool, particularly to the northern Indian gene pool, and the migrants had supposedly moved into India through what is now Afghanistan and Pakistan. Using mitochondrial DNA variation data collated from various studies, we have shown that populations of Central Asia and Pakistan show the lowest coefficient of genetic differentiation with the north Indian populations, a higher differentiation with the south Indian populations, and the highest with the northeast Indian populations. Northern Indian populations are genetically closer to Central Asians than populations of other geographical regions of India... . Consistent with the above findings, a recent study using over 500,000 biallelic autosomal markers has found a north to south gradient of genetic proximity of Indian populations to western Eurasians. This feature is likely related to the proportions of ancestry derived from the western Eurasian gene pool, which, as this study has shown, is greater in populations inhabiting northern India than those inhabiting southern India.

Chaubey et al. 2015 detected a distinctive East Asian ancestral component, mainly restricted to specific populations in the foothills of Himalaya and northeastern part of India. Highest frequency of the component is observed among the Tibeto-Burmese speaking groups of northeast India and was also detected in Andamanese populations at 32%, with substantial presence also among Austroasiatic speakers. It is found to be largely absent in Indo-European and Dravidian speakers, except in some specific ethnic groups living in the Himalayan foothills and central-south India.^[12] The researchers however suggested that the East Asian ancestry (represented by the Han) measured in the studied Andamanese groups may actually reflect the capture of the affinity of the Andamanese with Melanesians and Malaysian Negritos (rather than true East Asian admixture),^[12] as a previous study by Chaubey et al. suggested "a deep common ancestry" between Andamanese, Melanesians and other Negrito groups,^[12] and an affinity between Southeast Asian Negritos and Melanesians (as well as the Andamanese) with East Asians.^[103] Other studies also reveal varying degrees of admixture with Ancestral North Indian and Ancestral South Indian for Northeast Indians. Their East Asian-related ancestry is likewise more similar to Trans-Himalayan populations, who are situated between East and South Asia. However, there is evidence that Northeast India was initially populated by Dai-related Southern East Asians before mixing with Yakut-related Northern East Asians.^[105]^[106] Himalayan populations are also estimated to have input from Tibetan-related lineages, including non-Tibetan Tibeto-Burman speaking populations such as Naga, Tamang, Gurung, Bhutanese etc., and this Tibetan-related lineage can be modeled as a mixture of Late Neolithic Upper Yellow River-related ancestry (80–92%) and a deep lineage that is phylogenetically near the split between West and East Eurasian lineages ( 8–20%). Steppe-related ancestry is also found in northern Himalayan populations and was introduced about ~2700 to 3800 yr B.P.^[107]^[108]

Lazaridis et al. (2016) notes "The demographic impact of steppe related populations on South Asia was substantial, as the Mala, a south Indian Dalit population with minimal ANI (Ancestral North Indian) along the 'Indian Cline' of such ancestry is inferred to have ~ 18% steppe-related ancestry, while the Kalash of Pakistan are inferred to have ~ 50%, similar to present-day northern Europeans." The study estimated (6.5–50.2%) steppe-related admixture in South Asians. Lazaridis et al. further notes that "A useful direction of future research is a more comprehensive sampling of ancient DNA from steppe populations, as well as populations of central Asia (east of Iran and south of the steppe), which may reveal more proximate sources of the ANI than the ones considered here, and of South Asia to determine the trajectory of population change in the area directly.^[16]

Pathak et al. 2018 concluded that the Indo-European speakers of the Gangetic Plains and the Dravidian speakers have significant Yamnaya Early-Middle Bronze Age (Steppe_EMBA) ancestry but no Middle-Late Bronze Age Steppe (Steppe_MLBA) ancestry. On the other hand, the "North-Western Indian and Pakistani" populations (PNWI) showed significant Steppe_MLBA ancestry along with Yamnaya (Steppe_EMBA) ancestry. The study also noted that ancient South Asian samples had significantly higher Steppe_MLBA than Steppe_EMBA (or Yamnaya). The study also suggested that the Rors could be used as a proxy for the ANI.^[9]

David Reich in his 2018 book Who We Are and How We Got Here states that the 2016 analyses found the ASI to have significant amounts of an ancestry component deriving from Iranian farmers (about 25% of their ancestry), with the remaining 75% of their ancestry deriving from native South Asian hunter-gatherers. He adds that ASI were unlikely the local hunter-gatherers of South Asia as previously established, but a population responsible for spreading agriculture throughout South Asia. In the case of the ANI, the Iranian farmer ancestry is 50%, with the rest being from steppe groups related to the Yamnaya.^[109]

(Narasimhan et al. 2018), similarly, conclude that ANI and ASI were formed in the 2nd millennium BCE.^[26]^: 15 They were preceded by a mixture of AASI (ancient ancestral south Indian, i.e. hunter-gatherers sharing a distant root with the Andamanese, Australian Aboriginals, and East Asians); and Iranian agriculturalists who arrived in India c. 4700–3000 BCE, and "must have reached the Indus Valley by the 4th millennium BCE".^[26]^: 15 According to Narasimhan et al., this mixed population, which probably was native to the Indus Valley Civilisation, "contributed in large proportions to both the ANI and ASI", which took shape during the 2nd millennium BCE. ANI formed out of a mixture of "Indus Periphery-related groups" and migrants from the steppe, while ASI was formed out of "Indus Periphery-related groups" who moved south and mixed further with local hunter-gatherers. The ancestry of the ASI population is suggested to have averaged about 73% from the AASI and 27% from Iranian-related farmers. Narasimhan et al. observe that samples from the Indus periphery group are always mixes of the same two proximal sources of AASI and Iranian agriculturalist-related ancestry; with "one of the Indus Periphery individuals having ~42% AASI ancestry and the other two individuals having ~14–18% AASI ancestry" (with the remainder of their ancestry being from the Iranian agriculturalist-related population).^[26]^: 15 The authors propose that the AASI indigenous hunter-gatherers represent a divergent branch that split off around the same time that East Asian, Onge (Andamanese) and Australian Aboriginal ancestors separated from each other. It inferred, "essentially all the ancestry of present-day eastern and southern Asians (prior to West Eurasian-related admixture in southern Asians) derives from a single eastward spread, which gave rise in a short span of time to the lineages leading to AASI, East Asians, Onge, and Australians."^[26]^: 15

A genetic study by Yelmen et al. (2019) found that the native South Asian genetic component, termed the S-component, is distinct from the Andamanese, and that the Andamanese are an imperfect proxy for it. This component (when represented by the Andamanese Onge) was not detected in the northern Indian Gujarati samples, and hence they assumed that the South Indian tribal Paniya people (a group of predominantly ASI ancestry) would serve as a better source for the component in modern South Asians. However, unlike the Paniya samples, the S-component extracted from the tribal Irula samples were found to be devoid of any West Eurasian contribution, suggesting it to be a better representative of the native South Asian genetic component. Their improved results, based on "local ancestry deconvolution and masking of 500 samples from 25 South Asian populations via coalescent simulations", suggest that the AASI diverged from the common ancestor of Andamanese and East Asians shortly after these have diverged from West Eurasians. They also found that there were "multiple waves of West Eurasian arrival, as opposed to a simpler one wave scenario".^[7]

Two genetic studies (Narasimhan et al. 2019 & Shinde et al. 2019) analysing remains from the Indus Valley civilisation (of parts of Bronze Age Northwest India and East Pakistan), found them to have a mixture of ancestry, both from native South Asian hunter-gatherers sharing a distant root with the Andamanese, and from a group related to Iranian farmers. The samples analysed by Shinde derived about 50–98% of their genome from Iranian-related peoples and from 2–50% from native South Asian hunter-gatherers. The samples analysed by Narasimhan et al. had 45–82% of Iranian farmer-related ancestry and 11–50% of South Asian hunter-gatherer origin. The analysed samples of both studies have little to none of the "Steppe ancestry" component associated with later Indo-European migrations into India. The authors found that the respective amounts of those ancestries varied significantly between individuals, and concluded that more samples are needed to get the full picture of Indian population history.^[6]^[25]

Genetic distance between caste groups and tribes

Studies by Watkins et al. (2005) and Kivisild et al. (2003) based on autosomal markers conclude that Indian caste and tribal populations have a common ancestry.^[48]^[110] Reddy et al. (2005) found fairly uniform allele frequency distributions across caste groups of southern Andhra Pradesh, but significantly larger genetic distance between caste groups and tribes indicating genetic isolation of the tribes and castes.^[111]

Viswanathan et al. (2004) in a study on genetic structure and affinities among tribal populations of southern India concludes,^[112]

Genetic differentiation was high and genetic distances were not significantly correlated with geographic distances. Genetic drift therefore probably played a significant role in shaping the patterns of genetic variation observed in southern Indian tribal populations. Otherwise, analyses of population relationships showed that all Indian and South Asian populations are still similar to one another, regardless of phenotypic characteristics, and do not show any particular affinities to Africans. We conclude that the phenotypic similarities of some Indian groups to Africans do not reflect a close relationship between these groups, but are better explained by convergence.

A 2011 study published in the American Journal of Human Genetics^[32] indicates that Indian ancestral components are the result of a more complex demographic history than was previously thought. According to the researchers, South Asia harbours two major ancestral components, one of which is spread at comparable frequency and genetic diversity in populations of Central Asia, West Asia and Europe; the other component is more restricted to South Asia. However, if one were to rule out the possibility of a large-scale Indo-Aryan migration, these findings suggest that the genetic affinities of both Indian ancestral components are the result of multiple gene flows over the course of thousands of years.^[32]

Modeling of the observed haplotype diversities suggests that both Indian ancestry components are older than the purported Indo-Aryan invasion 3,500 YBP. Consistent with the results of pairwise genetic distances among world regions, Indians share more ancestry signals with West than with East Eurasians.

Narasimhan et al. 2019 found Austroasiatic-speaking Munda tribals could not be modelled simply as mixture of ASI, AASI, or ANI ancestry unlike other South Asians but required additional ancestry component from Southeast Asia. They were modelled as mixture of 64% AASI, and 36% East Asian-related ancestry, represented by the Nicobarese, thus the ancestry profile of the Mundas provides an independent line of ancestry from Southeast Asia around the 3rd millennium BCE.^[6] Lipson et al. 2018 found similar admixture results in regard to Munda tribals stating "we obtained a good fit with three ancestry components: one western Eurasian, one deep eastern Eurasian (interpreted as an indigenous South Asian lineage), and one from the Austroasiatic clade".^[113] Lipson et al. 2018 further found that the Austroasiatic source clad (proportion 35%) in Munda tribals was inferred to be closest to Mlabri.^[113] Singh et al. 2020 similarly found Austroasiatic speakers in South Asia fall out of the South Asian cline due to their Southeast Asian genetic affinity.^[114]

Origin of caste endogamy in India

Tournebize et al.^[115] analyse founder events spread across the world and write:

Our direct estimates of founder ages provide an independent line of evidence to understand the origin of endogamy in India. We inferred that these founder events occurred between ~120–3,500 years ago across 78 ethno-linguistic groups in India. Our dates are consistent with a previous smaller survey including 13 ethno-linguistic groups from India [18]. In a majority of the populations, the founder events occurred within the past 600–1,000 years, suggesting this period was integral to shaping endogamy in India.

Notes

↑ Srinath Perur (2 December 2014). "The origins of Indians. What our genes are telling us". Fountain Ink. https://fountainink.in/reportage/the-origins-of-indians.
"The origins of Indians. What our genes are telling us". Fountain Ink. December 2013. pp. 42–55. https://genetics.med.harvard.edu/reich/Reich_Lab/Press_files/Fountain%20Ink%20-%20December%202013%20-%20Cover.pdf.
↑ "How genetics is settling the Aryan migration debate". The Hindu. 16 June 2017. https://www.thehindu.com/sci-tech/science/how-genetics-is-settling-the-aryan-migration-debate/article19090301.ece.

References

↑ Schorkowitz, Dittmar; Chávez, John R.; Schröder, Ingo W. (28 September 2019) (in en). Shifting Forms of Continental Colonialism: Unfinished Struggles and Tensions. Springer Nature. p. 251-252. ISBN 978-981-13-9817-9. "The Indo-Aryans (the Eurasian Steppe people) brought with them the mastery of the chariot, an early version of Sanskrit, and various cultural practices, such as sacrificial rituals, that formed the basis of early Vedic-Hindu culture. ... The first two major migrations had thus culminated in the development of Harappa or Indus Valley civilization. The third, Indo-Aryan migration might have caused some amount of upheaval when it encountered the Indus Valley population. Consequently, some of the latter moved farther south, joined, and mixed with South Asian hunter-gatherers, the Ancient Ancestral South Indian (AASI), to create the Ancestral South Indian (ASI) population. The Indo-Aryan steppe pastoralists mixed with groups of the Indus Valley periphery living in the northern fringe, to create the Ancestral North Indian (ANI) branch. More migration into the Indian subcontinent occurred in later times, though mostly from East Asia. These groups assimilated with one of the two dominant groups. Thus, most of the South Asian populations carry either the lineage of ASI or ANI or a mixture of both."
↑ ^2.0 ^2.1 ^2.2 ^2.3 ^2.4 "The Place of the Indian Mitochondrial DNA Variants in the Global Network of Maternal Lineages and the Peopling of the Old World". Genomic Diversity. 1999. pp. 135–152. doi:10.1007/978-1-4615-4263-6_11. ISBN 978-1-4613-6914-1.
↑ ^3.0 ^3.1 "Mitochondrial DNA diversity in tribal and caste groups of Maharashtra (India) and its implication on their genetic origins". Annals of Human Genetics 68 (Pt 5): 453–460. September 2004. doi:10.1046/j.1529-8817.2004.00108.x. PMID 15469422.
↑ ^4.0 ^4.1 Science & Technology For Upsc. Tata McGraw-Hill Education. 2007. p. 595. ISBN 978-0-07-065548-5. https://books.google.com/books?id=CzV1MgFH6oMC&pg=PA595. Retrieved 24 May 2016.
↑ ^5.0 ^5.1 ^5.2 ^5.3 "Trends in Molecular Anthropological Studies in India". International Journal of Human Genetics 8 (1–2): 1–20. 4 September 2017. doi:10.1080/09723757.2008.11886015.
↑ ^6.00 ^6.01 ^6.02 ^6.03 ^6.04 ^6.05 ^6.06 ^6.07 ^6.08 ^6.09 ^6.10 "The formation of human populations in South and Central Asia". Science 365 (6457). September 2019. doi:10.1126/science.aat7487. PMID 31488661. Bibcode: 2019Sci...365t7487N.
↑ ^7.0 ^7.1 ^7.2 ^7.3 ^7.4 ^7.5 ^7.6 "Ancestry-Specific Analyses Reveal Differential Demographic Histories and Opposite Selective Pressures in Modern South Asian Populations". Molecular Biology and Evolution 36 (8): 1628–1642. August 2019. doi:10.1093/molbev/msz037. PMID 30952160. "The two main components (i.e., autochthonous South Asian and West Eurasian) of Indian genetic variation form one of the deepest splits among non-African groups, which took place when South Asian populations separated from East Asian and Andamanese populations, shortly after having separated from West Eurasian populations (Mondal et al. 2016; Narasimhan et al. 2018).".
↑ ^8.0 ^8.1 ^8.2 "A genetic history of migration, diversification, and admixture in Asia" (in en). Human Population Genetics and Genomics 2 (1): 1–32. 6 January 2022. doi:10.47248/hpgg2202010001. ISSN 2770-5005. https://www.pivotscipub.com/hpgg/2/1/0001/html. "The branches predominantly associated with present-day Asian populations include the Ancient Ancestral South Indian (AASI) lineage, Australasian (AA) lineage, and East and Southeast Asian (ESEA) lineage.".
↑ ^9.0 ^9.1 ^9.2 ^9.3 ^9.4 Pathak, Ajai K.; Kadian, Anurag; Kushniarevich, Alena; Montinaro, Francesco; Mondal, Mayukh; Ongaro, Linda; Singh, Manvendra; Kumar, Pramod et al. (6 December 2018). "The Genetic Ancestry of Modern Indus Valley Populations from Northwest India". The American Journal of Human Genetics 103 (6): 918–929. doi:10.1016/j.ajhg.2018.10.022. ISSN 0002-9297. PMID 30526867. "A previous ancient-DNA study has suggested that the Iran_N and Steppe_EMBA groups are the best proxies for the ancient West Eurasian component in South Asians. The study also suggested that most South Asians can be modeled as a mixture of these two groups but also have Onge- and Han-related ancestries.".
↑ ^10.0 ^10.1 ^10.2 ^10.3 "Reconstructing Indian population history". Nature 461 (7263): 489–494. September 2009. doi:10.1038/nature08365. PMID 19779445. Bibcode: 2009Natur.461..489R.
↑ ^11.0 ^11.1 ^11.2 ^11.3 "Early Neolithic genomes from the eastern Fertile Crescent". Science 353 (6298): 499–503. July 2016. doi:10.1126/science.aaf7943. PMID 27417496. Bibcode: 2016Sci...353..499B. ; Lay summary in: "Prehistoric genomes from the world's first farmers in the Zagros mountains reveal different Neolithic ancestry for Europeans and South Asians". https://www.sciencedaily.com/releases/2016/07/160714151201.htm. "The research team found that the Iranian genomes represent the main ancestors of modern-day South Asians. ...the Zagros people of the Neolithic eastern Fertile Crescent that are ancestral to most modern South Asians..."
↑ ^12.0 ^12.1 ^12.2 ^12.3 ^12.4 "East Asian ancestry in India". Indian Journal of Physical Anthropology and Human Genetics 34 (2): 193–199. January 2015. https://serialsjournals.com/abstract/78963_2.pdf. "Here the analysis of genome wide data on Indian and East/Southeast Asian demonstrated their restricted distinctive ancestry in India mainly running along the foothills of Himalaya and northeastern part.".
↑ ^13.0 ^13.1 "Population genetic structure in Indian Austroasiatic speakers: the role of landscape barriers and sex-specific admixture". Molecular Biology and Evolution 28 (2): 1013–1024. February 2011. doi:10.1093/molbev/msq288. PMID 20978040.
↑ "Genetic Affinity of the Bhil, Kol and Gond Mentioned in Epic Ramayana". PLOS ONE 10 (6). 10 June 2015. doi:10.1371/journal.pone.0127655. PMID 26061398. Bibcode: 2015PLoSO..1027655C.
↑ ^15.0 ^15.1 ^15.2 "Investigating the West Eurasian ancestry of Pakistani Hazaras". Journal of Genetics 98 (2). June 2019. doi:10.1007/s12041-019-1093-2. PMID 31204712.
↑ ^16.0 ^16.1 ^16.2 ^16.3 ^16.4 "Genomic insights into the origin of farming in the ancient Near East". Nature 536 (7617): 419–424. August 2016. doi:10.1038/nature19310. PMID 27459054. Bibcode: 2016Natur.536..419L.
↑ "Unravelling the distinct strains of Tharu ancestry". European Journal of Human Genetics 22 (12): 1404–1412. December 2014. doi:10.1038/ejhg.2014.36. PMID 24667789.
↑ ^18.0 ^18.1 "Demographic History and Genetic Adaptation in the Himalayan Region Inferred from Genome-Wide SNP Genotypes of 49 Populations". Molecular Biology and Evolution 35 (8): 1916–1933. August 2018. doi:10.1093/molbev/msy094. PMID 29796643.
↑ ^19.0 ^19.1 Bennett, E. Andrew; Liu, Yichen; Fu, Qiaomei (3 December 2024). "Reconstructing the Human Population History of East Asia through Ancient Genomics" (in en). Elements in Ancient East Asia. doi:10.1017/9781009246675. ISBN 978-1-009-24667-5. https://www.cambridge.org/core/elements/reconstructing-the-human-population-history-of-east-asia-through-ancient-genomics/0524D629660B5E43FC7094C043D54C6A.
↑ Aoki, Kenichi; Takahata, Naoyuki; Oota, Hiroki; Wakano, Joe Yuichiro; Feldman, Marcus W. (30 August 2023). "Infectious diseases may have arrested the southward advance of microblades in Upper Palaeolithic East Asia". Proceedings of the Royal Society B: Biological Sciences 290 (2005). doi:10.1098/rspb.2023.1262. PMID 37644833. "A single major migration of modern humans into the continents of Asia and Sahul was strongly supported by earlier studies using mitochondrial DNA, the non-recombining portion of Y chromosomes, and autosomal SNP data [42–45]. Ancestral Ancient South Indians with no West Eurasian relatedness, East Asians, Onge (Andamanese hunter–gatherers) and Papuans all derive in a short evolutionary time from the eastward dispersal of an out-of-Africa population [46,47]".
↑ Aragon, Jose A. Urban; Bandyopadhyay, Esha; Fernando, Amali S.; Castro, Constanza de la Fuente; Welikala, Anjana H. J.; Biddanda, Arjun; Witonsky, David; Sander, Nathan et al. (9 June 2025). "Population histories of the Indigenous Adivasi and Sinhalese from Sri Lanka using whole genomes" (in English). Current Biology 35 (11): 2554–2566.e7. doi:10.1016/j.cub.2025.04.039. ISSN 0960-9822. PMID 40494279. Bibcode: 2025CBio...35.2554U. "The high levels of the AASI-like genetic ancestry, an unsampled basal Asian lineage that shares deep ancestry with ancestral East Asians,19 in the Adivasi could partially explain the extra allele sharing between them and populations with East Asian-related genetic ancestry.".
↑ "Papuan mitochondrial genomes and the settlement of Sahul". Journal of Human Genetics 65 (10): 875–887. October 2020. doi:10.1038/s10038-020-0781-3. PMID 32483274.
↑ "Genome-wide data substantiate Holocene gene flow from India to Australia". Proceedings of the National Academy of Sciences of the United States of America 110 (5): 1803–1808. January 2013. doi:10.1073/pnas.1211927110. PMID 23319617. Bibcode: 2013PNAS..110.1803P.
↑ "A genomic view of the peopling and population structure of India". Cold Spring Harbor Perspectives in Biology 7 (4). August 2014. doi:10.1101/cshperspect.a008540. PMID 25147176.
↑ ^25.0 ^25.1 ^25.2 ^25.3 "An Ancient Harappan Genome Lacks Ancestry from Steppe Pastoralists or Iranian Farmers". Cell 179 (3): 729–735.e10. October 2019. doi:10.1016/j.cell.2019.08.048. PMID 31495572.
↑ ^26.0 ^26.1 ^26.2 ^26.3 ^26.4 "The formation of human populations in South and Central Asia". Science 365 (6457). 2019. doi:10.1126/science.aat7487. PMID 31488661. Bibcode: 2019Sci...365t7487N.
↑ ^27.0 ^27.1 "Genetic study of Dravidian castes of Tamil Nadu". Journal of Genetics 87 (2): 175–9. August 2008. doi:10.1007/s12041-008-0027-1. PMID 18776648.
↑ ^28.0 ^28.1 "Genetic structure in the Sherpa and neighboring Nepalese populations". BMC Genomics 18 (1). January 2017. doi:10.1186/s12864-016-3469-5. PMID 28103797.
↑ ^29.0 ^29.1 (in en) The Evolution and History of Human Populations in South Asia: Inter-disciplinary Studies in Archaeology, Biological Anthropology, Linguistics and Genetics. Springer Science & Business Media. 22 May 2007. ISBN 978-1-4020-5562-1. https://books.google.com/books?id=Qm9GfjNlnRwC&pg=PA201.
↑ ^30.0 ^30.1 Vallini, Leonardo; Zampieri, Carlo; Shoaee, Mohamed Javad; Bortolini, Eugenio; Marciani, Giulia; Aneli, Serena; Pievani, Telmo; Benazzi, Stefano et al. (25 March 2024). "The Persian plateau served as hub for Homo sapiens after the main out of Africa dispersal" (in en). Nature Communications 15 (1): 1882. doi:10.1038/s41467-024-46161-7. ISSN 2041-1723. PMID 38528002. Bibcode: 2024NatCo..15.1882V.
↑ "Genetics and Material Culture Support Repeated Expansions into Paleolithic Eurasia from a Population Hub Out of Africa". Genome Biology and Evolution 14 (4). April 2022. doi:10.1093/gbe/evac045. PMID 35445261.
↑ ^32.0 ^32.1 ^32.2 "Shared and unique components of human population structure and genome-wide signals of positive selection in South Asia". American Journal of Human Genetics 89 (6): 731–744. December 2011. doi:10.1016/j.ajhg.2011.11.010. PMID 22152676.
↑ ^33.0 ^33.1 "Genetic evidence for recent population mixture in India". American Journal of Human Genetics 93 (3): 422–438. September 2013. doi:10.1016/j.ajhg.2013.07.006. PMID 23932107.
↑ "A genetic chronology for the Indian Subcontinent points to heavily sex-biased dispersals". BMC Evolutionary Biology 17 (1). March 2017. doi:10.1186/s12862-017-0936-9. PMID 28335724. Bibcode: 2017BMCEE..17...88S.
↑ Kerdoncuff, Elise; Skov, Laurits; Patterson, Nick et al. (2025). "50,000 years of evolutionary history of India: Impact on health and disease variation". Cell 188 (13): 3389–3404. doi:10.1016/j.cell.2025.04.027. PMID 40578318.
↑ ^36.0 ^36.1 "Genomic reconstruction of the history of extant populations of India reveals five distinct ancestral components and a complex structure". Proceedings of the National Academy of Sciences of the United States of America 113 (6): 1594–1599. February 2016. doi:10.1073/pnas.1513197113. PMID 26811443. Bibcode: 2016PNAS..113.1594B.
↑ Zhang, X.; Liao, S.; Qi, X. et al. (2015). "Y-chromosome diversity suggests southern origin and Paleolithic backwave migration of Austro-Asiatic speakers from eastern Asia to the Indian subcontinent". Scientific Reports 5. doi:10.1038/srep15486. PMID 26482917. Bibcode: 2015NatSR...515486Z.
↑ Arunkumar, G. (2015). "A late Neolithic expansion of Y chromosomal haplogroup O2a1-M95 from east to west". Journal of Systematics and Evolution 53 (6): 546–560. doi:10.1111/jse.12147. Bibcode: 2015JSyEv..53..546A.
↑ Tätte, Kai; Pagani, Luca; Pathak, Ajai K.; Kõks, Sulev; Ho Duy, Binh; Ho, Xuan Dung; Sultana, Gazi Nurun Nahar; Sharif, Mohd Istiaq et al. (7 March 2019). "The genetic legacy of continental scale admixture in Indian Austroasiatic speakers" (in en). Scientific Reports 9 (1): 3818. doi:10.1038/s41598-019-40399-8. ISSN 2045-2322. PMID 30846778. Bibcode: 2019NatSR...9.3818T.
↑ ^40.0 ^40.1 ^40.2 ^40.3 ^40.4 An Indian Ancestry: a Key for Understanding Human Diversity in Europe and Beyond. McDonald Institute Monographs. 2000. http://evolutsioon.ut.ee/publications/Kivisild2000.pdf. Retrieved 11 November 2005.
↑ ^41.0 ^41.1 ^41.2 "Y Haplogroups of the World". 2004. http://www.scs.uiuc.edu/~mcdonald/WorldHaplogroupsMaps.pdf.
↑ ^42.0 ^42.1 "A late Neolithic expansion of Y chromosomal haplogroup O2a1-M95 from east to west" (in en). Journal of Systematics and Evolution 53 (6): 546–560. 2015. doi:10.1111/jse.12147. Bibcode: 2015JSyEv..53..546A.
↑ ^43.00 ^43.01 ^43.02 ^43.03 ^43.04 ^43.05 ^43.06 ^43.07 ^43.08 ^43.09 ^43.10 ^43.11 ^43.12 ^43.13 ^43.14 ^43.15 ^43.16 ^43.17 "Polarity and temporality of high-resolution y-chromosome distributions in India identify both indigenous and exogenous expansions and reveal minor genetic influence of Central Asian pastoralists". American Journal of Human Genetics 78 (2): 202–221. February 2006. doi:10.1086/499411. PMID 16400607. Bibcode: 2006AmJHG..78..202S.
↑ ^44.0 ^44.1 ^44.2 ^44.3 ^44.4 ^44.5 ^44.6 "A prehistory of Indian Y chromosomes: evaluating demic diffusion scenarios". Proceedings of the National Academy of Sciences of the United States of America 103 (4): 843–848. January 2006. doi:10.1073/pnas.0507714103. PMID 16415161. Bibcode: 2006PNAS..103..843S.
↑ ^45.0 ^45.1 ^45.2 ^45.3 ^45.4 ^45.5 "Genetic affinities among the lower castes and tribal groups of India: inference from Y chromosome and mitochondrial DNA". BMC Genetics 7. August 2006. doi:10.1186/1471-2156-7-42. PMID 16893451.
↑ ^46.0 ^46.1 ^46.2 "Presence of three different paternal lineages among North Indians: a study of 560 Y chromosomes". Annals of Human Biology 36 (1): 46–59. 2009. doi:10.1080/03014460802558522. PMID 19058044.
↑ Reich 2018.
↑ ^48.00 ^48.01 ^48.02 ^48.03 ^48.04 ^48.05 ^48.06 ^48.07 ^48.08 ^48.09 ^48.10 ^48.11 ^48.12 ^48.13 Kivisild, T.; Rootsi, S.; Metspalu, M.; Mastana, S.; Kaldma, K.; Parik, J.; Metspalu, E.; Adojaan, M. et al. (February 2003). "The Genetic Heritage of the Earliest Settlers Persists Both in Indian Tribal and Caste Populations". American Journal of Human Genetics 72 (2): 313–332. doi:10.1086/346068. ISSN 0002-9297. PMID 12536373. Bibcode: 2003AmJHG..72..313K.
↑ Singh, Mugdha; Sarkar, Anujit; Nandineni, Madhusudan R. (18 October 2018). "A comprehensive portrait of Y-STR diversity of Indian populations and comparison with 129 worldwide populations" (in en). Scientific Reports 8 (1): 15421. doi:10.1038/s41598-018-33714-2. ISSN 2045-2322. PMID 30337554. Bibcode: 2018NatSR...815421S.
↑ "X-chromosomal STR based genetic polymorphisms and demographic history of Sri Lankan ethnicities and their relationship with global populations". Scientific Reports 11 (1). June 2021. doi:10.1038/s41598-021-92314-9. PMID 34140598. Bibcode: 2021NatSR..1112748P.
↑ The Real Eve: Modern Man's Journey out of Africa. New York: Carroll and Graf Publishers. 2003. ISBN 978-0-7867-1192-5.
↑ ^52.0 ^52.1 "Comparative analysis of cancer genes in the human and chimpanzee genomes". BMC Genomics 7. January 2006. doi:10.1186/1471-2164-7-15. PMID 16438707.
↑ ^53.0 ^53.1 ^53.2 "Most of the extant mtDNA boundaries in south and southwest Asia were likely shaped during the initial settlement of Eurasia by anatomically modern humans". BMC Genetics 5 (1). August 2004. doi:10.1186/1471-2156-5-26. PMID 15339343. Bibcode: 2004BMCGe...5...26M.
↑ "Deep common ancestry of indian and western-Eurasian mitochondrial DNA lineages". Current Biology 9 (22): 1331–1334. November 1999. doi:10.1016/s0960-9822(00)80057-3. PMID 10574762. Bibcode: 1999CBio....9.1331K.
↑ Mahal, David G.; Matsoukas, Ianis G. (2018). "The Geographic Origins of Ethnic Groups in the Indian Subcontinent: Exploring Ancient Footprints with Y-DNA Haplogroups". Frontiers in Genetics 9. doi:10.3389/fgene.2018.00004. PMID 29410676.
↑ Tariq, Muhammad; Ahmad, Habib; Hemphill, Brian E.; Farooq, Umar; Schurr, Theodore G. (2022). "Contrasting maternal and paternal genetic histories among five ethnic groups from Khyber Pakhtunkhwa, Pakistan". Scientific Reports 12 (1): 1027. doi:10.1038/s41598-022-05076-3. PMID 35046511. Bibcode: 2022NatSR..12.1027T.
↑ "Y-DNA Haplogroup H and its Subclades – 2015". http://www.isogg.org/tree/ISOGG_HapgrpH.html.
↑ ^58.0 ^58.1 ^58.2 ^58.3 ^58.4 ^58.5 "Independent origins of Indian caste and tribal paternal lineages". Current Biology 14 (3): 231–235. February 2004. doi:10.1016/j.cub.2004.01.024. PMID 14761656. Bibcode: 2004CBio...14..231C.
↑ ^59.0 ^59.1 ^59.2 "Dissecting the influence of Neolithic demic diffusion on Indian Y-chromosome pool through J2-M172 haplogroup". Scientific Reports 6 (1). January 2016. doi:10.1038/srep19157. PMID 26754573. Bibcode: 2016NatSR...619157S.
↑ (in en) Ancestral DNA, Human Origins, and Migrations. Academic Press. 2018. p. 250. ISBN 978-0-12-804128-4. https://books.google.com/books?id=ZF1gDwAAQBAJ&q=Ancestral+DNA+Human+Origins+and+Migrations+J2b-M102+South+Asia&pg=PA250.
↑ ^61.0 ^61.1 "Population differentiation of southern Indian male lineages correlates with agricultural expansions predating the caste system". PLOS ONE 7 (11). 2012. doi:10.1371/journal.pone.0050269. PMID 23209694. Bibcode: 2012PLoSO...750269A.
↑ ^62.0 ^62.1 ^62.2 ^62.3 ^62.4 "The Indian origin of paternal haplogroup R1a1* substantiates the autochthonous origin of Brahmins and the caste system". Journal of Human Genetics 54 (1): 47–55. January 2009. doi:10.1038/jhg.2008.2. PMID 19158816.
↑ "Diverse genetic origin of Indian Muslims: evidence from autosomal STR loci". Journal of Human Genetics 54 (6): 340–8. June 2009. doi:10.1038/jhg.2009.38. PMID 19424286.
↑ ^64.0 ^64.1 ^64.2 ^64.3 "Y-chromosomal DNA variation in Pakistan". American Journal of Human Genetics 70 (5): 1107–1124. May 2002. doi:10.1086/339929. PMID 11898125.
↑ ^65.0 ^65.1 "Indian Siddis: African descendants with Indian admixture". American Journal of Human Genetics 89 (1): 154–161. July 2011. doi:10.1016/j.ajhg.2011.05.030. PMID 21741027.
↑ ^66.0 ^66.1 ^66.2 "Indian Ocean crossroads: human genetic origin and population structure in the Maldives". American Journal of Physical Anthropology 151 (1): 58–67. May 2013. doi:10.1002/ajpa.22256. PMID 23526367. Bibcode: 2013AJPA..151...58P.
↑ (in en) Deep Ancestry: The Landmark DNA Quest to Decipher Our Distant Past. National Geographic Books. 20 November 2007. pp. 161–162. ISBN 978-1-4262-0211-7. https://books.google.com/books?id=NWgDAQAAQBAJ. "This part of the M9 Eurasian clan migrated south once they reached the rugged and mountainous Pamir Knot region. The man who gave rise to marker M20 was possibly born in India or the Middle East. His ancestors arrived in India around 30,000 years ago and represent the earliest significant settlement of India."
↑ (in en) The Journey of Man: A Genetic Odyssey. Princeton University Press. 28 March 2017. pp. 111–113. ISBN 978-0-691-17601-7. https://books.google.com/books?id=Sus9DwAAQBAJ.
↑ ^69.0 ^69.1 "A population genetics perspective of the Indus Valley through uniparentally-inherited markers". Annals of Human Biology 32 (2): 154–162. 2005. doi:10.1080/03014460500076223. PMID 16096211.
↑ "Presence of three different paternal lineages among North Indians: a study of 560 Y chromosomes". Annals of Human Biology 36 (1): 46–59. 2009. doi:10.1080/03014460802558522. PMID 19058044.
↑ "The influence of natural barriers in shaping the genetic structure of Maharashtra populations". PLOS ONE 5 (12). December 2010. doi:10.1371/journal.pone.0015283. PMID 21187967. Bibcode: 2010PLoSO...515283T.
↑ "The Geographic Origins of Ethnic Groups in the Indian Subcontinent: Exploring Ancient Footprints with Y-DNA Haplogroups". Frontiers in Genetics 9. 23 January 2018. doi:10.3389/fgene.2018.00004. PMID 29410676.
↑ ^73.0 ^73.1 "Y-STR Haplogroup Diversity in the Jat Population Reveals Several Different Ancient Origins". Frontiers in Genetics 8. 20 September 2017. doi:10.3389/fgene.2017.00121. PMID 28979290.
↑ "The influence of natural barriers in shaping the genetic structure of Maharashtra populations". PLOS ONE 5 (12). December 2010. doi:10.1371/journal.pone.0015283. PMID 21187967. Bibcode: 2010PLoSO...515283T.
↑ ^75.0 ^75.1 ^75.2 ^75.3 "Ethnic India: a genomic view, with special reference to peopling and structure". Genome Research 13 (10): 2277–2290. October 2003. doi:10.1101/gr.1413403. PMID 14525929.
↑ ^76.0 ^76.1 ^76.2 "Y-chromosomal evidence for a limited Greek contribution to the Pathan population of Pakistan". European Journal of Human Genetics 15 (1): 121–126. January 2007. doi:10.1038/sj.ejhg.5201726. PMID 17047675.
↑ ^77.0 ^77.1 "The Genetics of Language and Farming Spread in India". Examining the farming/language dispersal hypothesis. McDonald Institute for Archaeological Research, Cambridge, United Kingdom. 2003. pp. 215–222. http://evolutsioon.ut.ee/publications/Kivisild2003a.pdf. Retrieved 11 November 2005.
↑ ^78.0 ^78.1 ^78.2 ^78.3 ^78.4 "Separating the post-Glacial coancestry of European and Asian Y chromosomes within haplogroup R1a". European Journal of Human Genetics 18 (4): 479–484. April 2010. doi:10.1038/ejhg.2009.194. PMID 19888303.
↑ "Y-chromosome distribution within the geo-linguistic landscape of northwestern Russia". European Journal of Human Genetics 17 (10): 1260–1273. October 2009. doi:10.1038/ejhg.2009.6. PMID 19259129.
↑ ^80.0 ^80.1 ^80.2 "The phylogenetic and geographic structure of Y-chromosome haplogroup R1a". European Journal of Human Genetics 23 (1): 124–131. January 2015. doi:10.1038/ejhg.2014.50. PMID 24667786.
↑ ^81.0 ^81.1 ^81.2 "Brief communication: new Y-chromosome binary markers improve phylogenetic resolution within haplogroup R1a1". American Journal of Physical Anthropology 149 (4): 611–615. December 2012. doi:10.1002/ajpa.22167. PMID 23115110. Bibcode: 2012AJPA..149..611P.
↑ "Mitochondrial and Y-chromosome diversity of the Tharus (Nepal): a reservoir of genetic variation". BMC Evolutionary Biology 9 (1). July 2009. doi:10.1186/1471-2148-9-154. PMID 19573232. Bibcode: 2009BMCEE...9..154F.
↑ "A Synthesis of Haplogroup R2". 2006. http://www.ethnoancestry.com/index_files/index_data/Haplogroup_R2_Manoukian.pdf.
↑ ^84.0 ^84.1 "Y-chromosomal sequences of diverse Indian populations and the ancestry of the Andamanese". Human Genetics 136 (5): 499–510. May 2017. doi:10.1007/s00439-017-1800-0. PMID 28444560.
↑ Amjadi, Motahareh Ala; Özdemir, Yusuf Can; Ramezani, Maryam; Jakab, Kristóf; Megyes, Melinda; Bibak, Arezoo; Salehi, Zeinab; Hayatmehar, Zahra et al. (13 May 2025). "Ancient DNA indicates 3,000 years of genetic continuity in the Northern Iranian Plateau, from the Copper Age to the Sassanid Empire" (in en). Scientific Reports 15 (1): 16530. doi:10.1038/s41598-025-99743-w. ISSN 2045-2322. PMID 40360796. Bibcode: 2025NatSR..1516530A.
↑ ^86.0 ^86.1 ^86.2 ^86.3 ^86.4 ^86.5 "Y-chromosome evidence suggests a common paternal heritage of Austro-Asiatic populations". BMC Evolutionary Biology 7 (1). March 2007. doi:10.1186/1471-2148-7-47. PMID 17389048. Bibcode: 2007BMCEE...7...47K.
↑ "Traces of sub-Saharan and Middle Eastern lineages in Indian Muslim populations". European Journal of Human Genetics 18 (3): 354–363. March 2010. doi:10.1038/ejhg.2009.168. PMID 19809480.
↑ ^88.0 ^88.1 "Austro-Asiatic tribes of Northeast India provide hitherto missing genetic link between South and Southeast Asia". PLOS ONE 2 (11). November 2007. doi:10.1371/journal.pone.0001141. PMID 17989774. Bibcode: 2007PLoSO...2.1141R.
↑ ^89.0 ^89.1 ^89.2 "Y-chromosome diversity suggests southern origin and Paleolithic backwave migration of Austro-Asiatic speakers from eastern Asia to the Indian subcontinent". Scientific Reports 5 (1). October 2015. doi:10.1038/srep15486. PMID 26482917. Bibcode: 2015NatSR...515486Z.
↑ ^90.0 ^90.1 ^90.2 "Y chromosome haplotypes reveal prehistorical migrations to the Himalayas". Human Genetics 107 (6): 582–590. December 2000. doi:10.1007/s004390000406. PMID 11153912.
↑ "Y-chromosome evidence of southern origin of the East Asian-specific haplogroup O3-M122". American Journal of Human Genetics 77 (3): 408–419. September 2005. doi:10.1086/444436. PMID 16080116. Bibcode: 2005AmJHG..77..408S.
↑ "The Himalayas as a directional barrier to gene flow". American Journal of Human Genetics 80 (5): 884–894. May 2007. doi:10.1086/516757. PMID 17436243.
↑ Indian Genome Variation Consortium (April 2008). "Genetic landscape of the people of India: a canvas for disease gene exploration". Journal of Genetics 87 (1): 3–20. doi:10.1007/s12041-008-0002-x. PMID 18560169.
↑ "The Place of the Indian mtDNA Variants in the Global Network of Maternal Lineages and the Peopling of the Old World". http://www.imtech.res.in/raghava/reprints/IGVdb.pdf.
↑ "Ethnologue report for Indo-European". Ethnologue.com. http://www.ethnologue.com/show_family.asp?subid=2-16.
↑ Linguistic Change and Reconstruction Methodology. Walter de Gruyter. 1990. p. 342. ISBN 978-3-11-011908-4.
↑ "Languages and language families in China". Encyclopedia of Chinese Language and Linguistics. Leiden: Brill. 2015. doi:10.1163/2210-7363_ecll_COM_00000219. https://www.academia.edu/1542763. "MK in the wider sense including the Munda languages of eastern South Asia is also known as Austroasiatic."
↑ "Genetic evidence on the origins of Indian caste populations". Genome Research 11 (6): 994–1004. June 2001. doi:10.1101/gr.GR-1733RR. PMID 11381027.
↑ "High-resolution analysis of Y-chromosomal polymorphisms reveals signatures of population movements from Central Asia and West Asia into India". Journal of Genetics 80 (3): 125–135. December 2001. doi:10.1007/BF02717908. PMID 11988631.
↑ "Herders of Indian and European cattle share their predominant allele for lactase persistence". Molecular Biology and Evolution 29 (1): 249–260. January 2012. doi:10.1093/molbev/msr190. PMID 21836184.
↑ ^101.0 ^101.1 "Lactose Tolerance in the Indian Dairyland". ScienceLife. University of Chicago Medicine & Biological Sciences. 2011. http://sciencelife.uchospitals.edu/2011/09/14/lactose-tolerance-in-the-indian-dairyland/.
↑ "Genetic evidence for recent population mixture in India". American Journal of Human Genetics 93 (3): 422–438. September 2013. doi:10.1016/j.ajhg.2013.07.006. PMID 23932107.
↑ ^103.0 ^103.1 "The Andaman Islanders in a regional genetic context: reexamining the evidence for an early peopling of the archipelago from South Asia". Human Biology 85 (1–3): 153–172. June 2013. doi:10.3378/027.085.0307. PMID 24297224. https://digitalcommons.wayne.edu/cgi/viewcontent.cgi?article=2055&context=humbiol.
↑ "The human genetic history of South Asia". Current Biology 20 (4): R184–R187. February 2010. doi:10.1016/j.cub.2009.11.053. PMID 20178765. Bibcode: 2010CBio...20.R184M.
↑ Tagore, Debashree; Majumder, Partha P.; Chatterjee, Anupam; Basu, Analabha (2022). "Multiple migrations from East Asia led to linguistic transformation in NorthEast India and mainland Southeast Asia". Frontiers in Genetics 13. doi:10.3389/fgene.2022.1023870. PMID 36303544.
↑ Bankura, Biswabandhu; Basak, Bishnupriya; Singh, Prajjval Pratap et al. (2026). "Northeast india: Genetic inconsistency across ethnicity and geography". Molecular Genetics and Genomics 301 (1). doi:10.1007/s00438-026-02358-7. PMID 41619049.
↑ Liu, Chi-Chun; Witonsky, David; Gosling, Anna et al. (2022). "Ancient genomes from the Himalayas illuminate the genetic history of Tibetans and their Tibeto-Burman speaking neighbors". Nature Communications 13 (1203). doi:10.1038/s41467-022-28827-2. PMC 8904508. Bibcode: 2022NatCo..13.1203L. https://www.nature.com/articles/s41467-022-28827-2#Sec9.
↑ Bandyopadhyay, Esha; Witonsky, David; Castro, Constanza de la Fuente et al. (2025). "Dynamic human admixture histories over the past ~1300 years at the northern Himalayan frontier". Science Advances 11 (44). doi:10.1126/sciadv.adu9625. PMID 41160688. Bibcode: 2025SciA...11.9625B.
↑ Reich 2018, pp. 149–152.
↑ "Diversity and divergence among the tribal populations of India". Annals of Human Genetics 69 (Pt 6): 680–692. November 2005. doi:10.1046/j.1529-8817.2005.00200.x. PMID 16266407.
↑ "Microsatellite diversity in Andhra Pradesh, India: genetic stratification versus social stratification". Human Biology 77 (6): 803–823. December 2005. doi:10.1353/hub.2006.0018. PMID 16715839.
↑ "Genetic structure and affinities among tribal populations of southern India: a study of 24 autosomal DNA markers". Annals of Human Genetics 68 (Pt 2): 128–138. March 2004. doi:10.1046/j.1529-8817.2003.00083.x. PMID 15008792.
↑ ^113.0 ^113.1 "Ancient genomes document multiple waves of migration in Southeast Asian prehistory". Science 361 (6397): 92–95. July 2018. doi:10.1126/science.aat3188. PMID 29773666. Bibcode: 2018Sci...361...92L.
↑ "Dissecting the paternal founders of Mundari (Austroasiatic) speakers associated with the language dispersal in South Asia". European Journal of Human Genetics 29 (3): 528–532. March 2021. doi:10.1038/s41431-020-00745-1. PMID 33087879.
↑ "Reconstructing the history of founder events using genome-wide patterns of allele sharing across individuals". PLOS Genetics 18 (6). June 2022. doi:10.1371/journal.pgen.1010243. PMID 35737729.

External links

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Genetics and archaeogenetics of South Asia. Read more

[Perur-31] Srinath Perur (2 December 2014). "The origins of Indians. What our genes are telling us". Fountain Ink. https://fountainink.in/reportage/the-origins-of-indians.
"The origins of Indians. What our genes are telling us". Fountain Ink. December 2013. pp. 42–55. https://genetics.med.harvard.edu/reich/Reich_Lab/Press_files/Fountain%20Ink%20-%20December%202013%20-%20Cover.pdf.

[48] "How genetics is settling the Aryan migration debate". The Hindu. 16 June 2017. https://www.thehindu.com/sci-tech/science/how-genetics-is-settling-the-aryan-migration-debate/article19090301.ece.

[SchorkowitzChávezSchröder2019-1] Schorkowitz, Dittmar; Chávez, John R.; Schröder, Ingo W. (28 September 2019) (in en). Shifting Forms of Continental Colonialism: Unfinished Struggles and Tensions. Springer Nature. p. 251-252. ISBN 978-981-13-9817-9. "The Indo-Aryans (the Eurasian Steppe people) brought with them the mastery of the chariot, an early version of Sanskrit, and various cultural practices, such as sacrificial rituals, that formed the basis of early Vedic-Hindu culture. ... The first two major migrations had thus culminated in the development of Harappa or Indus Valley civilization. The third, Indo-Aryan migration might have caused some amount of upheaval when it encountered the Indus Valley population. Consequently, some of the latter moved farther south, joined, and mixed with South Asian hunter-gatherers, the Ancient Ancestral South Indian (AASI), to create the Ancestral South Indian (ASI) population. The Indo-Aryan steppe pastoralists mixed with groups of the Indus Valley periphery living in the northern fringe, to create the Ancestral North Indian (ANI) branch. More migration into the Indian subcontinent occurred in later times, though mostly from East Asia. These groups assimilated with one of the two dominant groups. Thus, most of the South Asian populations carry either the lineage of ASI or ANI or a mixture of both."

[Kivisild_1999b-2] 2.0 ^2.1 ^2.2 ^2.3 ^2.4 "The Place of the Indian Mitochondrial DNA Variants in the Global Network of Maternal Lineages and the Peopling of the Old World". Genomic Diversity. 1999. pp. 135–152. doi:10.1007/978-1-4615-4263-6_11. ISBN 978-1-4613-6914-1.

[Baig_2004-3] 3.0 ^3.1 "Mitochondrial DNA diversity in tribal and caste groups of Maharashtra (India) and its implication on their genetic origins". Annals of Human Genetics 68 (Pt 5): 453–460. September 2004. doi:10.1046/j.1529-8817.2004.00108.x. PMID 15469422.

[Kumar-4] 4.0 ^4.1 Science & Technology For Upsc. Tata McGraw-Hill Education. 2007. p. 595. ISBN 978-0-07-065548-5. https://books.google.com/books?id=CzV1MgFH6oMC&pg=PA595. Retrieved 24 May 2016.

[Tripathy_2008-5] 5.0 ^5.1 ^5.2 ^5.3 "Trends in Molecular Anthropological Studies in India". International Journal of Human Genetics 8 (1–2): 1–20. 4 September 2017. doi:10.1080/09723757.2008.11886015.

[Narasimhan_2019-6] 6.00 ^6.01 ^6.02 ^6.03 ^6.04 ^6.05 ^6.06 ^6.07 ^6.08 ^6.09 ^6.10 "The formation of human populations in South and Central Asia". Science 365 (6457). September 2019. doi:10.1126/science.aat7487. PMID 31488661. Bibcode: 2019Sci...365t7487N.

[Yelmen_2019-7] 7.0 ^7.1 ^7.2 ^7.3 ^7.4 ^7.5 ^7.6 "Ancestry-Specific Analyses Reveal Differential Demographic Histories and Opposite Selective Pressures in Modern South Asian Populations". Molecular Biology and Evolution 36 (8): 1628–1642. August 2019. doi:10.1093/molbev/msz037. PMID 30952160. "The two main components (i.e., autochthonous South Asian and West Eurasian) of Indian genetic variation form one of the deepest splits among non-African groups, which took place when South Asian populations separated from East Asian and Andamanese populations, shortly after having separated from West Eurasian populations (Mondal et al. 2016; Narasimhan et al. 2018).".

[Yang_2022-8] 8.0 ^8.1 ^8.2 "A genetic history of migration, diversification, and admixture in Asia" (in en). Human Population Genetics and Genomics 2 (1): 1–32. 6 January 2022. doi:10.47248/hpgg2202010001. ISSN 2770-5005. https://www.pivotscipub.com/hpgg/2/1/0001/html. "The branches predominantly associated with present-day Asian populations include the Ancient Ancestral South Indian (AASI) lineage, Australasian (AA) lineage, and East and Southeast Asian (ESEA) lineage.".

[Pathak-9] 9.0 ^9.1 ^9.2 ^9.3 ^9.4 Pathak, Ajai K.; Kadian, Anurag; Kushniarevich, Alena; Montinaro, Francesco; Mondal, Mayukh; Ongaro, Linda; Singh, Manvendra; Kumar, Pramod et al. (6 December 2018). "The Genetic Ancestry of Modern Indus Valley Populations from Northwest India". The American Journal of Human Genetics 103 (6): 918–929. doi:10.1016/j.ajhg.2018.10.022. ISSN 0002-9297. PMID 30526867. "A previous ancient-DNA study has suggested that the Iran_N and Steppe_EMBA groups are the best proxies for the ancient West Eurasian component in South Asians. The study also suggested that most South Asians can be modeled as a mixture of these two groups but also have Onge- and Han-related ancestries.".

[Reich_2009-10] 10.0 ^10.1 ^10.2 ^10.3 "Reconstructing Indian population history". Nature 461 (7263): 489–494. September 2009. doi:10.1038/nature08365. PMID 19779445. Bibcode: 2009Natur.461..489R.

[Broushaki_2016-11] 11.0 ^11.1 ^11.2 ^11.3 "Early Neolithic genomes from the eastern Fertile Crescent". Science 353 (6298): 499–503. July 2016. doi:10.1126/science.aaf7943. PMID 27417496. Bibcode: 2016Sci...353..499B. ; Lay summary in: "Prehistoric genomes from the world's first farmers in the Zagros mountains reveal different Neolithic ancestry for Europeans and South Asians". https://www.sciencedaily.com/releases/2016/07/160714151201.htm. "The research team found that the Iranian genomes represent the main ancestors of modern-day South Asians. ...the Zagros people of the Neolithic eastern Fertile Crescent that are ancestral to most modern South Asians..."

[ChaubeyEast-12] 12.0 ^12.1 ^12.2 ^12.3 ^12.4 "East Asian ancestry in India". Indian Journal of Physical Anthropology and Human Genetics 34 (2): 193–199. January 2015. https://serialsjournals.com/abstract/78963_2.pdf. "Here the analysis of genome wide data on Indian and East/Southeast Asian demonstrated their restricted distinctive ancestry in India mainly running along the foothills of Himalaya and northeastern part.".

[Chaubey_2010-13] 13.0 ^13.1 "Population genetic structure in Indian Austroasiatic speakers: the role of landscape barriers and sex-specific admixture". Molecular Biology and Evolution 28 (2): 1013–1024. February 2011. doi:10.1093/molbev/msq288. PMID 20978040.

[Chaubey_2015-14] "Genetic Affinity of the Bhil, Kol and Gond Mentioned in Epic Ramayana". PLOS ONE 10 (6). 10 June 2015. doi:10.1371/journal.pone.0127655. PMID 26061398. Bibcode: 2015PLoSO..1027655C.

[Pakistan-15] 15.0 ^15.1 ^15.2 "Investigating the West Eurasian ancestry of Pakistani Hazaras". Journal of Genetics 98 (2). June 2019. doi:10.1007/s12041-019-1093-2. PMID 31204712.

[Lazaridis_2016-16] 16.0 ^16.1 ^16.2 ^16.3 ^16.4 "Genomic insights into the origin of farming in the ancient Near East". Nature 536 (7617): 419–424. August 2016. doi:10.1038/nature19310. PMID 27459054. Bibcode: 2016Natur.536..419L.

[Chaubey2014-17] "Unravelling the distinct strains of Tharu ancestry". European Journal of Human Genetics 22 (12): 1404–1412. December 2014. doi:10.1038/ejhg.2014.36. PMID 24667789.

[Arciero-18] 18.0 ^18.1 "Demographic History and Genetic Adaptation in the Himalayan Region Inferred from Genome-Wide SNP Genotypes of 49 Populations". Molecular Biology and Evolution 35 (8): 1916–1933. August 2018. doi:10.1093/molbev/msy094. PMID 29796643.

[:0-19] 19.0 ^19.1 Bennett, E. Andrew; Liu, Yichen; Fu, Qiaomei (3 December 2024). "Reconstructing the Human Population History of East Asia through Ancient Genomics" (in en). Elements in Ancient East Asia. doi:10.1017/9781009246675. ISBN 978-1-009-24667-5. https://www.cambridge.org/core/elements/reconstructing-the-human-population-history-of-east-asia-through-ancient-genomics/0524D629660B5E43FC7094C043D54C6A.

[20] Aoki, Kenichi; Takahata, Naoyuki; Oota, Hiroki; Wakano, Joe Yuichiro; Feldman, Marcus W. (30 August 2023). "Infectious diseases may have arrested the southward advance of microblades in Upper Palaeolithic East Asia". Proceedings of the Royal Society B: Biological Sciences 290 (2005). doi:10.1098/rspb.2023.1262. PMID 37644833. "A single major migration of modern humans into the continents of Asia and Sahul was strongly supported by earlier studies using mitochondrial DNA, the non-recombining portion of Y chromosomes, and autosomal SNP data [42–45]. Ancestral Ancient South Indians with no West Eurasian relatedness, East Asians, Onge (Andamanese hunter–gatherers) and Papuans all derive in a short evolutionary time from the eastward dispersal of an out-of-Africa population [46,47]".

[21] Aragon, Jose A. Urban; Bandyopadhyay, Esha; Fernando, Amali S.; Castro, Constanza de la Fuente; Welikala, Anjana H. J.; Biddanda, Arjun; Witonsky, David; Sander, Nathan et al. (9 June 2025). "Population histories of the Indigenous Adivasi and Sinhalese from Sri Lanka using whole genomes" (in English). Current Biology 35 (11): 2554–2566.e7. doi:10.1016/j.cub.2025.04.039. ISSN 0960-9822. PMID 40494279. Bibcode: 2025CBio...35.2554U. "The high levels of the AASI-like genetic ancestry, an unsampled basal Asian lineage that shares deep ancestry with ancestral East Asians,19 in the Adivasi could partially explain the extra allele sharing between them and populations with East Asian-related genetic ancestry.".

[22] "Papuan mitochondrial genomes and the settlement of Sahul". Journal of Human Genetics 65 (10): 875–887. October 2020. doi:10.1038/s10038-020-0781-3. PMID 32483274.

[23] "Genome-wide data substantiate Holocene gene flow from India to Australia". Proceedings of the National Academy of Sciences of the United States of America 110 (5): 1803–1808. January 2013. doi:10.1073/pnas.1211927110. PMID 23319617. Bibcode: 2013PNAS..110.1803P.

[24] "A genomic view of the peopling and population structure of India". Cold Spring Harbor Perspectives in Biology 7 (4). August 2014. doi:10.1101/cshperspect.a008540. PMID 25147176.

[Shinde_2019-25] 25.0 ^25.1 ^25.2 ^25.3 "An Ancient Harappan Genome Lacks Ancestry from Steppe Pastoralists or Iranian Farmers". Cell 179 (3): 729–735.e10. October 2019. doi:10.1016/j.cell.2019.08.048. PMID 31495572.

[Narasimhan_2018-26] 26.0 ^26.1 ^26.2 ^26.3 ^26.4 "The formation of human populations in South and Central Asia". Science 365 (6457). 2019. doi:10.1126/science.aat7487. PMID 31488661. Bibcode: 2019Sci...365t7487N.

[Kanthimathi_2008-27] 27.0 ^27.1 "Genetic study of Dravidian castes of Tamil Nadu". Journal of Genetics 87 (2): 175–9. August 2008. doi:10.1007/s12041-008-0027-1. PMID 18776648.

[Cole-28] 28.0 ^28.1 "Genetic structure in the Sherpa and neighboring Nepalese populations". BMC Genomics 18 (1). January 2017. doi:10.1186/s12864-016-3469-5. PMID 28103797.

[Petraglia-29] 29.0 ^29.1 (in en) The Evolution and History of Human Populations in South Asia: Inter-disciplinary Studies in Archaeology, Biological Anthropology, Linguistics and Genetics. Springer Science & Business Media. 22 May 2007. ISBN 978-1-4020-5562-1. https://books.google.com/books?id=Qm9GfjNlnRwC&pg=PA201.

[:2-30] 30.0 ^30.1 Vallini, Leonardo; Zampieri, Carlo; Shoaee, Mohamed Javad; Bortolini, Eugenio; Marciani, Giulia; Aneli, Serena; Pievani, Telmo; Benazzi, Stefano et al. (25 March 2024). "The Persian plateau served as hub for Homo sapiens after the main out of Africa dispersal" (in en). Nature Communications 15 (1): 1882. doi:10.1038/s41467-024-46161-7. ISSN 2041-1723. PMID 38528002. Bibcode: 2024NatCo..15.1882V.

[32] "Genetics and Material Culture Support Repeated Expansions into Paleolithic Eurasia from a Population Hub Out of Africa". Genome Biology and Evolution 14 (4). April 2022. doi:10.1093/gbe/evac045. PMID 35445261.

[Metspalu_2011-33] 32.0 ^32.1 ^32.2 "Shared and unique components of human population structure and genome-wide signals of positive selection in South Asia". American Journal of Human Genetics 89 (6): 731–744. December 2011. doi:10.1016/j.ajhg.2011.11.010. PMID 22152676.

[Moorjani_2013-34] 33.0 ^33.1 "Genetic evidence for recent population mixture in India". American Journal of Human Genetics 93 (3): 422–438. September 2013. doi:10.1016/j.ajhg.2013.07.006. PMID 23932107.

[35] "A genetic chronology for the Indian Subcontinent points to heavily sex-biased dispersals". BMC Evolutionary Biology 17 (1). March 2017. doi:10.1186/s12862-017-0936-9. PMID 28335724. Bibcode: 2017BMCEE..17...88S.

[36] Kerdoncuff, Elise; Skov, Laurits; Patterson, Nick et al. (2025). "50,000 years of evolutionary history of India: Impact on health and disease variation". Cell 188 (13): 3389–3404. doi:10.1016/j.cell.2025.04.027. PMID 40578318.

[Basu2016-37] 36.0 ^36.1 "Genomic reconstruction of the history of extant populations of India reveals five distinct ancestral components and a complex structure". Proceedings of the National Academy of Sciences of the United States of America 113 (6): 1594–1599. February 2016. doi:10.1073/pnas.1513197113. PMID 26811443. Bibcode: 2016PNAS..113.1594B.

[38] Zhang, X.; Liao, S.; Qi, X. et al. (2015). "Y-chromosome diversity suggests southern origin and Paleolithic backwave migration of Austro-Asiatic speakers from eastern Asia to the Indian subcontinent". Scientific Reports 5. doi:10.1038/srep15486. PMID 26482917. Bibcode: 2015NatSR...515486Z.

[39] Arunkumar, G. (2015). "A late Neolithic expansion of Y chromosomal haplogroup O2a1-M95 from east to west". Journal of Systematics and Evolution 53 (6): 546–560. doi:10.1111/jse.12147. Bibcode: 2015JSyEv..53..546A.

[40] Tätte, Kai; Pagani, Luca; Pathak, Ajai K.; Kõks, Sulev; Ho Duy, Binh; Ho, Xuan Dung; Sultana, Gazi Nurun Nahar; Sharif, Mohd Istiaq et al. (7 March 2019). "The genetic legacy of continental scale admixture in Indian Austroasiatic speakers" (in en). Scientific Reports 9 (1): 3818. doi:10.1038/s41598-019-40399-8. ISSN 2045-2322. PMID 30846778. Bibcode: 2019NatSR...9.3818T.

[Kivisild_2000a-41] 40.0 ^40.1 ^40.2 ^40.3 ^40.4 An Indian Ancestry: a Key for Understanding Human Diversity in Europe and Beyond. McDonald Institute Monographs. 2000. http://evolutsioon.ut.ee/publications/Kivisild2000.pdf. Retrieved 11 November 2005.

[McDonald_2004-42] 41.0 ^41.1 ^41.2 "Y Haplogroups of the World". 2004. http://www.scs.uiuc.edu/~mcdonald/WorldHaplogroupsMaps.pdf.

[Arunkumar_2015-43] 42.0 ^42.1 "A late Neolithic expansion of Y chromosomal haplogroup O2a1-M95 from east to west" (in en). Journal of Systematics and Evolution 53 (6): 546–560. 2015. doi:10.1111/jse.12147. Bibcode: 2015JSyEv..53..546A.

[Sengupta_2006-44] 43.00 ^43.01 ^43.02 ^43.03 ^43.04 ^43.05 ^43.06 ^43.07 ^43.08 ^43.09 ^43.10 ^43.11 ^43.12 ^43.13 ^43.14 ^43.15 ^43.16 ^43.17 "Polarity and temporality of high-resolution y-chromosome distributions in India identify both indigenous and exogenous expansions and reveal minor genetic influence of Central Asian pastoralists". American Journal of Human Genetics 78 (2): 202–221. February 2006. doi:10.1086/499411. PMID 16400607. Bibcode: 2006AmJHG..78..202S.

[Sahoo_2006-45] 44.0 ^44.1 ^44.2 ^44.3 ^44.4 ^44.5 ^44.6 "A prehistory of Indian Y chromosomes: evaluating demic diffusion scenarios". Proceedings of the National Academy of Sciences of the United States of America 103 (4): 843–848. January 2006. doi:10.1073/pnas.0507714103. PMID 16415161. Bibcode: 2006PNAS..103..843S.

[Thanseem_2006-46] 45.0 ^45.1 ^45.2 ^45.3 ^45.4 ^45.5 "Genetic affinities among the lower castes and tribal groups of India: inference from Y chromosome and mitochondrial DNA". BMC Genetics 7. August 2006. doi:10.1186/1471-2156-7-42. PMID 16893451.

[Zhao_2009-47] 46.0 ^46.1 ^46.2 "Presence of three different paternal lineages among North Indians: a study of 560 Y chromosomes". Annals of Human Biology 36 (1): 46–59. 2009. doi:10.1080/03014460802558522. PMID 19058044.

[FOOTNOTEReich2018-49] Reich 2018.

[Kivisild_2003-50] 48.00 ^48.01 ^48.02 ^48.03 ^48.04 ^48.05 ^48.06 ^48.07 ^48.08 ^48.09 ^48.10 ^48.11 ^48.12 ^48.13 Kivisild, T.; Rootsi, S.; Metspalu, M.; Mastana, S.; Kaldma, K.; Parik, J.; Metspalu, E.; Adojaan, M. et al. (February 2003). "The Genetic Heritage of the Earliest Settlers Persists Both in Indian Tribal and Caste Populations". American Journal of Human Genetics 72 (2): 313–332. doi:10.1086/346068. ISSN 0002-9297. PMID 12536373. Bibcode: 2003AmJHG..72..313K.

[51] Singh, Mugdha; Sarkar, Anujit; Nandineni, Madhusudan R. (18 October 2018). "A comprehensive portrait of Y-STR diversity of Indian populations and comparison with 129 worldwide populations" (in en). Scientific Reports 8 (1): 15421. doi:10.1038/s41598-018-33714-2. ISSN 2045-2322. PMID 30337554. Bibcode: 2018NatSR...815421S.

[52] "X-chromosomal STR based genetic polymorphisms and demographic history of Sri Lankan ethnicities and their relationship with global populations". Scientific Reports 11 (1). June 2021. doi:10.1038/s41598-021-92314-9. PMID 34140598. Bibcode: 2021NatSR..1112748P.

[53] The Real Eve: Modern Man's Journey out of Africa. New York: Carroll and Graf Publishers. 2003. ISBN 978-0-7867-1192-5.

[Thangaraj_2006-54] 52.0 ^52.1 "Comparative analysis of cancer genes in the human and chimpanzee genomes". BMC Genomics 7. January 2006. doi:10.1186/1471-2164-7-15. PMID 16438707.

[Metspalu_2004-55] 53.0 ^53.1 ^53.2 "Most of the extant mtDNA boundaries in south and southwest Asia were likely shaped during the initial settlement of Eurasia by anatomically modern humans". BMC Genetics 5 (1). August 2004. doi:10.1186/1471-2156-5-26. PMID 15339343. Bibcode: 2004BMCGe...5...26M.

[Kivisild_1999a-56] "Deep common ancestry of indian and western-Eurasian mitochondrial DNA lineages". Current Biology 9 (22): 1331–1334. November 1999. doi:10.1016/s0960-9822(00)80057-3. PMID 10574762. Bibcode: 1999CBio....9.1331K.

[frontiers_h-57] Mahal, David G.; Matsoukas, Ianis G. (2018). "The Geographic Origins of Ethnic Groups in the Indian Subcontinent: Exploring Ancient Footprints with Y-DNA Haplogroups". Frontiers in Genetics 9. doi:10.3389/fgene.2018.00004. PMID 29410676.

[kpk-58] Tariq, Muhammad; Ahmad, Habib; Hemphill, Brian E.; Farooq, Umar; Schurr, Theodore G. (2022). "Contrasting maternal and paternal genetic histories among five ethnic groups from Khyber Pakhtunkhwa, Pakistan". Scientific Reports 12 (1): 1027. doi:10.1038/s41598-022-05076-3. PMID 35046511. Bibcode: 2022NatSR..12.1027T.

[isogg.org-59] "Y-DNA Haplogroup H and its Subclades – 2015". http://www.isogg.org/tree/ISOGG_HapgrpH.html.

[Cordaux2004-60] 58.0 ^58.1 ^58.2 ^58.3 ^58.4 ^58.5 "Independent origins of Indian caste and tribal paternal lineages". Current Biology 14 (3): 231–235. February 2004. doi:10.1016/j.cub.2004.01.024. PMID 14761656. Bibcode: 2004CBio...14..231C.

[Singh2016-61] 59.0 ^59.1 ^59.2 "Dissecting the influence of Neolithic demic diffusion on Indian Y-chromosome pool through J2-M172 haplogroup". Scientific Reports 6 (1). January 2016. doi:10.1038/srep19157. PMID 26754573. Bibcode: 2016NatSR...619157S.

[Herrera2018-62] (in en) Ancestral DNA, Human Origins, and Migrations. Academic Press. 2018. p. 250. ISBN 978-0-12-804128-4. https://books.google.com/books?id=ZF1gDwAAQBAJ&q=Ancestral+DNA+Human+Origins+and+Migrations+J2b-M102+South+Asia&pg=PA250.

[Arunkumar_2012-63] 61.0 ^61.1 "Population differentiation of southern Indian male lineages correlates with agricultural expansions predating the caste system". PLOS ONE 7 (11). 2012. doi:10.1371/journal.pone.0050269. PMID 23209694. Bibcode: 2012PLoSO...750269A.

[Sharma_2009-64] 62.0 ^62.1 ^62.2 ^62.3 ^62.4 "The Indian origin of paternal haplogroup R1a1* substantiates the autochthonous origin of Brahmins and the caste system". Journal of Human Genetics 54 (1): 47–55. January 2009. doi:10.1038/jhg.2008.2. PMID 19158816.

[Eaaswarkhanth_2009b-65] "Diverse genetic origin of Indian Muslims: evidence from autosomal STR loci". Journal of Human Genetics 54 (6): 340–8. June 2009. doi:10.1038/jhg.2009.38. PMID 19424286.

[Qamar_2002-66] 64.0 ^64.1 ^64.2 ^64.3 "Y-chromosomal DNA variation in Pakistan". American Journal of Human Genetics 70 (5): 1107–1124. May 2002. doi:10.1086/339929. PMID 11898125.

[Shah2011-67] 65.0 ^65.1 "Indian Siddis: African descendants with Indian admixture". American Journal of Human Genetics 89 (1): 154–161. July 2011. doi:10.1016/j.ajhg.2011.05.030. PMID 21741027.

[Pijpe2013-68] 66.0 ^66.1 ^66.2 "Indian Ocean crossroads: human genetic origin and population structure in the Maldives". American Journal of Physical Anthropology 151 (1): 58–67. May 2013. doi:10.1002/ajpa.22256. PMID 23526367. Bibcode: 2013AJPA..151...58P.

[Wells2007-69] (in en) Deep Ancestry: The Landmark DNA Quest to Decipher Our Distant Past. National Geographic Books. 20 November 2007. pp. 161–162. ISBN 978-1-4262-0211-7. https://books.google.com/books?id=NWgDAQAAQBAJ. "This part of the M9 Eurasian clan migrated south once they reached the rugged and mountainous Pamir Knot region. The man who gave rise to marker M20 was possibly born in India or the Middle East. His ancestors arrived in India around 30,000 years ago and represent the earliest significant settlement of India."

[Wells_2017-70] (in en) The Journey of Man: A Genetic Odyssey. Princeton University Press. 28 March 2017. pp. 111–113. ISBN 978-0-691-17601-7. https://books.google.com/books?id=Sus9DwAAQBAJ.

[McElreavey_2005-71] 69.0 ^69.1 "A population genetics perspective of the Indus Valley through uniparentally-inherited markers". Annals of Human Biology 32 (2): 154–162. 2005. doi:10.1080/03014460500076223. PMID 16096211.

[72] "Presence of three different paternal lineages among North Indians: a study of 560 Y chromosomes". Annals of Human Biology 36 (1): 46–59. 2009. doi:10.1080/03014460802558522. PMID 19058044.

[73] "The influence of natural barriers in shaping the genetic structure of Maharashtra populations". PLOS ONE 5 (12). December 2010. doi:10.1371/journal.pone.0015283. PMID 21187967. Bibcode: 2010PLoSO...515283T.

[Mahal_2018-74] "The Geographic Origins of Ethnic Groups in the Indian Subcontinent: Exploring Ancient Footprints with Y-DNA Haplogroups". Frontiers in Genetics 9. 23 January 2018. doi:10.3389/fgene.2018.00004. PMID 29410676.

[David_G_2017-75] 73.0 ^73.1 "Y-STR Haplogroup Diversity in the Jat Population Reveals Several Different Ancient Origins". Frontiers in Genetics 8. 20 September 2017. doi:10.3389/fgene.2017.00121. PMID 28979290.

[Thangaraj_2010-76] "The influence of natural barriers in shaping the genetic structure of Maharashtra populations". PLOS ONE 5 (12). December 2010. doi:10.1371/journal.pone.0015283. PMID 21187967. Bibcode: 2010PLoSO...515283T.

[Basu_2003-77] 75.0 ^75.1 ^75.2 ^75.3 "Ethnic India: a genomic view, with special reference to peopling and structure". Genome Research 13 (10): 2277–2290. October 2003. doi:10.1101/gr.1413403. PMID 14525929.

[Firasat_2007-78] 76.0 ^76.1 ^76.2 "Y-chromosomal evidence for a limited Greek contribution to the Pathan population of Pakistan". European Journal of Human Genetics 15 (1): 121–126. January 2007. doi:10.1038/sj.ejhg.5201726. PMID 17047675.

[Kivisild_2003a-79] 77.0 ^77.1 "The Genetics of Language and Farming Spread in India". Examining the farming/language dispersal hypothesis. McDonald Institute for Archaeological Research, Cambridge, United Kingdom. 2003. pp. 215–222. http://evolutsioon.ut.ee/publications/Kivisild2003a.pdf. Retrieved 11 November 2005.

[Underhill_2009-80] 78.0 ^78.1 ^78.2 ^78.3 ^78.4 "Separating the post-Glacial coancestry of European and Asian Y chromosomes within haplogroup R1a". European Journal of Human Genetics 18 (4): 479–484. April 2010. doi:10.1038/ejhg.2009.194. PMID 19888303.

[Mirabal_2009-81] "Y-chromosome distribution within the geo-linguistic landscape of northwestern Russia". European Journal of Human Genetics 17 (10): 1260–1273. October 2009. doi:10.1038/ejhg.2009.6. PMID 19259129.

[Underhill_2015-82] 80.0 ^80.1 ^80.2 "The phylogenetic and geographic structure of Y-chromosome haplogroup R1a". European Journal of Human Genetics 23 (1): 124–131. January 2015. doi:10.1038/ejhg.2014.50. PMID 24667786.

[Pamjav_2012-83] 81.0 ^81.1 ^81.2 "Brief communication: new Y-chromosome binary markers improve phylogenetic resolution within haplogroup R1a1". American Journal of Physical Anthropology 149 (4): 611–615. December 2012. doi:10.1002/ajpa.22167. PMID 23115110. Bibcode: 2012AJPA..149..611P.

[Fornarino_2009-84] "Mitochondrial and Y-chromosome diversity of the Tharus (Nepal): a reservoir of genetic variation". BMC Evolutionary Biology 9 (1). July 2009. doi:10.1186/1471-2148-9-154. PMID 19573232. Bibcode: 2009BMCEE...9..154F.

[Manoukian_2006-85] "A Synthesis of Haplogroup R2". 2006. http://www.ethnoancestry.com/index_files/index_data/Haplogroup_R2_Manoukian.pdf.

[Y-chromosomal_sequences_of_diverse-86] 84.0 ^84.1 "Y-chromosomal sequences of diverse Indian populations and the ancestry of the Andamanese". Human Genetics 136 (5): 499–510. May 2017. doi:10.1007/s00439-017-1800-0. PMID 28444560.

[87] Amjadi, Motahareh Ala; Özdemir, Yusuf Can; Ramezani, Maryam; Jakab, Kristóf; Megyes, Melinda; Bibak, Arezoo; Salehi, Zeinab; Hayatmehar, Zahra et al. (13 May 2025). "Ancient DNA indicates 3,000 years of genetic continuity in the Northern Iranian Plateau, from the Copper Age to the Sassanid Empire" (in en). Scientific Reports 15 (1): 16530. doi:10.1038/s41598-025-99743-w. ISSN 2045-2322. PMID 40360796. Bibcode: 2025NatSR..1516530A.

[Vikrant2007-88] 86.0 ^86.1 ^86.2 ^86.3 ^86.4 ^86.5 "Y-chromosome evidence suggests a common paternal heritage of Austro-Asiatic populations". BMC Evolutionary Biology 7 (1). March 2007. doi:10.1186/1471-2148-7-47. PMID 17389048. Bibcode: 2007BMCEE...7...47K.

[Eaaswarkhanth_2009-89] "Traces of sub-Saharan and Middle Eastern lineages in Indian Muslim populations". European Journal of Human Genetics 18 (3): 354–363. March 2010. doi:10.1038/ejhg.2009.168. PMID 19809480.

[Reddy2007-90] 88.0 ^88.1 "Austro-Asiatic tribes of Northeast India provide hitherto missing genetic link between South and Southeast Asia". PLOS ONE 2 (11). November 2007. doi:10.1371/journal.pone.0001141. PMID 17989774. Bibcode: 2007PLoSO...2.1141R.

[Zhang2015-91] 89.0 ^89.1 ^89.2 "Y-chromosome diversity suggests southern origin and Paleolithic backwave migration of Austro-Asiatic speakers from eastern Asia to the Indian subcontinent". Scientific Reports 5 (1). October 2015. doi:10.1038/srep15486. PMID 26482917. Bibcode: 2015NatSR...515486Z.

[Su2012-92] 90.0 ^90.1 ^90.2 "Y chromosome haplotypes reveal prehistorical migrations to the Himalayas". Human Genetics 107 (6): 582–590. December 2000. doi:10.1007/s004390000406. PMID 11153912.

[93] "Y-chromosome evidence of southern origin of the East Asian-specific haplogroup O3-M122". American Journal of Human Genetics 77 (3): 408–419. September 2005. doi:10.1086/444436. PMID 16080116. Bibcode: 2005AmJHG..77..408S.

[94] "The Himalayas as a directional barrier to gene flow". American Journal of Human Genetics 80 (5): 884–894. May 2007. doi:10.1086/516757. PMID 17436243.

[95] Indian Genome Variation Consortium (April 2008). "Genetic landscape of the people of India: a canvas for disease gene exploration". Journal of Genetics 87 (1): 3–20. doi:10.1007/s12041-008-0002-x. PMID 18560169.

[96] "The Place of the Indian mtDNA Variants in the Global Network of Maternal Lineages and the Peopling of the Old World". http://www.imtech.res.in/raghava/reprints/IGVdb.pdf.

[97] "Ethnologue report for Indo-European". Ethnologue.com. http://www.ethnologue.com/show_family.asp?subid=2-16.

[98] Linguistic Change and Reconstruction Methodology. Walter de Gruyter. 1990. p. 342. ISBN 978-3-11-011908-4.

[99] "Languages and language families in China". Encyclopedia of Chinese Language and Linguistics. Leiden: Brill. 2015. doi:10.1163/2210-7363_ecll_COM_00000219. https://www.academia.edu/1542763. "MK in the wider sense including the Munda languages of eastern South Asia is also known as Austroasiatic."

[Bamshad_2001-100] "Genetic evidence on the origins of Indian caste populations". Genome Research 11 (6): 994–1004. June 2001. doi:10.1101/gr.GR-1733RR. PMID 11381027.

[Mukherjee_2001-101] "High-resolution analysis of Y-chromosomal polymorphisms reveals signatures of population movements from Central Asia and West Asia into India". Journal of Genetics 80 (3): 125–135. December 2001. doi:10.1007/BF02717908. PMID 11988631.

[102] "Herders of Indian and European cattle share their predominant allele for lactase persistence". Molecular Biology and Evolution 29 (1): 249–260. January 2012. doi:10.1093/molbev/msr190. PMID 21836184.

[ScienceLife2011-103] 101.0 ^101.1 "Lactose Tolerance in the Indian Dairyland". ScienceLife. University of Chicago Medicine & Biological Sciences. 2011. http://sciencelife.uchospitals.edu/2011/09/14/lactose-tolerance-in-the-indian-dairyland/.

[:1-104] "Genetic evidence for recent population mixture in India". American Journal of Human Genetics 93 (3): 422–438. September 2013. doi:10.1016/j.ajhg.2013.07.006. PMID 23932107.

[Chaubey_and_Endicott-105] 103.0 ^103.1 "The Andaman Islanders in a regional genetic context: reexamining the evidence for an early peopling of the archipelago from South Asia". Human Biology 85 (1–3): 153–172. June 2013. doi:10.3378/027.085.0307. PMID 24297224. https://digitalcommons.wayne.edu/cgi/viewcontent.cgi?article=2055&context=humbiol.

[Majumder_2010-106] "The human genetic history of South Asia". Current Biology 20 (4): R184–R187. February 2010. doi:10.1016/j.cub.2009.11.053. PMID 20178765. Bibcode: 2010CBio...20.R184M.

[107] Tagore, Debashree; Majumder, Partha P.; Chatterjee, Anupam; Basu, Analabha (2022). "Multiple migrations from East Asia led to linguistic transformation in NorthEast India and mainland Southeast Asia". Frontiers in Genetics 13. doi:10.3389/fgene.2022.1023870. PMID 36303544.

[108] Bankura, Biswabandhu; Basak, Bishnupriya; Singh, Prajjval Pratap et al. (2026). "Northeast india: Genetic inconsistency across ethnicity and geography". Molecular Genetics and Genomics 301 (1). doi:10.1007/s00438-026-02358-7. PMID 41619049.

[109] Liu, Chi-Chun; Witonsky, David; Gosling, Anna et al. (2022). "Ancient genomes from the Himalayas illuminate the genetic history of Tibetans and their Tibeto-Burman speaking neighbors". Nature Communications 13 (1203). doi:10.1038/s41467-022-28827-2. PMC 8904508. Bibcode: 2022NatCo..13.1203L. https://www.nature.com/articles/s41467-022-28827-2#Sec9.

[110] Bandyopadhyay, Esha; Witonsky, David; Castro, Constanza de la Fuente et al. (2025). "Dynamic human admixture histories over the past ~1300 years at the northern Himalayan frontier". Science Advances 11 (44). doi:10.1126/sciadv.adu9625. PMID 41160688. Bibcode: 2025SciA...11.9625B.

[FOOTNOTEReich2018149–152-111] Reich 2018, pp. 149–152.

[Watkins_2005-112] "Diversity and divergence among the tribal populations of India". Annals of Human Genetics 69 (Pt 6): 680–692. November 2005. doi:10.1046/j.1529-8817.2005.00200.x. PMID 16266407.

[Reddy_2005-113] "Microsatellite diversity in Andhra Pradesh, India: genetic stratification versus social stratification". Human Biology 77 (6): 803–823. December 2005. doi:10.1353/hub.2006.0018. PMID 16715839.

[Vishwanathan_2004-114] "Genetic structure and affinities among tribal populations of southern India: a study of 24 autosomal DNA markers". Annals of Human Genetics 68 (Pt 2): 128–138. March 2004. doi:10.1046/j.1529-8817.2003.00083.x. PMID 15008792.

[Lipson_2018-115] 113.0 ^113.1 "Ancient genomes document multiple waves of migration in Southeast Asian prehistory". Science 361 (6397): 92–95. July 2018. doi:10.1126/science.aat3188. PMID 29773666. Bibcode: 2018Sci...361...92L.

[116] "Dissecting the paternal founders of Mundari (Austroasiatic) speakers associated with the language dispersal in South Asia". European Journal of Human Genetics 29 (3): 528–532. March 2021. doi:10.1038/s41431-020-00745-1. PMID 33087879.

[117] "Reconstructing the history of founder events using genome-wide patterns of allele sharing across individuals". PLOS Genetics 18 (6). June 2022. doi:10.1371/journal.pgen.1010243. PMID 35737729.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[web 1]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[web 2]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66]

[67]

[68]

[69]

[70]

[71]

[72]

[73]

[74]

[75]

[76]

[77]

[78]

[79]

[80]

[81]

[82]

[83]

[84]

[85]

[86]

[87]

[88]

[89]

[90]

[91]

[92]

[93]

[94]

[95]

[96]

[97]

[98]

v t e Human genetics
Sub-topics	Human genome Human Genome Project Evolutionary genetics Human-chimp MRCA Neanderthal genetics Neanderthal genome project Timeline Genetic variation Blood type distribution by country Genealogical DNA test Genetic genealogy Race and genetics Recent evolution Surname DNA project Genetic engineering
Genetic history by region	Sub-Saharan African Australo-Melanesian South Asian North African Near Eastern Early Anatolian Farmers Caucasus Caucasian Hunter-Gatherer Europe Western Hunter-Gatherer British Isles Iberia Italy Central Asian Ancient North Eurasian East Asian Southeast Asian American Ancient Beringian
Population genetics by group	Europe Albanians Basques Bosniaks Bulgarians Croats Romanians Russians Sami Serbs Jews MENA Arabs Azerbaijanis Egyptians Moroccans Turks South Asia Gujaratis Sinhalese Tamils (Sri Lankan) East Asia Han Chinese Japanese Sub-Saharan Africa Hutu/Tutsi Khoisan Pygmies
Category Commons

Anonymous

Search

Biology:Genetics and archaeogenetics of South Asia

Overview

mtDNA

Macrohaplogroup M

Macrohaplogroup R

Haplogroup U

Y chromosome

Haplogroup H

Haplogroup J2

Haplogroup L

India

Pakistan

Sri Lanka

Haplogroup R1a1

India

Pakistan

Sri Lanka

Maldives

Nepal

Haplogroup R2

India

Pakistan

Sri Lanka

Maldives

Nepal

Haplogroup O

Reconstructing South Asian population history

mtDNA variation

Y Chromosome variation

Autosomal DNA variation

AASI-ANI-ASI

Genetic distance between caste groups and tribes

See also

Notes

References

Further reading

External links

Navigation

Wiki tools

Page tools

Other projects

Categories