Featured Post

Major update reorganizes entire haplogroup I2 tree

Thursday, December 4, 2014

December 2014 Tree for I-P37

This tree is dated 2014-11-25c

There are also several changes from the 2014-11-05 tree which you can still see in an earlier post.




Friday, November 7, 2014

November 2014 tree for I-P37

This shows some interesting new SNPs below I-L161 and some other changes. We might make slight updates to this tree soon.


Friday, September 12, 2014

The Disles group is complex

The "Disles" group is rare. We give it the nickname Disles because it is related to the very common Dinaric group but it is found very often in the British Isles while the Dinaric group is usually in Eastern and Central Europe. (Disles= D for Dinaric + isles for British Isles).

And there are some important STR marker differences between Dinaric and Disles. The most important is that Dinaric has the rare value DYS565=9, while Disles has DYS565=11 like many other haplogroups.

And now we discovering more SNPs and learning of SNP differences between Dinaric and Disles. For example all Dinarics are CTS5966+ and CTS10228+ while Disles people so far are negative (ancestral) for these two SNPs.

One Disles man did the Geno 2.0 test and the Big Y test earlier this year (kit N52277 who is Australian with British paternal ancestry). He is L621+ but CTS4022- CTS10936- CTS11768- L147- CTS5966- CTS4002- (see figure 1 below). Our working assumption was that all Disles men would have the same results.

But in August another Disles man (kit N113464 who is American with likely paternal ancestry) did the Big Y test, and he is L621+ and CTS4022+ CTS10936+  CTS11768+ L147- CTS5966- CTS4002-. There is another man with paternal ancestry from Poland who has exactly the same SNP results and we have been calling him a "Dinaric cousin". The American and the Polish man are not closely related becaue they have 67 markers quite different from each other.

So now we have two groups of Disles which we are calling Disles B (a more recent common ancestor with the Dinaric group) and Disles A (a more distant common ancestor with the Dinaric group).

We are recommending that all Disles people do additional testing to determine which Disles group they belong to.
--the cheapest test we are recommending is CTS10936 which costs $39 at Family Tree DNA.
--but, it's possible that some Disles people might have differences at CTS4002 or CTS11768 or other SNPs (see Figure 3 below). If you are interested we recommend the Geno 2.0 test which includes all the SNPs shown on the tree, it is currently $159.95 plus $9.95 shipping to the US (higher to other countries):
click to buy Geno 2.0

And the best test is the expensive Big Y or other "full Y chromosome" sequencing.

Please let me know if you have questions at berniecullen@gmail.com and please inform me if you order one of these tests


Tuesday, September 9, 2014

New SNPs for I-L233 "Western"; A417 has been confirmed




This tree summarizes six Big Y tests for I-L233 "Western". The Big Ys sequenced around 10 million base pairs on each person's Y chromosome, and the six men had differences at two SNPs called A4223 and A417. The German-ancestry man at top was ancestral for both SNPs (Y4223- and A417-), the two Irish-ancestry men in the middle were derived for both (Y4223+ and A417+), and the three men at the bottom were Y4223+ and A417-.

Dozens of additional new SNPs were discovered in each man's Big Y results but at this point we have good evidence for only Y4223 and A417. When more Big Y tests are completed we will find a SNP specific to the branch at the top of the tree (for example), and we will find more branches not shown on this tree.

A417 is available for purchase at FTDNA and another company called YSeq.net for $39, and six people not shown on the tree have tested A417 as an individual test. Five were A417- and one was A417+. So in summary, twelve L233+ men have tested A417, and nine were A417- and three were A417+.

The five people who received A417- results from these individual tests could be Y4223+ or Y4223-. The one person who received A417+ results from his individual test is sure to be Y4223+ based on the tree above.

Unfortunately Y4223 is not available at FTDNA or YSeq.net for individual testing, but we have requested that FTDNA add it to their catalog.

From this testing so far, around 25% of L233+ men belong to the A417+ group. We can compare the 67 markers of the three A417+. and they are not very similar to each other. So the three A417+ men are not closely related, and the A417 mutation happened well over 1,000 years ago. I think we will find man more A417+ men, and I recommend the A417 test to all I-L233 "Western" men. Please feel free to email berniecullen@gmail.com with any questions.

Friday, September 5, 2014

Latest I-P37 haplotree

There are interesting new discoveries for several branches of the tree:
--Dinaric (I-CTS5966) can now be divided into several subgroups based on SNPs.
--The Disles group is more complex than previously known: some people who were placed in Disles based on STR markers (and who have paternal ancestry from Great Britain) have the same SNP results as the "Dinaric-cousin" person who is from southern Poland. Other Disles men are in the I-L621* group as shown on this tree,
--I-L161 ("Isles): There is one SNP found in  Isles-C but not in Isles-A, B, D. And there are three SNPs found in Isles-D but not in Isles-A, B, C. 
--I-L233 ("Western"): there are at least two important SNPs that split I-L233. This is so new that it's not shown on this tree. One of these, A417, is available for individual testing at FTDNA and YSeq.net.  Another SNP called Y4223 is even more common than A417, Y4223 is available at YSeq but not yet at FTDNA.

Remember that this tree is preliminary and may change with new information. And not all branches and not all SNPs are shown.

To get an idea of how large each of these branches is, and where each branch is found, take a look at the groupings at the FTDNA I-P37 project:
https://www.familytreedna.com/public/I2aHapGroup/default.aspx?section=ycolorized






Tuesday, May 20, 2014

Important new SNPs for Dinarics

The Dinaric group is a very large paternal haplogroup centered in Eastern and Central Europe, at various times it has been known as I-P37.2, I-P37.2xM26, I-M423, I-L69.2, I-L147.2, I-L621, I-CTS10228 etc. (and some of these names are not completely accurate). Currently we call it I-CTS5966.

We have found the first important SNPs to divide the Dinaric group, see the tree below. The most important SNP is S17250. So far, 5 Dinaric-North people are S17250- and 7 Dinaric-North people are S17250+. And all 5 Dinaric-South people who have done the test are S17250+.



S17250 is now available at FTDNA for order as an individual SNP, if you are already a FTDNA customer with 12 or more marker results, you can order it from the bottom of your Y-DNA....Haplotree & SNPs page.

You can also order S17250 from http://www.yseq.net/ and no prior testing is required at this company. And S17250 is included as part of the Chromo2 test at BritainsDNA/ ScotlandsDNA/ IrelandsDNA/ YorkshireDNA http://www.britainsdna.com/

How to read the tree above: the long numbers like 15531354 are positions on the Y chromosome and the letter/number codes like S17250 are the name of the SNP found at that position. At right, the numbers like 76814 are FTDNA kit numbers, all of these people did the expensive Big Y test which covers all of these SNPs. Thanks to the FTDNA Polish Project administrator and Zdenko Markovic of the I2a Project for analyzing the Big Y data and preparing this tree.

Thursday, April 10, 2014

Evidence for a small "Out of Sardinia" migration?

Around 40% of men living on the island of Sardinia belong to the I-M26 haplogroup, which is much less common in other places. Paolo Francalacci sequenced the Y chromosomes of hundreds of Sardinian men of all haplogroups, and found thousands of new SNPs, and arranged the men into a tree. Unfortunately I haven't been able to find a version of his tree that shows all the SNPs on each branch, but I think it is clear that all the I-M26 men living in his study belong to the more specific I-PF4189 branch.



The designers of the Genographic 2.0 test did have access to Dr. Francalacci's discoveries, and they included many of his PF series SNPs. I know of over 50 I-M26 men who have taken the Geno 2.0 test, and 45+ of them belong to other branches, not to I-PF4189 (and these 45+ men have paternal ancestry from Spain, the Azores, France, mainland Italy, Germany, Switzerland, Britain, Ireland, Belgium).

But six men do belong to I-PF4189 and they have ancestry from Sardinia and Malta, but also from Germany and England. See the attached tree--the two German men belong to different branches of I-PF4189, and they share several downstream SNPs with the Sardinian men from Francalacci's study, and one shares many SNPs with a Sardinian-American man in the FTDNA I2a Project.

I don't think this is recent (last 500 year) migration from Sardinia or because of adoptions etc. in Germany or America, one of the German men has STR marker similarities to other Germans who have not yet done the Geno 2.0 test indicating a long presence in Germany.

Was this an out of Sardinia migration? Or did the Sardinians and Germans both come from a founding population on the mainland of Europe?


Friday, March 14, 2014

Big discoveries in I-P37.2 tree

(Click on tree to enlarge--if you are having problems seeing everything, maybe try downloading it (with right click then save maybe) and then open the tree)

The relationships of the major branches of I-P37.2 are now known. These five branches and their nicknames:
M26  (Sardinian)
M423 (Dinaric, Disles, Isles)
L1286 (Alpine and L233 Western)
L880 (Northern France)
L1294 (France)

The key SNPs that show the history of these branches are:
CTS595,  S21825, and  L1286. The importance of each of these was learned in a different way:
CTS595 was included in the Geno 2.0 test
S21825 was included in the Chromo2 test at www.britainsdna.com and their chief scientist Jim Wilson released 2000 anonymous results in January. Ken Nordtvedt compared these results. See the most recent Chromo2P37 tree at Ken's website.
Finally, L1286 was discovered back in 2012 in a WTY (Walk Thorough the Y) by Thomas and Astrid Krahn back when they were at Family Tree DNA. For a long time we thought that only the Alpine and the I-L233 Western groups were L1286+, but then Zdenko Markovic noticed that not all I-P37.2 groups had tested it, and it turned out that the I-L880 "Northern France" group is also L1286+.

We used the nicknames "Sardinian" "Dinaric" etc. for many years because in many cases no SNPs were known to define the branches. For example, there were a few men with P37.2+ M26- M42- L233- results who all had a distinctive pattern of STR markers including DYS388=9. Many of the men had paternal origins from France (but others from Germany, Britain etc), so haplogroup I expert Ken Nordtvedt called the group P37.2*-"France" just to have an easy way to refer to the group.

Now all of the major groups have their own defining SNP and most of the subgroups do too. But very few of these SNPs are on the ISOGG tree. Zdenko Markovic joined the FTDNA I2a Project as co-administrator in January, and began coordinating the P37.2 part of the ISOGG tree at the same time. Thanks to his efforts several SNPs in the M423 area have been added to the ISOGG tree, and so has L880, and he's working on many more additions.



Wednesday, January 22, 2014

Updated I-L160 tree



We've had several sets of Geno 2.0 results and I have updated the tree for I-L160 (which is called I2a1b at FTDNA, it's part of I-M26 "Sardinian").

I've added CTS787 to the tree for the first time. I had omitted it on some earlier trees, it was thought to be equivalent to Z118. But now we have a person who is CTS787+ and negative for all downstream SNPs including Z118.

It's also become clear that there is a large and complex group of people who are PF4088+ and CTS11338-. There are several SNPs that have been found in this group, but so far they have each been found in only one family: L1299, PF5084, CTS8386. There are 8 people so far with PF4088+ CTS11338- results and they have 67 and 111 STR markers that are quite different from each other.

The top tree is thanks to Zdenko Markovic and the handwritten tree is my own--Bernie

Top level SNPs in I-P37.2


Click on tree to enlarge. There are five main branches of I-P37.2 and this tree shows how they are related:
the M423+ branch is CTS595-
and the other four branches (L1286, L880, L1294 and M26) are all CTS595+.

We still need to sort out these four CTS595+ branches. It's possible that PF3983 will be useful: we know that three branches are PF3983+ (these three branches are L880, L1294 and M26) but we aren't sure what the status of PF3983 is for the L1286 group. (We know that the M423 branch is PF3983-). But we have been unable to determine a PF3983 result for L1286 at Geno 2.0 or FTDNA, probably L1286+ people
have some mutation in the area of PF3983 that is interfering with the test.

Eventually we will have full Y/Big Y sequencing for all of these groups that will help us figure out the rest of the branching pattern. We have  multiple Big Y tests in progress for people in L621, L161, L233 and M26 but none for the other groups which are all much rarer.

We already have at least one Geno 2.0 result for every group except I-L1295 "France-Scotland", and we have learned a lot from Geno 2.0, especially for the L621, L161 and M26 groups. At this point I think we have learned most of what there is to learn from Geno 2.0, and I am recommending it mainly for people in the I-L160 group (which is part of M26).

(The top tree is thanks to Zdenko Markovic and the handwritten tree is my own--Bernie)

Friday, January 3, 2014

(from Sept 2013) Tree for I-L160



(from Sept 2013) Main branches of I-M26

All known I-M26 is either F1915+ or L672+. I-L672 is by far the larger group.

I-L672 can be L160+, PF6947+ or can be I-L672*, with the * meaning no known additional derived SNPs beyond L672.
I-L160 is by far the biggest group, and it has at least 10 SNP defined branches of its own, see the separate L160 tree.
I-PF6947 is a small group known only from Ireland.
All of the I-L672* "France" and "DYS413=19,22" groups and most or all of the "generic" group are more recently related to I-PF6947, not I-L160. We know this from 111 marker results, they share some distinctive marker values, in particular DYS532=9.

(from July 2013) Geno 2.0 Results for I-M26 Maltese gentleman

We now have a Geno 2.0 CSV file for someone with paternal ancestry from Malta (an island in the Mediterranean, between Africa and Sicily/Italy). He clearly is related to the I-M26 people in Sardinia. The question I have is, does he have recent ancestry from Sardinia/Sicily/Italy, or has his paternal line been in Malta for thousands of years?

(from July 2013) L160 Tree with Geno 2.0 SNPs, updated and simplified

We have nine Geno 2.0 results for I-L160 which fall into eight different subgroups. Unless one of your relatives or close matches have done Geno 2.0, it's impossible to predict what group you will belong to. And it's very likely that we will discover new groups when more I-L160 people do Geno 2.0. All of the SNPs on this tree are included in the Geno 2.0 test which costs $199, and they all can be ordered at FTDNA for $39 each. But you would need to order at least two SNPs and probably many more, so Geno is probably a better idea.

(from July 2013) Early Branches of I-M26


Four branches of I-M26 have been known for a few years:
M26+ L672+
M26+ L277+
M26+ L277+ L247+
M26+ (L672- and L277-)

What's new is that several L672+ men and one L247+ man have completed Geno 2.0 tests. We learned that the L247+ man is ancestral for 17 SNPs that previously were thought to be equivalent to M26. Now the M26+ L672- L277- man is doing the Geno 2.0 test, and I expect that he will be on his own branch that split off even earlier, and he will be ancestral for some of the 36 SNPs that currently give results equivalent to M26.

The Mexican-American M26+ L277+ L247+ also had two new SNP results at Geno 2.0: he is F1915+ and YSC0000078+. These occur somewhere in the area of L277 or L247.

(from July 2013) M26 Men in Sardinia

It's well known that around 40% of men on the island of Sardinia belong to the I-M26 paternal haplogroup, but which subgroup? And can that tell us anything about where M26 originated? Did M26 people expand "out of Sardinia"? Or did M26 arise on the continent, and a few M26 made it to Sardinia early in its human history?

It's starting to look like the second scenario is closer to what happened. The I2a Project at Family Tree DNA (FTDNA) has around 8 I-M26 members who were born in Sardinia, or who have fathers or grandfathers born in Sardinia. From testing at FTDNA we know that they are all likely L160+, the I-L160 group is the most common group of I-M26 but there are several L160- branches known only from continental Europe and Britain and Ireland. The design of the new Genographic 2.0 test involved sequencing several men living in Sardinia, so it includes many new SNPs relevant to these men. So far, one Sardinian-American man and one Maltese ancestry man belong to the PF4190 branch of I-L160. I think probably many or most I-M26 men living in Sardinia will belong to this branch, but we will need more testing to be sure.

(from April 2013) Updated I-L160 Tree

Click on picture to enlarge. At this point we have 6 groups of L160+ (and the Z110+ group only known from 1000 Genomes Project sequences). I don't know if F1295+ and PF4088+ are the only basic branches of L160+ or if there will be more. Usually I can tell if someone belongs to Z118+ or Z106+ based on 67 marker results, but we don't have enough results to make predictions for the other groups. About 40% of L160+ belongs to Z106+, but I don't know how big the other groups will turn out to be.

F1295, PF4088  and CTS11338are available for order as individual SNPs at Family Tree DNA, but each costs $39. My recommendation is that L160+ people order the Genographic 2.0 test for $199, it will test all the SNPs on the tree above and many more (except for a few of the Z SNPs and L1299).

(from Jan 2013) I-L160 Tree

Another Geno 2.0 result this week creates another level in the I-M26 tree, and now the L160 section is so large that it requires its own diagram, see above.

Someone with paternal ancestry from the Azores is the first person at Geno 2.0 to test L160+ and PF4088-. We have one English ancestry man and one Welsh ancestry man who have L160+ PF4088+ results from Geno 2.0.

The Azorean person is also F1295+ and PF6950+. He is the first person in I-P37.2 to be derived for those SNPs. 

More later.

(from Jan 2013) I-M26 Tree


This tree includes all known SNPs in I-M26. The format is based on this tree which shows mainly Z SNPs discovered by volunteers looking at 1000 Genomes data:
https://www.box.com/s/5be433c53d36df9ed57a
There are probably millions of men belonging to I-M26, they live in Spain, Portugal, France (up to 5% of men in some parts of these countries); and in descendants of Spanish, Portuguese and Frenchmen in the Americas etc, and they also comprise about 40% of the men on the island of Sardinia.   I-M26 is found in less than 1% of men in Ireland, Great Britain, the low countries, western Germany, Switzerland, the Italian peninsula etc. but it is very characteristic of western Europe and has a long presence there (i.e. 5,000 year old I-M26 skeletons have been found in southern France and near Paris).

There are at least 11 different SNP-defined groups on my tree, but I estimate that over 90% of I-M26 men belong to the green, blue, or red groups. With 67 STR markers, almost everyone can be placed in one of the minor groups, the green group, the blue group, or one of the red groups. But we don't yet have enough results to know if there are STR patterns that can distinguish between the three red groups.

Probably the PF4189+ group will have many more members besides the "1 English Family". But I don't know which of the three red groups will be biggest when we have done more testing. Unfortunately, some of the important tests are not offered by Family Tree DNA (Z105, Z120, CTS11338, PF4189); these are only available as part of the Geno 2.0 test at this time.

In the near future with more Geno 2.0 results we will learn much more about the three red groups. And we will learn if the PF6947+ group is a major part of the green group or not. But unfortunately Geno 2.0 will probably not teach us much more about about the green group--there is surely a SNP out there that is shared by almost all of the green group, but it hasn't been found at Geno 2.0 or in WTYs so far.

(From Dec 2012) Geno 2.0 Heat Maps

Genographic Project 2.0 heat maps for paternal haplogroup I-M26 and I-PF4189. They are exactly the same, and supposedly Sardinia doesn't have these groups. Are the I-M26 people who make up 40% of men in Sardinia considered a different, more specific haplogroup?

Click on pictures to enlarge.




(from March 2011) Z SNPs added to I-M26 Tree


Here is a drawing of the current I-M26 tree (March 2012), click on it to enlarge. I will write more later.

(from 2011) L672 placed in I-M26 Tree


Since my most recent post we have tested the new SNPs in many different I-M26 people from all the major subgroups. All were L673+ and L707+ so I think L673 and L707 are equivalent to M26. (It's possible that these SNPs and L158 and L159 are actually above M26, not equivalent. But the other major groups of I-P37.2 are ancestral (negative) for all these SNPs, there are a few small groups who have not tested all of them yet).

L672 is more interesting, some I-M26 people are L672+ and some are L673- which introduces a new level in the tree. It turns out that all the people with YCAIIa,b=11,21 (or who are close matches to these people even if they have different values) are L672+. The people who are expected to test L277+ are L672-, and these people have YCAIIa,b=18,21 and are quite different in all their markers from the L672+ people. (L277 is placed in quotes because Family Tree DNA cannot make this test work for L277+ people, but we have results from 23andMe).

(from 2011) Tree of Haplogroup I-M26

(Click on tree to enlarge)
This is a hasty phylogenetic tree I drew up showing the major groups of I-M26 (which has been known as I2a1 for the last 3 years or so). This is the group well-known for occurring in around 40% of men in Sardinia. But in the last three years we have discovered several new SNPs that split M26 into more specific groups which occur in different regions, all in westernmost Europe.

The Sardinian M26 seems to be all L160+ which makes an out of Sardinia origin unlikely. Spain and France contain most of the major groups of M26 and I think our M26 ancestors started expanding from somewhere like southern France, up the Atlantic coast to northern France, Britain and Ireland, and probably up the Rhine to Western Germany and Switzerland. A very few M26 made it to Denmark, Norway and Sweden. The single M26 I know of from Iceland has a 37 marker STR haplotype similar to some Irish M26+ L160- people. There are also a very few M26+ people from Jewish families from Poland, Belarus and Odessa, I believe most of these probably go back to Sephardic (Spanish/Portuguese Jewish) roots, and in fact some of the families have a tradition that their paternal ancestors were Sephardic.

Some specifics about the tree: the green triangles show the periods when population have been expanding, the L160+ triangle is the biggest because this group is most common. The dates when the expansions started and when the different groups split is very loosely based on Ken Nordtvedt's work but I didn't attempt to show his calculations very accurately. Not shown is M161 which was discovered before the year 2000 but has never been found in testing at Family Tree DNA or as far as I know at 23andMe. Also not shown is the Z106 SNP, this is a subset of L160+ which probably occurs in about 1/3 of Spanish L160+ and in a much smaller percentage of English and other northern L160+. The L277 SNP is placed in quotes because FTDNA has not been able to sequence the L277 area, our information on L277 comes mostly from 23andMe. Finally, there is yet another SNP called L707 which shows the same pattern so far as L672 and L673, we are currently testing all of these in the L277+ group.

(Note: in the final line of the tree picture "happened in position A or B" should be "B or C")