You’ve got your DNA test back, and now you have a million questions to ask. Here is a list of some questions and answers that might help to get you started. Find out about Mickey Mouse, X DNA, Haplogroups, Clusters, DNA companies, GEDmatch and much much more…
Q: Help! I have my test results – what do I do now?
A: First thing most people do is look at their Ethnicity/Origins percentages – and are usually happy or confused! Check out some questions further down about ethnicity, and why it’s just an estimate. After that take a look at your match list. The match list is the most powerful part of a DNA test, it can help you confirm your family tree, breakdown brick-walls, and/or find biological family. Your match list is ranked from highest match to lowest. Matches are categorised based on the amount of DNA you share, the more DNA you share the closer your relationship is.
Most sites will category a match for you but this is an estimate or category – only genealogy will be able to determine the correct relationship. More about how DNA is measured, and how to work your match list, further down.
Q: Why do my ethnicity results not match my tree?
A: Ethnicity Estimates are just that “estimates”. They can relate to your ancestry from hundreds or even thousands of years ago and therefore will not match your paper trail from the genealogical time period. Ethnicity results are calculated by comparing you to a reference group of people (often called a reference panel). These are people that meet certain criteria requested by the test company (e.g. a reference person might have 4 grandparents all born within 50 miles of each other, all with known Ancestry in that region). Your DNA is compared to these people, often by scanning it several times and taking an average of the readings. The results are good for a continental view of where your DNA comes from, but the science is not able to really produce anything more detailed than that (yet!). Companies are continuing to update the reference panel (note that Ancestry have gone from an original 3,000 people to now 40,000 people) and continue to update their ethnicity estimates. Ancestry have updated their estimates in 2019, 2020 and again in 2021. My Heritage have promised an ethnicity update very soon although have added their genetic groups.
Q: I’ve tested at another company, or uploaded my DNA at another company, why are my results different?
A: Each test company has a different way of working out ethnicity (sometimes also called admixture). They all use different a different reference panel and thereby arrange people into different regions. One company might have a category “Great Britain and Ireland” and another one might split that into two regions. If that particular company does not have a reference person for a specific region, then you will never get that region in your results – even if that is what you are 100%. How do you know which one is accurate? Well none of them are “accurate”, choose the one you like the best!
Debbie Kennett did an excellent presentation at Family Tree Live in April 2019 that covered everything you need to know about Ethnicity – it is free here online at YouTube
Q: Should I download/upload my DNA to other sites? And which sites can I upload to?
A: Quite often in Facebook groups as soon as someone mentions they have their results back, people start commenting to “Download your raw file and upload to ….” . Whilst uploading to other sites can be a good idea, it can be easier to learn all the necessary details and jargon at the test site before tackling other sites. Always check the T&Cs and privacy settings of the sites you want to upload to, and ensure you are comfortable with them. ftDNA and GEDmatch also have law enforcement matching that you may want to opt-in or opt-out of. Many sites accept your DNA for free and some request payment for analysis and use of any tools they provide. Not all upload sites accept files from all test companies, so always check which company files can be uploaded.
Some common sites accepting DNA uploads:
- ftDNA there is a small fee to unlock their chromosome browser and access to My Origins
- My Heritage – if uploaded before Dec 2018 then grandfathered into tools for free – after that date there is a fee for DNA tools (if you are not a subscriber). Sometimes they have had free upload days where you would also be grandfathered in
- LivingDNA (for matching only, but you can upgrade for the regional ethnicity breakdown for a fee. Matching is still limited with most people only having a small number)
- GEDmatch, WeGene, Geneanet and Geni.
Q: What does my match list mean, how do I use this information?
A: Your match list is usually arranged in order of highest match at the top and lowest match at the bottom. The test sites categorise your DNA into groups of cousins. Usually close family are listed as Immediate or Close Family, the names can be slightly different at each site. Although a match might be in the say 2nd cousin category, it may not be a 2nd cousin as various relationships fall into the same category due to the amount of DNA Shared. Things like half relationships, once or twice removed cousins, or a cousin marriage in that branch can mean the relationship is different than the category it is in. It’s key to learn about how to predict a relationship based on the cM shared (cM stands for centiMorgan, which is a complex measurement of the amount of DNA you share with someone). At Ancestry you can do this by clicking on the cM amount in your match list – this will open up a chart with the list of possible relationships and the probability of each. The more DNA shared, the higher the cM amount and therefore the closer the relationship with the match.
For matches at other sites there is a fantastic third party resource called the Shared cM Project . There is more about this site in a couple of questions further on. Some sites (e.g. 23andMe) show your shared DNA as percentage rather than cM (although you can see the cM by looking further into the detail of your match). The SharedcM tool will take percentages as well as cM amount.
Remember: The higher the cM amount the closer the most recent common ancestor (mrca) is.
Q: What does shared cM, largest cM and Segment numbers mean?
A: cM = centiMorgan, a complex calculation to work out how much DNA you share. The test usually reports the total amount shared as either a cM number or a percentage (23andMe). They also usually show how many segments this includes. A segment is just a block of DNA you share. For beginners the number of segments doesn’t really help determine a relationship. In distant relationships the number of segments is usually 1 (ie just one small block of DNA is shared), and in closer relationships usually more. The segments are often on one or more chromosomes – we’ll cover off chromosomes later. Beginners can probably ignore segment numbers for now. Some sites will show a third number, and that’s the largest block. So a match might be 40cM / 2 segments and 25 longest block. This means the total amount of DNA Shared (the most meaningful number) is 40cM. The DNA is on 2 segments, one is 25cM (the longest block) and so the other must then be 15cM. The most relevant number is the amount of total shared cM as this will help you predict a relationship.
Note: DNA results of someone from an endogamous population will find the longest segment information the most useful.
Q: How do I know what relationship the match is to me?
A: As above the test site will give you a prediction, ie 2nd cousin, 3rd cousin etc. But these are high level estimates only – based on the cM number. There are charts available to help you look at various possible relationships. The ISOGG wiki (International Society of Genetic Genealogy) has a huge amount of useful data, one of those is the Autosomal sharing statistics. The Autosomal DNA Statistics page has three charts, one that shows the various relationships in a family tree, one that shows the average likely relationship based on cM shared, and the last chart has a range of reported cM numbers by many people who have been confirming their DNA matches. The ranges are really helpful and can see that some cM amounts could mean many different relationships.
The third party tool previously mentioned is the SharedcM Project and is available in the tools section of the DNApainter site, this is really useful as you can type in the cM amount of your match and all the various possibilities will be highlighted. This is invaluable for genetic genealogists – so make sure you bookmark this site. As previously mentioned, if you have tested at Ancestry they now have a chart that pops up when you click on the cM amount of your match.
Once you have a predicted relationship, it is only by building family trees that you can work out exactly what the paper trail relationship is. Note that if your paper trail says someone is your second cousin, but the DNA does not fit into the range reported, then you have a mystery to solve. Maybe they are only your half second cousin, in this case look for other shared matches to see what might be going on, are they also matching lots of unexpected families? (or are you not matching a lot of your own family?). Unfortunately DNA can produce some surprises, not all of them good ones.
DNA research requires a lot of traditional tree building, so it helps to also get skilled up on traditional genealogy research.
Q: How can I work out if my match is paternal or maternal?
A: The test cannot tell you this. The best way to do this is to test a parent, as then any match that doesn’t match them, must be the other side. Of course not everyone can test a parent, so the next best is an aunt or uncle – they will help work out which matches are their side, but you cannot make the same assumption about those they do not match … some of those could still be their side, they just did not get the same DNA as your parent did. First cousins can also be helpful as then you can pinpoint Grandparents that the match belongs to. Once you start confirming matches, shared matches (also called ICW, In common with) will really help. As I’ve confirmed a lot of 3rd cousins as soon as a new close matches comes in I can click on shared matches and almost place the new match immediately, based on who they share DNA with in my list. Be aware that many genetic genealogists have found up to 30% of their distant matches do not match either of their tested parents. That means we can have many “false positives” in our match list. Our DNA only comes from our parents, so if a match does not match either of them, then it is a false positive.
A good blog on this topic from Debbie Kennett
Q: Will I match DNA with all my cousins?
A: No you won’t! Currently it’s never been found that a 2nd cousin will not share DNA , so if you are not sharing DNA with a known second cousin (who has also tested) then you have a mystery to solve, something is not right. But more distant than 2nd cousin and it’s possible to no longer have any DNA from the distant ancestor (see further down for details of how DNA inheritance works). This is generally around 10% of 3rd cousins who will not match you, up to 50% of 4ths and 70% of 5ths it can vary of course, but these are high level numbers, so don’t be surprised if a known 4th cousin does not share DNA with you, it does not mean your paper trail is wrong and more research is needed to confirm the branch.
Info from ISOGG Wiki on Cousin Statistics
Q: Can I inherit DNA from my grandparents that has skipped my parents? Does DNA skip a generation?
A: No this is not biologically possible. Everybody gets a random 50% of their parents, which is a random “recombined” version of their DNA they got from their parents. This means that you do not end up with an exact 25% of your grandparents, and also explains why your siblings will have different DNA than you, in fact they will have a different ethnicity estimate and different matches (there will be some commonality though). Here is my chart showing how DNA inheritance works
And here are the ethnicity estimates, from the Ancestry site, for my Dad’s five siblings (proven full siblings)
Q: None of my matches seem to have family trees attached, and none of them are answering my messages – has DNA testing been a waste of time? Why do people bother to test and then not share anything?
A: No most certainly your test has not been a waste of time, and this has been the most common complaint from genetic genealogists over the past two years. Here are the cold hard facts: when someone got given a DNA test, or paid for a DNA test, or was requested by a family member to do a DNA test … at no time during that process were they buying into communicating with other people, replying to messages, sharing their tree, or for some even bothering to look at it again after they first got it! It’s not for us to wonder why, or have unrealistic expectations that everyone is like us and want to get about researching their tree. Also remember that many testers are adoptees and foundlings, firstly they do not have a tree to share, and secondly they are often warned off communicating too much about being an adoptee, sadly not everyone wants to help them. So what can we do about it? Well as it turns out, actually quite a lot!! If you have a match with a private tree there are tricks to figure those out too, check the next question for private trees .
Researching DNA matches without trees – my long list of things to do:
– open the dna profile page and check if they do have a tree but just not linked (Ancestry) <- about 75% of the people with no trees, do actually have a tree. (If you’ve not linked your DNA to a tree.. go and do it now!)
– go to their main profile page at Ancestry, look for clues as to how long they have been researching (if a long time, then they may have a tree on another site)
– check their main page for age range and location … sometimes people put heaps of really useful info on their profile
– google their name/ profile name using some of the clues you may have found – looking for possible hits on who they are. Learn to do advanced google searches. For ftDNA matches google their email address to see if they have a tree somewhere else, or family information online .
– write all your clues down, tick them off or cross them out as you validate them/invalidate them.
– look for possible trees at wikitree, geni, genes reunited, My Heritage, FMP, etc..
– if you’ve now established a possible name, check 192 dot com, white pages, electoral rolls, pipl, etc … where you might find details about them
– search for Online obituary’s (especially useful if the match is in the US as they often list all their family in detail)
– bear in mind woman testers often use their married name
– on the matches DNA profile page, click on “shared matches”… who else do they match ? You might be able to research the group of shared matches together, working off clues from the shared matches trees. Shared Matches can be gold when trying to identify and solve a match without a tree.
– if they are at Ancestry check if they’ve uploaded to My Heritage, ftDNA or GEDmatch.. can you identify them there (if you can now you will have an email address, for more google searching) .. and also a new set of people who match you both
-start building a tree for them , keep it private and unsearchable .. it’s “quick and dirty” so add hints, extra people, whatever you need to try and establish who your match is and their ancestors. This type of tree is often called a “Quick & Dirty tree” – due to the fact you are doing it quickly (accept hints if using an online tree), and dirty in that you are not looking for sources at this point you are copying other peoples tree or using hints.
This process might take say 2 hours, you identify your match and build their tree and find the common ancestor (quickest I’ve done it is about 45mins, as lots of clues fell into place quickly)
BUT … Keep in mind you may get nowhere, in which case you research hasn’t gone to waste … maybe tomorrow they decide to create a tree, or they get a new shared match that opens up better clues. (Keep your notes! I keep mine in OneNote)
As you start pulling together your hypothesis of who the match is, and how they relate to you, always go back and try and disprove it, don’t get caught out with confirmation bias where you only look for clues to prove your theory, more importantly look for clues to disprove it! This includes once you find the match, to go back and check that the DNA could not have come from another relative!
If you are looking at what the predicted relationship might be, see the earlier questions on how to do this.
Recently some people have challenged the privacy and ethics of researching people in this way. In no way are you doing anything illegal by searching public information to find out about your match (your cousin in fact). But please keep your research private, if you come across something your match (your cousin) probably doesn’t want published, please think of them and be ethical about sharing information you’ve discovered. Any speculative trees you make should be private and unsearchable. If a situation seems delicate, please treat it as such.
Here are some more resources on how to identify your match and/or create quick and dirty trees
- Crista Cowen (Ancestry) using quick and dirty trees https://youtu.be/VP8rUlZbmeA
Q: My match has a tree, but it’s private and they are not responding to messages, how can I work with this match?
A: Now the first thing you need to know about private trees is that when you search your Ancestry matches, the results include private trees. So by quickly doing a few searches, using names in your tree, you can establish which names they also have. This can help to confirm a branch they may be related on. Also use some of the tricks and tips above and especially look at Shared Matches … this can tease out exactly where this match might sit in your tree.
Q: At Ancestry some of my matches have a green leaf … does this show where the DNA connection is?
A: These are Common Ancestor hints – but note the word Hint, because that is all they are. They work based on the tree you have attached your DNA to. If you haven’t attached your DNA to a tree, and you have a tree, go and do it now! It will really help 🙂 The hint picks up an ancestor that it thinks is both in your tree and your ancestors tree, but sometimes the “same name” isn’t actually the same person, so you must check. Also if your match just copied your tree then you will, of course, also have the same name in your tree. My old saying is that if you have Mickey and Minnie Mouse in your tree and your match copies it, then you will get a Common Ancestor hint showing you are descended from the Mouse family – of course you are not.
So all Common Ancestor hints require further research but they are often a very good place to start. Ancestry have taken this one step further now and create their “ThruLines”. These work in the background, stitching together your tree, your matches tree, and other trees in the Ancestry database – looking for how you and your match may be connected. They can be excellent, but they can also be wrong (very wrong) – so please take care to always do more research on a ThruLine. Crista Cowen (Ancestry) has presented a great video on What are Thru lines
Q: I’ve uploaded to GEDmatch, but I’m lost and it’s confusing, what does it all mean?
A: GEDmatch is a third party site that was previously run by volunteers, it is more technical than the test sites themselves and can often appear confusing. At the end of 2019 GEDMatch was sold to Verogen. Details here. During 2019 GEDMatch came under fire for allowing Law Enforcement use of their database – without properly alerting their users. Much of the Law Enforcement advancement into Genetic Genealogy came around the time the Golden State Killer was identified by using genetic genealogy. Many other doe cases & law enforcement cases have followed – often with much discussion from the Genealogy community who have been using DNA databases for genealogy and not expecting their use for law enforcement. Everyone has different opinions on what should & shouldn’t be allowed. Ethically this is something each genetic genealogist needs to determine for themselves. Genetic Genealogists should also act ethically with kits they manage for other people – is there Informed Consent for your testers and have they agreed to what you do with their DNA data. (Note ftDNA also now allow Law Enforcement to match kits in their database). At both ftDNA & GEDMatch you can opt-in or opt-out of Law Enforcement matching.
GEDMatch process: Files can be uploaded from the test sites and you can then compare your DNA to someone who has tested on another site. Download your raw DNA from your test site (usually this needs to be done on a PC or Laptop and not a device or tablet. If you subscribe to a Cloud Service like Dropbox you can download it there, but the file will NOT download to your iPad or other device). Once uploaded it can take up to a day for your file to process (“tokenise”) at GEDMatch but you will immediately get a kit number. Older style kit numbers have one letter at the beginning denoting the site the original test was at, e.g. A=Ancestry, M=23andMe, H=MyHeritage, T=ftDNA). New kits have two random letters at the beginning of the kit number – but the testing company name is asked for on upload and is shown on some GEDMatch reports). As soon as you’ve uploaded you can run a one-to-one report, this means inputting two kit numbers and checking to see if they have DNA in common. After the kit has tokenised you can then run all the other types of reports including the one to many. The one-to-many report will list up to 2000 other kits in the database that you match. Like the test sites it shows shared DNA amount, and longest segment, and a predicted relationship expressed in generations. When you do a one to one match with someone it will show you the chromosomes you match on, and the number of generations back to a common ancestor. For the most part what actual chromsomes you match on are irrelevant, whether it’s chr 1 you match on or chr 9, there is no difference to what the match means. The mrca (most recent common ancestor) is a prediction only. To work out the generation of the most recent common ancestor, count back starting with 1 being your parents, 2 = Grandparents, etc.. I recommend spending time learning more about GEDmatch – there is a lot of good learning resources in this link (including my own blog for beginners). There is also a great video in the link …watching YouTube videos really helps with understanding, I highly recommend it. https://isogg.org/wiki/GEDmatch
Q: What is X DNA and how can I use it in my research?
A: X DNA is the DNA on the the X chromosome (sometimes called chromosome 23) . We all have 22 pairs of chromosomes from our parents (these are called the Autosomes, hence the term Autosomal DNA) – one of each chromosome from our mother and one of each from our father. A mother then also passes a recombined X DNA from her two X chromosomes and the father determines the sex of a baby by either handing down an X chromosome to make a girl, or a Y chromosome to make a boy. This means a male only has one X chromosome and he must have got that from his mother. This makes it a good tool to help sorting out your matches, as if you are a male tester and match someone that shares significant DNA on their X chromosome, then it must come from your maternal side. However the X chromsome can be tricky and may recombine slower than other DNA, therefore caution is advised … most experts say that the X DNA should be at least 15cM before being meaningful, and if you don’t share any DNA other than X, then it may be a very distant match that you would be chasing. When you have a match with X DNA you can fill out an “inheritance chart” to help highlight where the DNA could have come from (as it cannot be passed from a father to son). There are two charts, one for a Female as she will have X DNA from her mother and father) and one for a Male (as he can only have X DNA from his mother) . One of my favourite X DNA blog posts is here, and it includes the inheritance charts. X Marks the Spot …
Q: I’m working with my match and we both have solid trees, why can we not find the common ancestor, or why can we not find a common surname?
A: here is my list of reasons why you may not find a common ancestor.
If you cannot figure out how you are related to somebody, because their tree and your tree do not have the same names in it – then it could be for one of the following reasons:
1. Your tree is wrong or does not go far enough back to find a common ancestor. Keep building out your tree, you should be able to get a rough prediction of where the most recent common ancestor is, based on the amount (cM) of DNA you share.
2. Their tree is wrong or does not go far enough back to find a common ancestor … in this case you need to do the research in their tree yourself. I do this by starting a private (and unsearchable tree), and starting with their parents or grandparents build up the tree trying to find a location in common. Don’t get too hung up on surnames, look for places where your ancestors have crossed paths
3. There is an NPE (non paternal event) in your tree. If you are finding a lot of trees that don’t match your own, or family that have tested that do not match you as expected, then maybe there is a broken line where the father named in the source document is not the actual father.
4. There is an NPE in the matches tree. In this case look at shared matches, and see if you can find where the matches tree might be wrong. In these cases it’s often easier to contact the match and try and research with them.
5. The common ancestor is back before the genealogical time frame, i.e. a segment of DNA has passed through the generations unbroken and you will never find the common ancestor. These segments of DNA are often called sticky segments (although that term is going out of favour). There are specific known areas that are prone to this (called known pile up regions). Ancestry’s matching algorithm does remove those.
6. The DNA match is false, a small match under say 10cM may just be a coincidence rather than a real match.
Q; The same match at another site seems to share a different amount of DNA with me – how can that be?
A: Each test site uses a different matching algorithm to create what they call a match. Ancestry has a sophisticated algorithm they call Timber and it down-weights segments of DNA that are considered to be shared due to a population rather than a recent common ancestor. This means that the Ancestry site might report a lower number than another company. FtDNA set their matching amount quite high but then add in all the tiny amounts >1cM, this means that most companies report a lower number than ftDNA.
Q: What is a Haplogroup? And how do I find mine?
A: Haplgroups come in two types, maternal and paternal, and they pertain to our ‘deep ancestry’ ie they show our maternal (mother’s mother’s mother’s… line) or paternal line (father’s father’s father’s line …). The haplogroup is a type of “family name” for people who have similar DNA “Mutations” (mutations shouldn’t be considered bad here, it just means the code you have at specific parts of your DNA) .
Everybody can be assigned a maternal haplogroup if they test their mitochondrial DNA. This a special type of DNA that surrounds the nucleus of the cell, it is not on a chromosome. Everybody has mitochondrial DNA (MtDNA) but they only get it from their mothers. Men do not pass down any MtDNA . So males and females can take the test, and sometimes a high level haplogroup can be assigned by testing a very small part of your mtDNA and this is often combined with a autosomal DNA test (23andMe and LivingDNA do this). Usually when this is done you would not get any MtDNA matches, just the high level haplogroup. ftDNA do full MtDNA tests that give you a final haplogroup and matches. But one note of caution with MtDNA tests, MtDNA mutates very slowly so when you get an extremely close match, it could mean the most recent common ancestor lived over 1,000 years ago. So this may not be the best test to take if you are looking for matches within the genealogical timeframe. More info here at ISOGG https://isogg.org/wiki/Mitochondrial_DNA_haplogroup
The second type of haplogroup comes from the Y chromosome, and therefore the test is only available for males. Many woman ask their brothers, fathers, or uncles to take the test for them. Like MtDNA a high level haplogroup can be estimated, but more detailed testing is required to get a more detailed haplogroup. FtDNA also offer Y DNA tests, both the marker test (STRs) and SNP tests. When you match someone with the same haplogroup, this does NOT mean you match each other within the genealogical timeframe, or that you even match each other on your father’s father’s father’s line. Within your family tree, you would have many many haplogroups, but when testing yourself you are only seeing the very limited lines of father’s father’s father’s line… or mother’s mother’s mother’s lines. Haplogroups are amazing and interesting, as are Y DNA and MtDNA tests, so it’s well worth finding out more if you want to go down this route. More great information on Y DNA tests here from ISOGG https://isogg.org/wiki/Y_chromosome_DNA_tests
Q: I’ve found several matches that all match me at the same location on the same chromosome – does this mean we all have the same common ancestor?
A: You cannot answer that until you do further steps. You must do one on one matching with each match (with each other) in turn. So for 4 matches you need to check A-B, A-C, A-D, B-C, B-D, C-D. This is because you are looking at a pair of chromosomes stacked on top of one another, your paternal chromosome and your maternal chromosome. Some of the matches will be paternal to you (they will all match each other, but not everyone) and some will be maternal to you (they will match each other but not anyone in the first group). Each group can now be considered to have the same common ancestor, but you need to figure out which group is maternal and which is paternal. They must all overlap for at least 7cM or more to be considered a real match. Note that this can be easily done at GEDMatch, My Heritage and 23andMe where you can check your matches against one another, but at ftDNA you can only check matches against yourself! Ftdna have a tool called the matrix which can be used to see if you all match each other, however you can only make an assumption as the matrix does not show the chromosome number, just high level whether someone matches another. At Ancestry you cannot see any chromosome or segment data, therefore you need to use Shared Matches, but again you will not be able to confirm if they are all the same common ancestor.
Q: How small a match should I go chasing to find a common ancestor?
A: it’s personal preference really. Anything under 7cM is considered to be IBS (Identical By State, or sometimes said Identical By Coincidence). That means it has more likelihood of being a false match. I personally don’t chase matches that are likely more distant than 4th cousin, this is firstly due to the number of matches I have across many kits, but also the chances of actually finding a common ancestor at that range becomes much much harder … bearing in mind that source documents are harder to find, women’s maiden names may not be recorded, and there is more chance of errors on trees, and NPEs causing confusion. Even if you find a common ancestor with a potential 8th cousin (because you have the same tree, bear in mind the Mickey Mouse theory above, but also at such a small amount of shared DNA you cannot confirm the DNA was from that particular ancestor and not another unknown ancestor branch on your tree). I leave small matches alone.
Here is a great blog regarding small matches a small segment round- up
Q: There is no chromosome browser at Ancestry – do I need one?
A: Although a chromosome browser can be useful to solve some particular puzzles, for the majority of work for genealogy purposes you will not need a chromosome browser. Although this can be hotly debated in genetic genealogy groups. In my opinion the best and most productive method of using your DNA matches is to work with your DNA groups of shared matches, build a quick & dirty tree and see how they are connected. The common ancestor of a group of shared matches is most likely your common ancestor.
An interesting blog by Diahan Southerd on using Chromosome Browsers
Q: I’ve heard about Clusters and the Dana Leeds method – what is that about?
A: Using Shared Matches has become the powerhouse of DNA research and “Clustering” is now the validated method of working with your DNA matches.
Generally speaking a group of matches that also match each other is likely to have the same common ancestor. This common ancestor is also likely to be your common ancestor. The theory is that if you solve the group (or cluster) of shared matches, you can work out your connection to them. If the common ancestor of the group is not in your tree, you may have a mystery to solve.
Finding shared matches is quite easy on most of the test sites. Ancestry and My Heritage go a step further and allow you to colour code your matches into groups. This can be really helpful as you can colour code your matches however you like. I suggest you colour code them based on the group of shared matches. This type of methodology was documented by Dana Leeds in a series of her blogs. You take your top match, check the shared matches and then colour code them all the same colour – thereby creating a cluster ready for you to try and identify yours (and their) common ancestor. The Leeds method suggests starting with matches of 400cM and clustering to 90cM. If you don’t have many high matches it is better to cluster down to about 40cM or even 30cM. Any lower and it can become quite complicated.
A great third party tool available to automate this is called Genetic Affairs. By using the Genetic Affairs site you can create specific DNA cluster charts to help your research. This same tool is embedded in the My Heritage tools and also available via the tier 1 tools at GEDMatch. Genetic Affairs is supported by a Facebook User Group.
Clustering your shared matches is going to be the most useful part of your DNA research.
During 2020 I recorded a video on how to do clustering and use this method to solve your matches, and to solve unknown parentage.
Q: What is DNAPainter – I hear this all the time, but what actually is it?
A: One of the most useful third party sites – DNAPainter is a fantastic site that houses not just the DNAPainter tool itself but also the SharedcM Project chart, the WATO (What Are The Odds) tool and more.
- The actual DNAPainter tool is for mapping your chromosomes. This means taking segment data where you share with your match and “painting” it on to a blank chromosome. If you know the common ancestor of that match, then you now know that the DNA on that part of your Chromosome is from that ancestor. You need segment data to do chromosome mapping. Segment data is available at ftDNA, My Heritage, GEDMatch and 23andMe (but not at Ancestry). The tool is supported by a Facebook User Group.
- WATO – this tool is fantastic to help work out the mathematics of placing yourself amongst your DNA matches. Invaluable if you do not know you connect to a group/cluster of matches. You start the tool by adding the common ancestor of the group and charting out how your matches connect to that ancestor. Then you add various hypothesis points to the chart (or click the button to have them automatically added)- each hypothesis gives you a score against the previous hypothesis – as you work on it you can gain more clarity about where you are likely to fit with this cluster. The tool is supported by a Facebook User Group.
Some other helpful links:
– Acronyms – here is a list of them all https://isogg.org/wiki/Abbreviations
– GEDmatch admixture calculators – a great explanatory blog http://genealogical-musings.blogspot.co.uk/2017/04/finally-gedmatch-admixture-guide.html
Some of my favourite blogs and bloggers:
An unscripted interview with me at Family Tree Live
Some of my beginners talks online Please note some of these were filmed with a live audience and there will be some, hopefully minimal, background noise
And of course my Facebook Group that was set up to help both beginners and advanced users, specialising in those with UK ancestry.