How DNA analysis combined with family tree research is revolutionizing criminal investigations
In 2018, a decades-old mystery that had haunted California was finally solved. The Golden State Killer, a man responsible for at least 13 murders and over 50 sexual assaults, was identified after evading capture for over 40 years. The key to his identification wasn't a sudden witness or a misplaced fingerprint, but a revolutionary technique that has since transformed criminal investigations: forensic genetic genealogy (FIGG) 6 .
We all share portions of our DNA with our relatives. The closer the relationship, the more DNA is shared 1 .
By examining genetic markers, experts can identify distant relatives and build extensive family trees 6 .
This powerful method blends modern DNA analysis with traditional family tree research, creating an unprecedented tool for solving cold cases and identifying unknown victims.
The process begins with the collection of biological evidence from a crime scene—this could be blood, skin cells, saliva, or hair follicles containing nucleated cells 2 .
Traditional forensic methods used in the Combined DNA Index System (CODIS) typically analyze about 20 short tandem repeat (STR) markers. While excellent for direct matching, this approach lacks the resolution to identify more distant relatives 6 .
FIGG, in contrast, leverages next-generation sequencing (NGS) technologies to examine hundreds of thousands of single-nucleotide polymorphisms (SNPs)—single-letter changes in the DNA code scattered throughout our genome 1 3 .
The transformed genetic data is then uploaded to genetic genealogy databases such as GEDmatch or Family Tree DNA, which allow users to voluntarily share their DNA data for research and comparison purposes. When the unknown DNA from a crime scene partially matches with multiple individuals in the database, it indicates a familial connection 6 .
Database algorithms identify individuals who share significant stretches of identical DNA with the crime scene sample.
Genealogists use public records to build extensive family trees connecting the genetic matches.
Investigators eliminate branches where no individual fits the profile based on age, location, and case specifics.
A single individual is identified as the potential source, and traditional DNA testing confirms the match.
| Feature | Traditional CODIS Analysis | Forensic Genetic Genealogy |
|---|---|---|
| Genetic Markers | ~20 STR loci | Hundreds of thousands of SNPs |
| Primary Use | Direct matching to known offenders | Identifying distant relatives |
| Database | Convicted offender/arrestee profiles (CODIS) | Consumer/volunteer databases (e.g., GEDmatch) |
| Investigation Path | Direct "hit" or exclusion | Build family trees from genetic matches |
| Typical Timeframe | Relatively fast (if a match exists) | Can take weeks or months of research |
The identification of Joseph James DeAngelo, the Golden State Killer, serves as the quintessential "experiment" that demonstrated the power of forensic genetic genealogy to the world.
The success of this case catalyzed widespread adoption of FIGG in law enforcement.
The success of the Golden State Killer investigation proved that a viable DNA profile could be obtained from old evidence and that even distant familial matches could provide a viable path to a suspect's identity. It highlighted a critical shift in forensic science: from merely matching a known individual's DNA to inferring identity through a web of biological relationships.
This case's immense impact is reflected in the subsequent adoption of the technique. Since 2018, forensic genetic genealogy has been used to solve hundreds of cold cases, both for identifying perpetrators and giving names back to unidentified victims, such as the "Boy in the Box" from Philadelphia, who was identified in 2022, 65 years after his body was discovered 6 .
The practice of FIGG relies on a sophisticated combination of laboratory technology, bioinformatics, and genealogical research.
A laboratory tool used to genotype a DNA sample for known SNPs simultaneously. It is the workhorse for generating the raw genetic data used in genealogy databases 8 .
Online platforms where users can upload their raw DNA data. These databases perform the initial comparison between the unknown forensic profile and the database's users to find relatives 6 .
Computational methods and software for processing raw sequencing data, calling SNP genotypes, and preparing the data for upload in the correct format for genealogy websites 5 .
The power of forensic genetic genealogy is undeniable, but its use raises profound ethical and privacy concerns that society is still grappling with.
When an individual uploads their DNA to a consumer service, they are making a personal choice. However, that choice has implications for their entire biological family. Your genetic data is inherently relational; by testing yourself, you are also revealing partial information about your parents, siblings, children, and even distant cousins.
This creates a "genetic dragnet" where thousands of people who have not consented to law enforcement searches can inadvertently lead investigators to a suspect 8 . Critics argue this constitutes a mass surveillance technique that challenges traditional notions of privacy and probable cause.
The history of genetics is marred by misuse, from eugenics to racial pseudoscience. There are fears that the growth of genetic databases could enable new forms of genetic discrimination 8 .
Currently, the main U.S. law guarding against this is the Genetic Information Nondiscrimination Act (GINA) of 2008, which prohibits the use of genetic information in health insurance and employment. However, GINA does not cover life insurance, disability insurance, or long-term care insurance, leaving significant gaps in protection 8 .
Furthermore, the databases themselves can be targets. The 2023 data breach at 23andMe, which exposed ancestry information of millions, highlights the vulnerability of this sensitive data 8 . The ethical application of FIGG requires a careful balance, using it for serious violent crimes while establishing clear guidelines to prevent misuse and protect civil liberties.
Forensic genetic genealogy is evolving from a novel technique into a sophisticated discipline. Advances in computational genomics and the continuous growth of genomic databases will only enhance its power and resolution 1 .
We are also seeing the emergence of more advanced, and controversial, techniques like forensic DNA phenotyping (FDP), which aims to predict a person's physical appearance (e.g., eye, hair, and skin color) and biogeographic ancestry directly from their DNA 3 .
While FDP can provide investigative leads where none exist, it is fraught with scientific and ethical problems. Predicting complex traits like facial structure is highly imprecise, and critics warn it could reinforce racial biases if not applied with extreme caution .
The future of the field will undoubtedly be shaped not just by technological innovation, but by the ongoing public and legal dialogue about establishing robust frameworks that safeguard privacy and prevent discrimination while pursuing justice.
Key areas for development include:
The future of forensic genealogy will be shaped not just by technological innovation, but by the ongoing public and legal dialogue about establishing robust frameworks that safeguard privacy and prevent discrimination while pursuing justice.
Forensic genetic genealogy has irrevocably altered the landscape of criminal investigation. It serves as a powerful tool to deliver long-awaited justice to victims and closure to families, cracking cases that once seemed unsolvable.
Yet, this powerful genetic blueprint forces us to confront difficult questions about the trade-offs between public safety and individual privacy, between the pursuit of justice and the protection of our most personal information. As this technology continues to integrate into our justice system, its responsible use will depend on a shared commitment to strong ethical standards, transparent practices, and comprehensive legal safeguards that keep pace with the science itself.
References will be populated here with proper citations.