HRAS

A Data Discovery Tool

Data

Publicly available self-reported uniparental line ancestor origins of samples on the YFull YTree (~42000 unique samples) and MTree (~80000 unique samples)
Publicly available branching structure of the YFull YTree and MTree
Publicly available TMRCA estimates for each branching point
(Future) YSEQ customer samples

Controls

Click the link to see that section's documentation.

General advice

1) Be patient waiting for files to load.

There are several large files that your browser must download.
Any time you change the radius, an additional large file must be downloaded.
However, once you have downloaded these, the files stay in the cache for the future.

2) This application is in Beta.

There could be bugs. Before contacting me to report a bug, please first try reloading the page. If that doesn't work, check the Common User Issues section.

Glossary

TMRCA - Time to Most Recent Common Ancestor
formed/formation - The process where one line begins to diverge from its siblings, evidenced by accumulation of unique SNPs.
target haplogroup - The current haplogroup of focus, indicated by the haplogroup / SNPs input field.
downstream haplogroup - Haplogroups that formed after a target haplogroup. Downstream in this sense means further along the flow of time. Children is a synonym.
haplogroup root - Each haplogroup supported by HRAS is a child of one of a set of haplogroup roots. The haplogroup roots are currently determined by how the YFull YTree and MTree are organized on the public YFull website. For the list of haplogroup roots, see Enter SNP

Common User Issues

Enter SNP

Centroids

DNA type

Map type

Relative Frequency Heatmap (the page says mtDNA but methodology applies to both)

Relative Frequency Heatmap (Classic)

Diversity Heatmap

Leaflet heatmap.js plugin

The controls have been streamlined.
The surfaces may look different for the same haplogroup the radius is now configurable.
The relative frequency estimates in classic may be slightly different because the denominators come from the new codebase (computed in js rather than python) and the legend is calibrated with the more accurate relative frequencies computed by the newer heatmap version.
The orginal diversity map always used the same radius for each sample. The HRAS version applies radius and intensity to each sample based on regional code area according to the same logic for each map type as defined in section Sample Minimum Radius.

Enter haplogroup or SNP

YTree

MTree

Ancient vs Modern Samples

Samples

Country Statistics

Pie Chart showing breakdown of modern samples positive for R1b-DF13 with ENG country code on YFull

Pie Chart showing breakdown of ancient samples from 5000-3000 ybp with GBR country code on YFull

Animation

TMRCA slider

Outliers

J2b-L283 migrations to children showing J-Y146401 centroid computed as within Saudi Arabia by default

J2b-L283 migrations to children showing J-Y146401 centroid computed as Syria/Lebanon after declaring India as an outlier

Centroids

Centroid Computation Algorithm

Centroid Metrics

Theory

indicator

possibility

Migration from Haplogroup Root to Target

centroids

approximate

discrepancies in regional sampling rates
lack or paucity of ancient samples
errors in self-reported ancestral locations
lack of geographic consistency of self-reported ancestral locations
the ever-present possibility of mass migration from a homeland where all men in the original location either moved in the same direction or died out

Migrations from Target to Children

centroids

approximate

discrepancies in regional sampling rates
lack or paucity of ancient samples
errors in self-reported ancestral locations
lack of geographic consistency of self-reported ancestral locations
the ever-present possibility of mass migration from a homeland where all men in the original location either moved in the same direction or died out

Diversification Timeline

TMRCA Time Slider

Sample Minimum Radius

Technical

Example effective relative radii (1-10) for various regional codes

Divide the area of the geographic code's region by 10,000 km2 and then take the square root to get a number between 1-15
Translate this number to a 1-10 scale

HRAS

A Data Discovery Tool

Data

Controls

General advice

Glossary

Common User Issues

DNA type

Map type

Enter haplogroup or SNP

Ancient vs Modern Samples

Samples

Country Statistics

Animation

Outliers

Centroids

Centroid Metrics

Theory

Migration from Haplogroup Root to Target

Migrations from Target to Children

Diversification Timeline

TMRCA Time Slider

Sample Minimum Radius

Technical

Intensity

Future

Geodesic Lines

Migration Animation

YSEQ Samples

Research/Project Links