Question 1

What DNA files do you accept?

Accepted Answer

We accept raw data exports from 23andMe (all versions), AncestryDNA, MyHeritage, LivingDNA, and FamilyTreeDNA. Files can be .txt, .csv, .tsv, .zip, or .gz format. Most consumer DNA chips contain 600K–900K variants.

Question 2

How accurate are polygenic risk scores?

Accepted Answer

Every score uses peer-reviewed, published models from the PGS Catalog — the same database used by research hospitals. Each score shows its confidence level and the number of variants used. With deep imputation enabled, coverage jumps from ~30% to ~95%, making scores significantly more powerful. PRS show relative genetic risk compared to the population.

Question 3

What happens to my DNA data after analysis?

Accepted Answer

Your DNA file is deleted immediately after analysis completes. All intermediate files are purged within 2 hours. You receive a SHA-256 Data Deletion Certificate proving destruction. We do not store, sell, or share your genetic data.

Question 4

What’s included in the report?

Accepted Answer

2,800+ polygenic risk scores across cardiovascular, cancer, metabolic, neurological, and autoimmune categories. ClinVar pathogenic variant scanning. Pharmacogenomics (drug response) analysis. A personalized longevity protocol with actionable recommendations. All delivered as an interactive HTML report.

Question 5

What is deep imputation?

Accepted Answer

Consumer DNA chips only test ~700K positions out of your 3 billion base pairs. Deep imputation uses the Beagle 5.5 reference panel to statistically infer the missing ~27 million variants. This dramatically improves the accuracy of polygenic risk scores, especially for conditions where key variants aren’t on the chip.

Question 6

How long does analysis take?

Accepted Answer

Without imputation: 2–5 minutes. With deep imputation enabled: 15–45 minutes depending on server load. You’ll receive your report by email when it’s ready.

Question 7

Is this a medical diagnostic tool?

Accepted Answer

No. Helix Sequencing is for research and educational purposes. Results should be discussed with a healthcare provider before making medical decisions. Polygenic risk scores show relative genetic predisposition, not diagnoses.

Question 8

Can I compare my DNA with a family member?

Accepted Answer

Yes. Our DNA Compare feature lets you trace variant inheritance between two related individuals — see which alleles came from which parent, identify shared risk factors, and explore inherited traits side by side.

Trait	Metric	Ours	UK Biobank
Height	Pearson r	0.107	r ≈ 0.45–0.50
Red hair	AUC	0.67	—
Black hair	AUC	0.63	—
Eye colour	AUC	0.54	~0.95

Item	Detail	Cost
GPU compute	Vast.ai instances	$200–400
Claude API	6 parallel domain agents	$100–300
VPS hosting	Production server	$20/mo
Domain + SSL	helixsequencing.com	$15/yr
Total		$400–800

How we built this

This research is open source

Building Accurate Polygenic Risk Scores from Consumer DNA Data

Start Simple, See What Breaks

Three Compounding Errors

174GB Database, GPU-Accelerated Scoring

Validation Results

Bayesian Weight Recomputation

943 Genomes, 95% Failure Rate

Ridge Regression Pulled Everything to the Mean

What It Cost

Where We Are Now

Lessons Learned

Pipeline V2: Haiku Collectors + Opus Narrators

Key Improvements

iOS App, SEO Foundation, and Going Public