Open Source · Free Forever

Your Genome,
Interpreted by AI

A genomics analysis server that runs real bioinformatics pipelines on your DNA files and produces structured results that Claude can read, explain, and reason about.

Why This Exists

Your genome lives in formats that are enormous, binary, and completely opaque to AI.

AI

LLMs Cannot Read Your DNA

A whole-genome BAM is 50–100 GB of compressed read alignments. A gVCF packs tens of millions of variant calls in formats optimized for bioinformatics tools, not language models. No AI can open or reason about these files.

?

Consumer Services Are Black Boxes

23andMe and Promethease give you fixed reports. You cannot ask follow-up questions, inspect call confidence, drill into star-allele ambiguity, or compare two variants side by side.

!

Raw Tools Fail Silently

plink2 and bcftools will happily score a GRCh37 VCF against GRCh38 coordinates—no error, clean-looking output, completely wrong results. Both humans and AI agents make these mistakes.

How It Works

The deterministic pipelines do the heavy lifting. The AI does the reasoning.

1

Upload

Register a VCF, gVCF, BAM, or CRAM file from local storage, upload, or URL.

2

Prepare

VCF/gVCF files are converted to plink2 binary format. BAM/CRAM skip this step.

3

Pick a Test

Choose from 280+ polygenic scores, monogenic panels, pharmacogenomics, ancestry, and more.

4

Pipeline Runs

The right validated tool runs with build validation, QC gates, and sanity checks.

5

AI Reads Results

Structured markdown output that Claude can read, explain, and cross-reference.

What It Can Do

38 curated tests, 284 polygenic scores, 51 monogenic panels, 19 pharmacogenomic tests, and 16 validation checks.

P

Polygenic Risk Scores

Score against any PGS Catalog file using plink2 + 1000 Genomes reference panel. Precomputed EUR percentiles with sanity gates: |z|>6 fails, |z|>4 warns, std-collapse detected.

M

Monogenic Screening

ClinVar annotation for Pathogenic and Likely_pathogenic variants. ACMG SF v3.3 panels: Cancer predisposition, Cardiovascular, Metabolism. Auto-cached annotated VCFs.

Rx

Pharmacogenomics

Star-allele calling for CYP2D6 (via Cyrius), CYP2C19, CYP2C9, DPYD, TPMT, NUDT15, SLCO1B1, UGT1A1, and more. Allele verification prevents false positive calls.

A

Ancestry & Haplogroups

PCA projection onto 1000G, ADMIXTURE K=5, Y-DNA haplogroup (ISOGG), mtDNA haplogroup (HaploGrep3), Neanderthal %, runs of homozygosity, HLA typing (T1K).

R

Repeat Expansions

ExpansionHunter v5 calls trinucleotide repeats from BAM/CRAM: FMR1 (Fragile X), HTT (Huntington’s), DMPK (Myotonic Dystrophy). Per-allele counts + clinical classification.

Q

Sample QC & Sex Check

Ti/Tv ratio, Het/Hom ratio, SNP/indel counts via bcftools stats. Sex verification via Y-chromosome reads, SRY coverage, X:Y ratio, chrX het rate.

The Dashboard

A single-page app for managing files, running tests, and browsing results.

Test Dashboard — 23 & Claude
Run Tests
sample.g.vcf.gz (GRCh38, Ready)
PGS: Coronary Artery DiseasePGS000018 · Khera 2018
PGS: Type 2 DiabetesPGS000014 · Mahajan 2018
PGS: Breast CancerPGS000004 · Mavaddat 2019
Monogenic: BRCA1/BRCA2ClinVar · Cancer Panel
Monogenic: CardiovascularClinVar · ACMG SF v3.3
PGx: CYP2D6Cyrius · Star Allele
PGx: CYP2C19Variant Lookup
Ancestry: PCA + ADMIXTURE1000G · 106K sites
QC: Sample Statisticsbcftools stats
PGS Result
Coronary Artery Disease
ScorePGS000018
Raw0.00247
Z-score1.84
Percentile96.7th (EUR)
Variants6,268,514 / 6,630,150
QualityHIGH
Ancestry Composition
Middle Eastern63.4%
South European31.0%
Central/Khoisan African5.6%
mtDNAH1a
Y-DNAJ-M267
F_ROH0.22% (outbred)

What Claude Sees

Every test produces structured markdown that Claude reads natively—no parsing, no guessing.

## Polygenic Risk Score: Coronary Artery Disease
Score ID: PGS000018 (Khera et al. 2018)
Raw score: 0.00247
Z-score: 1.84
Percentile: 96.7 (EUR reference, precomputed)
Variants matched: 6,268,514 / 6,630,150 (94.5%)
Match quality: HIGH
### Sanity Checks
Build validation: PASS (GRCh38 confirmed)
Score SD: 0.00134 (no collapse)
Z-score range: |z| < 4, within normal bounds

Supported Formats & Tools

Battle-tested bioinformatics pipelines under the hood.

Input Formats

FormatPrep TimeBest For
.g.vcf.gz5–15 minHighest accuracy
.vcf.gz5–30 secQuick results
.bamNo prepOn-demand calling
.cramNo prepCompressed BAM

Pipeline Tools

ToolPurpose
plink2PGS scoring, PCA
bcftoolsClinVar, variant queries
CyriusCYP2D6 star alleles
ExpansionHunterRepeat expansions
HaploGrep3mtDNA haplogroup
T1KHLA typing
samtoolsBAM/CRAM processing

Fail Loudly, Never Silently

The pipeline catches the mistakes that both humans and AI agents routinely make.

38

Build Validation

GRCh37 vs GRCh38 mismatch is caught before scoring begins. Sentinel SNP spot-checks confirm coordinate alignment.

z

Sanity Gates

|z|>6 fails the test. |z|>4 triggers a warning. Standard-deviation collapse is detected. Percentiles are capped at [0.5, 99.5].

A/T

Allele Verification

Position-only hits without matching REF/ALT are reported as locus_mismatch, not false positives. gVCF reference blocks correctly resolve to hom-ref.

Free. Open Source.
No Strings Attached.

Not a company. Not a service. Just science you can run on your own hardware.

View on GitHub
284Polygenic Scores
51Monogenic Panels
19PGx Tests
7Pipeline Tools