What is a clinical virtual twin?

A clinical virtual twin is a comprehensive virtual model of a patient built from 500+ biomarkers including genomic, metabolomic, phenotypic, behavioral, and environmental data. It enables predictive healthcare by simulating health outcomes and treatment responses with 87% accuracy.

How accurate are BioTwin's health predictions?

BioTwin achieves 87% prediction accuracy across various health conditions including cancer detection, chronic disease complications, and mental health outcomes. Our models are continuously validated through clinical studies and real-world data.

What biomarkers does BioTwin analyze?

BioTwin analyzes over 500 biomarkers across five categories: genomic markers, metabolomic profiles, phenotypic data, behavioral patterns, and environmental factors. This comprehensive approach enables holistic health assessment and prediction.

Metabolomic Fingerprinting from Dried Blood Spots Enables Individual Identification Across 1,257 Participants at 94% User-Level Accuracy

Preprint by Hauguel, Anctil, and Noel demonstrating that metabolomic profiles from dried blood spots are stable enough to identify individuals across 18,288 samples and 134 analytical batches.

Pierrick Hauguel, Nicolas Anctil, Louis-Philippe Noel · April 11, 2026

biometricsmetabolomicsdried blood spotspreprint

Summary

A 1,257-participant cohort with 18,288 dried blood spot (DBS) samples collected over 134 analytical batches and 15 months is used to test whether untargeted LC-MS metabolomic profiles carry enough individual-level signal to identify a participant from a single fingerprick. After batch-aware normalization, supervised feature selection, biological signal filtering, and user-level majority voting across DBS cards, the model reaches 94.1% user-level accuracy and 85.5% sample-level accuracy under 10-fold GroupKFold (group = batch). On a held-out future-batch set of 17 batches, the model reaches 96.1% user-level and 92.6% sample-level accuracy across 1,134 classes, against a chance baseline of 0.088%.

A second contribution of the paper is methodological: the authors show that naive random splitting inflates accuracy because 92.8% of test samples share their (user, batch) pair with the training set. Group-aware splitting is required to measure real generalization.

Why it matters

Most biomarker science still treats lab values as one-shot snapshots. This preprint lays out the case, with data, for treating biology as a trajectory and for evaluating change against an individual’s own baseline. It also frames the protocol and validation discipline that the rest of BioTwin’s research programme is built on.

Authors

Pierrick Hauguel, Nicolas Anctil, Louis-Philippe Noel. All authors are employees and shareholders of BioTwin Inc. PCT patent pending.