FHE Aging

About

Zama.ai Bounty for Season 8: Implement an FHE-based Biological Age and Aging Pace Estimation ML Model Using Zama Libraries

Glossary

CpG group := (Cytosine - phosphate - Guanine)
- Notable for role in gene regulation through methylation processes

Age Modelling Considerations

Linear Models

Just polynomials of degree 1, very efficient representation in FHE applications

Graphing in Bioinformatics

Manhattan plot

Microarray Platforms

HumanMethylation450 BeadChip (released in 2008)
Targets over 450k sites across the human genome

Context

Human arrays

Age Prediction Context

Models trained to predict chronological age of tissue based on biomarkers
Delta betwen chronological age and real age used as marker to predict
- Mortality risk
- Disease states
- etc.

Inputs and Outputs for Age Prediction / Age Pacing

Inputs (CgPs)
Outputs (Predicted chronological age)

Datasets and Models

Dataset	Model
Horvath	ElasticNet
AltumAge	Deep Learning-based
PCGrimAge	PCA based version of GrimAge
GrimAge2	Latest version of GrimAge ?
DunedinPACE	Biomarker of the pace of aging

Plan of Attack

1. Test Different Models + Assess Performance

Start with simpler models (linear regression-based clocks) as easier to implement in FHE
Balance accuracy vs compute complexity - some models might use hundreds of CpG sites that would be expensive in FHE
Horvath clock well established but uses elastic net regression with many features
DunedinPACE measures aging pace rather than biological age, which might be interesting but more complex

2. Balance FHE Implementation Feasibility

Linear models most straightforward
Avoid non-linear activation functions, if possible
Consider feature count - Less CpG sites means faster FHE computation
Some biological clocks use relatively few CpG sites (~10-50) which would be ideal (NOTE: Need to validate this)

3. Port to Zama.ai's FHE Libraries

Start with Concrete ML for higher-level abstractions
Need to quantise model (using brevitas-nn / concrete-ml, etc.)
Benchmark acc between original + FHE - expect precision loss (though this depends on implementation / model, etc.)
Reduce multiplicative depth

4. Optimise for Efficiency

Use concrete's compiler to analyse circuit depth / bottlenecks
Precision vs performance trade-off (this one is likely key)
Heavily consider preprocessing strategies pre-encryption to offload computation

5. Deploy to HuggingFace Spaces

Client: Encrypts methylation data
Server: Processes encrypted data without decryption
Client: Receives and decrypts the predicted biological age

Demo + sample data

Number of Features per Dataset (for `pyaging`)

Challenge Data

Datasets

The Illumina HumanMethylation450 BeadChip data
GEO datasets like GSE40279 (often used for Horvath's clock)
TCGA (The Cancer Genome Atlas) methylation data

`dnaMethyAge` R Package - Datasets

27k_reference: probeAnnotation21kdatMethUsed
CBL_common: coefs
CBL_specific: coefs
Cortex_common: coefs
DunedinPACE: coefs gold_standard_means
HannumG2013: coefs
HorvathS2013: coefs
HorvathS2018: coefs
LevineM2018: coefs
LuA2019: coefs
McEwenL2019: coefs
ShirebyG2020: coefs
YangZ2016: epiTOCcpgs
ZhangQ2019: coefs
ZhangY2017: coefs
subGSE174422: betas info

betas: Methylation beta values - actual DNA methylation measurements that serve as input features for the model. X

coefs: Coefficient matrices for different biological clock models. Each named entry represents a different published biological age clock with its trained coefficients. Weights?

probeAnnotation21kdatMethUsed: Annotation data for DNA methlyation probes (CpG sites) used in the models.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.artifacts		.artifacts
concrete-ml		concrete-ml
dnaMethyAge		dnaMethyAge
fhe_models/phenoage		fhe_models/phenoage
prior-submissions		prior-submissions
pyaging		pyaging
pyaging_data		pyaging_data
.DS_Store		.DS_Store
.gitignore		.gitignore
CHALLENGE.md		CHALLENGE.md
LICENSE		LICENSE
README.md		README.md
REPORT.md		REPORT.md
app.py		app.py
compile.py		compile.py
fhe.ipynb		fhe.ipynb
out.csv		out.csv
phenoage.csv		phenoage.csv
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FHE Aging

About

Glossary

Age Modelling Considerations

Linear Models

Graphing in Bioinformatics

Microarray Platforms

Context

Age Prediction Context

Inputs and Outputs for Age Prediction / Age Pacing

Datasets and Models

Plan of Attack

1. Test Different Models + Assess Performance

2. Balance FHE Implementation Feasibility

3. Port to Zama.ai's FHE Libraries

4. Optimise for Efficiency

5. Deploy to HuggingFace Spaces

Datasets

`dnaMethyAge` R Package - Datasets

About

Uh oh!

Releases

Packages

Languages

License

MiscellaneousStuff/fhe-aging

Folders and files

Latest commit

History

Repository files navigation

FHE Aging

About

Glossary

Age Modelling Considerations

Linear Models

Graphing in Bioinformatics

Microarray Platforms

Context

Age Prediction Context

Inputs and Outputs for Age Prediction / Age Pacing

Datasets and Models

Plan of Attack

1. Test Different Models + Assess Performance

2. Balance FHE Implementation Feasibility

3. Port to Zama.ai's FHE Libraries

4. Optimise for Efficiency

5. Deploy to HuggingFace Spaces

Datasets

dnaMethyAge R Package - Datasets

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`dnaMethyAge` R Package - Datasets

Packages