Open Source · GPL-3.0

Predictive Models
from Omics Data

Discover minimal-feature biomarker signatures from high-dimensional omics data using evolutionary algorithms — interpretable, parsimonious, and blazing fast.

0x
Faster (Rust)
<1%
Feature selection
0
Search algorithms
Open
Source & Free

From Data to Discovery

Four steps from raw omics data to validated, interpretable predictive models.

1
Upload Data

Import TSV/CSV omics matrices with training and optional test sets

2
Configure

Choose algorithms, model languages, and fine-tune parameters

3
Analyze

Launch evolutionary search with real-time progress monitoring

4
Explore

Inspect models, feature importance, jury voting, export reports

Designed for Discovery

From raw omics matrices to publication-ready models in minutes, with full transparency at every step.

Parsimonious Models

Discovers minimal feature sets (binary, ternary, ratio) that achieve high classification accuracy. Models you can interpret and publish.

🧬

Evolutionary Search

Genetic algorithms, beam search, and MCMC heuristics explore the feature space efficiently to find optimal biomarker signatures.

🗳

Jury Voting

Ensemble of expert models with majority voting, rejection capability, concordance analysis, and per-sample confidence scores.

📊

Rich Visualizations

Prevalence plots, heatmaps, violin plots, SHAP explanations, PCoA, co-presence networks, and 30+ interactive chart types.

🔄

Cross-validation

Built-in k-fold CV with generation-level tracking. Monitor train vs test AUC, complexity, and fit in real time.

Blazing Fast

Core engine rewritten in Rust with Python bindings. Up to 1,000x faster than the original R implementation.

Web Application

A full-stack interface for running analyses, exploring results, and sharing with collaborators.

Dashboard with global statistics, recent jobs, and project overview
Data explorer with prevalence plots and feature filtering
Parameter configuration for algorithms and model types
Job results table with AUC scores and metrics
Best model with coefficients, feature importance, and SHAP
Jury voting with confusion matrices and vote heatmap
Population of models with feature heatmap and violin plots
Co-presence analysis with network visualization
Comparative analysis across multiple jobs

Real-World Use Cases

Predomics has been applied to microbiome and multi-omics classification problems across clinical contexts.

Cirrhosis Prediction

Predict liver cirrhosis from gut microbiome composition using metagenomic species abundance profiles.

0.94 AUCk=8 features232 samples
Qin et al., Nature 2014

Cancer Classification

Classify colorectal cancer status from stool metagenomic data using sparse ternary models.

0.92 AUCk=12 features156 samples
Zeller et al., Mol. Syst. Biol. 2014

Treatment Response

Predict immunotherapy response from baseline gut microbiome in melanoma patients.

0.87 AUCk=5 features112 samples
Gopalakrishnan et al., Science 2018

Metabolic Disease

Identify type 2 diabetes biomarkers from metagenome-wide association studies.

0.89 AUCk=10 features345 samples
Karlsson et al., Nature 2013

The Predomics Suite

Three complementary tools, one unified approach to interpretable omics classification.

Rust + Python

gpredomics

High-performance ML engine rewritten in Rust — up to 1,000x faster than the original R package. Python bindings via gpredomicspy for notebooks and scripts.

View Repository →
FastAPI + Vue.js

PredomicsApp

Full-stack web application for running analyses, exploring results with 30+ interactive visualizations, and sharing with collaborators. Deploy anywhere with Docker.

View Repository →
R Package

gpredomicsR

R bindings for the gpredomics Rust engine. Bringing the 1,000x Rust speedup to the R ecosystem with a familiar interface.

View Repository →

Up and Running in Minutes

Choose your preferred way to use Predomics.

🐳 Docker (Web App)

git clone https://github.com/predomics/predomicsapp-web.git
cd predomicsapp-web
docker compose up -d

# Open http://localhost:8001

🐍 Python

import gpredomicspy

param = gpredomicspy.Param()
param.load("params.yaml")
experiment = gpredomicspy.fit(param)
experiment.display_results()

📊 R (gpredomicsR)

devtools::install_github("predomics/gpredomicsR")

library(gpredomicsR)
result <- gpredomics.fit(params)

Publication

GigaScience, Volume 9, Issue 3, March 2020
Interpretable and accurate prediction models for metagenomics data
Prifti E., Chevaleyre Y., Hanczar B., Belda E., Danchin A., Clément K., & Zucker J.-D.
DOI: 10.1093/gigascience/giaa010 →

Team & Contributors

The researchers, developers, and students behind Predomics.

Core Team
Edi Prifti
Creator & Lead
IRD / UMMISCO
Created Predomics in 2015. Designed the evolutionary algorithms, Family of Best Models, and the web application.
ORCID: 0000-0001-8861-1305
Jean-Daniel Zucker
Co-Creator
IRD / UMMISCO
Co-led the work since 2015. Proposed the beam search heuristic and founding ideas.
ORCID: 0000-0002-5597-7922
Yann Chevaleyre
Mathematical Optimization
LIPN
Worked on balances and the TerDa optimization method.
Blaise Hanczar
Model Selection
IBISC
Contributed to best model selection and general framework ideas.
Eugeni Belda
Interpretability & Validation
IRD
Signature interpretability, microbiome applications, and extensive dataset testing.
ORCID: 0000-0003-4307-5072
gpredomics Rust Engine
Louison Lesage
Lead Rust Developer
GMT Science
Led the Rust engine rewrite delivering up to 1,000x performance gains.
ORCID: 0009-0000-0252-6311
Raynald de Lahondès
Rust Developer
GMT Science
Engine architecture and optimization for the Rust implementation.
ORCID: 0009-0000-2862-9589
Vadim Puller
Scientific Developer
GMT Science
Algorithm implementation and validation in the Rust engine.
ORCID: 0000-0002-3900-8283
Contributors
Lucas Robin
TerGa2 algorithm implementation and code optimization (2016).
Shasha Cui
Feature importance and model stability analysis (2017).
Magali Cousin Thorez
Signature simplification and microbial ecosystem exploration (2019).
Youcef Sklab
Co-led student projects for the R Shiny PredomicsApp.
Gaspar Roy
Evolved version of the R Shiny application (2023).
Fabien Kambu
Testing, documentation, and deployment of PredomicsApp.