Open Source · GPL-3.0

Predictive Models
from Omics Data

Discover minimal-feature biomarker signatures from high-dimensional omics data using evolutionary algorithms — interpretable, parsimonious, and blazing fast.

Get Started → View on GitHub

Faster (Rust)

<1%

Feature selection

Search algorithms

Open

Source & Free

How It Works

From Data to Discovery

Four steps from raw omics data to validated, interpretable predictive models.

⇪1

Upload Data

Import TSV/CSV omics matrices with training and optional test sets

⚙2

Configure

Choose algorithms, model languages, and fine-tune parameters

⚡3

Analyze

Launch evolutionary search with real-time progress monitoring

⌖4

Explore

Inspect models, feature importance, jury voting, export reports

Why Predomics

Designed for Discovery

From raw omics matrices to publication-ready models in minutes, with full transparency at every step.

⚖

Parsimonious Models

Discovers minimal feature sets (binary, ternary, ratio) that achieve high classification accuracy. Models you can interpret and publish.

🧬

Evolutionary Search

Genetic algorithms, beam search, and MCMC heuristics explore the feature space efficiently to find optimal biomarker signatures.

🗳

Jury Voting

Ensemble of expert models with majority voting, rejection capability, concordance analysis, and per-sample confidence scores.

📊

Rich Visualizations

Prevalence plots, heatmaps, violin plots, SHAP explanations, PCoA, co-presence networks, and 30+ interactive chart types.

🔄

Cross-validation

Built-in k-fold CV with generation-level tracking. Monitor train vs test AUC, complexity, and fit in real time.

⚡

Blazing Fast

Core engine rewritten in Rust with Python bindings. Up to 1,000x faster than the original R implementation.

See It In Action

Web Application

A full-stack interface for running analyses, exploring results, and sharing with collaborators.

Dashboard with global statistics, recent jobs, and project overview

Data explorer with prevalence plots and feature filtering

Parameter configuration for algorithms and model types

Job results table with AUC scores and metrics

Best model with coefficients, feature importance, and SHAP

Jury voting with confusion matrices and vote heatmap

Population of models with feature heatmap and violin plots

Co-presence analysis with network visualization

Comparative analysis across multiple jobs

Applications

Real-World Use Cases

Predomics has been applied to microbiome and multi-omics classification problems across clinical contexts.

⚕

Cirrhosis Prediction

Predict liver cirrhosis from gut microbiome composition using metagenomic species abundance profiles.

0.94 AUCk=8 features232 samples

Qin et al., Nature 2014

⚄

Cancer Classification

Classify colorectal cancer status from stool metagenomic data using sparse ternary models.

0.92 AUCk=12 features156 samples

Zeller et al., Mol. Syst. Biol. 2014

⌘

Treatment Response

Predict immunotherapy response from baseline gut microbiome in melanoma patients.

0.87 AUCk=5 features112 samples

Gopalakrishnan et al., Science 2018

☣

Metabolic Disease

Identify type 2 diabetes biomarkers from metagenome-wide association studies.

0.89 AUCk=10 features345 samples

Karlsson et al., Nature 2013

Ecosystem

The Predomics Suite

Three complementary tools, one unified approach to interpretable omics classification.

Rust + Python

gpredomics

High-performance ML engine rewritten in Rust — up to 1,000x faster than the original R package. Python bindings via gpredomicspy for notebooks and scripts.

View Repository →

FastAPI + Vue.js

PredomicsApp

Full-stack web application for running analyses, exploring results with 30+ interactive visualizations, and sharing with collaborators. Deploy anywhere with Docker.

View Repository →

R Package

gpredomicsR

R bindings for the gpredomics Rust engine. Bringing the 1,000x Rust speedup to the R ecosystem with a familiar interface.

View Repository →

Get Started

Up and Running in Minutes

Choose your preferred way to use Predomics.

🐳 Docker (Web App)

          
          git clone https://github.com/predomics/predomicsapp-web.git

          cd predomicsapp-web

          docker compose up -d

          # Open http://localhost:8001

🐍 Python

          
          import gpredomicspy

          param = gpredomicspy.Param()

          param.load("params.yaml")

          experiment = gpredomicspy.fit(param)

          experiment.display_results()

📊 R (gpredomicsR)

          
          devtools::install_github("predomics/gpredomicsR")

          library(gpredomicsR)

          result <- gpredomics.fit(params)

Research

Publication

GigaScience, Volume 9, Issue 3, March 2020

Interpretable and accurate prediction models for metagenomics data

Prifti E., Chevaleyre Y., Hanczar B., Belda E., Danchin A., Clément K., & Zucker J.-D.

DOI: 10.1093/gigascience/giaa010 →

People

Team & Contributors

The researchers, developers, and students behind Predomics.

Core Team

Edi Prifti

Creator & Lead

IRD / UMMISCO

Created Predomics in 2015. Designed the evolutionary algorithms, Family of Best Models, and the web application.

ORCID: 0000-0001-8861-1305

Jean-Daniel Zucker

Co-Creator

IRD / UMMISCO

Co-led the work since 2015. Proposed the beam search heuristic and founding ideas.

ORCID: 0000-0002-5597-7922

Yann Chevaleyre

Mathematical Optimization

LIPN

Worked on balances and the TerDa optimization method.

Blaise Hanczar

Model Selection

IBISC

Contributed to best model selection and general framework ideas.

Eugeni Belda

Interpretability & Validation

IRD

Signature interpretability, microbiome applications, and extensive dataset testing.

ORCID: 0000-0003-4307-5072

gpredomics Rust Engine

Louison Lesage

Lead Rust Developer

GMT Science

Led the Rust engine rewrite delivering up to 1,000x performance gains.

ORCID: 0009-0000-0252-6311

Raynald de Lahondès

Rust Developer

GMT Science

Engine architecture and optimization for the Rust implementation.

ORCID: 0009-0000-2862-9589

Vadim Puller

Scientific Developer

GMT Science

Algorithm implementation and validation in the Rust engine.

ORCID: 0000-0002-3900-8283

Contributors

Lucas Robin

TerGa2 algorithm implementation and code optimization (2016).

Shasha Cui

Feature importance and model stability analysis (2017).

Magali Cousin Thorez

Signature simplification and microbial ecosystem exploration (2019).

Youcef Sklab

Co-led student projects for the R Shiny PredomicsApp.

Gaspar Roy

Evolved version of the R Shiny application (2023).

Fabien Kambu

Testing, documentation, and deployment of PredomicsApp.

Predictive Modelsfrom Omics Data

From Data to Discovery

Designed for Discovery

Parsimonious Models

Evolutionary Search

Jury Voting

Rich Visualizations

Cross-validation

Blazing Fast

Web Application

Real-World Use Cases

Cirrhosis Prediction

Cancer Classification

Treatment Response

Metabolic Disease

The Predomics Suite

gpredomics

PredomicsApp

gpredomicsR

Up and Running in Minutes

🐳 Docker (Web App)

🐍 Python

📊 R (gpredomicsR)

Publication

Team & Contributors

Predictive Models
from Omics Data