FL Reference Implementation

Federated Learning reference implementation with Privacy-Enhancing Technologies (PETs) for cross-silo deployment on AWS/GCC, built on Flower.

This repo provides two things:

Tutorials — hands-on Jupyter notebooks (beginner to advanced) covering FL paradigms, privacy controls, secure inference, and distributed deployment
PET adapter code (fl_pets/) — production-ready modules that plug PETs (DP, SecAgg, PSA, HE, MPC) into a Flower FL pipeline without modifying the core training logic
Multi-server deployment (deploy/distributed/) — Docker Compose configs for distributed FL training across multiple EC2 nodes with mTLS, plus Terraform modules for AWS provisioning
Microservices architecture (deploy/ARCHITECTURE.md) — containerised HFL and VFL deployment with DP accountant, SecAgg orchestrator, PSA service, audit logging, and model registry

Key capabilities:

Horizontal FL — same features, different samples across sites
Vertical FL — different features, same entities (with Private Set Alignment (PSA))
Split Learning — model partitioned across sites
Transfer Learning — pretrained models fine-tuned across sites
Federated LoRA — only adapter weights are federated (for LLMs)

Quick Start

# Clone and install
git clone https://github.com/govtech-data-practice/Fl_deployment.git fl-reference
cd fl-reference
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

# Run smoke test (synthetic data, ~60 seconds)
python runners/run_ec2.py fraud --synthetic

# Validate a data manifest
python tools/validate_manifest.py ~/fl-deploy/data/fraud/manifest.json

# Check DP privacy budget
python tools/dp_budget.py --all --rounds 100

See Tutorial 1: Setup & First Run for the full step-by-step guide.

Prerequisites

Requirement	Version	Notes
Python	3.10+	Required
Flower	>= 1.30	FL framework
PyTorch	>= 2.2	Model training
Docker	24+	Tutorials 8-9, 12 only
Terraform	1.5+	Tutorial 9 only
GPU (CUDA 12.4+)	Optional	Tutorial 11 (LLM), beneficial for 8-9

Install optional PET libraries: pip install -e ".[pets]" (TenSEAL, anonlink, clkhash)

Architecture

Horizontal FL — same features, different samples

Site A              Site B              Site C
[patient records]   [patient records]   [patient records]
     |                   |                   |
     |  model updates    |  model updates    |  model updates
     +--------->---------+--------->---------+
                         |
                  FL Server (aggregates)
                  No raw data leaves any site

Vertical FL — different features, same entities

Org A               Org B               Org C
[transactions]      [credit scores]     [demographics]
     |                   |                   |
     |  partial model    |  partial model    |  partial model
     +--------->---------+--------->---------+
                         |
                  FL Server (combines partial models)
                  Each org only sees its own feature columns

Data

The repo includes real and synthetic datasets. See data/README.md for details.

Dataset	Records	Source	Licence
Credit Card Fraud 2023	25K sample (568K full)	Kaggle	CC BY 4.0
METABRIC Breast Cancer	1,904	cBioPortal	Open access
Singapore PSA Records	1K + hard negatives	Synthetic (multi-ethnic names, HDB addresses)	—
Sepsis, ECG, etc.	500 each	Synthetic generators	—

Models

Model	Parameters	Use Case
MLP	50K	Tabular (fraud, drug)
BiLSTM	500K	Time-series (sepsis, ECG)
DenseNet-121	8M	Medical imaging (chest X-ray)
VFL MLP	50K	Vertical FL (multi-party)
Split BiLSTM	500K	Split learning
Mistral 7B QLoRA	7B (160MB adapter)	Clinical NLP

CLI Tools

Tool	Command	Purpose
FL Server	`python runners/run_ec2.py fraud --synthetic`	Run FL training (simulation or distributed)
FL Client	`python runners/run_client.py --server host:9092`	Connect to a distributed FL coordinator
Data Ingest	`python tools/ingest.py --task sepsis --input data.csv`	Ingest and validate participant data
DP Budget	`python tools/dp_budget.py --all --rounds 100`	Calculate privacy budget (epsilon) for all presets
Manifest Validator	`python tools/validate_manifest.py manifest.json`	Validate data manifest against task requirements
Data Generator	`python data/generators/generate_all.py --task fraud`	Generate synthetic sample data for any task
SG Synthetic	`from data.generators.sg_synthetic import generate_records`	Generate Singapore patient data for PSA testing
Benchmark	`python tests/benchmarks/run_benchmarks.py --tasks fraud`	Run centralised vs FL accuracy comparison
Test Suite	`python tests/run_tests.py fraud`	Run strategy validation tests

FL Strategies

FedAvg, FedProx, SCAFFOLD, FedAdam, FedYogi, SecAgg+, DP-Central, DP-Local, OneOwner

PET Toolkit (`fl_pets/`)

Privacy-Enhancing Technologies organised by FL lifecycle stage:

Stage	PET	Library	What it does
Pre-training	PSA	anonlink + clkhash (Data61)	Fuzzy entity alignment across parties without revealing raw data
During training	DP	Opacus (Meta)	Per-sample gradient clipping + noise (DP-SGD with RDP accounting)
During training	SecAgg	Flower SecAgg+	Pairwise masking so server only sees aggregate updates
Inference	HE vs MPC	TenSEAL + CrypTen	Encrypted inference comparison: polynomial approx vs secret sharing
Post-training	Privacy Attacks	Custom suite	MIA, gradient leakage (DLG), model inversion, canary insertion

Pre-training       During training        Inference          Post-training
+---------+        +----+  +------+       +----+  +-----+    +---------+
|  PSA    |  --->  | DP |  |SecAgg| --->  | HE |  | MPC | -> | Privacy |
| (align) |        |(noise) (mask)|       |(enc)  (split)|   | attacks |
+---------+        +----+  +------+       +----+  +-----+    +---------+

Tutorials

Hands-on tutorials organised by experience level. See tutorials/README.md for the full index.

Level	Tutorials	Format
Beginner	1. Setup, 2. First Model, 3. Data Pipeline	Jupyter Notebooks
Intermediate	4. DP, 5. SecAgg, 6. Strategies, 7. Privacy Attacks	Jupyter Notebooks
Advanced	8. Distributed, 9. Terraform, 10. VFL & PSA, 11. LLM, 12. Operations	Markdown guides
PET Tools	PSA, DP, SecAgg, HE, MPC	Jupyter Notebooks
Reference	Configuration, PET Reference, Deployment Guide	Markdown

Results (Validated on EC2)

Task	Model	Best Strategy	Metric
Chest X-ray (NIH 112K)	DenseNet-121	FedAvg	AUC 0.819
Sepsis (eICU 100K+)	BiLSTM	SCAFFOLD	Acc 0.809
Fraud (50K)	MLP	FedAvg	Acc 0.98
Clinical NLP	Mistral 7B QLoRA	FL LoRA + DP	MIA 1.0->0.83

Deployment

# Microservices (Docker Compose)
cd deploy/microservices
docker compose up

# Multi-node (Terraform + Docker)
cd deploy/terraform
terraform init && terraform apply

See Tutorial 8 and Tutorial 9.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FL Reference Implementation

Quick Start

Prerequisites

Architecture

Horizontal FL — same features, different samples

Vertical FL — different features, same entities

Data

Models

CLI Tools

FL Strategies

PET Toolkit (`fl_pets/`)

Tutorials

Results (Validated on EC2)

Deployment

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
data		data
deploy		deploy
fl_common		fl_common
fl_pets		fl_pets
models		models
privacy		privacy
runners		runners
scenarios		scenarios
secure_inference		secure_inference
tasks		tasks
tests		tests
tools		tools
tutorials		tutorials
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

FL Reference Implementation

Quick Start

Prerequisites

Architecture

Horizontal FL — same features, different samples

Vertical FL — different features, same entities

Data

Models

CLI Tools

FL Strategies

PET Toolkit (fl_pets/)

Tutorials

Results (Validated on EC2)

Deployment

Licence

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

PET Toolkit (`fl_pets/`)

Packages