Principal Data Engineer | ML Platform Architect | PhD
I build the data architectures that power State-of-the-Art Generative AI. From architecting petabyte-scale multimodal lakes to scaling ML pipelines across hundreds of GPUs, I bridge the gap between deep research and data infrastructure.
- GenAI Infrastructure: Petabyte-scale data management for Video, Image, and 3D data (Stability AI, Synthesia).
- ML Platform Engineering: Scalable pipelines via Kubeflow, Argo, and SLURM; expert in data synthesis and VLM-driven curation.
- HealthTech & RWD: Former Head of Engineering at Arcturis Data; patented ML methods for clinical signal processing and EHR pipelines.
- Academic Foundation: Postdoc at University of Oxford; PhD in Electrical Engineering (Biomedical Signal Processing).





