We are looking for a Senior Geospatial Data Engineer to build the high-performance data backbone for our multitemporal, multimodal Earth observation models. While our ML Engineers focus on model architecture, you will own the full geospatial data lifecycle: discovery, ingestion, standardisation, quality assurance, and delivery of production-ready datasets that combine very high-resolution optical and Synthetic Aperture Radar (SAR) imagery.
This is a high-impact role at the intersection of Earth Observation and AI. You will be the custodian of our geospatial "data engine" — ensuring it is scalable, deterministic, and capable of handling terabytes of multi-sensor satellite data to enable semantic understanding across sensors and time. Beyond data engineering, you will play an active role in ML lifecycle management, from dataset versioning and experiment tracking through to model deployment and monitoring.
LiveEO is a young, dynamic team that thrives on big challenges and fast learning cycles—we move quickly, stay curious, and genuinely enjoy building together. We’re on a mission to break the “curse of Earth Observation”: turning incredible satellite data into reliable, actionable decisions that people can trust and use in real operations. In this role, you’ll work in a fun, high-ownership environment where ambitious technical problems (multimodal SAR/optical foundation models) meet real-world impact—and where your ideas can go from whiteboard to production in tight, collaborative iterations.
At LiveEO, you'll sit with the AI team and partner closely with downstream product teams to translate model capabilities into measurable business value and production-ready workflows. You'll also work hand-in-hand with our dedicated data annotation team to define labelling guidelines, drive feedback loops on data quality, and ensure training and evaluation datasets reflect real-world edge cases.
You will work with:
Geospatial & EO: GDAL, Rasterio, GeoPandas, QGIS, STAC, Cloud-Optimised GeoTIFF, Zarr, PostGIS
Data Orchestration & Compute: Prefect, Ray
Data Stores: PostgreSQL + PostGIS
Cloud: AWS (S3, EC2, and supporting infrastructure)
ML Lifecycle: Databricks, MLflow, PyTorch, PyTorch Lightning
Core Language: Python
