Location: Hybrid - Cape Town / Johannesburg
Employment Type: Full-Time
Department: Data Engineering
As a Senior Data Engineer, you'll design, build, and own modern, cloud-native data platforms across AWS and Azure, leveraging AWS native services, Databricks, and Microsoft Fabric. You'll lead delivery across the full data lifecycle, ingestion, transformation, modelling, and serving, enabling real-time analytics, reporting, and AI use cases for enterprise clients.
Beyond building, you'll set technical direction, make architectural trade-offs, establish standards, and mentor other engineers. As a senior member of a consulting team, you'll also be a trusted technical voice with clients, translating their needs into robust, secure, cost-effective platforms.
What You'll Do
You'll operate as a technical leader: shaping solutions, raising the bar on quality, and unblocking the team.
Lead architecture and delivery
- Design and own end-to-end data platform architectures across AWS and Azure, making sound trade-offs on reliability, performance, and cost (FinOps).
- Lead the design and delivery of scalable ingestion across hybrid and cloud environments: streaming (AWS Kinesis / MSK, Azure Event Hubs / Kafka) and batch (AWS DataSync, DMS, ADF / Synapse / Fabric Pipelines), plus integration via APIs, JDBC/ODBC, and CDC.
- Architect data lakes (Amazon S3, ADLS Gen2) and lakehouse solutions using Databricks (Delta Lake, Unity Catalog) and Microsoft Fabric (OneLake, Lakehouse, Warehouse).
- Design and review robust ETL/ELT using Databricks (PySpark, Delta Live Tables, Workflows), AWS Glue / Lambda, and Azure Databricks / Synapse Spark / Fabric Data Engineering, with a strong medallion (Bronze/Silver/Gold) discipline.
Enable analytics and AI
- Design platforms that support BI, advanced analytics, and ML/AI use cases using Amazon Redshift / Athena, Microsoft Fabric (Semantic Models, Direct Lake), Power BI, and Databricks SQL & ML.
- Support feature engineering, data science workflows, and real-time decisioning systems.
- Establish data quality, observability, and lineage frameworks across the platform.
Own governance, security, and operations