Data Modernization Lead
Bupa · Yeda, Arabia Saudí
Job Description
Role Purpose:
The Data Modernization Lead is the most technically consequential hire in Bupa Arabia's Data Office. The role owns the end-to-end design, build, and operationalization of Bupa Arabia's cloud-native data platform on the cloud starting with Google Cloud Platform (GCP) from real-time Oracle CDC streams to Looker dashboards trusted by actuaries, clinicians, and executives serving over 12 million members.
Cloud starting GCP and Big Query are selected. This role builds the platform, not the strategy — writing code, setting engineering standards, enforcing data quality at every Medallion layer, and holding the system together as it scales. The Lead acts as technical authority over any external implementation vendor, holding them accountable to SLA benchmarks and engineering quality.
The role is responsible for delivering a Vertical Slice (Business-First Agile) implementation: first measurable business value within 90 days, full enterprise scale within 12–18 months, and a platform compliant with NDMO and PDPL from Day 1.
Key Accountabilities:
1-Build & Operate Real-Time Data Ingestion Pipelines:
Design and operate GCP Datastream CDC pipelines from all Oracle sources (CAESAR core insurance system, Oracle EBS) and SQL Server sources (CRM, IPAC, ACCPAC, Edge, Wathiq, and 10+ others)
Build event-driven ingestion using Pub/Sub + Dataflow for MongoDB (Telemedicine, Salma Chatbot), Elasticsearch (Non-NPHIES), JSON (Speech Analytics), and file-based sources
Engineer schema evolution pipelines automatically handle new columns, type changes, and table additions in source systems without failures or manual code changes
Enforce metadata capture: source system, timestamp, job ID, record count, schema version, and lineage marker logged on every ingestion event
2-Design & Deliver the Medallion Architecture (Bronze / Silver / Gold):
Author and maintain all silver and gold layer dbt models in dbt Cloud with Git version control, CI/CD deployment pipelines (GitHub Actions), and automated dbt test suites
Write dbt tests covering completeness, uniqueness, referential integrity, and custom business logic for every Tier-1 KPI: Gross Written Premium, Loss Ratio, Netpaid Claims, Burning Cost, and Lapse Rate
Implement SCD Type 2 for all conformed dimensions: Customer, Member, Product, Contract, Provider, and Channel
Design Analytical MDM layer — golden records for Customer and Member with de-duplication, survivorship rules, and multi-year history preservation for Actuarial models (IBNR, run-off triangles require 3+ years)
3-Build & Govern the Looker BI Semantic Layer:
Build and govern the Looker LookML semantic layer 50+ Tier-1 KPIs with reusable, governed dimensions and measures
Enable self-service exploration: business users must be able to drill from aggregate KPI to individual claim or member record without writing SQL or requesting analyst support
Configure Looker role-based access controls aligned precisely to Big Query column-level policies no user can access data they are not entitled to at any consumption layer Implement embedded analytics for internal portals and clinical dashboards via the Looker REST API; maintain T-1 daily refresh and sub-15-minute micro-batch refresh for operational dashboards
4-Enforce Data Governance, Quality & NDMO / PDPL Compliance:
Configure and operate GCP Data plex for automated data discovery, column-level PII / PHI / SPI classification, data lineage (Bronze Silver Gold
Looker), business glossary, and DQ monitoring across all Medallion layers
Enforce NDMO compliance: all data resident in GCP me-central2 (Dammam, KSA); data classification taxonomy applied and auditable; PDPL retention policies enforced at column level
Build and maintain automated source-to-target reconciliation: daily validation that bronze, silver, and gold data reconciles to source with zero tolerance on Tier-1 financial KPIs before any report is released
Define and enforce the Definition of Done for all data engineering deliverables no dataset is complete until dbt tests pass, documentation is merged, lineage is captured, and DQ gates are green
5-Build AI / ML Infrastructure & Operationalize Vertex AI Use Cases:
Design and populate the Vertex AI Feature Store from Gold layer data — enabling Wave 1 AI use cases: FWA Service Overutilization, FWA Duplicated Claims, FWA Provider Collusion, Document OCR Extraction, and member churn propensity
Build Vertex AI Pipelines for automated model training, evaluation, promotion to Model Registry, and deployment to production inference endpoints no manual notebook-to-production process
Collaborate with data scientists and the Track 2 AI team to operationalize models from prototype stage into scalable, monitored GCP inference pipelines
Enable Big Query ML as a self-service modelling tool for actuarial and finance analysts requiring SQL-based predictive model development
6-Platform Engineering, Vendor Oversight & Internal Knowledge Transfer:
Own the GCP landing zone: multi-environment architecture (Dev / UAT / Prod), GCP IAM with principle of least privilege, Terraform IaC for all infrastructure, BYOK KMS encryption via Thales, and Cloud Composer orchestration
Act as technical authority over the external implementation vendor — enforce engineering standards, review architecture decisions, hold vendor to SLA commitments, and escalate quality issues immediately to CDO
Operate a co-delivery model: Bupa Arabia engineers embedded in vendor squads as co-developers; all code committed to Bupa-owned Git repositories from Day 1; vendor retains no proprietary ownership of any deliverable
Drive embedded knowledge transfer by programme close, a minimum of 3 Bupa Arabia data engineers must be independently capable of developing new dbt models, maintaining pipelines, and operating the platform without vendor dependency
Skills
GCP Big Query (Medallion architecture, partitioning, clustering, cost optimization, query plan tuning
GCP DataStream (CDC from Oracle and SQL Server, initial backfill, schema drift handling)
dbt Cloud (CI/CD deployment, custom generic tests, macros, Semantic Layer, documentation site)
Looker / LookML (governed semantic layer, row-level security, RBAC, Looker API, PDTs)
SQL Big Query dialect (window functions, complex analytics, execution plan optimization)
GCP Pub/Sub + Dataflow (streaming ingestion, exactly-once semantics, dead-letter queues)
GCP Data plex (auto-discovery, column-level classification, lineage, DQ policies)
Vertex AI (Feature Store, Vertex AI Pipelines, Model Registry, batch and online inference)
Terraform + GCP IAM + KMS / BYOK (IaC, security controls, least-privilege architecture)
Cloud Composer / Apache Airflow (DAG design, SLA monitoring, backfill, GCP integration)
Python (Cloud Functions, custom Composer operators, pipeline scripting)
Git + CI/CD (GitHub Actions or equivalent for both infrastructure and data model deployment)
Education
Bachelor’s degree computer science, Data Engineering, Software Engineering] or any related field.
Sobre el empleador

UK, Australia, Spain, Chile, Poland, New Zealand, Hong Kong SAR, Türkiye, Brazil, Mexico, the US, Middle East, Ireland, Saudi Arabia and India. · Reino Unido
Bupa's purpose is helping people live longer, healthier, happier lives and making a better world. We are an international healthcare company serving over 38 million customers worldwide. With no shareholders, we reinvest profits into providing more and better healthcare for the benefit of current and future customers. We directly employ around 85,000 people, principally in the UK, Australia, Spain, Chile, Poland, New Zealand, Hong Kong SAR, Türkiye, Brazil, Mexico, the US, Middle East and Ireland. We also have associate businesses in Saudi Arabia and India. For more information, visit www.bupa.com
Empleos relacionados
- Digital Health Product ManagerMediclinic Middle East · Dubái, Emiratos Árabes Unidos
- Digital Health Product ManagerMediclinic Middle East · Dubái, Emiratos Árabes Unidos
- Project Manager – Digital & ITDallahHealth · Riad, Arabia Saudí
- Project Manager (UAE)Innovaccer · Emiratos Árabes Unidos
- IT with good experience and highly qualifiedfirst telemedicine · La Meca, Arabia Saudí
- Epic Delivery OwnerThe Cigna Group · Riad, Arabia Saudí