Building pipelines that power healthcare & financial decisions at scale. 4+ years engineering real-time streaming, cloud migrations, and ML-powered data platforms.
I'm a Data Engineer who thrives at the intersection of complex data problems and elegant engineering. Currently at CVS Health, I architect real-time streaming platforms and healthcare data lakehouse systems processing millions of events daily across 9,000+ pharmacy locations.
My work spans the full data engineering stack — from ingestion and streaming through transformation, governance, and ML deployment — always with a focus on reliability, compliance, and measurable business impact.
Event-driven Kafka streaming pipeline ingesting pharmacy data from 9,000+ CVS locations with sub-60-second latency, Avro schema validation, and real-time QuickSight dashboards for 1,200+ users.
Migration of 47 SSIS packages to AWS Glue PySpark with automated HIPAA compliance — KMS, Macie, CloudTrail — and Blue/Green CI/CD via GitHub Actions.
Medallion Architecture on Databricks for 50+ hospital partners. MinHashLSH MPI for patient identity matching. Zero-copy Delta Sharing to Snowflake.
200+ dbt models, 680+ schema tests, 45 MetricFlow canonical metrics. Slim CI cut build time from 45 min to 7 min with zero dashboard downtime.
Centralized Snowflake DW replacing Oracle + Hadoop silos. GoldenGate CDC, 120+ dbt models, IFRS 9 and Basel III data marts.
TF Wide & Deep (AUC 0.88) for credit default + XGBoost + Isolation Forest fraud ensemble with SHAP explainability. Deployed on GKE with HPA autoscaling.
Open to new opportunities in data engineering, platform engineering, and ML infrastructure — especially in financial services, healthcare, and high-throughput data environments.