Data Engineer/Scientist

I’m Saroj, a Senior Data Engineer and Scientist based in Dallas, Texas. I specialize in building scalable, cloud-native data solutions that turn complex data into actionable insights. With six years of experience across industries like healthcare, aviation, and finance, I’ve led end-to-end data pipelines, advanced analytics, and AI initiatives on AWS, Azure, and GCP. I’m passionate about leveraging machine learning and modern data architectures to solve real-world problems and drive smarter decisions.

Engineering intelligence for scalable healthcare pipelines

I led a full-stack transformation of our data science workflows—from real-time Spark pipelines to predictive ML models for patient risk and fraud detection. We reduced ETL times by 50%, automated clinical NLP, and enabled operational decisions through Power BI dashboards.

Designing real-time aviation intelligence at scale

I built cloud-native streaming systems across AWS and GCP to predict flight delays, optimize routes, and enable personalized customer offers. With Spark, TensorFlow, and SageMaker, we modernized how data drove flight ops, customer experience, and pricing strategy.

Transforming global finance systems at Western Union

I transformed Western Union’s legacy ETL into a real-time, streaming data platform powered by Kafka, Spark, and Flink—enabling rapid fraud detection, global AML compliance, and dynamic analytics for 1,500+ stakeholders, while ensuring high-fidelity processing of over 200 million transactions each month.

Work

  1. Company
    Diverge Health
    Role
    Senior Data Scientist/Engineer
    Date
  2. Company
    American Airlines
    Role
    Data Engineer
    Date
  3. Company
    Western Union
    Role
    Data Engineer
    Date
Download CV