Data Engineer with 4+ years of experience building scalable cloud-native data platforms, ETL pipelines, and enterprise software applications. I specialize in designing production-grade data workflows using Python, SQL, Airflow (Cloud Composer), Dataproc, BigQuery, and Google Cloud Platform, with a strong focus on automation, reliability, and software engineering best practices.
In my current role at CVS Health, I develop end-to-end data pipelines involving API integrations, complex data transformations, automated testing, CI/CD, and business reporting. I have also led AI-driven engineering initiatives using Vertex AI and Gemini to modernize legacy applications and improve developer productivity. My experience extends to Java and .NET enterprise applications, giving me a strong software engineering foundation alongside data engineering expertise.
I hold an M.S. in Applied Analytics from Columbia University and a B.Tech. in Computer Science with a minor in Big Data Analytics. My research background includes speech processing and natural language processing, with publications at international conferences.
I am proficient in Python, SQL, Java, TensorFlow, GCP, and AWS, with a proven ability to deliver efficient, high-quality solutions in dynamic environments. I am actively seeking full-time opportunities in data engineering, analytics, machine learning, and software development.
Feel free to reach out for collaborations or job opportunities!