Richa Sinha
Bangalore, Karnataka | P: - |-
SUMMARY
Backend-focused full stack engineer with 4 years of experience building scalable, cloud-native systems. Skilled in Java Spring
Boot, Scala, ReactJS, and AWS, with a strong focus on performance, fault tolerance, and clean, test-driven code. Experienced in
cross-team collaboration, mentoring, and delivering robust solutions that improve reliability and customer experience.
WORK EXPERIENCE
Optum (UnitedHealth Group)
Software Development Engineer - 2
Bangalore, Karnataka
Aug 2021 – Present
Contributed to EIMP, a large-scale healthcare data mastering platform that consolidates fragmented patient records using
probabilistic matching (Fellegi-Sunter). The system handles 1M+ records/hour, 350M+ identities, and supports Kafka, API,
and batch ingestion, with AWS-based microservices and a data steward UI for manual review.
● Scaled a Kafka-based Scala service in EIMP’s matching pipeline 10x (200K → 2M/hour) by resolving partition imbalance
via fanout topic, bypassing a latency-heavy API using Slick ORM, and optimizing JSON parsing for critical downstream ingestion.
● Built an extensible automation tool in Spring Boot for EIMP’s operational bulk tasks, enabling SREs to perform
Kafka/S3-based operations via ECS with plugin-based task configuration — replacing manual Postman workflows and saving 100s
of dev hours.
● Developed a multithreaded testing utility to validate a search API at 100K records/hour, transforming patient CSV data,
applying rule-based evaluations, and generating client-ready summaries to improve onboarding and data confidence.
● Led UI enhancements for the data steward portal, exposing and managing anomalies in mastered records — enabling efficient
human-in-the-loop resolution for uncertain matches identified by EIMP’s matching engine.
● Enhanced EIMP’s probabilistic search API by introducing exact-match toggles and resolving cold-start latency using thread
pool warm-up logic
● Built a Cucumber-based end-to-end test suite (100+ scenarios) for validating EIMP deployments, improving test reliability by
replacing slow data resets with infra-level teardown, and ensuring stakeholder alignment through readable Gherkin syntax.
● Prototyped an ML-based pipeline on SageMaker to detect over-merged identities, a critical privacy concern in EIMP.
Cleaned mislabeled training data, engineered features (e.g., multiple names, DOBs), and evaluated robust models (Random Forest,
XGBoost).
● Mentored junior engineers and interns, improving onboarding time and team velocity
● Designed and implemented a user activity tracking system for EIMP using AWS Lambda and EventBridge to
asynchronously track 1,000+ operations/day, with DLQ handling and a React-based UI for CSV export and traceability.
● Built a JSON diff utility as an Athena UDF to compare millions of nested records between blue and green environments,
surfacing critical mismatches and directly influencing go/no-go decisions during EIMP’s infrastructure migrations.
● Migrated a legacy batch processing component in EIMP to AWS (ECS + Lambda) using Terraform, adding retry logic and
event-driven file monitoring — significantly reducing support incidents post-migration.
● Contributed to disaster recovery design for EIMP’s streaming pipeline, ensuring high availability and fault tolerance
across critical data mastering services.
EDUCATION
Manipal Institute of Technology
Bachelor of Technology - Computer Science and Engineering
Cumulative GPA: 8.69/10.0
SKILLS & CERTIFICATIONS
Languages: Java, Python, Scala, JavaScript
Frameworks & Libraries: Spring Boot, React
Tools and Technologies: Kafka, Elasticsearch, Terraform, Docker, Git, AWS
Databases: MySQL, DynamoDB (NoSQL)
Certifications & Training: AWS Solutions Architect Associate (SAA-C03)
Manipal, Karnataka
Aug 2017 - May 2021