As a highly motivated and results-driven Data Engineer, I possess over 1 year of hands-on experience specializing in designing, building, and optimizing scalable, high-throughput data infrastructure, primarily within the demanding financial sector.
My most recent professional role is as a Consultant at Mindcraft Software pvt ltd, serving the client Yes Bank (July 2024 – Present). In this capacity, I focus on transforming complex business needs into reliable, real-time data solutions.
My technical stack is centered around leading big data technologies. I possess advanced proficiency in Python and SQL , paired with deep practical experience in Apache Spark (PySpark) and Apache Kafka for both batch and real-time streaming infrastructure. I am an expert in workflow orchestration using Apache Airflow, where I consistently implement best practices for pipeline reliability.
I have delivered tangible performance improvements. For instance, I successfully optimized critical ETL/ELT workflows, achieving a runtime reduction of 2-3 hours through targeted performance tuning (SQL indexing and Spark partition pruning). Furthermore, I have a strong focus on data quality and governance, having implemented validation frameworks that cut recurring Data Quality tickets by 30%.
I have specialized knowledge in MLOps and machine learning infrastructure , having built and automated workflows for model retraining and deployment using MLflow. My pipelines are designed to feed and report on high-value machine learning models used for critical applications like real-time fraud detection and proactive customer retention initiatives, ensuring a stable MLOps environment.
I have hands-on experience with cloud data platforms (AWS: S3, Lambda, Glue, IAM) and am adept at managing operational stability using Observability tools such as the ELK Stack and Grafana.
I thrive in collaborative, cross-functional team environments and possess near-native English fluency, making me an effective communicator for global stakeholders, analysts, and data scientists. I am committed to leveraging my expertise in data warehousing, dimensional modeling, and agile methodologies (Jira) to deliver data that is timely, reliable, and actionable.