Satyam Singh

Satyam Singh

$25/hr
Data & ML Engineer: Kafka/Spark real‑time ETL, Airflow orchestration, AWS, MLflow MLOps
Reply rate:
-
Availability:
Hourly ($/hour)
Age:
23 years old
Location:
Varanasi, Uttar Pradesh, India
Experience:
1 year
About

As a highly motivated and results-driven Data Engineer, I possess over 1 year of hands-on experience specializing in designing, building, and optimizing scalable, high-throughput data infrastructure, primarily within the demanding financial sector.

My most recent professional role is as a Consultant at Mindcraft Software pvt ltd, serving the client Yes Bank (July 2024 – Present). In this capacity, I focus on transforming complex business needs into reliable, real-time data solutions.

My technical stack is centered around leading big data technologies. I possess advanced proficiency in Python and SQL , paired with deep practical experience in Apache Spark (PySpark) and Apache Kafka for both batch and real-time streaming infrastructure. I am an expert in workflow orchestration using Apache Airflow, where I consistently implement best practices for pipeline reliability.

I have delivered tangible performance improvements. For instance, I successfully optimized critical ETL/ELT workflows, achieving a runtime reduction of 2-3 hours through targeted performance tuning (SQL indexing and Spark partition pruning). Furthermore, I have a strong focus on data quality and governance, having implemented validation frameworks that cut recurring Data Quality tickets by 30%.

I have specialized knowledge in MLOps and machine learning infrastructure , having built and automated workflows for model retraining and deployment using MLflow. My pipelines are designed to feed and report on high-value machine learning models used for critical applications like real-time fraud detection and proactive customer retention initiatives, ensuring a stable MLOps environment.

I have hands-on experience with cloud data platforms (AWS: S3, Lambda, Glue, IAM) and am adept at managing operational stability using Observability tools such as the ELK Stack and Grafana.

I thrive in collaborative, cross-functional team environments and possess near-native English fluency, making me an effective communicator for global stakeholders, analysts, and data scientists. I am committed to leveraging my expertise in data warehousing, dimensional modeling, and agile methodologies (Jira) to deliver data that is timely, reliable, and actionable.

Languages
Get your freelancer profile up and running. View the step by step guide to set up a freelancer profile so you can land your dream job.