I am a results-driven Data Analyst with a strong foundation in Python, SQL, and advanced Excel, specializing in data cleaning, transformation, and visualization. My experience spans developing automation tools, building predictive models, and creating actionable dashboards that enhance decision-making and operational efficiency.
During my recent projects, I built a Python/Tkinter-based repository change log management tool for TraceLink, automating schema extraction, JSON validation, and dynamic reporting—reducing manual review effort by 30%. I also implemented Matplotlib-powered dashboards for real-time engagement insights.
At PepsiCo, I engineered a Random Forest regression pipeline to forecast freight and storage costs, achieving an RMSE of ~170 and enabling data-driven supply chain optimization. Additionally, I developed a churn prediction model with 89% accuracy, providing key insights into customer behavior to support retention strategies.
Proficient in libraries like Pandas, NumPy, Seaborn, Scikit-learn, and tools such as Power BI, Jupyter, AzureML, and Databricks, I bring end-to-end capabilities from data ingestion to deployment. My strengths lie in statistical analysis, feature engineering, AI automation, and performance optimization.
I am a B.Tech graduate in Computer Science with certifications in Full-Stack Data Science, SQL, and Excel, along with practical experience from freelancing, internships, and certifications. My approach combines analytical thinking, problem-solving, and attention to detail to deliver impactful results in every project.