Taha Rushain-
in/taharushain
-
SUMMARY
Experienced Data Engineer & Scientist with 5+ years of expertise in advanced analytics, data engineering and data sciences. Skilled in
developing data pipelines, and predictive models to drive data-driven strategies for business growth.
EXPERIENCE
Data Engineer Lead
Master Works [Contract]: May 2023 - Present, Riyadh
1. Develop and implement data integration strategies, ensuring the smooth flow of data across different platforms and systems.
2. Design and optimize SQL databases, specifically MSSQL, to support data storage, retrieval, and processing needs.
3. Create ETL processes and jobs, solving potential problems and making improvements to increase ETL performance.
Lead Data Engineer
MBL : May 2022 - Present, Pakistan
1. Collaborate on ETL tasks, maintaining data integrity and verifying pipeline stability using IBM Infosphere Datastage.
2. Develop ETL pipelines in Python and PySpark, orchestrated using Airflow.
3. Develop processes for ETL monitoring and maintaining Data Quality on Data warehouse and Data Lake.
4. Analyze complex data and identified anomalies, trends, and risks to provide useful insights to improve internal controls.
5. Design and develop analytical data structures and Data Models.
Senior Data Scientist & Engineer
Afiniti : March 2019 - April 2022, Bermuda & Pakistan
1. Applied statistical and algebraic techniques to interpret key points from gathered data, using Python.
2. Maintained model accuracy by testing different modeling approaches including Ensemble models.
3. Performed feature engineering to enhance predictability of models.
4. Data Visualizations using Python & PowerBI for data and model accuracy.
5. Designed data models for complex analysis needs.
6. Developed Data Pipelines (ETL) using Talend and Python
7. Leveraged various technologies like Greenplum, Hive, Mysql, PostgreSQL, Parquet, Snowflake
8. Mentored and managed junior data engineers.
Assistant Manager - ETL
HBL : September 2017 - February 2019, Pakistan
1. ETL Development and Process Automation.
2. Development and Deployment of Data Pipelines using SSIS and Microsoft SQL Server.
3. Developed Data Marts for multiple projects.
EDUCATION
Master of Science (MS), Computer Science
Institute of Business Administration • 2022
Bachelor of Science (BS), Computer Science
Institute of Business Administration • 2016
CERTIFICATIONS
AWS Cloud Quest: Cloud Practitioner (AWS • 2023)
Neural Networks and Deep Learning (Coursera • 2018)
Artificial Intelligence Nanodegree (Udacity • 2018)
SKILLS
Industry Knowledge: Data Warehousing, Data Engineering, Big Data Analytics, Data Sciences
Tools & Technologies: Big Query, Snowflake, Airflow, Kafka, Docker, AWS (RedShift, S3, EC2, Glue), IBM Infosphere, SSIS,
Talend, Git, IBM Cognos, Tableau, MS SQL, MySql, PostgreSQL, DB2, Greenplum, MongoDB, Hive, Java, Python, SQL, PLSQL, PowerBI