TAPIWANASHE
MLANGENI
www.github.com/zimthug | www.linkedin.com/in/tamla83
- |-
Experienced Data Engineer with over 12 years of expertise in designing, developing, and maintaining robust data platforms
across marketing, finance, analytics, and SaaS environments. Proven ability to build scalable ETL pipelines, drive data
modeling and warehousing strategies, and deliver impactful business insights through dashboards and visualizations.
EXPERIENCE
LEAD DATA ENGINEER
Untapped Global, USA | Nairobi, Kenya
JAN 2022 – PRESENT
● Architected scalable data infrastructure in AWS (S3, EC2, RDS, Redshift) Snowflake and Databricks, reducing
query response times by 50%.
● Built and optimized ETL pipelines processing millions of transactional records using AWS Glue, PySpark, and EMR,
ensuring high-speed data ingestion and transformation.
● Developed AWS infrastructure with Terraform, managing secure data storage in S3, real-time processing with
Kinesis, and analytics using Redshift.
● Implemented automated vulnerability detection systems, integrating AWS GuardDuty and CloudTrail logs,
improving threat response efficiency.
● Developed SQL queries and Redshift optimizations for security data reporting, reducing query execution time by
40%.
SENIOR DATA ARCHITECT
Zimbabwe Shared Services | Harare, Zimbabwe
AUG 2021 – DEC 2021
● Designed and optimized the company’s core data architecture, significantly improving the scalability and
performance of data-intensive applications.
● Automated data pipelines using Python, Apache Airflow, and SQL, reducing manual data processing time by 70%
and ensuring seamless integration with cloud-based data warehouses.
● Led the implementation of a centralized data warehouse on Azure Synapse, improving data accessibility and
compliance reporting.
● Developed interactive dashboards using Power BI and Tableau, translating complex data insights into actionable
recommendations for business stakeholders.
SENIOR SOFTWARE ENGINEER
Zimbabwe Electricity Transmission and Distribution Co. | Harare, Zimbabwe
APR 2019 – JUL 2021
● Developed real-time analytics pipelines using Apache Kafka, Spark Streaming, and Flink, enabling faster
decision-making and reducing data latency by 60%.
● Built and optimized ETL workflows using SQL, Apache Spark, and Python, improving data processing efficiency
and cutting pipeline execution time by 40%.
● Automated data quality checks and validation frameworks, incorporating anomaly detection algorithms and unit
testing to ensure 99.9% data accuracy.
● Conducted system analysis, documentation, testing, and platform transition support, reducing deployment errors
by 30% through standardized version control and CI/CD automation.
● Led performance tuning initiatives by optimizing database indexing, partitioning strategies, and query execution
plans, improving SQL query response time by 50%.
SENIOR DATA ENGINEER
Indra, Spain | Nairobi, Kenya
APR 2012 – MAR 2019
● Developed and optimized scalable ETL pipelines using Databricks and Spark for real-time data processing.
● Developed a robust data ingestion platform to process electricity smart meter data, integrating Apache Kafka for
●
●
●
●
streaming ingestion and Apache Spark for ETL transformations.
Designed a solution to process messages from streaming platform to RDBMS for both OLTP and
warehousing, enabling efficient querying and reporting.
Designed and implemented a customer analytics platform that integrated marketing data for segmentation and
targeted campaigns.
Implemented A/B testing frameworks to measure marketing campaign effectiveness, leading to an 85%
improvement in campaign performance by refining audience targeting strategies.
Optimized ETL workflows by integrating SQL, Spark, and cloud-based solutions, ensuring real-time data
accessibility and reducing processing times from hours to minutes.
EDUCATION
MASTER OF SCIENCE IN INFORMATION TECHNOLOGY
DIPLOMA, SYSTEMS ANALYSIS
University of Derby, UK
Association of Computer Professionals, UK
JAN 2022
DEC 1999
SKILLS
● Programming: Python, Java, Scala, SQL, Javascript,
Shell Scripting
● ETL & Data Pipelines: Airflow, dbt, Meltano,
Fivetran, Kafka, Spark, AWS Glue
● Databases: Oracle, PostgreSQL, MySQL, MSSQL,
MongoDB, Cassandra
● Cloud Platforms: AWS, GCP, Azure
● Data Warehousing: Snowflake, Databricks, Redshift,
BigQuery, Azure Synapse, MS Fabric, AWS Athena
● Visualization & Reporting: Power BI, Tableau,
Superset, Looker
● CI/ CD Tools: Git, Terraform, ArgoCD
● Other: Marketing segmentation, CRM integration, SaaS
metrics, Revenue recognition logic, JSON/XML, APIs