Horacio Soldman Totobetanimena
Data Engineer
- |-| linkedin.com/in/horacio-soldman | Antananarivo, Madagascar
Data Engineer | Driving business growth with scalable and reliable data solutions
Experienced Data Engineer with 3 years of delivering impactful data analytics and data engineering
solutions. Strong background in Software Engineering, leveraging Python, SQL, and PySpark to optimize
decision-making and solve business problems
SKILLS
Technical: Python, SQL, PySpark, ETL/ELT, Big Data, BigQuery, AWS Glue, Airflow, Kafka, ELK, Looker
Studio, Metabase, Docker, Terraform, Bash, Git, CI/CD, Javascript, PostgreSQL, MongoDB, Redshift
Language: English C1, French C1
EXPERIENCE
The BVA Family
04/2024 - Present
Consultant Data Engineer
- Converted Pandas-based data workflow to PySpark and saved about 50% of execution time and
40% of resources consumption
- Designed and Implemented articles summarization tool that leverages OpenAI models
- Implemented a call center scheduling optimization, improved efficiency by 70% and reduced manual
scheduling adjustments
- Maintained and enhanced an ETL framework with modularizations and code maintainability
- Ensured GDPR compliance across all data engineering projects
SmartOne
03/2022 - 04/2024
Data Engineer
- Built data warehouses on BigQuery with one big table and dimensional modelling approaches
- Applied data governance policies and ensured efficient data access for authorized users only
- Developed a reporting solution on Looker Studio that enhanced decision-making
- Migrated data workflows from on-premise to AWS Glue for more efficiency
- Implemented data integrity report that runs in a daily basis
MyAgency
01/2020 - 09/2020
Fullstack Javascript Developer
- Developed and maintained reliable APIs written in Symfony and Node.js
- Created reusable components and optimized pages with Next.js, enhancing front-end performance
- Ensured responsiveness of the frontend app using bootstrap and css
Soatransplus
12/2018 - 12/2019
Fullstack Javascript Developer
- Developed and maintained a robust trips booking system for a transportation company
- Enabled real time dashboard for the Administration team using socket.io
- Deployed the application on a self-managed VPS server and handled the full devOps cycle
PERSONAL PROJECTS
Real time click events analytics Github link
10/2023
- Implemented Change Data Capture technique to capture website click events in real time
- Enabled fast decision-making with real time dashboards with Kibana
- Guaranteed scalability of the pipeline by using distributed system tools such as Kafka and ELK
- Orchestrated all dockerized components using Docker Compose
Batch processing on AWS Github link
03/2022
- Enabled tasks automation and orchestration with Apache Airflow
- Applied data transformations with PySpark deployed to AWS EMR
- Designed and built a Data Warehouse on AWS Redshift for an efficient data exploration
- Ensured portability and reproducibility of the data infrastructure with Terraform and Docker
EDUCATION
MSc Data Science from Heriot-Watt University in Edinburgh, United Kingdom
MSc Software Engineering from Athénée Saint Joseph Antsirabe, Madagascar
BSc Computer Science from Athénée Saint Joseph Antsirabe, Madagascar
CERTIFICATIONS
Google Cloud Professional Data Engineer Certificate
MLOPS Zoomcamp
Data Engineering Zoomcamp
IBM Data Engineering Specialisation
International English Testing System (IELTS)
🔗
🔗
🔗
🔗
INTERESTS
Data & AI related communities, Data & AI conferences, books, travelling
12/2021
12/2018
07/2016
06/2023
09/2022
05/2022
03/2022
01/2019