EKEMINI OTU
Data Engineer | Data Scientist | Python Developer
Abuja, Nigeria | Open to Remote Worldwide
GitHub: github.com/EkeminiImeOtu | YouTube: Programming Tutorials Channel
● AVAILABLE IMMEDIATELY · OPEN TO FULL-TIME, CONTRACT & FREELANCE · REMOTE
WORLDWIDE
PROFESSIONAL SUMMARY
Multi-disciplinary data professional with nearly 5 years of experience combining Data Engineering, Data
Science and Python automation. Recently completed a 4-year, 8-month role at 7seer (Aug 2021 — April
2026) and now fully available for remote roles worldwide. Builds production-grade ETL pipelines, real-time
streaming systems and machine learning prediction models. Certified by ALX Africa (Data Analytics, Python
Programming) and DataTalks.Club (Data Engineering Zoomcamp). Member of AI Saturdays Abeokuta —
part of the global AI6 movement. Programming educator with a YouTube channel teaching Python, Java,
Scratch and MIT App Inventor. Former Teach For Nigeria Fellow. Ready to start immediately.
CORE SKILLS
Data Engineering: Apache Airflow, dbt, Snowflake, AWS (S3, EC2, SSM, SNS, IAM), Docker, Terraform
Streaming: Apache Kafka, Apache Spark, PySpark Structured Streaming
Data Science / ML: Scikit-learn, Random Forest, Classification, Regression, Predictive Modelling
Web Scraping: Selenium, Playwright, BeautifulSoup, Scrapy, Octoparse, Requests
Visualisation: Power BI, Streamlit, Matplotlib, Seaborn
Languages: Python, SQL, Java, Scratch
Libraries: Pandas, NumPy, Boto3, Scikit-learn, Flask
Containerisation: Docker, Docker Compose
Teaching: Python, Scratch, Java, MIT App Inventor for kids, teens and adults
PROFESSIONAL EXPERIENCE
Data Specialist, Automation & Data Engineer
7seer | August 2021 — April 2026 (4 yrs 8 mos)
Started as a Data Specialist focused on web scraping and automation, then progressively took on full data
engineering responsibilities — designing, building and operating production-grade pipelines while completing
the DataTalks.Club Data Engineering Zoomcamp.
Data Engineering work (2025 — 2026):
• Built Netflix ETL Pipeline: automated batch pipeline ingesting 87,000+ rows daily from AWS S3 through
Apache Airflow and layered dbt models into Snowflake. Real-time Slack and AWS SNS alerts.
Containerised with Docker on AWS EC2.
• Built Real-Time Streaming Pipeline: live event processing using Apache Kafka and PySpark
Structured Streaming. Processes events in 15-second micro-batches into Snowflake.
• Built Olist E-Commerce Analytics: end-to-end analytics platform on 1.5 million rows. Python ingestion
from S3, 11 dbt models across staging and mart layers, Power BI executive dashboard.
Automation & data extraction (2021 — 2025):
• Built web scraping pipelines using Selenium, Playwright, BeautifulSoup and Octoparse to extract
structured data from complex websites at scale.
• Developed API integration workflows connecting multiple data sources and automating data transfer
between systems.
• Containerised automation workflows with Docker for reliable, reproducible deployments.
• Delivered Python training to junior developers covering automation and data extraction.
• Transformed raw unstructured web data into clean, analytics-ready datasets for client business
decisions.
All data engineering projects fully documented and open-sourced on GitHub.
Programming Educator & Content Creator
Independent / YouTube | 2021 — Present (Part-time, side activity)
• Teaches Python, Scratch, Java and MIT App Inventor to kids, teens and adults.
• Creates and publishes structured programming tutorials on YouTube channel.
• Demonstrates ability to communicate complex technical concepts clearly to all audiences.
Data Scientist — Community Projects
AI Saturdays Abeokuta (AI6) | 2019 — 2021
Member of the global AI Saturdays movement, a community-driven initiative making AI education accessible
worldwide with 5,000+ participants across 100+ cities. Built and deployed machine learning prediction
models solving real-world problems:
• Diabetes Prediction Web App: ML classification model deployed as interactive Streamlit web
application for early diabetes risk assessment.
• Multiple Disease Prediction System: Predicts multiple diseases from patient data using classification
algorithms.
• Hotel Bookings Cancellation Prediction: Predicts cancellation likelihood to help hotels optimise
revenue and resource allocation.
• Flight Price Prediction: Forecasts flight prices using multiple machine learning algorithms and feature
engineering.
• Flowers Classification Model: Image classification system using Random Forest.
Fellow
Teach For Nigeria | 2019 — 2021
Competitively selected for Teach For Nigeria, part of the global Teach For All network placing high-achieving
graduates in underserved communities.
• Mentored secondary school students with measurable learning outcomes.
• Developed leadership, communication and stakeholder management skills.
Chemistry & Basic Science Teacher (NYSC)
Federal Government Secondary School | 2017 — 2018
Completed Nigeria's mandatory National Youth Service Corps programme as a Chemistry and Basic Science
teacher, developing strong communication and presentation skills.
CERTIFICATIONS
DataTalks.Club — Data Engineering Zoomcamp
Completed April 2026
ALX Africa — Data Analytics (6-Month Programme)
Issued August 2025
ALX Africa — Python Programming (8-Week Training)
Issued November 2025
ALX Africa — Professional Foundations
Issued April 2025
EDUCATION
Bachelor's Degree
University of Uyo | 2012 — 2016
KEY PROJECTS
Netflix ETL Pipeline
Stack: Airflow, dbt, Snowflake, AWS, Docker, Python
Real-Time Streaming Pipeline
Stack: Kafka, Spark, Snowflake, Docker, Python
Olist E-Commerce Analytics
Stack: dbt, Snowflake, Power BI, AWS, Python
Diabetes Prediction Web App
Stack: Python, Scikit-learn, Streamlit, Machine Learning
Multiple Disease Prediction System
Stack: Python, Scikit-learn, Classification, Streamlit
Hotel Bookings Cancellation Prediction
Stack: Python, Pandas, Scikit-learn, Machine Learning