Wroclaw, Poland
--
My github
My linkedin
Jakub Kiełbasiewicz
Technical Skills
Languages:
Java, Python, R, SQL
Frameworks/Libraries: pandas, matplotlib, numpy, pySpark, psycopg2, cassandra, elasticsearch, sqlalchemy,
pytest
Databases: SQL Server, Oracle, MySQL, PostgreSQL
Big Data: C
assandra, Airflow, Hadoop, Spark (pySpark), Databricks, Elasticsearch, Impala, Hive, Data Lakes
Cloud: Azure (advanced), GCP (medium), AWS (medium)
Other: SSIS, Data Warehousing, ETL, SSAS, Power BI, Tableau, Microstrategy
Project Experience
Toptal Freelance - https://www.toptal.com/resume/jakub-kielbasiewicz
●
●
2019
Prepared architecture and implemented data warehouse solution for US-based insurance company [SSIS,
PowerBI, SQL]
Designed DWH solution along with ETL implementation for books start-up from UK [SSIS, SQL, C#]
Wroclaw University of Science and Technology
●
●
January 2020
Developed Data Quality solution for IoT implementation in manufacturing company, which allowed
assessing the data quality of sensors data - re-modeled database, created metamodel for database (Azure
SQL), designed and developed Power BI dashboard (DAX + Python). Project has involved implementation
of machine learning algorithms - like Isolation Forest. Along with the dashboard I have proposed a new
solution for data quality management in companies - the iterative methodology and step-by-step process
how to work with IoT data quality. [Python, PowerBI, SQL, Azure]
I have architected an award-winning project in Azure - datawarehou.se - online application for storing
digital goods (qualified to the finals of “Young Masters” which has taken place on XXV Teleinformatics
Forum) [Azure, Python, Databricks]
Upwork Freelance
●
2020
Already existing ETL process maintenance and performance tuning
Work Experience
Roche - Cloud Data Engineer
●
Preparing IoT PoC solution in Azure cloud
Aberdeen - Data Engineer
●
●
●
●
●
●
Remote | April 2020
- Now
Remote | July 2019
- April 2020
Master Data Management
Performance tuning
Advanced string matching algorithms improvement
Elasticsearch clusters maintenance
Leading migration to AWS (re-designing the whole data product - AWS EMR, s3, RDS, Lambda, Athena,
Glue)
Tests automation
Ryanair - BI Developer
Wroclaw, Poland |
January 2019-June 2019
●
●
Created ETL processes - using hadoop, python scripts for 3rd party API integration
Modeled new data marts
●
●
Developed Data Quality checks
Preparing and running PoC solutions in Azure(Databricks, ADF), graph databases, Power BI
Ryanair - ETL & Automation Analyst
●
●
Wroclaw, Poland |
May 2018-December 2018
Query performance tuning
Data integration automation (R, Python, SQL, MDX)
Tech Data Client Solutions - BI Consultant
Wroclaw, Poland |
June 2016-April 2018
●
●
Current workloads maintenance
Resolved ETL issues
●
Creating end-to-end reporting environment and solutions for Tesco Mobile (Microstrategy)
Education
Udacity - Data Engineer Nanodegree
Udacity - Data Analyst Nanodegree
University of Colorado - Data Warehousing for Business Intelligence
Wroclaw University of Science and Technology -
Computer Science Bachelor’s degree
University of Wroclaw - Economics Bachelor’s degree
Coursera - Tens of cloud/python/big data/machine learning courses
Certifications
GCP: Associate Cloud Engineer
Microsoft Certified: Azure Administrator Associate
MCSE: Data Management and Analytics
MCSA: SQL BI Reporting
MCSA: SQL 2016 Database Development
MCSA: SQL 2016 BI Development
MCSA: SQL 2016 Database Administration