Kyriakos Sideris
Location: Athens, Greece
Nationalities: Greek, British
Pronouns: he/him/his
Date of birth:
E-mail:
Mobile:
15-Nov-1982--
Things you should know about me
•
For the past 10 years, I have been working as a Data Engineer in various-sized companies, from
startups with a handful of employees to a large conglomerate business like Expedia Group, to
research organisations like the University of Bristol.
•
I started my Data Engineering journey working with big data technologies like Kafka and Spark and
it has extended to multiple other branches like handling data quality, compliance with GDPR and
integrations with 3rd party tools like CRMs, marketing and sales platforms etc.
•
During my past roles, I worked as a team leader of various size teams while maintaining technical
responsibilities.
Education- MSc in Communications and Signal Processing,
Newcastle University, UK
Grade: Distinction (77/100)
- BEng and MSc in Computer Engineering
and Informatics, University of Patras, Greece
Grade: 7.7/10
Work Experience
•
January 2024 – today: Senior Data Engineer, freelancer (remote)
o Migration of Elasticsearch cluster
o Re-architecting of data ingestion pipeline of social media data for NLP in AWS
o Leading a team of three engineers
•
March 2022 – January 2024: Senior Data Engineer at Hotjar (remote)
o Change Data Capture solution for Postgres with Debezium
o GDPR pipelines for the right to be forgotten and the right to access requests
o Data quality monitoring system with Airflow
o Data ingestion and ELT pipelines with 3rd party tools
o Leading a team of four engineers
•
August 2018 – March 2022: Senior Dev/Big Data Engineer at Expedia Group (UK),
o Design, development and testing of
§ features for the Kafka-based Data Highway (DH)
§ Hive Sink project for Kafka-based EG Stream Platform
o Development and productionisation of Data Science models
o Proof of Concept development of customer-facing analytics with Apache Pinot
•
June 2017 – August 2018: Senior Data Engineer at DeepCrawl (UK)
o Design and implementation of SEO platform in Scala
o Development and Ops with Apache Spark, Mesos, Kafka and Cassandra in DC/OS platform
o Migration of legacy Ruby-based platform into map-reduce Spark pipelines
•
January 2017 – June 2017: Data Engineer at Adarga Ltd (UK)
o Design and implementation of micro-services in Python, Docker and Kubernetes
o Development on Apache Spark, Kafka and TensorFlow
o Architecture design of the NLP pipeline and data models
•
December 2014 – December 2016: Research Associate at the High-Performance Networks Group,
University of Bristol (UK)
o Design of network analytics platform with Apache Spark, Kafka, Cassandra and HDFS
o Application development on network controllers like OpenDayLight and ONOS
o Experimentation on mobility patterns in a Smart City environment
•
January 2011 – November 2014: Software Design Engineer at Imagination Technologies Group
plc (UK)
o Analysis, design, implementation in MATLAB, C, C++ and testing in Python of various parts
of DVB-T2 IP firmware including DSP and system components
o Performance optimisation of DVB-T2 IP targeting cycles and memory usage on Ensigma
Series3 PRU in Python
o Development and maintenance of debugging tools, support in IP packaging, release processes
and ticket resolution
•
January 2008 – September 2008: Software Engineer at Research Academic Computer Technology
Institute (RA CTI, University of Patras Campus, Greece)
o Web development using PHP, JS, CSS, on Apache and MySQL
o Desktop application development, using Visual Basic .NET, SQL server
Selected Research Activity
•
October 2008 – August 2009: Identification of Nonlinear Dynamical Systems using Neural
Networks.
As part of my MSc project I worked on the implementation of the EM algorithm using Extended
Kalman Smoother in the Expectation step and RBF Neural Networks in the Maximisation step in
order to identify Nonlinear Dynamical Systems. (School of Electrical, Electronic and Computer
Engineering, Newcastle University, UK)
•
December 2016: Seer: Empowering Software Defined Networking with Data Analytics.
In this published work, we presented a flexible, highly configurable data analytics platform for
network intelligence based on software-defined networking and big data tools. Seer combines a
computational engine with a distributed messaging system to provide a scalable, fault-tolerant and
real-time platform for knowledge extraction. (K. Sideris, R. Nejabati and D. Simeonidou, IUCC-CSS
2016)
Technical Skills
•
•
•
•
•
Python, Scala and Java for backend development in Docker and Kubernetes with Spring Boot or Play
framework, and ML development and data engineering on Apache Spark, Databricks
Building streaming solutions with Apache Kafka, Connect, Schema registry, Debezium etc.
Operation on-premises and in AWS (EC2, S3, Athena, Lambda, Redshift, Glue etc.) and Terraform
Developing CI/CD pipelines with Jenkins, GitLab pipelines and GitHub actions
Monitoring with Datadog, Grafana and Prometheus
Additional Skills and Hobbies
•
•
•
•
Fluent speaker of English and Greek. Novice in Spanish.
Coursera enthusiast (Big Data, Functional Programming, Machine Learning, Cryptography, etc.)
Regular attendee of the data science, engineering and BI meetups
Runner of 10K, half marathons and squash player
References
Upon request