Andrew Lister
Monitoring and Automation Specialist | Site Reliability, DevOps & Infrastructure Engineer
q
https://www.linkedin.com/in/andrewlister
apac
q https://github.com/AndrewAPAC
SUMMARY
SKILLS
Senior Site Reliability Engineering SRE with over 10 years in DevOps,
monitoring, and infrastructure optimization and 30 years in tech & finance.
Expert in monitoring, automation, and system reliability with proven track
record of reducing operational costs and improving system performance
across global financial and tech environments. Australian citizen and Hong
Kong permanent resident seeking remote opportunities.
Monitoring
PROFESSIONAL SUMMARY
í
p
Reliability and Resiliency
I have significantly reduced outages from 15-20 per month to 0-2, time
to resolution and observability by introducing accurate monitoring and
alerting. I have done this in multiple companies the most notable being
Citibank and 23,000 hosts, transforming the global support landscape.
Automation
Automation is a passion and I automate everything in the office and at
home. I have been identifying repetitive tasks and successfully fixing
with an optimal solution for over 30 years. I always take the attitude of
save 15 minutes a day and use that 15 to save more time.
Platform Engineering
As a Linux user since 1992 and Unix since 1988, I have explored most
aspects of networking, administration, storage and other OS related
tasks which I can draw on to come up with innovative solutions.
Technical Leadership
I have acted as a technical lead and mentor for up to 6 team members
since 2012. Code reviews, design recommendations and other
assistance can be provided with my extensive experience in many IT
aspects.
Excellence in Coding, Standards and Documentation
I like to do things the right way and strive to remove any duplications of
code or configuration. Comments, structured documentation and
source control are all critical in a well functioning IT environment.
Examples at the end of this document.
EXPERIENCE
Career Break
09/2023 - Present
ITRS Geneos
Zabbix
• Used Selenium and Python to access personal financial data, download
and store in MySQL and visualise on Grafana dashboards.
• Fully automated YouTube shorts content creation using multiple AI
platforms meaning I could generate 5 short form videos with 2 minutes of
effort.
• Creation of a command line based home automation application to control
IoT devices from cron, systemd and so on.
• Continue to develop my application framework python module:
https://github.com/AndrewAPAC/alx-common.
• Keeping up to date with technology trends and evolving AI platforms.
Sensu
Prometheus
Datadog
Elastic
Development
Python
C/C
/C#
Bash/Shell Scripting
Agile
Jenkins
Confluence
Perl
CI/CD
Bamboo
CI/CD
git
Go
GitLab
Jira
Java
Other
Docker
Kubernetes
Infrastructure As Code
Linux/Unix
Windows
PostgreSQL
Sybase
GCP/AWS
Ansible
Terraform
MySQL
Oracle
SQL Server
STRENGTHS
Monitoring
p
Automation
Coding
Communication
Bali, Indonesia
Travel and personal development. Remained hands on with home automation,
content creation and other personal projects:
Grafana
Accurate monitoring is a passion and the
cornerstone of site reliability. Early detection
of issues, metric collection and dashboards,
through to postmortem analysis.
Have the attitude of 'automate everything'
and a strong track record of doing just that
regardless of complexity.
I have been coding since high school 1985
and the Commodore 64. Developed code in
multiple languages on different platforms and
take pride in modularization and defining best
practices.
Strong believer in structured documentation,
issue tracking and transparency. Always keen
to explore better ways of doing things.
EXPERIENCE
EDUCATION
Senior Lead Engineer, SRE and Innovation
Bachelor of Science
CLSA
Flinders University
01/2022 - 08/2023
Hong Kong
CLSA is a leading brokerage and investment group in Hong Kong
• Designed and implemented ITRS Geneos monitoring framework overhaul,
reducing configuration complexity by 70-90%
• Overseeing and mentoring of offshore team: collaboration, training and
development.
• Developed python sampler to monitor hardware using Prometheus API
decreasing hardware only monitoring costs by 45% across 3,000 hosts.
• Created innovative alerting solution reducing issue resolution time by 35%.
• Optimized Jira workflows, streamlining issue management by 80%.
Senior Site Reliability Engineer – APAC / EMEA Lead
Slync
09/2020 - 12/2021
Remote
A platform specializing in tracking logistics and shipping
• Led transition to kube-promethus-stack, improving system observability
and usability by 45% as well as saving on monitoring costs.
• Reduced deployment errors by 60% through customised helm charts and
GitLab CI/CD.
• Fully automated monthly client reporting processes, saving leadership a full
day of manual work and significantly reducing errors.
DevOps Officer and Linux Administrator
BFAM Partners
02/2019 - 06/2020
Hong Kong
A Hong Kong based investment manager focusing on equity strategies
• Performed 100% of Linux system administration and DevOps duties for 20
developers and front office support.
• Migrated Zabbix monitoring to ITRS Geneos, enhancing estate visibility by
60%, significantly reducing outages and increasing supportability.
• Centralized Atlassian products with PostgreSQL replication and remediated
DR solution.
• Automated Linux server build process using Ansible, expediting
deployment and reducing manual configuration.
Global DevOps Engineer and Execution Trader
quantPort Asset Management
04/2016 - 11/2018
Hong Kong
A systematic asset management fund
• Managed 50% of all trading, back office, and 90% of IT functions for the
region specializing in algorithmic trading.
• Boosted developer efficiency by 50% through centralizing functionalities
with a new Python module removing repetition across the code base.
• Wrote an intuitive front-end for sensu monitoring, enhancing user
experience and significantly increasing issue resolution.
Global Monitoring Lead
Citibank
01/2010 - 01/2013
Hong Kong
One of the major financial services organizations worldwide
• Developed and implemented Greenfield monitoring solutions with ITRS
Geneos across global production environments, reducing downtime by up
to 75% and increasing visibility immensely.
• Spearheaded reduction of outages in key financial hubs, employing
strategic methodologies. London went from 15 to 20 outages per month to
between 0 and 2 over a period of 1 month.
• Engineered bespoke monitoring solutions in Perl, saving the company over
$1 million in licensing costs.
• Engineered and published infrastructure standards creating a new global
initiative.
02/1988 - 11/1990
Adelaide, Australia
• Major in Computer Science and Mathematics
High School Certificate
Gilles Plains High School
02/1983 - 12/1987
Adelaide, Australia
PROJECTS
ALX-common: Open source python
module
2019 - Present
Personal project
I maintain an open source framework which I have
used in multiple companies with excellent results in
standardisation, removal of code duplication and
enhanced productivity.
• https://github.com/AndrewAPAC/alx-common.
• https://pypi.org/project/alx-common/.
• https://andrewapac.github.io/alxcommon/alx.html.