Vivek Singh
India, Dehradun |-| -
Summary
Senior DevOps Engineer with over 10 years of hands-on experience in AWS Cloud and CI/CD
automation. Proficient in provisioning infrastructure with Terraform, optimizing cloud costs, and
handling complex networking solutions. Experienced in collaborating with Data teams and
software developers across banking, security, and gaming industries. Passionate about driving
efficiency, security, and scalability in cloud environments. Seeking a role in a collaborative
organization to leverage my expertise in cloud automation and security best practices.
Technical Summary
•
Cloud & Containerization: AWS, GCP, ECS, Kubernetes, Docker
•
Operating Systems & Scripting: Linux, Windows, Bash, Python, Node.js
•
Infrastructure as Code: Terraform, Ansible, Helm
•
CI/CD & DevOps Tools: Jenkins, GitLab CI, TeamCity, Bitbucket Pipelines, GitHub Actions,
CodePipeline, Octopus, Terraform Cloud
•
Monitoring & Logging: Grafana, New Relic, Prometheus, Elasticsearch, PagerDuty, Sumo
Logic, Graphite, Graylog
•
Databases & Data Platforms: MongoDB, Redis, MySQL, PostgreSQL Databricks
•
Work Management: Jira, Remedy, Snow
Work Experience
Senior DevOps Engineer at A5Labs (Data Division) [April’22 - Present]
•
Optimized AWS ECS services to enhance application performance, ensuring efficient
resource utilization and minimizing latency.
•
Implemented cost optimization strategies, including Savings Plans, resource tagging, and
cost reduction initiatives, achieving up to 70% savings.
•
Managed GPU-accelerated workloads using NVIDIA GPU instances for high-performance
computing.
•
Architect and provisioned AWS infrastructure, including networking, compute, storage,
and monitoring, using a modular approach.
•
Integrated New Relic with AWS services for real-time observability and alerting.
•
Architected and administered Databricks environments using Terraform.
•
Architected and deployed Mongo Atlas Database clusters using Terraform.
•
Developed and maintained CI/CD pipelines with TeamCity and Bitbucket Pipelines,
improving software delivery speed and quality.
•
Enhanced security by enforcing IAM least privilege principles, policy enforcement, WAF
implementation, Security Hub activation, and audit logging.
•
Implemented Hashicorp Vault for secure secrets management in ECS services.
Associate Staff Engineer at Nagarro [Dec’2017 to April’22]
•
Led Kubernetes adoption on AWS EKS, reducing deployment times and improving scalability.
•
Developed and maintained CI/CD pipelines, reducing deployment time by 70% and production
issues by 50%.
•
Strengthened application security by implementing WAF, stricter security policies, and VPN
solutions, significantly reducing security incidents.
•
Researched and evaluated emerging technologies to improve system performance and
reliability.
•
Led cloud migration projects for major South African banks, ensuring seamless transitions with
zero data loss.
•
Initiated Terraform module-based infrastructure provisioning for improved maintainability and
collaboration.
Analyst (Cloud Operations Administrator) at HCL Technologies [Nov’15 to Nov’17]
•
Developed real-time system performance dashboards in Grafana, Kibana, Graylog, Sumo
Logic, and CloudWatch, improving availability by 25%.
•
Reduced AWS costs by 20% through reserved instances, EC2 optimization, and cost
allocation tagging.
•
Implement cloudwatch alerting for desired metrics
Associate (Technical Support Engineer) at WIPRO [March’15 to Oct’15]
•
Managed incident response using ServiceNow.
•
Automated operational tasks with Python scripting.
•
Developed performance monitoring dashboards to track system health.
Notable Projects
•
CI/CD & Secrets Management: Implemented CI/CD pipelines and secrets management in
Databricks while migrating jobs to new workspaces.
•
Bitbucket Runners in Kubernetes: Deployed and maintained self-hosted Bitbucket
runners in Kubernetes.
•
Data Migrations: Migrated production data from AWS DocumentDB to MongoDB Atlas and
from on-prem data centres to AWS FSx using AWS DataSync.
•
Multi-Cloud Cost Optimization: Designed a multi-cloud setup, reducing GPU costs by
70%.
•
Disaster Recovery Strategies: Implemented active-active disaster recovery setups.
•
AWS ECS Optimization: Reduced latencies in ECS services using Global Accelerator and
enhanced auto-scaling mechanisms.
•
Centralized Monitoring System: Developed a unified monitoring system across 8 projects
using Python, Graphite, and Grafana.
•
Cost & Downtime Monitoring: Created a Grafana- and Python-based monitoring system to
track costs and downtime trends for stakeholders.
•
Health Check API: Built RESTful health check APIs using Flask and Express.js.
•
Security & Compliance Tooling: Developed a web application using React, MySQL, and
CVE databases to track plugin vulnerabilities.
•
RabbitMQ Monitoring: Created a custom RabbitMQ monitoring tool to track queue
performance.
Trainings & Certifications
•
•
•
•
•
•
•
•
Kubernetes
AWS
Databricks
Application Security Training
Sumo Logic Administrator – 2020
Sumo Logic Fundamentals – 2020
RedHat Certified Engineer – 2017
RedHat Certified System Administrator– 2017
Education
Bachelor’s degree in technology (Electronics & Telecomm)
University: College of Engineering Roorkee -)
Location: India, Uttarakhand, Roorkee
Languages
• English (Fluent)
• Hindi (Fluent )