DevOps Engineer
Google Cloud Platform infrastructure: • Compute Engine • Red Hat Enterprise Linux (RHEL) virtual machines • Windows virtual machines • Local SSD • Persistent Disk • Cloud Networking • Cloud Logging • Cloud Monitoring • Cloud Storage • Cloud Key Management • Cloud Secret Manager • Dynatrace • GitHub Enterprise • HashiCorp products, including Terraform and Packer • Jenkins • Backstage • Jira Cloud and Jira Align Skillset Required : Google Cloud Platform Infrastructure Engineering: Proven experience designing, building, and operating secure, automated cloud platform capabilities, with a focus on Google compute products and services. • Infrastructure as Code: Proficiency with Terraform (minimum), Jenkins, and modern CI/CD systems (GitHub Actions, Harness, Jenkins). • Networking & Security: Experience with GCP Cloud Armor, GCP Networking, and embedding secure-by-design controls from design to runtime. • Automation & Observability: Implementing actionable observability, performance tuning, and automation to reduce toil. Defining and operating against SLOs/SLIs. • Scripting & Tooling: Scripting in Bash, PowerShell, or Python. Familiarity with HashiCorp Vault, Harness, and Backstage is desirable. • Team Leadership, Collaboration & Mentoring: Ability to lead and motivate a team of infrastructure engineers, ensuring cross-team collaboration and mentoring. • Certifications: Relevant GCP certifications are desirable. Scope of services As an infrastructure engineering lead within the Public Cloud Services Compute (GCP DCX) team, the scope of service includes: • Design, Build, and Operate: Deliver and maintain secure, automated Google compute capabilities, supporting Red Hat Enterprise Linux (RHEL) and Windows virtual machine and broader compute and networking products and configurations. • Platform Enablement: Enable product teams to deliver Google IaaS solutions at pace, leveraging reusable patterns and robust integration tools. • Infrastructure Automation: Develop and maintain Infrastructure as Code (IaC) solutions for provisioning and managing Google resources, ensuring repeatability and compliance. • Security & Compliance: Embed security best practices and controls throughout the platform lifecycle, safeguarding organisational and customer data. • Performance & Reliability: Define, monitor, and operate against service level objectives (SLOs/SLIs), ensuring high availability, performance, and fault tolerance. • Continuous Improvement: Drive automation, observability, and performance tuning to reduce manual effort and improve platform reliability. • Collaboration: Work closely with architecture and feature teams to evolve the cloud roadmap and platform products, contributing to documentation and enablement. • Team Leadership, Mentoring & Standards: Lead, motivate, and mentor a team of infrastructure engineers and uphold engineering standards, fostering a culture of continuous learning and improvement.