At Kiteworks, we are on a mission to empower millions of users to securely share sensitive content with the outside world. Our platform delivers unparalleled enterprise security and governance for critical content exchange — actively preventing data leaks and fortifying defenses against sophisticated cyberattacks.
Role Overview
We are looking for a Cloud Operations & Reliability Manager to lead and evolve our Cloud Operations function across EMEA.
This role is about building and improving CloudOps capabilities globally, not just keeping systems running. Our cloud environment is complex and continues to grow through both organic development and acquisitions — which makes this an exciting space to step into. If you get energy from bringing clarity to evolving environments, aligning teams around shared goals, and helping Cloud Operations scale in a thoughtful and sustainable way, this role is for you.
This is a manager‑level leadership role that combines people leadership, delivery ownership, and technical depth. You will lead a team, work closely with senior engineers, and play a key role in driving CloudOps and platform initiatives forward as the environment continues to mature.
You will partner closely with Engineering, Security, IT, Architecture, and Product teams, acting as a key Cloud Operations leader for EMEA and helping shape how Cloud Operations grows and evolves across the region.
What You’ll Do
Delivery & Agile Execution
· Drive customer‑facing infrastructure initiatives across Kiteworks and acquired entities
· Lead Cloud Operations delivery using Scrum and Agile methodologies
· Own CloudOps delivery end to end, including:
o Requirements gathering, design, and implementation
o Backlog creation and prioritization
o Sprint planning, execution, and retrospectives
o Capacity planning and workload forecasting
o Tracking velocity, burn‑downs, and operational KPIs
· Balance business‑as‑usual operations and incident response with strategic platform and reliability improvements
· Provide transparency and predictable delivery across operational and project‑based work
· Operate effectively in a global environment
Cloud Operations & Reliability
· Own day‑to‑day cloud operations, ensuring platform stability, reliability, and performance
· Lead Site Reliability Engineering (SRE) practices, including:
o Incident management and escalation
o Root cause analysis (RCA) and post‑incident reviews
o Definition and management of SLIs, SLOs, and error budgets
· Ensure production systems are highly available, scalable, and resilient
· Drive continuous improvements in monitoring, alerting, observability, and operational readiness
· Establish and maintain strong on‑call, incident response, and post‑incident review processes
Platform & Technical Leadership
· Provide technical oversight and guidance across the cloud platform, including:
o Kubernetes (EKS, AKS, GKE, or equivalent)
o ArgoCD and GitOps‑based deployment models
o Cloud‑native and managed databases (RDS, Aurora, PostgreSQL, MySQL, NoSQL, etc.)
· Lead architecture decisions related to reliability, security, scalability, and performance
· Partner closely with Engineering, Product, IT, Security, and Architecture teams to execute platform roadmaps
· Maintain strong governance around change management, release processes, and environment stability
· Lead and manage a team of CloudOps / SRE engineers
What You’ll Bring
· Strong experience in Cloud Operations, SRE, or Platform Engineering supporting production systems
· Hands‑on knowledge of cloud infrastructure and Kubernetes‑based platforms
· Experience with Kubernetes monitoring and observability stacks:
o Prometheus, Grafana (metrics)
o Loki, ELK (Elasticsearch, Logstash, Kibana), Fluentd / Fluent Bit (log aggregation)
· Practical experience implementing SRE principles, including SLIs/SLOs, error budgets, incident response, and RCA
· Experience leading Agile delivery for operational or platform teams
· Familiarity with GitOps and modern CI/CD practices (e.g. ArgoCD)
· Solid understanding of cloud‑native and managed database technologies
· Proven ability to balance operational stability with continuous improvement and technical debt reduction
· Clear communicator with a pragmatic, systems‑thinking mindset and a strong bias toward reliability and resilience
Job Benefits
Commitment to Equal Opportunity & Inclusion
Kiteworks is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances.
Other Requirements
Ability to meet Kiteworks, customer, and/or government security screening requirements for this role. These requirements include, but are not limited to, the following specialized security screenings:
Kiteworks Background Check: This position requires passing the Kiteworks background check upon hire/transfer and every two years thereafter.