- Home
- Remote Jobs
- Team Lead, Site Reliability Engineering
Date Posted:
8/27/2025
Remote Work Level:
100% Remote
Location:
Remote in United Kingdom
Job Type:
Employee
Job Schedule:
Full-Time
Career Level:
Manager
Travel Required:
Yes
Education Level:
We're sorry, the employer did not include education information for this job.
Salary:
We're sorry, the employer did not include salary information for this job.
Categories:
IT, System Administrator, Tech Support, Product Manager, Project Manager, Software Engineer
Benefits:
Education Assistance, Paid Illness Leave, Paid Time Off, Career Development, Community Service
About the Role
Location: London
Type: Full Time
Workplace: remote
Category: Site Reliability Engineering
Job Description:
Team Lead, Site Reliability Engineering
Regular travel to Brighton required | United Kingdom | Remote | Work from Home
Why Pythian:
At Pythian, we are experts in strategic database and analytics services, driving digital transformation and operational excellence. Pythian, a multinational company, was founded in 1997 and started by ensuring the reliability and performance of mission-critical databases. We quickly earned a reputation for solving tough data challenges. We were there when the industry moved from on-premises to cloud environments, and as enterprises sought more from their data, we expanded our competencies to include advanced analytics.
Today, we empower organizations to embrace transformation and leverage advanced technologies, including AI, to stay competitive. We deliver innovative solutions that meet each client’s data goals and have built strong partnerships with Google Cloud, AWS, Microsoft, Oracle, SAP, and Snowflake. The powerful combination of our extensive expertise in data and cloud and our ability to keep on top of the latest bleeding edge technologies make us the perfect partner to help mid and large-sized businesses transform to stay ahead in today’s rapidly changing digital economy.
Why You:
Pythian is building a next-generation Site Reliability Engineering team, and we’re looking for a talented, and experienced Team Lead who thrives in fast-paced, problem-solving environments.
As a Team Lead, you’ll be responsible for leading a team of site reliability engineers that are designing, deploying, and operating large-scale distributed systems across compute, storage, networking, and AI/ML environments. You will act as the primary technical escalation point, oversee day-to-day operational delivery, mentor and coach team members, and ensure adherence to SLAs and quality standards. You may also directly contribute to delivery by leading projects from architecture to automation to intelligent monitoring, collaborating with both clients and teammates to build resilient, high-performing infrastructure.
If this is you, and you wonder what it would be like to work at Pythian, reach out to us and find out! Intrigued to see what a life is like at Pythian? Check out #pythianlife on LinkedIn!
What you will be doing:
-
- Team Leadership & Operational Management:
- Lead and mentor a team of Site Reliability Engineers to ensure technical excellence, timely resolution of incidents, and professional growth of team members.
- Oversee queue management, ticket prioritization, and workload distribution to meet SLA and utilization targets.
- Act as the primary point of contact for critical escalations and severity-1 incidents, providing guidance and technical direction.
- Conduct performance reviews, and knowledge-sharing sessions to strengthen the team’s capabilities.
- Collaborate with management on performance metrics, process adherence, and resource planning.
- Sets specific goals and objectives for team members as part of Pythian’s goal planning program. Provides guidance to team members in regards to training opportunities as part of Pythian’s self-directed training program. Meets regularly with team members for one-on-one sessions to disseminate information and gain feedback on opportunities for improvement.
- Technical Responsibilities:
- Operate and optimize Kubernetes clusters, Istio service mesh, and Linux-based systems.
- Automate workflows using Go, Python, and Shell scripting.
- Build monitoring and observability solutions with Prometheus, Grafana, and Loki.
- Troubleshoot complex networking, storage, and system performance issues.
- Partner with AI/ML teams to ensure infrastructure readiness for model training and data pipelines.
What you bring:
-
- A minimum of 3 years previous experience leading a team.
- Experience with Google Cloud, plus IaC tools (Terraform).
- Strong knowledge of microservices, containers (Kubernetes, Docker), and networking.
- Hands-on experience with PKI, service mesh, and Linux systems administration.
- SRE mindset with a focus on automation, scalability, and reliability.
What you get in return:
-
- Love your career: Competitive total rewards package. Blog during work hours. Hone your skills or learn new ones with our substantial training allowance; participate in professional development days, attend training, become certified, whatever you like!
- Love your work/life balance: Flexibly work remotely from your home, there’s no daily travel requirement to an office! All you need is a stable internet connection.
- Love your coworkers: Collaborate with some of the best and brightest in the industry!
- Love your workspace: We give you all the equipment you need to work from home including a laptop with your choice of OS, and an annual budget to personalize your work environment!
- Love yourself: Pythian cares about the health and well-being of our team. You will have an annual wellness budget to make yourself a priority (use it on gym memberships, massages, fitness and more). Additionally, you will receive a generous amount of paid vacation and sick days, as well as a day off to volunteer for your favorite charity.