Job Title: Senior Site Reliability Engineer
Department: Technology
Location: Sheffield, London, Talbot Green or Yoevil
Working Pattern: Hybrid, includes 3 days each week in the office
Contract Type: Full time, permanent
Salary: Up to £90,000 per annum
At Tes we are on a mission to power schools and enable great teaching worldwide, by delivering EdTech solutions that give educators the tools to succeed. From safeguarding and compliance to staff and pupil management, our innovative and flexible software and services help teachers and school leaders worldwide to provide the best education to millions of children.
With more than 13 million educators in our community, combined with our working relationships with 25,000 schools in over 100 countries, we have been making a difference for over 100 years.
Role overview
As a Senior SRE Engineer, you will be pivotal in designing and implementing best SRE practices while fostering a culture of continuous improvement and optimization. You will collaborate closely with development and operations teams to improve the platform stability and performance, ensuring that our systems are reliable, secure, and scalable.
Key Responsibilities
Infrastructure Management:
- Manage and scale cloud-based infrastructure (e.g., AWS, Azure, GCP).
- Apply Infrastructure as Code (IaC) principles for provisioning and configuration management.
Security and Compliance:
- Collaborate with the security team to implement best practices for system and data security.
- Ensure systems comply with relevant industry standards and regulations.
Monitoring and Performance:
- Set up and maintain monitoring and alerting systems for early issue detection and resolution.
- Continuously optimize system performance and resource usage.
Documentation:
- Create and maintain thorough documentation for SRE/platform processes, tools, and practices. Exposure to Jira and equivalent tool would be beneficial
What will you need to succeed?
Experience
- Proven experience in a SRE/DevOps/Platform role, with a strong background in both software development or operations.
- Knowledge of CI/CD tools (e.g., Jenkins, GitLab CI/CD, Travis CI).
- Proficiency in scripting and automation (e.g., Bash, Python, Ansible).
- Strong experience with containerization and orchestration technologies (e.g., Docker, Kubernetes).
- Strong hands-on experience of at least one major public cloud platforms (e.g., AWS, Azure, GCP).
- Strong problem-solving and troubleshooting abilities in a timebound situation (Major incidents).
- Clear communication and incident management experience.
- Demonstrable strong hands-on experience with Terraform.
- Knowledge of microservices architecture.
- Familiarity with security best practices and tools.
- Demonstrable experience of monitoring / observability tools preferred Grafana, Prometheus, PagerDuty, uptime.
Knowledge
- Cloud Platforms: Strong knowledge of AWS, Azure, or GCP, including cloud architecture, services, and security models.
- Containerization & Orchestration: In-depth understanding of Docker and Kubernetes for deploying and managing containerized applications.
- Infrastructure as Code (IaC): Knowledge of IaC frameworks, particularly Terraform, to manage cloud infrastructure via code.
- Microservices Architecture: Familiarity with microservices design patterns and deployment strategies in a cloud-native environment.
- Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking.
Skills
- CI/CD Tools: Hands-on experience with Jenkins, GitLab CI/CD, Travis CI, or similar tools for building CI/CD pipelines.
- Scripting & Automation: Proficiency in scripting languages like Bash and Python, along with automation tools such as Ansible for managing configurations and deployments.
- Containerization & Orchestration: Practical skills in deploying and managing containers using Docker and orchestrating workloads using Kubernetes.
- Cloud Platform Management: Expertise in managing and scaling cloud environments on AWS, Azure, or GCP, leveraging services for compute, storage, networking, and security.
- Infrastructure as Code (IaC): Skilled in using Terraform to automate provisioning and management of cloud infrastructure.
- Troubleshooting & Problem Solving: Strong analytical skills for identifying and resolving complex system issues, especially in production environments.
- Collaboration & Communication: Excellent ability to work under pressure e.g. in a Major incident.
Qualifications
- Certifications (Preferred): Holding certifications such as AWS Certified DevOps Engineer, CKA (Certified Kubernetes Administrator), or other relevant credentials.
What do you get in return?
- 25 days annual leave rising to 30
- 5% pension after probation
- State of the art offices
- Access to a range of benefits via My Benefits World
- Free eye care cover
- Life Assurance
- Cycle to Work Scheme
- EAP (Employee assistance programme)
- Quarterly Tes Socials
- Access to an extensive Learning and Development menu
Who are Tes?
Tes is a global Edtech leader, on a mission to empower schools and educators to deliver impactful, inspiring learning experiences worldwide. We understand the unique challenges faced by schools, and our ecosystem is specifically designed to address these needs head-on.
Our intuitive technology streamlines complex tasks, enhances learning experiences, and alleviates the administrative burdens that often overwhelm schools.
By working closely with schools, we provide up-to-date resources, expert guidance, and a technology ecosystem dedicated to innovation and excellence in education. Whether simplifying administrative workflows, creating dynamic classrooms, or advancing professional development, Tes is the trusted partner for schools worldwide.
Join the hundreds of schools already benefiting from the Tes ecosystem. Together, we empower educators to achieve more, ensuring every student thrives in a supportive, well-managed learning environment.
With Tes, excellence in education is not just a goal - it's a sustainable and rewarding reality. For more information about Tes, our products, and our team, visit: www.tes.com.