Anson McCade
Site Reliability Engineer
Job Location
Job Description
Site Reliability Engineer (SRE) - Network Automation Team
Salary: £100,000 - £120,000 + 15% Bonus
Location: Fully Remote (UK-based candidates preferred)
Are you passionate about building cutting-edge infrastructure solutions and keen to work for a global leader in securing data and applications? We are partnering with a well-recognized company in the cybersecurity sector, known for safeguarding mission-critical assets and proactively combating emerging threats. With a team comprised of experts from top-tier companies like Netflix, Amazon, and Cloudflare, this organization is on a mission to redefine its infrastructure using modern technologies and automation at every layer.
The Opportunity:
This role offers a chance to join an Infrastructure and Cloud Operations team in its early stages of development. They’re looking for an experienced Site Reliability Engineer (SRE) to support their Network Automation Team, playing a pivotal role in designing and deploying the infrastructure needed for their next-generation automation platform. You will have the unique opportunity to influence infrastructure decisions and shape the operational strategy for a globally distributed network.
What You’ll Do:
- Collaborate with the Network Automation Team to build and deploy infrastructure supporting the company’s new automation platform.
- Implement SRE principles such as defining and measuring SLIs, SLOs, and SLAs.
- Establish metrics for data-driven decision-making to enhance availability, reliability, and overall system performance.
- Build and evolve SLO and SLI baselines for network, system, and application performance.
- Troubleshoot complex infrastructure and network issues, engaging in incident response and conducting blameless post-mortems.
- Participate in a 24x7 on-call rotation, addressing escalations and supporting the organization’s global operations.
What You’ll Bring:
- Minimum of 3 years’ experience working with large-scale cloud or CDN infrastructure.
- Proficiency in programming languages such as Python and Go (knowledge of C/C++ is a plus).
- Deep understanding of Linux systems and network protocols (TCP, UDP, DNS, TLS/SSL, HTTP).
- Experience with BGP and Anycast routing is highly desirable.
- Familiarity with DevOps tools and concepts like Infrastructure as Code (Ansible, SaltStack), CI/CD (Gitlab, Jenkins), and monitoring/visualization tools (Prometheus, Grafana).
- Hands-on experience with Docker and Kubernetes.
- Strong background in software engineering best practices and troubleshooting distributed systems.
- Excellent collaboration and communication skills to work cross-functionally with global teams.
Why Join?
This role is ideal for someone with a strong networking background who is looking to drive change and innovation within a leading cybersecurity organization. You’ll be joining a team where infrastructure excellence and automation are at the heart of every project, with the freedom to work fully remotely and the potential to shape a growing department.
For further information, feel free to reach me at 02895213213, or simply apply!
Reference: AMC/RKI
Location: UK, GB
Posted Date: 10/17/2024
Contact Information
Contact | Human Resources Anson McCade |
---|