Site Reliability Engineer – SRE Consultant

·
Full time
Location: Manchester
·
Job offered by: Akkodis
·
Category: IT & Technology
Site Reliability Engineer - SRE Consultant Akkodis are currently working in partnership with a leading service provider to recruit an experienced Site Reliability Engineer with experience in ensuring reliability, scalability and efficiency of client platforms. Please note this is a fully remote role with travel to client sites required on occasion and you must be eligible to gain security clearance (do not need to hold currently). The Role As a Site Reliability Engineer (SRE) you will lead site reliability engineering initiatives with a strong emphasis on observability, ensuring high performance and reliability of applications & infrastructure. Provide strategic insights to shape the overall SRE strategy while collaborating on the design and implementation of scalable and reliable solutions. Establish effective monitoring, alerting and incident response strategies to maintain system availability and promote continuous improvement by collaborating with team members to deliver observability best practices and SRE methodologies. The Responsibilities Define and implement Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to measure and maintain system and application performance, ensuring services meet agreed reliability targets. Instrument applications to collect key metrics, logs, and traces that enable proactive monitoring and troubleshooting. Create dashboards and configure alerts to provide real-time visibility into system health, enabling teams to quickly detect and resolve issues. Assess and enhance Kubernetes capabilities, improving DevOps efficiency through innovation, agility and cost optimisation. Take a holistic approach to modernising the developer experience, focusing on organisational culture, DevOps practices, processes, automation and tooling. Architect scalable and resilient cloud infrastructure to ensure the seamless deployment and optimisation of containerised applications. Collaborate with cross-functional teams to implement automation strategies that reduce operational complexity and drive continuous improvement. The Requirements Strong understanding of the SRE mindset and principles, including the creation and management of Service Level Indicators (SLIs), Service Level Objectives (SLOs) and error budgets ensuring reliability and performance. Experience in implementing observability, instrumenting applications to provide insights into system performance. Hands-on experience with tools such as Dynatrace, Prometheus and OpenTelemetry for monitoring, tracing, and real-time alerting is highly sought after. An understanding of microservices and container orchestration with the ability to optimise containerised applications for reliability and scalability. Experience enabling continuous delivery pipelines, with a focus on ensuring system reliability, quality, and performance through automated deployment, scaling, and observability tools. Understanding of build and deployment of pipelines and experience in collaborating with developers to improve observability and monitoring practices. Strong collaboration skills with the ability to work effectively both independently and as part of a team. Comfortable interacting and engaging with clients, although a consulting background is not a prerequisite. An enthusiasm and excitement at the prospect of working with a wide range of technology stacks and cloud providers across the wide range of clients and industries we support. If you are looking for an exciting new challenge to join a leading consultancy please apply now.

#J-18808-Ljbffr

Recent Jobs

London (On site) · Full time

Are you a smart, driven professional who takes pride in making a difference in local communities? Turner & Townsend’s Real Estate division is experiencing significant growth and we’re looking for an experienced industry professional with health project experience to join our high-performing and collaborative Project Management team. Why Join Us? Impactful Work: Contribute to social [...]Read More... from Assistant Project Manager – Healthcare See details

Chasetown (On site) · Full time

My client, Autosmart International are a manufacturing success story! Site Operations Manager – leading fast-paced manufacturing and warehousing About Our Client Autosmart International is a manufacturing success story, leading the field in vehicle cleaning products. We are the No.1 choice of automotive trade customers across the UK. We have doubled in size in the last [...]Read More... from Site Operations Manager See details

London (On site) · Full time

CSS are looking for an experienced duty officer to join our client’s team who are a local council responsible for all areas within the Tendering district. Working hours: All shifts are 8 hours long with various start times available: Monday to Friday – start times between 6AM – 3PM Saturday & Sunday – 6AM – [...]Read More... from Duty Officer See details