Software Reliability Engineer, Senior

·
Full time
Location: Abingdon
·
Job offered by: Expert Employment
·
Category:
Software Reliability Engineering combines software development and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.

Software Reliability Engineers influence the whole lifecycle of services from inception and design, through deployment, operation, and refinement.

Key Skills

Python 3. Understanding of Docker and Kubernetes. Strong in Software Engineering: development lifecycle, DevOps, code release management, and development tools. Ability to debug and optimize code and automate routine tasks. Good to have: Cloud technology (GCP/AWS/Azure/Java).

Responsibilities Maintain and improve services once they are live by measuring and monitoring availability, latency, and overall system health. Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews. Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. Engaged in incident response and blameless postmortems. Maintain a broad knowledge of state-of-the-art computer technology, equipment, and systems; participate in professional development activities as appropriate. Support Software Development tooling such as Rundeck, Pagerduty, Stackdriver, PAM access (Cyber Ark), Operational Readiness (Internal process), DR/Incident Drills, Incident reports, Cost Dashboards, Billing exports, certificates, etc.

#J-18808-Ljbffr

Recent Jobs

London (On site) · Full time

Are you a smart, driven professional who takes pride in making a difference in local communities? Turner & Townsend’s Real Estate division is experiencing significant growth and we’re looking for an experienced industry professional with health project experience to join our high-performing and collaborative Project Management team. Why Join Us? Impactful Work: Contribute to social [...]Read More... from Assistant Project Manager – Healthcare See details

Chasetown (On site) · Full time

My client, Autosmart International are a manufacturing success story! Site Operations Manager – leading fast-paced manufacturing and warehousing About Our Client Autosmart International is a manufacturing success story, leading the field in vehicle cleaning products. We are the No.1 choice of automotive trade customers across the UK. We have doubled in size in the last [...]Read More... from Site Operations Manager See details

London (On site) · Full time

CSS are looking for an experienced duty officer to join our client’s team who are a local council responsible for all areas within the Tendering district. Working hours: All shifts are 8 hours long with various start times available: Monday to Friday – start times between 6AM – 3PM Saturday & Sunday – 6AM – [...]Read More... from Duty Officer See details