Incident Management Engineer, AWS Incident Detection and Response

·
Full time
Location: Newtownabbey
·
Job offered by: Amazon.com
·
Category:
Incident Management Engineer, AWS Incident Detection and Response

Amazon.com

ABOUT US

Amazon has built a reputation for excellence with a mission to be the earth’s most customer-centric company, a company that customers from all over the globe will recognize, value, and trust for both our products and our service. Amazon Web Services (AWS) is carrying on that tradition while leading the world in cloud technologies.

The AWS Incident Detection and Response team is part of the Enhanced Support Services (ES2) organisation within AWS Support, and is dedicated to offering eligible AWS Enterprise Support customers proactive engagement and incident management to reduce the potential for failure and to accelerate recovery of critical workloads from disruption. We achieve these objectives by working closely with customers to develop runbooks and response plans customized to the context of each workload onboarded to the service. Onboarded workloads are monitored 24x7 by a team of Incident Management Engineers (IMEs) to detect and engage customers on a call bridge within 5 minutes of a critical alarm.

ABOUT YOU

Incident Management Engineers have a broad skill set with demonstrated career progression and a proven track record of delivering results. The successful candidate will possess strong analytical acumen, solid technology experience, superb business judgment, strategic account ownership and a propensity to dive deep to solve complex problems. You will also have a passion for creating/providing a world class experience for our customers. The candidate must understand the competitive and industry landscape and must have the leadership presence and communication skills to effectively work with customers at all levels of their organization. You must be a self-starter and able to execute at both a tactical and strategic level – with a strong attention to detail. This is a global role that requires excellent written and verbal communication skills and a passion and desire for leading the resolution of critical incidents. Your decisions are not only fundamental to helping protect our most critical customers but will help maintain the health of AWS customers worldwide.

Finally, you are passionate about technology with a desire to learn more and do more with AWS.

ABOUT THE ROLE

AWS Support is looking for a leader with a strong background in Incident Management and customer ownership to be there during the moments that matter for our most critical customers. We are looking for a Major Incident Manager to join our team to provide incident response and account ownership. In this position, you will play a pivotal role in providing communication, emergency response, technical resolver engagement and incident management for our customers.

Key job responsibilities

Drive the resolution of large scale customer impacting incidents as part of a team rotation Drive critical, complex customer escalations in situations that are sometimes technically challenging in collaboration with Engineering Teams. Provide critical incident response/management (including leading calls with internal/external participants) for customer’s critical workloads Contribute to Problem Records for customers Conduct continuous real-time proactive monitoring of customer metrics Prioritize, manage, and own emerging and developing customer issues from start to finish Monitor and manage communications during high impact events via relevant channels Collaborate with key stakeholders across AWS to improve the customer experience and develop mechanisms that support operational excellence Lead projects and virtual teams to drive operational improvements Create and review documentation; design/influence new standard operating procedures Identify and troubleshoot recurring platform issues and own projects to drive improvements Mentor peers in your areas of technical and operational strength Perform other duties as required by the organisation

Please note that while this role is open to applicants in Dublin, as a follow-the-sun organisation, IMEs work the core hours of 7am to 3pm GMT or 8am to 4pm GMT+1. Successful applicants will be required to work some weekends (Sunday to Thursday, or Tuesday to Saturday), and public holidays.

Basic Qualifications 1+ year of experience in a similar role 2+ years of virtualization, orchestration and cloud computing (eg. Hypervisors, VMware, Xen) experience 1+ year of network and operating system support experience Bachelor's degree in computer science or equivalent, or 3+ years of technical support experience Preferred Qualifications

Experience creating or designing cloud application architectures with a focus on high availability and fault tolerance Experience with data manipulation and/or automation using Python, JavaScript or shell scripting Effective prioritization and time management skills and an ability to work in ambiguous environments Demonstrated critical thinking and logical problem solving skills Familiarity operating or designing distributed architectures with the ability to correlate system behaviours based on known inter-dependencies

Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build.

#J-18808-Ljbffr

Recent Jobs

London (On site) · Full time

Are you a smart, driven professional who takes pride in making a difference in local communities? Turner & Townsend’s Real Estate division is experiencing significant growth and we’re looking for an experienced industry professional with health project experience to join our high-performing and collaborative Project Management team. Why Join Us? Impactful Work: Contribute to social [...]Read More... from Assistant Project Manager – Healthcare See details

Chasetown (On site) · Full time

My client, Autosmart International are a manufacturing success story! Site Operations Manager – leading fast-paced manufacturing and warehousing About Our Client Autosmart International is a manufacturing success story, leading the field in vehicle cleaning products. We are the No.1 choice of automotive trade customers across the UK. We have doubled in size in the last [...]Read More... from Site Operations Manager See details

London (On site) · Full time

CSS are looking for an experienced duty officer to join our client’s team who are a local council responsible for all areas within the Tendering district. Working hours: All shifts are 8 hours long with various start times available: Monday to Friday – start times between 6AM – 3PM Saturday & Sunday – 6AM – [...]Read More... from Duty Officer See details