Senior Engineer- Alerting & Incident Management

First Abu Dhabi Bank (FAB)


Date: 6 days ago
City: Abu Dhabi
Contract type: Full time

Join the UAE’s largest bank and one of the world’s largest and safest financial institutions. Our focus is to create value for our employees, customers, shareholders and communities to grow through differentiation, agility and innovation. We are looking for top talent and your success is our success. Accelerate your growth as you help us reach our goals and advance your career. Be ready to make your mark a top company, in an exciting and dynamic industry.



Job Description

Overall objectives

•To establish and maintain an effective, intelligent, and timely alerting framework across infrastructure, application, and business services.

•To coordinate and continuously improve the incident management lifecycle with a focus on early detection, rapid response, and root cause accountability.

•To integrate observability data (logs, metrics, traces) into a unified alerting and incident response workflow.

•To reduce Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR) through automation, clear escalation paths, and operational discipline.

Role specific responsibilities

•Manage and continuously improve the incident response process, including triage, escalation, status communications, and resolution tracking.

•Act as the incident commander during major outages or high-severity issues, coordinating technical teams toward resolution.

•Maintain and govern on-call schedules, escalation paths, and responder playbooks.

•Integrate observability tools with incident management platforms to enable real-time, contextual alerting.

•Lead and document root cause analysis (RCA) and ensure completion of follow-up actions and preventive measures.

•Report on incident metrics and trends, identifying areas for resilience and process improvement.

General functional responsibilities

•Maintain detailed documentation on alert rules, incident workflows, contact rosters, and escalation trees.

•Ensure compliance with regulatory, audit, and risk management requirements related to incident response and system availability.

•Collaborate with monitoring, logging, and APM peers to align telemetry signals with operational response.

•Work with development, infrastructure, and support teams to embed alert and incident management best practices in SDLC and change management.

•Participate in regular incident simulations and on-call readiness drills.

•Drive continuous improvement through retrospective reviews, blameless post-mortems, and incident automation.



Qualifications

Core competencies required

Strong experience with alert management platforms such as Opsgenie, Splunk On-Call, ServiceNow Event Management, or VictorOps.

Familiarity with routing rules, escalation policies, noise suppression, on-call schedules, and alert deduplication.

Deep understanding of the end-to-end incident management process—detection, triage, escalation, communication, and closure.

Proficient in running major incident bridges, documenting timelines, and leading post-incident reviews (PIRs/RCAs).

Calm and assertive in high-pressure incident scenarios.

Excellent communicator—able to coordinate with technical and business stakeholders during incidents..

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Lead Consultant - OT Cybersecurity

CPX, Abu Dhabi
10 hours ago
OverviewIn this role, you will provide management within the Cyber Solutions Consulting team in order to support the Director achieve team objectives:Technical oversight of projects and service deliveryCyber Solutions Delivery contribution across presales pursuitsProvide leadership across cross-business unit presales and project engagementsProvide guidance from a technology architecture perspective across CPX internal initiativesResponsibilitiesProvide management and supervision to Cyber Solutions ConsultingDesign solutions...

Repair & Maintainance Technician III - Completions

Weatherford, Abu Dhabi
11 hours ago
Job Overview JOB DESCRIPTION The R&M Technician – Completions is responsible for the repair, maintenance, inspection, and refurbishment of Weatherford completions tools and equipment, including upper and lower completion systems, liner hangers, and related service tools. The role ensures that all equipment is serviced to the highest safety and quality standards before deployment to the field. Senior technicians may also...

Inside Sales Engineer

Westinghouse Electric Company, Abu Dhabi
1 day ago
Welcome to the future of nuclear energy, where Westinghouse Electric Company is leading the field with expertise and innovation to shape the power of tomorrow. At Westinghouse, innovation is in our DNA. We are creative. We think differently. We reimagine the possible across the nuclear industry every day.As a Inside Sales Engineer you will be the Westinghouse Parts Business (WPB)...