Principal Lead- Observability and Incident Management

First Abu Dhabi Bank (FAB)


Date: 6 days ago
City: Abu Dhabi
Contract type: Full time
Company Description

Are you ready to join us on our exciting transformation journey at the largest bank in the UAE? This is an opportunity to make a real impact on our customers, employees, shareholders, and communities, as part of the FAB team. We're committed to our grow stronger movement, and as a member of our team, you'll have access to everything you need to advance your career and make a meaningful contribution to our shared success. If you're looking for a career that will help you stand out and make a difference, now is the time to join us. Let's work together to achieve great things.

Job Description

Overall objectives

  • Ensure proactive detection, diagnosis, and resolution of service health issues across all IT environments
  • Establish a modern observability function that delivers full visibility into the critical services, applications and infra layers
  • Own and lead the major incident management process, ensuring rapid containment, clear communication and structured resolution
  • Drive actionable insights through metrics and logs (MTL) and ensure system health telemetry is used to improve availability, performance and user experience
  • Support operational risk reduction and continuous improvement through RCA, trend reporting and resilience engineering

Job scope

Role Specific Responsibilities

  • Monitoring and observability engineering
  • Alerting, noise reduction and event correlation
  • Incident management
  • Poset incident review and RCA
  • Dashboarding and health visibility
  • Service reliability metrics

General Functional Responsibilities

  • Define the observability architecture strategy ensuring scalability, data security and cost optimisation
  • Collaborate with app, infra and security teams to ensure instrumentation coverage and logging compliance
  • Maintain operational documentation, runbooks, escalation matrices and incident playbooks
  • Drive blameless culture of improvement and incident learning
  • Align monitoring practices with regulatory and compliance obligations
  • Represent the observability and incident management function at governance forums
  • Engage with vendors, SaaS providers, and cloud platforms to ensure integration with internal monitoring and incident workflows
  • Coach and mentor monitoring and incident managers to raise maturity across people, processes and tooling

Qualifications

Core competencies required

  • Deep expertise in monitoring platforms e.g., ELK, AppDynamics, Grafana, Elastic, Datadog, APM, synthetic monitoring and log aggregation
  • Solid understanding of distributed systems, microservices and hybrid cloud environments
  • Strong command of SRE, telemetry pipelines, SLI/SLO and alerting strategies
  • Experience running 24/7 incident command processes, leading war rooms, managing comms to executives and driving post-mortems
  • Ability to align observability practices to business-critical services and customer impact, not just infra health
  • Mastery of ITIL event management and incitement management with ITSM platforms like ServiceNow
  • Calm decisive leadership in high pressure scenarios, excellent cross functional coordination and communication skills
  • Overall 15+ years of technology experience is desirable

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

IT Administrative Assistant - UAE National

Technip Energies, Abu Dhabi
2 days ago
Job DescriptionWe are currently seeking a IT Administrative Assistant - UAE National , reporting directly to the Head of the Department to join our team based in UAE, Abu Dhabi.About us:Technip Energies is a global technology and engineering powerhouse. With leadership positions in LNG, hydrogen, ethylene, sustainable chemistry, and CO2 management, we are contributing to the development of critical markets...

Specialist, Production Optimization

ADNOC Group, Abu Dhabi
2 days ago
JOB PURPOSE: Supports TL-OS in planning, managing and coordinating activities to ensure that Production and Injection targets are achieved, field operations are conducted safely and efficiently in accordance with Company policies. Meet division objectives, KPl's, standards.KEY ACCOUNTABILITIES:Job Specific AccountabilitiesMonitoring of OperationsParticipates in the planning, coordination and recommendation of long and short-term production programs. Attend departmental meetings, discussing existing field capacities,...

Sr. Construction Engineer Civil

Penspen, Abu Dhabi
2 days ago
We are urgently looking for Sr. Construction Engineer Civil for one of our Offshore PMC projects in UAE.Qualification RequiredThe candidates should have an Bachelor's degree in Civil Engineering.Roles & ResponsibilitiesSupervise civil construction activities such as foundations, piling, concrete works, structural installations, and subsea infrastructure Ensure construction is carried out according to approved drawings, specifications, and safety standardsLead and coordinate multidisciplinary...