Engineer - Machine Learning
Presight
Date: 2 weeks ago
City: Abu Dhabi
Contract type: Full time
Overview
The Company
Presight, an ADX-listed public company limited by shares whose majority shareholder is Abu Dhabi company G42, is the region’s leading big data analytics company powered by Artificial Intelligence (“AI”). It combines big data, analytics, and AI expertise to serve every sector, of every scale, to create business and positive societal impact. With its world-class computer vision, AI and omni-analytics platform as its engine, Presight excels at all-source data interpretation to support insight-driven decision making that shapes policy and creates safer, healthier, happier, and more sustainable societies.
The Opportunity
We are seeking a highly skilled LLM Ops Engineer to lead the deployment, scaling, monitoring, and optimization of large language models (LLMs) across diverse environments. This role is critical to ensuring our machine learning systems are production-ready, high-performing, and resilient. The ideal candidate will have deep expertise in Python programming, a comprehensive understanding of LLM internals, and hands-on experience with various agentic frameworks, inference engines and deployment strategies. This position offers the opportunity to work on cutting-edge AI technologies in a dynamic and collaborative environment.
Responsibilities
Responsibilities
Qualifications
If you are a performance-driven, inquisitive mind with the agility to adapt to ambiguity, you will fit right in. You should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. Bias for action and a passion to conquer new frontiers is at the heart of the Presight community.
The Company
Presight, an ADX-listed public company limited by shares whose majority shareholder is Abu Dhabi company G42, is the region’s leading big data analytics company powered by Artificial Intelligence (“AI”). It combines big data, analytics, and AI expertise to serve every sector, of every scale, to create business and positive societal impact. With its world-class computer vision, AI and omni-analytics platform as its engine, Presight excels at all-source data interpretation to support insight-driven decision making that shapes policy and creates safer, healthier, happier, and more sustainable societies.
The Opportunity
We are seeking a highly skilled LLM Ops Engineer to lead the deployment, scaling, monitoring, and optimization of large language models (LLMs) across diverse environments. This role is critical to ensuring our machine learning systems are production-ready, high-performing, and resilient. The ideal candidate will have deep expertise in Python programming, a comprehensive understanding of LLM internals, and hands-on experience with various agentic frameworks, inference engines and deployment strategies. This position offers the opportunity to work on cutting-edge AI technologies in a dynamic and collaborative environment.
Responsibilities
Responsibilities
- Design Design, deploy, and scale LLM infrastructure across cloud and on-premises environments, including GPU clusters, containers, and orchestration with Kubernetes, ensuring high performance, reliability, and fault tolerance.
- Build and optimize inference pipelines for low-latency, high-throughput model serving using frameworks such as Triton Inference Server, vLLM, or TensorRT.
- Manage CI/CD pipelines, AI microservices, embeddings storage, and MCP servers, ensuring secure, production-ready deployment of models and tool integrations.
- Deploy and maintain agentic AI frameworks (e.g., Dify, LangFlow) and LLM gateways to manage traffic, enforce audit/compliance controls, and integrate with IAM systems.
- Monitor performance, cost, and resource usage; implement optimization strategies for GPU, CPU, and storage efficiency while maintaining scalability and reliability.
- Conduct hardware sizing and capacity planning to meet current and projected LLM workload requirements.
- Collaborate with data scientists and engineers to operationalize models and workflows into production-grade systems.
- Develop and maintain documentation, runbooks, and deployment playbooks for knowledge sharing and operational consistency.
- Stay current on emerging LLM techniques, including quantization, distillation, distributed inference, and best practices for production deployments.
- Troubleshoot and resolve production issues, continuously improving infrastructure for stability, scalability, and maintainability.
Qualifications
- Bachelor’s or Master’s degree in computer science, machine learning, or related field, with 2+ years of experience in ML Ops, DevOps, or ML infrastructure, including production deployment of ML/LLM workloads.
- Strong Python and scripting skills, with experience in containerization, orchestration (Docker, Kubernetes, Helm), CI/CD pipelines, monitoring, and observability for ML systems.
- Expertise in GPU cluster management, distributed inference, high-performance model serving, and scalable, fault-tolerant architectures.
- Proficiency with cloud/hybrid environments (AWS, GCP, Azure, on-prem), and knowledge of security, access control, and compliance requirements.
- Experience deploying and maintaining agentic AI frameworks (e.g., Dify, LangFlow) and MCP servers for LLM-tool integration.
- Familiarity with LLM orchestration, RAG pipelines, API integrations, and distributed inference frameworks (e.g., Ray).
- Expertise in hardware sizing, capacity planning, cost optimization, and infrastructure-as-code tools (Terraform) for large-scale ML/LLM deployments.
- Hands-on experience with LLM optimization techniques (quantization, distillation, compression).
- Understanding of compliance and governance standards (ISO, NIST) for operational AI systems.
If you are a performance-driven, inquisitive mind with the agility to adapt to ambiguity, you will fit right in. You should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. Bias for action and a passion to conquer new frontiers is at the heart of the Presight community.
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Title Paediatric Endocrinologist
Ayadi Home Health Care LLC,
Abu Dhabi
1 day ago
Closing DateNumber of positions: 1Recruiter name: Rami AlsamanReference number: 65326Workplace Type: [[cust_WorkplaceType]]Mediclinic Airport Road Hospital | Abu Dhabi | United Arab EmiratesMAIN PURPOSE OF JOBTo provide patient-centric and evidence-based consultant-level care by ensuring compliance to ethical and professional standards set by the company and regulatory authoritiesKEY RESPONSIBILITY AREASProvide a comprehensive professional and ethical quality clinical services to patients in an...
Registered Nurse - Emirati National Only
Etihad,
Abu Dhabi
1 day ago
SynopsisFrom Abu Dhabi to the world — empower the nation’s growth in a healthcare role that goes beyond borders. Provide nursing support to Etihad Airways Medical Centre (EAMC) for general, Aviation and Ocupational Health departments. If you are an Emirati Registered Nurse come join the Team!AccountabilitiesThe registered nurse supports the Physician in the performance of medical fitness examinations for aviation...
Technical Safety Consulting Director (Director Level)
ERM,
Abu Dhabi
1 day ago
ERM is looking for a highly experienced Technical Safety & Risk Consulting Director to be based in our Abu Dhabi office, with travel expected across the Middle East region.The role is an exciting opportunity to provide input to a range of high profile projects for Clients, aimed primarily at helping them identify and assess their major hazard risks, ensure they...