Infrastructure Operations Specialist (Remote) (m/f/d)
Halian | Managed Services, Recruitment Agency & Contract Staffing
Date: 3 weeks ago
City: Dubai
Contract type: Full time
Remote

Job Title: Infrastructure Operations Specialist (Remote)
Duration: 6 Months
Start Date: August 3rd
Location: Remote (aligned with MENA Standard Time)
Workload: Full-time, 5 days/week (Sunday–Thursday preferred; Monday–Friday acceptable)
Language Requirement: Fluent English
Job Overview:
We are seeking an experienced Infrastructure Operations Specialist to provide remote support, maintenance, and optimization for GPU-based server environments. The ideal candidate brings hands-on experience in cluster management, Ethernet networking, and working with CSP customers.
Responsibilities:
Administration & Operations
Duration: 6 Months
Start Date: August 3rd
Location: Remote (aligned with MENA Standard Time)
Workload: Full-time, 5 days/week (Sunday–Thursday preferred; Monday–Friday acceptable)
Language Requirement: Fluent English
Job Overview:
We are seeking an experienced Infrastructure Operations Specialist to provide remote support, maintenance, and optimization for GPU-based server environments. The ideal candidate brings hands-on experience in cluster management, Ethernet networking, and working with CSP customers.
Responsibilities:
Administration & Operations
- Monitor, review, and manage server infrastructure
- Handle user requests and access management
- Analyze log files and produce regular reports
- Maintain and operate Base Command Manager for GPU clusters
- Perform daily operational tasks
- Support firmware/software updates and change implementations
- Document and implement IT procedures and policy changes
- Manage migrations and related reporting
- Plan transition activities with deployment teams
- Configure additional host and network elements post-deployment
- Transfer knowledge on tools, procedures, and best practices
- Recommend enhancements and upgrades
- Isolate and troubleshoot incidents
- Coordinate service incidents and open support tickets
- Contribute to root cause analyses
- Recommend improvements for process efficiency
- Share best practices from similar projects
- Provide performance tuning input
- Review IT processes (incident, change, performance, etc.)
- Collaborate on documentation and procedural updates
- Support upskilling of internal teams
- Experience with B200 GPU XE9680L server maintenance
- Strong background in Ethernet networking
- Experience supporting Cloud Solution Provider (CSP) environments
- Solid understanding of infrastructure operations and technical leadership
- Strong communication skills and ability to deliver structured knowledge transfer
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Engineering Program Manager, Enterprise AI (Middle East)
Cohere,
Dubai
2 hours ago
Who are we?Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.We obsess over what we build. Each one of us...

Manager
Crisil,
Dubai
19 hours ago
We are seeking a highly skilled individual with 12+ years of experience in corporate banking, data analytics, and business intelligence. The ideal candidate should have strong expertise in banking products, data management, and project execution, ensuring effective collaboration between business and technology teams.Key Responsibilities:Extensive knowledge of corporate banking products, services, and processes. Strong expertise in analytics, business intelligence, SDLC, Data...

Senior Event Producer - Dubai (Freelancer)
Fever,
Dubai
1 day ago
Hey there!We’re Fever, the world’s leading tech platform for culture and live entertainment,Our mission? To democratize access to culture and entertainment. With our proprietary cutting-edge technology and data-driven approach, we’re revolutionizing the way people engage with live entertainment.Every month, our platform inspires over 300 million people in +40 countries (and counting) to discover unforgettable experiences while also empowering event creators...
