JOB TITLE
DevOps Engineer
POSITION
BUSINESS UNIT
Al Ghurair Investment
REPORTS TO (TITLE)
DEPARTMENT
Group IT
NUMBER OF DIRECT REPORTS
PURPOSE
The role purpose is a brief description of the position’s main functionality
The DevOps is a hands-on person responsible for designing, building, and maintaining scalable, secure, and reliable infrastructure and CI/CD pipelines. This role involves deep technical execution while also contributing to team coordination, process optimization, and DevOps strategy. The position strikes a balance between individual contribution (hands-on work) and collaboration across teams, ensuring efficient delivery pipelines, infrastructure automation, and high system reliability to support business agility and operational excellence.
KEY ACCOUNTABILITIES
Key accountabilities are areas of responsibility that are essential of the position
STRATEGIC
Support the continuous evolution of DevOps and SRE practices to enable scalable, secure, and high-performing infrastructure and application delivery.
Contribute to the infrastructure and platform roadmap by evaluating and recommending new tools, technologies, and process enhancements that drive automation, observability, and reliability.
Collaborate closely with engineering and product teams to ensure DevOps practices are fully aligned with development workflows, deployment strategies, and release timelines.
Integrate reliability engineering principles—including SLAs, SLOs, and SLIs—into the software delivery lifecycle to maintain system availability, performance, and user satisfaction.
OPERATIONAL
Hands-on engineering efforts in infrastructure provisioning, CI/CD pipeline implementation, configuration management, and monitoring.
Design, build, and maintain automation and deployment tools using technologies such as Terraform, Ansible, Jenkins, GitLab CI/CD, Docker, and Kubernetes.
Diagnose and resolve production issues, conduct thorough root cause analysis, and implement long-term corrective measures to prevent recurrence.
Manage and optimize cloud environments (Azure/AWS) across development, testing, and production, ensuring efficient provisioning, scaling, monitoring, and cost management.
Implement and maintain a robust observability stack, including tools like Prometheus, Grafana, ELK, and Datadog, to support proactive system monitoring and alerting.
Embed SRE principles into day-to-day operations, focusing on automation, incident management, runbook creation, and system reliability.
Maintain clear, comprehensive documentation for infrastructure components, deployment processes, and operational procedures to ensure team alignment and system transparency.
PEOPLE MANAGEMENT
Mentor and guide engineers, providing hands-on support in adopting tools, implementing best practices, and understanding scalable infrastructure patterns.
Participate in sprint planning, task estimation, and backlog grooming to support effective execution of DevOps initiatives within agile delivery cycles.
Foster a collaborative and knowledge-sharing culture within the team by encouraging continuous learning, peer reviews, and open communication
PRODUCT / PROCESS
IMPROVEMENT
Depending on the Position Level – Responsibilities that pertain to a proactive role in identifying and improving existing business processes or products
Identify and eliminate manual or repetitive tasks by implementing automation, self-service capabilities, and standardized tooling across the DevOps landscape.
Continuously evaluate and optimize CI/CD workflows, focusing on improving build efficiency, reducing release times, and enhancing overall developer experience.
Collaborate with development and architecture teams to improve infrastructure performance, scalability, and cost-efficiency across environments.
Lead regular retrospectives and post-incident reviews to capture lessons learned and implement improvements in operational workflows and reliability practices.
Monitor system telemetry, incident patterns, and performance metrics to proactively identify areas for improvement and drive enhancements in system stability and resilience
COMMUNICATION
The contact groups represent the functions or entities, both internal and external to Al Ghurair, which the position regularly interacts with
INTERNAL
EXTERNAL
1
Group IT Team
1
2
Product Owners, Developers, QA
2
3
Infra & Security team
3
KNOWLEDGE AND EXPERIENCE
This section outlines the education, experience, knowledge, and skills required for the position to be able to deliver upon the job’s duties and responsibilities.
KNOWLEDGE AND SKILL
Hands-on Technical Expertise:
Infrastructure as Code (IaC): Proficiency in tools like Terraform and Ansible.
Containerization & Orchestration: Deep experience with Docker, Kubernetes, and HELM.
CI/CD Tools: Strong command of GitLab CI/CD, Jenkins, or Azure DevOps for managing release workflows.
Cloud Platforms: Hands-on experience with Azure for provisioning, scaling, and monitoring environments is must. AWS or Google Cloud Platform is good to have
Monitoring & Logging: Skilled in tools such as Prometheus, Grafana, ELK stack, and Datadog.
Scripting & Automation:
Proficient in scripting languages such as Python, Bash, and Shell for automation of deployment and operational tasks.
DevOps & SRE Practices:
Strong understanding of DevOps principles, toolchains, and practices.
Working knowledge of SRE concepts including SLAs, SLOs, SLIs, and incident management workflows.
Practical experience integrating DevSecOps and security practices into CI/CD pipelines.
Collaboration & Agile Delivery:
Experience working in agile, cross-functional engineering teams to support continuous delivery and infrastructure reliability.
Ability to work closely with developers, architects, and QA teams to align platform engineering with product needs.
Soft Skills:
Strong problem-solving capabilities with the ability to troubleshoot complex infrastructure and deployment issues.
Excellent leadership, communication, and interpersonal skills, with a proven ability to mentor and guide junior engineers.
EXPERIENCE
10–12 years of overall experience in the IT industry, with at least 5–6 years of deep, hands-on experience in DevOps engineering and understanding of Site Reliability Engineering (SRE).
Proven experience in building and leading DevOps teams, providing technical mentorship, and driving adoption of DevOps and SRE practices.
Hands-on expertise in designing, implementing, and managing CI/CD pipelines and infrastructure as code solutions across cloud and hybrid environments.
Demonstrated success in supporting and scaling large-scale, high-availability systems, with a focus on automation, monitoring, and performance optimization.
Strong background in agile delivery environments, with the ability to thrive in fast-paced, dynamic settings and collaborate effectively across cross-functional teams.