Accepting Applications
Full-time
On-site
Posted 18 hours, 11 minutes ago
0 views
0 applications
Job Description
Job Mode : Full time with Mphasis (Client Location \- UAE \- Contract renewal basis)
LOOKING FOR IMMEDIATE JOINERS
**Responsibilities**
* Ensure reliability, availability, and performance of services running across Azure/AKS and on\-premises air\-gapped Kubernetes (RKE2\) environments, meeting strict SLAs and business requirements.
* Maintain scalable, resilient, and secure Kubernetes platforms, including ingress, storage, and stateful workloads.
* Automate operations and deployments using scripting (Python, Go, Bash), infrastructure\-as\-code (Terraform, Bicep, Ansible), and GitOps with ArgoCD and Kustomize across both cloud and on\-prem environments.
* Operate CI/CD pipelines Azure DevOps /Github Actions and manage container supply chains for both connected and air\-gapped environments, including private registry mirroring and image scanning.
* Monitor system performance using Azure Monitor, Prometheus, Grafana, and OpenTelemetry; proactively detect and resolve issues to prevent disruption.
* Lead incident response, perform root cause analysis, and drive post\-incident reviews with permanent fixes and improvements.
* Develop, document, and enforce best practices for operations, security, and compliance across cloud and on\-prem environments.
* Collaborate with development, security, and operations teams to enhance system design and support modern application platforms (Docker, Kubernetes).
* Participate in on\-call rotations to respond to critical incidents across all environments.
* Use IT Service Management tools or incident, change, and problem management.
* Working knowledge of Scrum, ITIL, Agile methodologies and experience interfacing with external auditors.
**Qualifications**
* Bachelor's degree in Computer Science, Engineering, or related field.
* Minimum 10 years as a Site Reliability Engineer, with significant expertise in Azure cloud environments.
* Strong knowledge of Azure cloud services, networking, and security.
* Hands\-on experience with both managed (AKS) and self\-managed/air\-gapped (Rancher RKE2 or equivalent) Kubernetes distributions.
* Proficiency in scripting languages (Python, Go, Bash) and infrastructure\-as\-code tools (Terraform, Bicep, Ansible).
* Experience with GitOps (ArgoCD, Kustomize), CI/CD pipelines, Docker, and Kubernetes for deployment automation in connected and disconnected environments.
* Hands\-on experience with monitoring tools (Azure Monitor, Prometheus, Grafana).
* Proven track record in incident management and troubleshooting.
* Excellent problem\-solving, communication, and collaboration skills; attention to detail and a commitment to continuous learning.
Login to Apply
Don't have an account? Register