Senior DevOps Engineer

Hays

United Arab Emirates

Accepting Applications Full-time On-site
Posted 1 week, 4 days ago 5 views 0 applications
Job Description
We are seeking a **Senior DevOps** with deep hands\-on expertise in **Rancher Kubernetes on bare metal** , **Kubernetes Operators** , and **high\-availability** design across core data services (PostgreSQL, Kafka, Redis, Vault, Keycloak). The ideal consultant has production experience integrating **hardware load balancers** (preferably **Fortinet** , alternatively **F5** ) with Kubernetes ingress, and can deliver a **repeatable, scripted cluster bootstrap** and **observability stack** (Grafana, Prometheus, Jaeger, plus EFK or VictoriaMetrics Logs). You will work **onsite in Dubai** with the Tejori team to design, implement, document, and hand over a robust, secure, and monitored platform to run Emcode’s commercial workloads. **Key Responsibilities** **Kubernetes \& Rancher** * Architect and deploy a **highly available Rancher\-managed Kubernetes** cluster on **bare metal** (multi\-master, etcd quorum sizing, worker pools). * Implement **cluster provisioning** and **lifecycle automation** (bootstrapping scripts and/or Pulumi/Ansible based flows). * Configure **Rancher projects, namespaces, RBAC, and fleet** for multi\-environment governance. **Networking \& Ingress** * Design and implement **ingress traffic** from **Fortinet hardware load balancers** (or F5\) to the Rancher/Kubernetes ingress layer (Layer 4/7\). * Configure **ingress controllers** (e.g., NGINX or HAProxy), **TLS termination** , **mTLS** where applicable, and **WAF** /security policies at the LB and cluster edge. **Data \& Platform Services on Kubernetes** * Deploy and harden **PostgreSQL, Kafka, Redis, Vault, and Keycloak** using **Operators** and/or well\-supported Helm charts. * Configure **backup/restore** , **secrets management (Vault/KMS)** , **rotations** , and HA (replication, quorum, partitions, failover). * Ensure **data durability** and performance tuning for bare\-metal constraints (storage classes, CSI drivers, network tuning). **Observability \& Logging** * Stand up **Prometheus \+ Alertmanager \+ Grafana** for metrics and dashboards. * Deploy **Jaeger** (or OpenTelemetry Collector → Jaeger) for distributed tracing. * Implement **EFK (Elasticsearch/Fluentd/Kibana)** or **VictoriaMetrics Logs** (e.g., VictoriaLogs/Loki alternative) with retention and index strategy. **Automation \& IaC** * Create **idempotent scripts** and/or **Pulumi** stacks for cluster bootstrap, app provisioning, and infra config. * Develop **Ansible** roles/playbooks for OS hardening, package prep, and repeatable node bring‑up. **Security \& Compliance** * Enforce **network policies** , **RBAC/ABAC** , **PodSecurity** /PSA, **image signing** /scanning, registry policies. * Integrate **Keycloak** for SSO into Rancher, Grafana, and app workloads. * Establish **backup, DR, and secrets management** standards (Vault policies, transit encryption). **Documentation \& Handover** * Produce **as\-built documentation** , **runbooks** , **troubleshooting guides** , and **DR procedures** . * Conduct **knowledge transfer** sessions and **tabletop failover tests** with the team. **Required Qualifications** * **10\+ years** DevOps/SRE with production **Kubernetes** (Rancher experience required). * Strong with **Kubernetes Operators** , **Helm** , **ingress controllers** . * Proven deployments of **PostgreSQL, Kafka, Redis, Vault, Keycloak** on K8s. * **Fortinet load balancers** (highly desirable) or **F5** experience in production. * **Pulumi** (preferred) and **Ansible** for IaC and configuration. * **Monitoring stack** : Grafana, Prometheus, Alertmanager, Jaeger; EFK or Victoria Logs. * **Bash** proficiency and at least one high\-level language ( **Go** or **Python** ). * HA/DR design on **bare metal** , including storage/CNI selection and tuning. * Excellent documentation and team enablement skills.
Login to Apply

Don't have an account? Register

About Company
Share this job