Accepting Applications
Full-time
Hybrid
Posted 2 hours, 43 minutes ago
0 views
0 applications
Job Description
**Grade:**
L3
**Location:**
Islamabad / Rawalpindi
**Last Date to Apply:**
7th May 2026
**What is Lead Cloud Infrastructure Engineer \- Teknosys?**
TekNoSys is seeking an experienced Lead Infrastructure \& Kubernetes Platform Architect to design, build, and operate our on\-premises and private cloud infrastructure, with a strong focus on Kubernetes\-based platforms, reliability engineering, and production operations.
You will lead the architecture and management of highly available, scalable, and secure on\-prem environments, enabling critical applications to run efficiently across data center infrastructure. This role requires deep expertise in Kubernetes cluster design, monitoring, automation, and infrastructure reliability, along with strong leadership to guide engineering teams and drive operational excellence.
**What does Lead Cloud Infrastructure Engineer \- Teknosys do?**
* Design, deploy, and manage on\-premises and private cloud infrastructure supporting mission\-critical applications
* Architect highly available Kubernetes clusters across bare metal or virtualized environments
* Define standards for scalability, resilience, and performance across the platform
* Build, upgrade, and maintain production\-grade on\-prem Kubernetes clusters
* Manage cluster lifecycle including provisioning, scaling, patching, backup, and disaster recovery
* Develop Helm charts, manifests, and platform templates for standardized deployments
* Optimize resource utilization and capacity planning across nodes and workloads
* Implement Infrastructure as Code (IaC) using Terraform, Ansible, or similar tools
* Automate provisioning, configuration, and operational tasks to minimize manual intervention
* Standardize repeatable infrastructure processes and deployment pipelines
* Implement comprehensive monitoring and alerting using Prometheus, Grafana, ELK/EFK, Datadog, or similar
* Establish logging, tracing, and observability best practices across clusters
* Proactively identify bottlenecks, performance issues, and failure risks
* Drive SRE practices including SLAs, SLOs, and incident response
* Build and maintain CI/CD pipelines supporting containerized application deployments
* Integrate DevOps workflows with Kubernetes for seamless releases
* Enable self\-service environments for development teams
* Implement Kubernetes and infrastructure security best practices (RBAC, network policies, secrets management)
* Ensure secure network segmentation, firewalls, encryption, and access controls
* Maintain compliance with organizational and regulatory standards
* Lead incident response, root cause analysis, and service restoration for critical production issues
* Define and test backup, failover, and disaster recovery strategies
* Ensure high availability and business continuity
* Lead and mentor infrastructure and platform engineers
* Collaborate closely with development, QA, and operations teams
* Define platform roadmaps and continuously improve operational maturity
**Requirements**
* Bachelor's degree in Computer Science, IT, or related field
* 6\-8 years of experience in Infrastructure Engineering, Platform Engineering, DevOps, or System Administration
* Proven experience managing production\-grade on\-prem or private cloud environments
* Strong background in Kubernetes platform operations and cluster management
**Technical Skills \& Technologies**
* On\-prem data center environments, virtualization (VMware vSphere, OpenStack, or similar)
* Bare metal server provisioning and management
* Storage systems (SAN/NAS/Ceph or equivalent)
* Advanced Kubernetes (cluster setup, scaling, upgrades, networking, security)
* Docker / container runtimes
* Helm, Kustomize, manifests
* Terraform, Ansible, or similar IaC tools
* Shell/Python scripting
* Prometheus, Grafana
* ELK/EFK stack
* Alerting and logging frameworks
* Jenkins, GitLab CI/CD, or similar
* Git workflows and version control
* TCP/IP, DNS, load balancers, firewalls
* RBAC, IAM, encryption, network policies
* Exposure to AWS/Azure/GCP for hybrid or DR environments (nice to have, not primary)
**Core Competencies**
* Leadership and technical ownership
* Strong troubleshooting and root cause analysis
* Platform reliability mindset
* Automation\-first approach
* Capacity planning and performance tuning
* Excellent collaboration and stakeholder communication
**Benefits**
**Why Join Teknosys?**
At Teknosys, you will be at the forefront of Pakistan's digital transformation journey, shaping solutions across AI, Data, IT, and Managed Services. You'll work alongside some of the brightest minds in the industry, partner with global hyperscalers \& leaders, and close deals that define the future of digital in the region.
Joining us means being part of a fast\-scaling, innovation\-led business where your impact will directly fuel growth, customer success, and leadership.
Login to Apply
Don't have an account? Register