Accepting Applications
Full-time
On-site
Posted 1 hour, 44 minutes ago
0 views
0 applications
Job Description
Join our SRE L2 squad supporting \~1000 AWS\-hosted services. You’ll own operational reliability, rapid triage, and proactive maintenance across production and non\-prod, partnering closely with Cloud Engineering, SOC, and application teams.
**Key Responsibilities**
* Deliver 24×7 monitoring, incident response, and problem management; drive MTTA/MTTR reduction and SLO/SLI adherence.
* Perform preventive health checks; analyze ticket trends to implement continual service improvements and automation to reduce toil.
* Execute blameless postmortems and high\-quality RCA; maintain SOPs/runbooks and reliability dashboards.
* Configure/tune observability (Dynatrace, CloudWatch, ELK); enable self\-healing workflows and workload optimizations.
* Support change/service requests within agreed SLAs; collaborate during transitions and onboard new AWS services.
**Core Skills \& Tools**
* **AWS:**
Lambda, ECS/Fargate/EC2, API Gateway, SNS/SQS, Kinesis, RDS; IAM/KMS foundations.
* **Observability \& ITSM:**
Dynatrace, CloudWatch, ELK; ServiceNow for incidents/changes; SLI/SLO dashboards.
* **Toil Reduction**
* **Reliability Practices:**
Error budgets, capacity/performance benchmarking, automation/runbook execution, FinOps awareness.
**Qualifications**
* 5\+ years SRE/DevOps or L2 operations for cloud\-native stacks; strong AWS production experience.
* Proven incident/change/problem management in 24×7 environments; adept at RCA and postmortems.
* Hands\-on with observability tooling and operational automation; excellent collaboration and documentation skills.
**Shift Coverage \& Locations**
Follow\-the\-sun model with overlapping handoffs across Canada/India to ensure continuous support. Success is measured by uptime, MTTR/MTTD, change failure rate, error\-budget consumption, SLO adherence, RCA quality, and CSI throughput.
More jobs from HCLTech
HCL Tech _ Voice Process - Fresher (2025/2026 Passouts) - Direct walk-in interview - sholinganallur- 11 April 2026
1 day, 1 hour agoData Scientist / Machine Learning Engineer (Predictive Analytics)
2 days, 1 hour agoDevOps Engineer
1 day, 1 hour ago
Login to Apply
Don't have an account? Register