Accepting Applications
Full-time
On-site
Posted 1 hour, 8 minutes ago
0 views
0 applications
Job Description
**Overview**
Haptiq is an AI\-native enterprise solutions company with purpose\-built technology for public \& private companies, governments, institutions, asset managers, and family offices. With headquarters in New York City and four global offices, Haptiq is supported by more than 300 engineers and delivery professionals across the globe. By centralizing and unifying data, automating workflows, and surfacing predictive insights, Haptiq enables organizations to scale operational excellence and generate alpha across complex enterprise environments.
**About the Role**
We are seeking an experienced DevOps Engineer to join our engineering team. This role will drive infrastructure automation, streamline deployments across multiple cloud providers, and ensure the reliability, scalability, and observability of our systems. You’ll work closely with development, QA, and operations teams to build and maintain a world\-class DevOps practice.
**Responsibilities**
**Infrastructure Automation**
* Design, implement, and manage infrastructure\-as\-code (IaC) using Terraform across AWS, GCP, and Azure.
* Maintain reusable, modular infrastructure components that scale with business needs.
**Cloud \& Containerization**
* Deploy and manage services across multiple cloud providers (AWS, GCP, Azure).
* Build, package, and orchestrate applications using Docker, Helm charts, and ECS (or equivalent orchestration platforms).
* Ensure high availability, fault tolerance, and cost optimization in cloud environments.
**Release \& Deployment Management**
* Develop and maintain CI/CD pipelines using AWS CodePipeline, Jenkins, or similar tools.
* Automate build, test, and release processes to enable frequent and reliable deployments.
* Collaborate with engineering teams to improve release velocity and rollback safety.
**Observability \& Reliability**
* Implement and manage observability solutions, including APM tools (e.g., Elastic, New Relic, Datadog) and metrics systems (Prometheus, Grafana).
* Define and monitor SLAs, SLOs, and SLIs to ensure system reliability and availability.
* Establish proactive site reliability monitoring, alerting, and incident response processes.
**Collaboration \& Best Practices**
* Partner with developers and architects to design systems that are scalable and maintainable.
* Promote DevOps culture by improving tooling, automation, and operational excellence.
* Document standards, practices, and runbooks for operational consistency.
**Qualifications**
* Strong experience with Terraform or other IaC frameworks.
* Hands\-on expertise in AWS, GCP, and Azure cloud environments.
* Solid knowledge of Docker and container orchestration platforms (Helm, ECS, Kubernetes is a plus).
* Proven experience building and maintaining CI/CD pipelines (AWS CodePipeline, Jenkins, GitHub Actions, or similar).
* Familiarity with observability stacks (Prometheus, Grafana, ELK, APM tools).
* Background in site reliability engineering (SRE) practices, including monitoring, incident management, and postmortems.
* Strong scripting/programming skills (Python, Bash, Go, or similar).
* Excellent problem\-solving and collaboration skills.
**Nice to Have**
* Kubernetes expertise (EKS, GKE, AKS).
* Experience with secrets management (Vault, AWS Secrets Manager, etc.).
* Security automation and compliance monitoring.
**Why Join Us?**
We value creative problem solvers who learn fast, work well in an open and diverse environment, and enjoy pushing the bar for success ever higher. We do work hard, but we also choose to have fun while doing it.
***The annual compensation range for this role is $115,000 \- $125,000 CAD***
Login to Apply
Don't have an account? Register