Platform Engineer

 

Description:

The Platform Engineer ensures the reliability, performance, and operational excellence of cloud-hosted seismic monitoring and data processing services. This role blends cloud infrastructure management and SRE practices to build resilient systems, reduce manual toil through automation, and improve observability across AWS and Kubernetes ecosystems.

The successful candidate will use Terraform or similar Infrastructure-as-Code technologies (Pulumi, AWS CDK, CloudFormation, OpenTofu) to deliver consistent, automated, scalable infrastructure.

Responsibilities

Cloud Reliability & Resilience
 

  • Ensure uptime, performance, and reliability of AWS-hosted services and Kubernetes workloads
  • Implement self-healing patterns, automated rollbacks, health checks, and safe-deployment strategies
  • Participate in on-call rotation and lead first-response triage for cloud and platform incidents
  • Build and maintain service-level indicators (SLIs) and service-level objectives (SLOs)
     

Automation & Infrastructure Engineering
 

  • Develop automation for cloud operations using Python, Bash, and IaC (Terraform)
  • Reduce operational toil through automated runbooks, event-driven remediation, and system orchestration
  • Improve deployment reliability in collaboration with Platform Engineering and R&D teams
  • Implement and refine configuration standards, CI/CD hygiene, and environment stability
     

Observability & Operational Intelligence
 

  • Maintain and extend observability stack (Prometheus, Grafana, InfluxDB, OpenTelemetry)
  • Tune alerts for accuracy, reduce noise, and implement actionable alerting tied to SLOs
  • Analyze logs, metrics, and traces to detect reliability issues and validate system behavior
  • Build dashboards that provide real-time visibility into system health and reliability trends
     

Operational Excellence
 

  • Support release processes, platform upgrades, and cloud infrastructure changes
  • Conduct root-cause analysis and drive post-incident corrective actions
  • Maintain operational documentation, runbooks, and environment validation workflows
  • Collaborate cross-functionally with NetOps, Platform Engineering, Field Ops, and R&D
     

Requirements: Education and Experience
 

  • Bachelor's degree or higher in Software Engineering, Computer Science, or related field.
  • 3+ years hands-experience working with cloud providers like AWS, etc and cloud-native technologies like Kubernetes, Helm, etc. and related technologies including observability platforms.
  • Experience with database operations (MySQL, PostgreSQL, MongoDB, Redis) in cloud and on-prem environments.
     

Cloud & Infrastructure
 

  • Strong experience with AWS (EC2, S3, IAM, VPC, EKS/ECS, CloudWatch)
  • Solid understanding of Kubernetes , Helm charts, and container orchestration
  • Familiarity with hybrid cloud environments (cloud + on-prem integration)

Organization Nanometrics Inc
Industry IT / Telecom / Software Jobs
Occupational Category Platform Engineer
Job Location Ottawa,Canada
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Experienced Professional
Experience 3 Years
Posted at 2026-03-27 5:42 pm
Expires on 2026-05-11