Description:
This vacant Infrastructure and Platform Management, Lead (Compute and Storage) role will report to the Leader, Infrastructure and Platform Management.
In this role you will lead the enhancement, optimization, and resiliency of our core infrastructure services. This role is instrumental in supporting the engineering and operational excellence of backup, disaster recovery, and storage platforms that underpin our products. The ideal candidate will have deep technical expertise in infrastructure systems, a strong focus on service reliability, and a commitment to continuously improving system performance and availability.
What You Will Do:
- Develop and maintain infrastructure roadmaps with a focus on service resiliency and operational maturity.
- Engineer and optimize backup and disaster recovery infrastructure using Commvault and Dell Data Domain.
- Design and enforce backup and retention policies to align with compliance, audit, and business continuity objectives.
- Conduct regular restore and failover tests to validate readiness and strengthen DR posture.
- Drive initiatives to modernize and automate infrastructure operations, reduce manual effort, and enhance system observability.
- Troubleshoot complex infrastructure issues and lead root cause analysis to improve long-term stability.
- Partner with development, platform, and security teams to ensure infrastructure scalability, security, and performance meet product needs.
- Build and maintaining operational runbooks, SOPs, and knowledge base documentation to improve team readiness.
- Participate in major incident management and on-call rotation to support service uptime and responsiveness.
- Ensure compliance with RPO/RTO, internal control, and security standards across infrastructure systems.
What You bring:
- A university degree in Computer Science, Computer Engineering, Information Systems or similar, or equivalent combination of education and work experience.
- 7+ years of hands-on experience in enterprise infrastructure engineering and operations.
- 5+ years managing and supporting backup and disaster recovery platforms (Commvault, Dell Data Domain, or equivalent).
- Strong scripting and automation skills (Shell, PowerShell, Python) to drive efficiency and reduce toil.
- Solid understanding of hybrid cloud infrastructure and integration (AWS, Azure).
- Familiarity with change, incident, and problem management processes within ITSM frameworks (e.g., ServiceNow).
- Proven experience in infrastructure performance tuning, capacity planning, and operational optimization.
- Strong communication and cross-team collaboration skills
- High attention to detail and disciplined operational mindset.
- Passion for improving infrastructure reliability, scalability, and cost-efficiency.
- Ability to lead initiatives, mentor junior team members, and uphold technical excellence.
- Eligibility to work for Interac Corp. in Canada in a full-time capacity.