Experts
Site Reliability Engineer – Storage Engineer – Ontario, Canada in IT & DevOps Department
in GoDaddy - Canada, Ontario

Remote
Full-time
Mid
Permanent

Job description

See job offer description.


GoDaddy is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. This role focuses on automating and maintaining our storage infrastructure with an emphasis on Ceph, ensuring the reliability, scalability, and performance of our systems. Responsibilities include automating and maintaining day-to-day operations of storage systems to support application demands, developing and maintaining tools and automation scripts to streamline storage operations and improve efficiency, monitoring system performance, identifying issues, and implementing solutions to ensure high availability and reliability. The role involves participation in agile practices such as daily stand-up meetings, task tracking boards, design and code reviews, automated testing, continuous integration, and deployment. The candidate will continuously improve system reliability, performance, and capacity through proactive monitoring, automation, and optimization. Required experience includes 2+ years with Ceph in production environments, site reliability engineering or similar roles, deployment and management of Ceph clusters, strong Linux/Unix system knowledge with automation focus, proficiency in Python or Bash, familiarity with Ansible, Terraform, or SaltStack, and experience with monitoring tools such as Nagios/Icinga2 and observability tools like Prometheus, Grafana, Mimir, and Loki. A solid understanding of networking concepts and protocols related to Linux/Unix is necessary. Preferred qualifications include experience with containerization and orchestration tools (Docker, Kubernetes), compute platforms (OpenStack, AWS), and contributions to CI/CD pipelines and automation workflows. GoDaddy offers a range of benefits including paid time off, retirement savings plans, bonus eligibility, equity grants, an employee stock purchase plan, competitive health benefits, and family-friendly options including parental leave. The company values diversity, equity, and inclusion, integrating these principles into their operations and culture, and is an equal opportunity employer supporting qualified applicants with criminal histories in compliance with local and federal laws. The role is fully remote with occasional office visits possible, supporting a flexible work-life balance.

More Offers From GoDaddy

Remote
Full-time
Senior
Permanent
Remote
Full-time
Senior
Permanent
Hybrid
Full-time
Senior
Permanent

Full Stack Senior Software Engineer – Colombia in GoDaddy Colombia, Colombia

Remote
Full-time
Senior
Permanent

Workday Integration Developer – India in GoDaddy India, India

Remote
Full-time
Senior
Permanent

Benefits

  • Paid time off
  • Retirement savings (401k, pension schemes)
  • Bonus/incentive eligibility
  • Equity grants
  • Employee stock purchase plan
  • Competitive health benefits
  • Family-friendly benefits including parental leave
  • Diversity and inclusion initiatives

Job requirements

  • 2+ years professional experience with Ceph in production
  • 2+ years site reliability engineering or similar role
  • Experience deploying and managing Ceph clusters
  • Proficiency in Python or Bash
  • Experience with Ansible, Terraform, SaltStack
  • Experience with Nagios-based monitoring tools (Icinga2)
  • Knowledge of observability tools (Prometheus, Grafana, Mimir, Loki)
  • Solid understanding of Linux/Unix networking protocols