GoDaddy logo

Site Reliability Engineer – Bulgaria

GoDaddy  ·  Bulgaria, Remote
Remote Full-time Senior Permanent IT & DevOps

Job Description

GoDaddy seeks a Site Reliability Engineer to join the Monitoring and Observability team, focusing on maintaining infrastructure reliability, performance, and availability for millions of customers. Responsibilities include designing and maintaining monitoring solutions for metrics, logs, and traces using tools such as Grafana Labs, ICINGA2, Site24x7, SNMP, and API integrations. The role involves responding to automated alerts and incidents, participating in on-call rotations, collaborating with engineering teams to resolve issues, automating operational tasks to enhance reliability, developing self-service observability tools, and supporting CI/CD pipelines for monitoring infrastructure. Candidates should have 5+ years in Site Reliability Engineering, DevOps, or infrastructure operations, advanced Linux/Unix systems skills, extensive experience with observability platforms, configuration management (Ansible, Puppet), scripting languages (Python, Go, Bash, JavaScript), event correlation or incident platforms, containerization and orchestration experience, hands-on incident response, on-call participation, and CI/CD pipeline implementation. Preferred qualifications include experience with modern observability stacks, supporting large-scale distributed systems, and synthetic monitoring platforms. GoDaddy offers comprehensive benefits including paid time off, retirement savings plans, bonuses, equity grants, employee stock purchase plans, competitive health benefits, parental leave, and Employee Resource Groups to support a diverse and inclusive culture. The company is committed to diversity, equity, inclusion, and belonging, and is an equal opportunity employer.

Apply Now

You'll be redirected to the company's application page

Benefits

  • Paid time off
  • Retirement savings plans (401k, pension schemes)
  • Bonus/incentive eligibility
  • Equity grants
  • Employee stock purchase plan
  • Competitive health benefits
  • Parental leave
  • Employee Resource Groups

Requirements

  • 5+ years in Site Reliability Engineering, DevOps, or infrastructure operations
  • 5+ years Linux/Unix systems expertise and troubleshooting
  • 3+ years with observability platforms (metrics, logging, tracing, visualization)
  • 3+ years configuration management (Ansible, Puppet, or similar)
  • 3+ years scripting experience (Python, Go, Bash, JavaScript)
  • 3+ years event correlation or incident platforms experience
  • 2+ years containerization and orchestration experience
  • Hands-on incident response and on-call participation
  • CI/CD pipeline implementation experience