See job offer description.
Join GoDaddys Security Infrastructure Operations team as a Senior Site Reliability Engineer to lead reliability and operational excellence across IDS/IPS, DDoS mitigation, and other security services protecting GoDaddys global footprint. You will define SLOs, reduce toil using automation, and manage incident response for mission-critical security infrastructure. Responsibilities include owning reliability outcomes by defining SLIs/SLOs and error budgets, building alerting, dashboards, and runbooks, architecting high availability, capacity planning, and disaster recovery solutions for security platforms. Design zero/minimal-downtime upgrades for OS, firmware, and signature updates. Automate deployments, configuration, and compliance using SaltStack and Python. Operate and improve heterogeneous stacks including TrendMicro TippingPoint IPS, Suricata, NetScout/Arbor Sightline/TMS, HAProxy, Nginx, Juniper, Palo Alto, Kentik/KProxy. Build observability systems using Icinga, Grafana, InfluxDB, rsyslog, drive SLO-based alerting and noise reduction. Lead incident response in 24/7 on-call rotations, including acting as incident commander and conducting blameless postmortems with durable fixes. Reduce toil through self-service tooling, APIs, automated health checks, champion reliability reviews and chaos testing. Ensure audit-ready operations compliant with WebTrust and PCI-DSS, including change management, configuration baselines, and access controls. Collaborate across teams (Network Engineering, Security Architecture, Hosting, Product) and mentor technical contractors. Maintain high-quality operational documentation, SOPs, and architectural diagrams. Required experience includes 5+ years in SRE or platform engineering supporting large-scale critical systems and network/security platforms; expert-level SaltStack for automation, strong Linux administration, deep networking knowledge (TCP/IP, routing, L4-7, load balancing), proficiency in Python and software engineering practices, observability tools like Icinga, Grafana, InfluxDB, rsyslog, Git workflows, Infrastructure as Code, effective 24/7 on-call and incident management skills, excellent documentation and mentoring abilities. Preferred qualifications include hands-on administration of IDS/IPS and DDoS platforms (TrendMicro TippingPoint, Suricata, NetScout/Arbor Sightline/TMS), experience with HAProxy, Nginx, Juniper, Palo Alto, a relevant Bachelors degree, industry certifications (Security+, CISSP, Linux+), experience in hosting or ISP environments, background in incident response, change management, compliance audits, hybrid cloud/on-premises operations, and knowledge of WebTrust and PCI-DSS requirements. GoDaddy offers a diverse and inclusive work culture with benefits including paid time off, retirement savings, bonus eligibility, equity grants, employee stock purchase plans, competitive health benefits, and family-friendly policies. The role is remote based in India with occasional office visits.