Our offices are located in Paris, Lille, Toulouse, Rennes, Rouen, Bordeaux and Lyon. WHY WE NEED YOU? Our growth is driving us to strengthen our SRE team to support and scale our production environments. Your mission will be to build and maintain reliable, observable, and secure infrastructure in order to ensure optimal service availability for our customers around the world. YOUR FUTURE TEAM We work in a collaborative and international environment where the diversity of Scalers, combined with a spirit of sharing, helps bring new projects to life every day, advancing our ambitions together. You will be part of a team of experienced Site Reliability Engineers. The team is responsible for maintaining and evolving core infrastructure and observability tools, supporting product teams, and improving the reliability of Scaleway’s services. YOUR DAILY ROUTINE
- Build and optimize tooling to automate monitoring, diagnosis, and remediation of production incidents.
- Troubleshoot high-impact production issues in collaboration with other engineering teams.
- Participate in an on-call rotation to handle incidents and ensure service continuity.
- Implement and maintain observability solutions to monitor infrastructure and application health.
- Contribute to infrastructure lifecycle management across different environments.
- Promote and apply best practices in terms of stability, resiliency, scalability, and security.
- Maintain clear technical documentation for tools and procedures.
- Contribute to system and tool evolution based on production feedback.
- Collaborate closely with development teams to ensure infrastructure readiness.
- Participate in team rituals and knowledge-sharing initiatives.
ABOUT YOU SOFTSKILLS :
- Proactive and solution-oriented mindset.
- Passion for automation and continuous improvement.
- Strong collaboration and communication skills.
- Ability to work independently and in a team.
- Willingness to mentor and share knowledge.
HARDSKILLS :
- Experience with Go, Python or Rust.
- Strong scripting skills (Bash, Python).
- Hands-on experience with Linux systems (Ubuntu/Debian).
- Knowledge of networking (TCP/IP, DNS, BGP, load-balancing, IPv6, etc.).
- Experience in cloud environments and infrastructure (bare metal, VMs, containers, orchestrators).
- Familiarity with monitoring and logging tools (Prometheus, Grafana, Elastic, etc.).
- Comfortable with Infrastructure-as-Code (Ansible, Salt, AWX, etc.).
- Experience managing relational databases (PostgreSQL).
- Understanding of CI/CD pipelines (GitLab).
- Comfortable with English (written and spoken).
WHAT YOU WILL FIND AT SCALEWAY ++++
- Hybrid work: Why join the Scaleway adventure ? ✔ A rich and diverse product offering: Scaleway offers over 100 public cloud products in IaaS, PaaS, and AI. ✔ A cutting-edge technical environment: Scaleway provides modern infrastructures, including high-performance bare metal servers, to tackle exciting technical challenges. ✔ Commitment to responsible cloud: Scaleway is dedicated to a more responsible cloud, with data centers powered solely by renewable energy since 2017, minimizing our ecological footprint and holding top-level certification. THE NEXT STEPS …
- Discovery call with a recruiter (30 min).
- Interview with the manager to understand your technical skills and approach to the role (45 min).
- Technical interview to validate your expertise (1h).
- Interview with the Head of the Tribe to deepen your discussions and assess your fit with the team (45 min).
- HR interview to tour our offices and meet your future colleagues.