Ready to dive into the cutting-edge world of generative AI? My client are seeking an experienced Site Reliability Engineer (SRE) to lead our infrastructure and establish the architectural direction for a rapidly growing company. As the first SRE hire, you’ll have the unique opportunity to build and manage a high-performing team while making a massive impact on how they scale and operate.
What’s the job?
- Own the Infrastructure: They're running on AWS, but need someone who can build out the SRE framework with tools like Grafana and Prometheus already in place. The real challenge? Designing systems to handle high-load scenarios, with plenty of room for scale.
- Lead the Charge: As the first member of the SRE team, you’ll set the stage for future growth—helping them scale with reliability and ensuring our systems can handle anything thrown their way.
- Bring the Magic to Incident Response: Help shape and implement our incident response strategy. They’re looking for someone who can create a seamless system, starting with the basics (we’ve got on-call support), and scaling up from there.
- SRE System Building: It's all about systems architecture, monitoring, and reliability. They're not greenfield, but we need you to take things to the next level.
- You’ve got a Kubernetes (or similar) background and can handle high-scale environments like those in streaming media or live broadcasting (think Twitch or YouTubeTV).
- Experience with cloud infrastructure is a must (AWS is their home), but you're ready to add that SRE magic.
- You’re not afraid to build from the ground up—while they’re not starting from scratch, they need someone who’s ready to step in and design for the future.
- You’ll need to debug like a pro and troubleshoot in real-time while helping to scale their systems as we grow.
- Size? Just you to start, but you'll be building and managing a team as we grow.
- Reporting to? The CTO, because we’re all in this together.
- Your Impact? HUGE. You’ll shape the reliability strategy that supports the future of generative AI.
- Be part of a leading-edge AI company with exciting growth and tons of opportunity.
- You'll set the stage for a high-impact team and have a hand in everything that comes next.
- Fast-paced, dynamic environment where your contributions matter.
- Screening call, tech interviews (x2), culture fit discussion, and a chat with the CEO.