⨠AI Insights & Summary
Remote is seeking a Senior Site Reliability Engineer to architect and maintain the critical infrastructure powering their 'Remote Build' platform, which enables AI agents to integrate with global employment compliance systems. This role offers a unique challenge to build robust, secure, and scalable infrastructure across 100+ jurisdictions, directly impacting the future of work. If you are a seasoned SRE with deep expertise in Kubernetes, AWS, and Infrastructure as Code, and you thrive on automating complex systems and ensuring operational excellence in a fully remote, asynchronous environment, this is an exceptional opportunity.
About Remote
Remote is revolutionizing global employment by enabling businesses to compliantly recruit, pay, and manage international teams. Operating with a future-focused, asynchronous work culture across six continents, Remote is dedicated to solving complex HR and finance challenges. Innovation, automation, and AI are core to their strategy, and they encourage diverse talents to build a best-in-class HR platform.
The Position: Senior Site Reliability Engineer for Remote Build
Remote is launching 'Remote Build,' an agentic shift designed to route AI agents through their employment infrastructure layer, covering labor law, payroll, and compliance across over 100 countries. As a Senior Site Reliability Engineer, you will own the operational excellence and infrastructure strategy for this platform, ensuring it is reliable, performant, and secure. You will report to the Engineering Manager and collaborate closely with leadership, product managers, engineers, and customer success teams.
What You'll Do
- Infrastructure as Code at Scale: Design, implement, and maintain IaC patterns using Terraform and Kubernetes to support standard and custom builds, simplifying deployment and operation for engineers.
- Observability and Incident Response: Build and maintain comprehensive monitoring, logging, and alerting systems. Lead incident response, conduct post-mortems, and drive continuous reliability improvements.
- Security and Compliance in Motion: Embed security into infrastructure layers and ensure compliance across 100+ jurisdictions without hindering developers or customers.
- Performance and Cost Optimization: Continuously optimize system performance, resource utilization, and cloud costs, making recommendations for improved reliability and unit economics.
- Automation and Operational Leverage: Systematically eliminate manual operational toil by building tools and processes that enable efficient team operation without headcount scaling.
- Platform Reliability and Developer Experience: Partner with platform teams to ensure APIs, MCP, and CLI are resilient and observable, providing infrastructure feedback to guide platform evolution.
What You'll Bring
- Senior-level SRE experience, including standing up and operating production systems at scale.
- Deep, hands-on experience running Kubernetes in production and solid AWS fundamentals.
- Proficiency with Terraform or similar Infrastructure-as-Code tools.
- Real experience setting up and operating CI/CD pipelines (GitLab, GitHub Actions, Jenkins, etc.).
- Strong bash scripting skills and comfort with Linux systems debugging.
- Excellent communication skills, able to explain complex infrastructure decisions clearly to both technical and non-technical stakeholders.
Nice to Have:
- Experience with at least one backend programming language (Elixir, Python, Go, Java, Node.js, etc.).
- Experience in consultancy settings.
- Familiarity with container registry and artifact management (ECR, Docker Hub, etc.).
- Depth in observability stacks (Datadog, Prometheus, ELK, Grafana, or similar).
- Experience with multi-tenant platforms.
Practicals
- Reporting to: Engineering Manager
- Team: Engineering
- Location: Anywhere in the World
- Start Date: As soon as possible
Application Process
Includes interviews with a recruiter, hiring manager, technical deep dives, executive interviews, and a bar raiser interview, followed by an offer and background check.
Compensation & Benefits
- Salary Range: $54,000ā$150,000 USD annually (base pay depends on location, skills, experience, etc.).
- Benefits & Perks: Work from anywhere, flexible PTO, flexible working hours (async), 16 weeks paid parental leave, mental health support, stock options, learning budget, home office/IT equipment budget, and budget for local social events or co-working spaces. Remote emphasizes fair, unbiased compensation and competitive benefits globally.