✨ AI Insights & Summary
This role at Reddit presents an exciting opportunity to lead the charge in building the next generation of ML Indexing & Retrieval systems, crucial for powering everything from content understanding to GenAI applications. As a technical leader, you'll architect and scale core infrastructure, influencing the platform's evolution and mentoring engineers. If you have deep expertise in large-scale distributed systems, indexing, and retrieval, this is your chance to shape the technological backbone of a globally recognized platform.
About Reddit
Reddit is a community of communities, built on shared interests, passion, and trust. It is the internet's largest source of authentic conversations, hosting over 100,000 active communities and attracting approximately 126 million daily active unique visitors.
About the Team
The ML Indexing & Retrieval Platform team at Reddit is responsible for building and scaling the core infrastructure that powers machine learning-driven recommendations. They design and maintain systems for ML data ingestion, low-latency retrieval services, and end-to-end lifecycle management of data. With a focus on performance, reliability, and scalability, the team enables real-time access to high-quality data supporting applications like Content Understanding, Semantic/Lexical retrieval, and GenAI.
How You'll Have Impact
You will lead the development of next-generation ML Indexing & Retrieval systems, owning the full lifecycle from ideation to production. You'll go beyond incremental improvements to reimagine core platform capabilities. As part of a high-impact, cross-functional team, you will solve complex technical challenges to build scalable, reliable platforms that empower developers to efficiently ship critical ML features.
What You'll Do
- Lead the technical strategy, architecture, and implementation of Reddit’s next-generation ML Indexing & Retrieval engine, integrating capabilities across lexical and vector indexing, low-latency retrieval, and emerging GenAI applications.
- Partner closely with product engineers across Content Understanding, Search, Feeds, Ads, Growth, and Safety to deliver high-quality experiences.
- Define best practices for observability, reliability, and operational excellence in large-scale distributed systems.
- Mentor and guide engineers in designing scalable infrastructure and adopting robust DevOps and SRE principles.
- Collaborate with infrastructure and ML teams to ensure the platform evolves to meet the needs of Reddit’s growing user base and diverse content ecosystem.
Who You Might Be
- 10+ years of experience in software engineering, specializing in Indexing and Retrieval systems.
- 3+ years in technical leadership, architecting and scaling distributed systems in production environments.
- Deep expertise in large-scale data platforms, including batch indexing and stream processing.
- Proven experience designing and operating large-scale, low-latency retrieval services.
- Expertise in lexical and vector search retrieval technologies, such as Milvus, Vespa, or Elasticsearch.
- Skilled in designing cloud-native architectures and managing containerized workloads using Kubernetes and AWS/GCP.
- Adept at translating complex technical challenges into clear, actionable strategies.
- Strong communicator and mentor who leads through collaboration, influence, and technical excellence.
Technical Stack
- Languages: Go, Java, Python, or any object-oriented programming language.
- Frameworks: Flink, Airflow, Spark for large-scale batch & stream processing.
- Databases: Familiarity with Vector, Lexical & Key-Value Databases.
- Tools: Kubernetes, Docker, AWS, GCP.
Benefits
- Comprehensive Healthcare Benefits and Income Replacement Programs
- 401k with Employer Match
- Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
- Family Planning Support
- Gender-Affirming Care
- Mental Health & Coaching Benefits
- Flexible Vacation & Paid Volunteer Time Off
- Generous Paid Parental Leave
Pay Transparency
This job posting may span more than one career level. In addition to base salary, this job is eligible to receive equity in the form of restricted stock units. Reddit offers a wide range of benefits to U.S.-based employees. The base salary range for this position is $279,200 - $390,900 USD.
Additional Information
- #LI-Remote
- In select roles and locations, interviews may be recorded, transcribed, and summarized by AI. You will have the opportunity to opt out.