AI Researcher
Toptal is a global network of top talent in business, design, and technology, enabling companies to scale their teams on demand. As the world's largest fully remote workforce, we combine the best elements of virtual teams with a robust support structure that fosters innovation, social interaction, and fun. We operate globally, move at a fast pace, and aren't afraid to break the mold.
We are building a dedicated AI Research team focused on advancing the frontier of agentic AI systems powered by proprietary real-world interaction data. We are seeking AI Researchers excited to explore how large-scale, real-world signals can enhance reasoning, improve generalization, and create more capable multimodal agents.
In this remote role, you will work at the intersection of model development, multimodal representation learning, and reinforcement learning. You will design new approaches for agents to learn from complex behavioral data, workflows, and multimodal inputs (audio, logs, structured interaction traces). Your focus will be on building and improving learning systems for agents, including methods for RAG, fine-tuning, reinforcement learning (RLHF, DPO, GRPO), joint embedding spaces, and speech/audio intelligence (STT, ASR, audio signal modeling). You will collaborate closely with engineering and product teams to translate research breakthroughs into scalable systems and ensure production feedback continuously improves model behavior.
Responsibilities:
- Advance research on agentic AI systems trained on real-world interaction signals and multimodal data.
- Design and experiment with learning paradigms for large-scale models, including RAG, supervised fine-tuning, RLHF, DPO, and GRPO-style methods.
- Develop multimodal representation learning approaches, including joint embedding spaces across text, audio, logs, and structured interaction traces.
- Improve speech and audio intelligence capabilities, including STT, ASR, and audio-driven learning signals.
- Research methods for enhancing agent reasoning, planning, tool use, and adaptation in real-world environments.
- Define how complex behavioral and interaction signals can be translated into effective training objectives for large-scale models.
- Build and refine evaluation methodologies for agent performance in real-world, domain-specific scenarios.
- Collaborate with engineering and product teams to bring research ideas into production systems.
- Identify patterns in real-world workflows and convert them into generalizable modeling and representation strategies.
- Contribute to the long-term research direction of Toptal’s agentic AI systems and multimodal capabilities.
- Stay current with academic and industry research and integrate relevant advancements into internal systems.
Qualifications and Job Requirements:
- PhD in Computer Science, Machine Learning, AI, Electrical Engineering, or a related field.
- 5+ years of experience in applied AI research or ML systems with production impact.
- Strong background in large-scale machine learning, LLMs, or multimodal AI systems.
- Hands-on experience with RAG systems, fine-tuning large language models, and reinforcement learning methods (RLHF, DPO, or GRPO-style approaches).
- Experience with VLM.
- Strong understanding of representation learning, embeddings, and joint embedding spaces.
- Experience with speech and audio modeling, including STT, ASR, or audio signal processing.
- Proficiency in Python and modern ML frameworks (PyTorch, Hugging Face ecosystem).
- Experience designing or improving evaluation methodologies for LLMs or agentic systems.
- Experience with agentic AI systems, including reasoning, planning, or tool-use architectures.
- Background in multimodal AI systems (text, audio, vision, or structured logs).
- Experience embedding AI into real-world products (browsers, IDEs, enterprise tools).
- Experience with real-time or streaming AI systems.
- Open-source contributions or publications in top-tier ML/AI conferences.
- Strong ability to define research hypotheses from ambiguous, real-world problems.
- Outstanding written and verbal communication skills in English.
- Must be a world-class individual contributor.