← Back to all jobs
27d 3h left to apply
T

AI Research Engineer (Multi-Modal & Vision)

Tether.io🌍 Remote WorldwideEstimated: $80,000 - $120,000

✨ AI Insights & Summary

Join Tether, a pioneering force in digital finance, and contribute to a global financial revolution from a remote, international team. This role offers a unique opportunity to work on the cutting edge of AI, specifically optimizing vision-language models for real-world deployment. You'll be integral to the full model development lifecycle, from data curation to deployment, applying state-of-the-art research in a dynamic environment. If you're passionate about multimodal AI, possess strong engineering discipline, and want to make a tangible impact in a rapidly evolving industry, Tether provides the platform for you to push boundaries and innovate.

About Tether

At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction.

Tether's Innovations:

  • Tether Finance: Features the world’s most trusted stablecoin, USDT, and pioneering digital asset tokenization services.
  • Tether Power: Optimizes excess power for Bitcoin mining using eco-friendly practices.
  • Tether Data: Fuels breakthroughs in AI and peer-to-peer technology with solutions like KEET for secure data sharing.
  • Tether Education: Democratizes access to digital learning, empowering individuals for the digital and gig economies.
  • Tether Evolution: Pushes the boundaries of technology and human potential, merging innovation with human capabilities.

Why Join Us?

Our team is a global talent powerhouse, working remotely from every corner of the world. If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards. We’ve grown fast, stayed lean, and secured our place as a leader in the industry. If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you.

About the Job

As a member of the AI model team, you will drive innovation in training and optimizing vision-language models with a focus on real-world deployment. Your work will span the full model development lifecycle - from data curation and training pipeline design to model evaluation and optimization - with the goal of building models that are both highly capable and practical to deploy at scale. You will work across a wide spectrum of multimodal architectures integrating text and vision, applying state-of-the-art research to improve model quality, efficiency, and domain-specific performance. We expect you to bring a research-driven mindset combined with strong engineering discipline - someone who can identify the right technique for a given problem, implement it rigorously, and measure its impact clearly. You will work closely with a small, high-caliber team where your contributions will have direct and meaningful impact. If you are passionate about pushing the boundaries of what multimodal AI can achieve in production environments, this is your opportunity.

Responsibilities

  • Conduct end-to-end research and engineering on vision-language models, covering training, evaluation, and optimization across the full model development lifecycle.
  • Design and implement post-training pipelines including supervised fine-tuning, knowledge distillation, and reinforcement learning from human feedback.
  • Develop and maintain high-quality multimodal datasets, including data curation, filtering, and balancing for domain-specific tasks.
  • Drive model efficiency and deployability, adapting models for resource-constrained environments using compression and optimization techniques.
  • Design and implement evaluation frameworks and benchmarks to measure model performance, robustness, and real-world task success.
  • Build and scale training workflows across distributed GPU infrastructure.
  • Identify and resolve bottlenecks in training pipelines to achieve state-of-the-art model quality on target benchmarks.
  • Contribute to and leverage open-source ecosystems including models, datasets, and tooling to accelerate development.
  • Stay current with the latest research in multimodal learning and vision-language systems, translating relevant findings into practical improvements.
  • Publish research findings in top-tier AI conferences and journals where applicable.

Requirements

  • Degree in Computer Science, Machine Learning, or a related field; MS/PhD preferred.
  • Strong experience with multimodal post-training workflows including supervised fine-tuning, knowledge distillation, and reinforcement learning from feedback.
  • Hands-on experience with parameter-efficient fine-tuning and distributed training frameworks.
  • Demonstrated ability to build and improve vision-language models with measurable results on standard benchmarks or real-world tasks.
  • Experience adapting models for resource-constrained environments.
  • Proven open-source contributions in multimodal AI on GitHub or HuggingFace.
  • Publications at top AI conferences (NeurIPS, ICML, ICLR, CVPR, ECCV etc.).

Apply Now

This job is active but will expire soon. Click below to apply on the company's website.

Apply for this role ↗

Share Job

Know someone who would be a perfect fit? Share this opportunity.

Job Overview

Posted6/17/2026
CategoryAI & Machine Learning
SourceJobsCollider

FAQ

Is this position remote?

The AI Research Engineer (Multi-Modal & Vision) role is a remote opportunity. The location specified is Remote Worldwide.

What is the salary?

The salary is not explicitly stated, but is competitive and based on experience.

How do I apply?

You can apply by clicking the "Apply for this role" button above to submit your application on the hiring website.

Similar Opportunities

Q

Computer Vision & AI Engineer - N3XT Interceptor C‑UAS (m/f/d)

Quantum- Systems GmbHGilching🏠 Remote
Competitive
AI & Machine Learning
View Job →
D

Trainee Developer / Programmierer für KI-Agenten (m/w/d)

DCF Verlag GmbHKoblenz🏠 Remote
Competitive
AI & Machine Learning
View Job →
E

Werkstudent AI Engineer (m/w/d)

EstateanfrageMunich🏠 Remote
Competitive
AI & Machine Learning
View Job →