Is the AI Research Engineer (Multi-Modal & Vision) position remote, hybrid, or on-site?

The AI Research Engineer (Multi-Modal & Vision) role at Tether.io is a remote opportunity. The location specified by the employer is Remote Worldwide.

What is the salary range for the AI Research Engineer (Multi-Modal & Vision) role at Tether.io?

The salary for AI Research Engineer (Multi-Modal & Vision) at Tether.io is not explicitly stated, but is competitive and based on experience.

How do I apply for the AI Research Engineer (Multi-Modal & Vision) position?

You can apply directly by visiting the dynamic application link on FutureTalent at: https://www.futuretalent.online/jobs/10362-ai-research-engineer-multi-modal-and-vision-tetherio.

AI Research Engineer (Multi-Modal & Vision) Job at Tether.io | Remote Opportunity

✨ AI Insights & Summary

Join Tether, a pioneering force in digital finance, and contribute to a global financial revolution from a remote, international team. This role offers a unique opportunity to work on the cutting edge of AI, specifically optimizing vision-language models for real-world deployment. You'll be integral to the full model development lifecycle, from data curation to deployment, applying state-of-the-art research in a dynamic environment. If you're passionate about multimodal AI, possess strong engineering discipline, and want to make a tangible impact in a rapidly evolving industry, Tether provides the platform for you to push boundaries and innovate.

About Tether

At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction.

Tether's Innovations:

Tether Finance: Features the world’s most trusted stablecoin, USDT, and pioneering digital asset tokenization services.
Tether Power: Optimizes excess power for Bitcoin mining using eco-friendly practices.
Tether Data: Fuels breakthroughs in AI and peer-to-peer technology with solutions like KEET for secure data sharing.
Tether Education: Democratizes access to digital learning, empowering individuals for the digital and gig economies.
Tether Evolution: Pushes the boundaries of technology and human potential, merging innovation with human capabilities.

Why Join Us?

Our team is a global talent powerhouse, working remotely from every corner of the world. If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards. We’ve grown fast, stayed lean, and secured our place as a leader in the industry. If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you.

About the Job

As a member of the AI model team, you will drive innovation in training and optimizing vision-language models with a focus on real-world deployment. Your work will span the full model development lifecycle - from data curation and training pipeline design to model evaluation and optimization - with the goal of building models that are both highly capable and practical to deploy at scale. You will work across a wide spectrum of multimodal architectures integrating text and vision, applying state-of-the-art research to improve model quality, efficiency, and domain-specific performance. We expect you to bring a research-driven mindset combined with strong engineering discipline - someone who can identify the right technique for a given problem, implement it rigorously, and measure its impact clearly. You will work closely with a small, high-caliber team where your contributions will have direct and meaningful impact. If you are passionate about pushing the boundaries of what multimodal AI can achieve in production environments, this is your opportunity.

Responsibilities

Conduct end-to-end research and engineering on vision-language models, covering training, evaluation, and optimization across the full model development lifecycle.
Design and implement post-training pipelines including supervised fine-tuning, knowledge distillation, and reinforcement learning from human feedback.
Develop and maintain high-quality multimodal datasets, including data curation, filtering, and balancing for domain-specific tasks.
Drive model efficiency and deployability, adapting models for resource-constrained environments using compression and optimization techniques.
Design and implement evaluation frameworks and benchmarks to measure model performance, robustness, and real-world task success.
Build and scale training workflows across distributed GPU infrastructure.
Identify and resolve bottlenecks in training pipelines to achieve state-of-the-art model quality on target benchmarks.
Contribute to and leverage open-source ecosystems including models, datasets, and tooling to accelerate development.
Stay current with the latest research in multimodal learning and vision-language systems, translating relevant findings into practical improvements.
Publish research findings in top-tier AI conferences and journals where applicable.

Requirements

Degree in Computer Science, Machine Learning, or a related field; MS/PhD preferred.
Strong experience with multimodal post-training workflows including supervised fine-tuning, knowledge distillation, and reinforcement learning from feedback.
Hands-on experience with parameter-efficient fine-tuning and distributed training frameworks.
Demonstrated ability to build and improve vision-language models with measurable results on standard benchmarks or real-world tasks.
Experience adapting models for resource-constrained environments.
Proven open-source contributions in multimodal AI on GitHub or HuggingFace.
Publications at top AI conferences (NeurIPS, ICML, ICLR, CVPR, ECCV etc.).

AI Research Engineer (Multi-Modal & Vision)

✨ AI Insights & Summary

About Tether

Tether's Innovations:

Why Join Us?

About the Job

Responsibilities

Requirements

Apply Now

Share Job

Job Overview

FAQ

Is this position remote?

What is the salary?

How do I apply?

Similar Opportunities

Computer Vision & AI Engineer - N3XT Interceptor C‑UAS (m/f/d)

Trainee Developer / Programmierer für KI-Agenten (m/w/d)

Werkstudent AI Engineer (m/w/d)