Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Applied Scientist / Research Engineer - Edge Devices and Quantization - EMEA

Mistral AI
City of London
1 week ago
Create job alert
About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society.

Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture onhttps://mistral.ai/careers.

About The Job

Mistral AI is seeking Applied Scientists and Research Engineers focused on model efficiency and edge deployment. You will research and build ultra-efficient models and toolchains for on-device inference across CPUs, GPUs, NPUs, and specialized accelerators. Your work will enable Mistral models to run privately, reliably, and fast on mobile, desktop, and embedded devices.

What you will do
  • Run pre-training, post-training and deploy state of the art models on clusters with thousands of GPUs. You don’t panic when you see OOM errors or when NCCL feels like not wanting to talk.
  • Design and evaluate quantization, pruning, distillation, and sparsity methods for LLMs and multimodal models.
  • Build deployment stacks, optimize kernels and memory layouts.
  • Run large-scale experiments to balance accuracy, latency, throughput, and power under tight memory constraints; profile and fix bandwidth/compute bottlenecks.
  • Develop tooling for calibration data generation, mixed-precision training, quant-aware finetuning, structured/unstructured sparsity, and compilation passes.
  • Manage research projects and communications with client research teams.
About you
  • You are fluent in English, and have excellent communication skills. You are at ease explaining complex technical concepts to both technical and non-technical audiences.
  • You’re not afraid of contributing to a big codebase and can find yourself around independently with little guidance.
  • You’ve a deep understanding of quantization trade-offs, hardware constraints, and compiler stacks
  • You’re expert with PyTorch or JAX; strong C++/CUDA or low-level performance skills a plus; production-grade Python.
  • You don’t need roadmaps: you just do. You don’t need a manager: you just ship.
  • Low-ego, collaborative and eager to learn.
  • You have a track record of success through personal projects, professional projects or in academia.
It would be great if you
  • Hold a PhD / master in a relevant field (e.g., Mathematics, Physics, Machine Learning), but if you’re an exceptional candidate from a different background, you should apply.
  • Have contributed to a large codebase used by many (open source or in the industry).
  • Have a track record of publications in top academic journals or conferences.
  • Contributions to open-source inference/compilers stacks.
  • Love improving existing code by fixing typing issues, adding tests and improving CI pipelines.
  • Have experience optimizing inference on edge devices
Benefits

We have local offices in Paris, London, Marseille, Singapore and Palo Alto.

France

💰 Competitive cash salary and equity

🥕 Food : Daily lunch vouchers

🥎 Sport : Monthly contribution to a Gympass subscription

🚴 Transportation : Monthly contribution to a mobility pass

🧑⚕️ Health : Full health insurance for you and your family

🍼 Parental : Generous parental leave policy

🌎 Visa sponsorship

UK

💰 Competitive cash salary and equity

🚑 Insurance

🚴 Transportation: Reimburse office parking charges, or 90GBP/month for public transport

🥎Sport: 90GBP/month reimbursement for gym membership

🥕 Meal voucher: £200 monthly allowance for its meals

💰Pension plan: SmartPension (percentages are 5% Employee & 3% Employer)


#J-18808-Ljbffr

Related Jobs

View all jobs

Sr. Applied Scientist, Silicon and Systems Group Edge AI

Data Scientist, Silicon and Systems Group Edge AI

Head of Radiotherapy Physics

Head of Radiotherapy Physics

Head of VLA Development

2026 Graduate - Mechanical Engineer - Cambridge

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Edge Computing Recruitment Trends 2025 (UK): What Job Seekers Need To Know About Today’s Hiring Process

Summary: UK edge computing hiring has moved from tool‑lists to capability‑driven assessments that emphasise resilient edge architectures, real‑time data pipelines, secure device fleets, container/Kubernetes at the edge, on‑device/near‑edge ML, and measurable business impact (latency, reliability, cost‑to‑serve). This guide explains what’s changed, what to expect in interviews & how to prepare—especially for edge platform engineers, IoT/OT engineers, edge SREs, embedded/firmware engineers, edge AI/ML engineers, network engineers (5G/private LTE), security specialists & product managers. Who this is for: Edge platform/SRE, IoT solution architects, embedded/firmware developers, edge AI/ML engineers, network engineers (5G/SD‑WAN), security engineers (OT/ICS), data/streaming engineers, site deployment/field engineers & edge product managers targeting roles in the UK.

Why Edge Computing Careers in the UK Are Becoming More Multidisciplinary

For years, computing innovation was focused on the cloud. But as demand for real-time analytics, low-latency processing and secure local data handling grows, edge computing has become the next frontier. From autonomous vehicles to healthcare monitoring devices, retail checkout systems to industrial IoT, edge computing is transforming how data is processed and used in the UK. This shift has also changed what it means to work in the field. Edge computing careers are no longer purely technical. They now require knowledge of law, ethics, psychology, linguistics & design, as professionals must consider regulation, human behaviour, communication & usability alongside engineering. In this article, we’ll explore why UK edge computing careers are becoming more multidisciplinary, how these five fields intersect with edge roles, and what job-seekers & employers need to know to thrive in this evolving landscape.

Edge Computing Team Structures Explained: Who Does What in a Modern Edge Computing Department

Edge computing is expanding rapidly in the UK, driven by demands for low latency, on-site processing, IoT proliferation, autonomous systems, 5G, AI inference on devices, and regulatory pressures for data sovereignty. Businesses in sectors such as telecoms, industrial automation, retail, smart cities, autonomous vehicles, and healthcare are pushing computation and intelligence closer to where data is generated. But to design, build, deploy, secure, and maintain edge computing systems requires more than just hardware or software — it requires structured teams with clearly defined roles and responsibilities. If you’re hiring, or applying for roles via EdgeComputingJobs.co.uk, understanding who does what in a mature edge computing department will help you plan better, show relevance in job applications, and build resilient teams. This article covers the key roles in edge computing teams, how they collaborate through the project lifecycle, what skills and qualifications UK employers usually expect, salary benchmarks, challenges and trends, and best practices for structuring effective edge teams.