Search Jobs in the UK

Mistral AI

Website LinkedIn All Job Openings

Developing the best generative AI models

11 - 50

Site Reliability Engineer (Paris/London)

August 27

🔄 Hybrid – London

⏰ Full Time

🟡 Mid-level

🟠 Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

Bash

Cloud

Distributed Systems

Docker

Flux

Grafana

Kubernetes

Prometheus

Python

Terraform

Apply Now

Mistral AI

Website LinkedIn All Job Openings

Developing the best generative AI models

11 - 50

Description

• Design, build, and maintain scalable, highly available and fault-tolerant infrastructures to support our web services and ML workloads • Make sure our platform, inference and model training environments are always highly available and enable seamless replication of work environments across several HPC clusters • Operate systems and troubleshoot issues in production environments (interrupts, on-call responses, users admin, data extraction, infrastructure scaling, etc.) • Implement and improve monitoring, alerting, and incident response systems to ensure optimal system performance and minimize downtime • Implement and maintain workflows and tools (CI/CD, containerization, orchestration, monitoring, logging and alerting systems) for both our client-facing APIs and large training runs • Participate occasionally in on-call rotations to respond to incidents and perform root cause analysis to prevent future occurrences • Drive continuous improvement in infrastructure automation, deployment, and orchestration using tools like Kubernetes, Flux, Terraform • Collaborate with AI/ML researchers to develop and implement solutions that enable safe and reproducible model-training experiments • Build a cloud-agnostic platform offering an abstraction layer between science and infrastructure • Design and develop new workflows and tooling to improve to the reliability, availability and performance of our systems (automation scripts, refactoring, new API-based features, web apps, dashboards, etc.) • Collaborate with the security team to ensure infrastructure adheres to best security practices and compliance requirements • Document processes and procedures to ensure consistency and knowledge sharing across the team • Contribute to open-source projects, research publications, blog articles and conferences

Requirements

• Master’s degree in Computer Science, Engineering or a related field • 5+ years of experience in a DevOps/SRE role • Strong experience with cloud computing and highly available distributed systems • Exposure to site reliability issues in critical environments (issue root cause analysis, in-production troubleshooting, on-call rotations...) • Experience working against reliability KPIs (observability, alerting, SLAs) • Hands-on experience with CI/CD, containerization and orchestration tools (Docker, Kubernetes...) • Knowledge of monitoring, logging, alerting and observability tools (Prometheus, Grafana, ELK Stack, Datadog...) • Familiarity with infrastructure-as-code tools like Terraform or CloudFormation • Proficiency in scripting languages (Python, Go, Bash...) and knowledge of software development best practices • Strong understanding of networking, security, and system administration concepts • Excellent problem-solving and communication skills • Self-motivated and able to work well in a fast-paced startup environment • experience in an AI/ML environment • experience of high-performance computing (HPC) systems and workload managers (Slurm) • worked with modern AI-oriented solutions (Fluidstack, Coreweave, Vast...)

Benefits

• Competitive salary and bonus structure • Comprehensive benefits package (daily lunch vouchers, gympass subscription, mobility pass contribution, full health insurance for you and your family, generous parental leave policy...)

Apply Now

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@techjobsuk.co.uk

Search Jobs by Job Title Search Entry-Level Jobs Search Skilled Worker Visa Jobs Search Jobs by Tech Stack Search Jobs by Contract Type Search Jobs by Location

Resources

UK Tech Jobs Salary Data Remote Only Jobs Tech Jobs in New York City Tech Jobs in California Tech Jobs in Canada Privacy policy and terms of service

Jobs by Contract Type

Full-time jobs in the UK Part-time jobs in the UK Contract jobs in the UK Internship jobs in the UK

Jobs by Title

Account Executive jobs Accounting Manager jobs Accountant jobs Administration jobs Administrative Assistant jobs Analytics Engineer jobs Android Engineer jobs Attorney jobs Backend Engineer jobs Business Development Rep jobs Business Operations & Strategy jobs Chief of Staff jobs Civil Engineer jobs Cloud Engineer jobs Community Manager jobs Compliance jobs Content Marketing Manager jobs Content Manager jobs Content Writer jobs Copywriter jobs Customer Success jobs Customer Support jobs Data Analyst jobs Database Administrator jobs Data Engineer jobs Data Entry jobs Data Scientist jobs DevOps jobs Ecommerce jobs Electrical Engineer jobs Email Marketing Manager jobs Engineering Manager jobs Executive Assistant jobs Controller jobs Financial Planning and Analysis jobs Full-stack Engineer jobs Frontend Engineer jobs Game Engineer jobs General Counsel jobs Graphics Designer jobs Growth Marketing jobs Human Resources jobs iOS Engineer jobs Influencer Marketing jobs Infrastructure Engineer jobs IT Support jobs Machine Learning Engineer jobs Marketing jobs Medical Writer jobs Mechanical Engineer jobs Operations jobs Paralegal jobs Performance Marketing jobs Product Analyst jobs Product Designer jobs Product Manager jobs Project Manager jobs Program Manager jobs Product Marketing jobs QA Engineer jobs SDET jobs Recruitment jobs Risk jobs Sales jobs Sales Development Rep jobs Sales Engineer jobs Salesforce Administrator jobs Salesforce Analyst jobs Salesforce Consultant jobs Salesforce Developer jobs Scrum Master / Agile Coach jobs Security Engineer jobs SEO Marketing jobs Site Reliability Engineer jobs Social Media Manager jobs Software Engineer jobs Solutions Engineer jobs Support Engineer jobs System Administrator jobs Systems Engineer jobs Tax jobs Technical Account Manager jobs Technical Writer jobs Technical Product Manager jobs User Researcher jobs

Jobs by City

Remote jobs in London Office jobs in London Hybrid jobs in London Remote jobs in Birmingham Office jobs in Birmingham Hybrid jobs in Birmingham Remote jobs in Bristol Office jobs in Bristol Hybrid jobs in Bristol Remote jobs in Cambridge Office jobs in Cambridge Hybrid jobs in Cambridge Remote jobs in Edinburgh Office jobs in Edinburgh Hybrid jobs in Edinburgh Remote jobs in Manchester Office jobs in Manchester Hybrid jobs in Manchester