Our mission is to create a world where all relationships are healthy and equitable, through Kind Connections.
August 5
š Hybrid ā London
Our mission is to create a world where all relationships are healthy and equitable, through Kind Connections.
ā¢ Maintain and improve the MLOps platform allowing us to serve predictions at massive scale and iterate faster on all our models ā¢ Administer and manage the GPU-powered MLOps Kubernetes clusters ā¢ Be part of the on call rota to support smooth operation of the MLOps platform and the health of all our ML services ā¢ Improve the MLOps platform in terms of processes, performance and testing ā¢ Research and experiment with the latest MLOps technologies and inference frameworks to unlock capabilities for all ML engineers in the company ā¢ Support the efforts of ML engineering and product teams ā¢ Mentor and coach team members on DevOps and ML engineering best practices
ā¢ Deep understanding of Kubernetes infrastructure and experience administering GPU enabled Kubernetes clusters at scale ā¢ Experience working with Docker and containerised applications ā¢ Experience with at least one programming language such as Python, Golang etc. ā¢ Experience with CI/CD tooling such as ArgoCD, GitHub Actions etc. ā¢ Experience configuring and maintaining monitoring systems such as Grafana, Prometheus etc. ā¢ Experience with IaC tooling, e.g. Terraform ā¢ Comfortable in reacting to incidents and be part of an on call rotation ā¢ Good understanding of machine learning model development life cycle processes and tools: ML model development and experimentation, training pipelines, model serving and monitoring ā¢ Ability to work collaboratively and proactively in a fast-paced environment alongside engineers, scientists and non-technical stakeholders ā¢ A passion for keeping up with the latest ongoings in DevOps and MLOps communities ā¢ A curious mind, self-starter and endlessly keen to learn and develop themselves professionally
Apply Now