The #1 platform for connected data
Graph Database • NoSQL Database • Native Graph Technology • Graph Platform • Graph Analytics
April 27
🏢 In-office - London
The #1 platform for connected data
Graph Database • NoSQL Database • Native Graph Technology • Graph Platform • Graph Analytics
• The Site Reliability Engineering team’s mission is to improve the overall reliability of Neo4j’s DBaaS product: Neo4j Aura. Our product operates at scale and spans all 3 major cloud providers, with hundreds of Kubernetes clusters running in production. • Until recently, the SRE function at Neo4j Aura achieved this by filling the shoes of a more traditional Ops team. We are in the process of transforming the team and need your help to implement what we believe to be a more authentic form of SRE, by: - Educating software engineers and product managers on SRE principles such as SLIs and SLOs - Reducing the barrier to effective Ops for the engineering department by building abstractions and automating away toily tasks - Applying software engineering to solve operational problems - we believe in writing operators rather than bash scripts - Encouraging engineering teams to take ownership of running their code in production
• Applying SRE practices in the wild: defining SLIs for key software, reducing toil through automation, monitoring applications for success • The ability to debug large and complex cloud-based systems • Extensive experience in monitoring systems and their performance • Experience deploying and working with observability systems such as: Prometheus, Grafana, Datadog, Google Logging (Stackdriver) • Extensive experience with deploying and managing applications running on Kubernetes (experience with administering Kubernetes clusters is a plus) • Knowledge of Go, Kustomize, and Terraform (some knowledge of Python is also a plus) • Production experience with proxy software (e.g, Envoy, NGINX, HAProxy) and networking in general • Experience with building CI/CD pipelines - we use GitHub Actions and TeamCity • Familiarity working with a variety of Cloud Native projects • Experience being on call is a plus
• 11 paid holidays • Generous Accrued Time Off increasing with years of service • Generous paid sick time • Annual day of service
Apply Now