May 2
🏢 In-office - London
• Galaxy is seeking a Site Reliability Engineer to build Observability and Infrastructure as code to help accelerate the development of innovative software systems for Galaxy Digital. • Be on a PagerDuty rotation to respond to availability incidents and provide support for developers and the business • Build, manage, and maintain our cloud infrastructure with Terraform, Kubernetes, flux/Helm, and other tools • Build and maintain automated configuration management • Help plan the growth trajectory of Galaxy Digital's infrastructure • Help ensure we're following industry best practices • Actively participate in incident response in the wake of production issues • Build and assist with CI/CD deployments and application observability
• BS degree in CS, Software Engineering or related field // or equivalent experience • Implement "Infrastructure as Code" using Terraform and CI/CD • Load balancing applications using including Proxies and CDN • Monitoring and Metrics in Prometheus, Grafana, OpenSearch, and integrations with Slack/PagerDuty • Disaster Recovery and High Availability strategy • Managing Kubernetes clusters and using Helm CI/CD for deployment • Cloud architecture and design • Coding in Python, Ruby, Go, or other high-level languages • Ansible, Puppet, Chef, or other configuration management tooling
• Competitive base salary and discretionary bonus • Flexible Time Off (i.e. unlimited paid vacation days) • Company paid Holidays (11) • Company paid sick leave • Company-paid health and protective benefits for employees, partners, and other dependents • 3% 401(k) company contribution • Generous paid Parental Leave • Free virtual coaching and counseling sessions through Ginger • Opportunities to learn about the Crypto industry • Free daily snacks in-office • Smart, entrepreneurial, and fun colleagues • Employee Resource Groups
Apply Now