...

Staff Site Reliability Engineer

Remote @Gemini in Engineering
  • Post Date : 18/09/2022
  • Apply Before : 18/10/2022
  • Share:

Job Detail

  • Job ID 221669
  • Working Hours Full-time
  • Years Experience Required 7

Job Description

Empower the Individual Through Crypto

Gemini is a crypto exchange and custodian that allows customers to buy, sell, store, and earn more than 30 cryptocurrencies like bitcoin, bitcoin cash, ether, litecoin, and Zcash. Gemini is a New York trust company that is subject to the capital reserve requirements, cybersecurity requirements, and banking compliance standards set forth by the New York State Department of Financial Services and the New York Banking Law. Gemini was founded in 2014 by twin brothers Cameron and Tyler Winklevoss to empower the individual through crypto.

Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we want to help you buy, sell, and store your bitcoin and cryptocurrency. Crypto is not just a technology, it’s a movement.

At Gemini, our mission is to empower the individual and that includes giving our employees flexibility of choice — our Office Optional Policy allows employees to choose to work from one of our physical locations or from home.

Select roles that are location-specific will still be eligible for flexible schedules.

The Department: Platform

Our Platform Organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate, and operate their services in production, improve resiliency of the service and increase organizational efficiency by reducing operational toil and increase system efficiency through architectural evolution.

THE ROLE: Staff Site Reliability Engineer

Our Platform Organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering teams to ensure all our systems are architected, engineered and deployed to be resilient, reliable and performant.

The Embedded SRE team is a part of Site Reliability Engineering with a focus on engaging directly with our other engineering teams to onboard them onto our platform systems, reviewing and recommending design and architectural decisions, and guiding our engineering teams on how to implement the tooling provided by the larger Platform organization required to ensure systems can scale and react to changing conditions, with continuous improvement loops. You will be an integral part of leading Gemini’s engineering teams towards modern DevOps practices, both by developing and providing modern automation and operational tooling and working cross functionally across Gemini’s engineering teams to influence and shape our development practices and culture.


RESPONSIBILITIES:

  • Guiding engineering teams onto the various supported services provided by Platform
  • Running on-going performance evaluations and improvements for Gemini systems
  • Architecture recommendations and engagement as part of SDLC
  • Creating “Production-ready Scorecards” to evaluate the health of systems pre-launch
  • Implementing and teaching monitoring, alerting and automated resolution best practices
  • Defining SLIs, SLOs with Engineering teams
  • Educating and guiding Engineering teams on reliability and resiliency best practices, like statelessness, chaos, etc.
  • Building operational tooling and automations

QUALIFICATIONS

  • 7+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale
  • Experience in a code-first environment, developing automated solutions to solve support and operational issues
  • Experience as a Technical Leader within a team, helping evaluating and making tech decisions for the team.
  • Experience working with containerization such as Nomad, EKS (k8s), Docker, etc.
  • Experience working with Configuration Management such as Ansible, Chef, Puppet.
  • Experience writing scripts or cli tools that help increase Developer Productivity.
  • Experience in analyzing system and application performance, identifying bottlenecks, and recommending architectural or systemic improvements
  • Experience working with Engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions
  • Experience working in a code-drive, automation-first public cloud infrastructure

It Pays to Work Here

We take a holistic approach to compensation at Gemini, which includes:

  • Competitive Compensation and Profit-Sharing Equity
  • Flexible vacation policy
  • Retirement Plan Matching
  • Generous Parental leave
  • Comprehensive health plans
  • Training and professional development

Required skills

Other jobs you may like