4 months ago

Site Reliability Engineer at Remind

74% 40 hours / week United States (Remote)
Paid health insurance
Paid parental leave
Retirement or pension contribution program
Transport or commuting benefits
Unlimited paid holidays

Remind, the leading communication platform in education, helps educators reach students and parents where they are: their phones. With nearly 30 million active users, we’re one of the fastest-growing companies in education technology, but we have our sights set on something bigger: giving every student the opportunity to succeed. About this role

The Remind Engineering Team collaborates to deliver features for our users and customers while setting and maintaining SLAs to ensure reliable system performance. We prefer strongly typed languages over dynamic for critical business systems, and leverage both relational and non-relational data structures as needed, supporting tens of thousands of requests per second. We bias towards using the right tool for the job, including Typescript, Python, Go, Ruby, Twirp, GraphQL, and many AWS services (Aurora, Lambda, DynamoDB, SQS, Kinesis).

As a Site Reliability Engineer at Remind, you’ll collaborate with our product engineering teams, as well as cross-functional teams, to maximize site availability, performance, and uptime, as well as build systems and features to enable engineers to ship more quickly and more confidently.

Not in San Francisco? No problem! Our team is distributed within +/-3 hours of Pacific Time.

About you:

  • You have consistently shipped high quality code to production as part of a team
  • You collaborate effectively with engineers and product managers to build systems to increase the leverage of our product engineering teams
  • You write clean code and have significant experience with one or more programming languages
  • You understand the value of an appropriately defined SLA for both internal and external systems and services, and have experience building highly available systems and services which scale and perform in accordance with such an SLA
  • Others enjoy working with you because of your positive attitude and technical competence

What you’ll do:

  • Increase the overall availability and performance of our distributed services
  • Support uptime through participation in our eng-wide on-call rotation
  • Help establish, conform to, and audit our SLAs so that the performance of our website exceeds the expectations of students, parents, and educators in even our largest and most demanding school districts
  • Use technologies such as Packer+Ansible, stacker, CloudFormation, Docker, ECS, and Lambda to maintain and improve our foundational infrastructure
  • Improve the deployment process to make it fast and predictable as possible
  • With product engineering teams, debug production issues across services and levels of the stack
  • Partner with product engineering teams, to plan the growth of Remind’s infrastructure
  • Maintain our various active and internally created open source projects, for example:

Compensation:

  • Competitive salary and equity
  • 401K
  • 100% health coverage for you and your dependents
  • Open vacation policy
  • Paid parental leave
  • Parking and commuter benefits