Senior Software Engineer - Reliability (Remote)

  1. Home
  2. Remote jobs
  3. terraform
  • Company Freenome
  • Employment Full-time
  • Location 🇺🇸 United States, California
  • Submitted Posted 1 day ago - Updated 12 minutes ago
<p><strong>About this opportunity:</strong></p><p><span style="font-weight: 300;">Our Site Reliability Engineering (SRE) team is a new and critical function at Freenome. As a founding member of the team, you’ll help define the culture and build the systems that keep our regulated, cloud-based production environments reliable as we transition from research to commercial operations. This is an opportunity to do meaningful engineering work that will directly save lives.<br></span></p><p><span style="font-weight: 300;">We value:</span></p><ul><li><span style="font-weight: 300;">Reliability as a product feature</span></li><li><span style="font-weight: 300;">Continual improvement and learning</span></li><li><span style="font-weight: 300;">Automate all the things!</span></li><li><span style="font-weight: 300;">Technical simplicity and clarity</span></li><li><span style="font-weight: 300;">Blameless postmortems and transparent communication<br></span></li></ul><p><span style="font-weight: 300;">As a Site Reliability Engineer, you will help design, implement, and operate observability, reliability, and incident management systems and practices across our clinical lab systems and regulated commercial workloads. You’ll partner with engineering teams to define service-level indicators (SLIs), objectives (SLOs), and error budgets; build runbooks and operational playbooks; and develop the monitoring and automation needed to ensure that our systems are reliable and compliant. &nbsp;This will also include contributions to system code, Infrastructure deployments and automation.<br></span></p><p><span style="font-weight: 300;">This role is ideal for an engineer with experience running production workloads in the cloud, who is excited to build an SRE practice from the ground up in a regulated environment. &nbsp;<br></span></p><p><span style="font-weight: 300;">The role reports to the Director, Cloud Infrastructure.<br></span></p><p><strong>What you’ll do:</strong></p><ul><li style="font-weight: 300;"><span style="font-weight: 300;">Define and implement observability practices (metrics, traces, dashboards, logs, alerts) for production systems</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Partner with product, engineering, and lab teams to develop and maintain incident response playbooks and escalation procedures</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Partner with engineering teams to define SLIs/SLOs and establish error budgets</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Participate in on-call rotation for production systems, champion a focus on automation and self-healing</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Contribute to production deployment and change-management processes that meet FDA and compliance requirements</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Automate operational tasks, reducing manual intervention</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Contribute to production systems and designs with the goal of improving reliability</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Use Infrastructure as Code (IaC) to manage and deploy team owned infrastructure and subsystems</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Help build out the SRE practice<br></span></li></ul><p><strong>Communication and Collaboration:</strong></p><ul><li><span style="font-weight: 300;">Work closely with engineering, product, and lab teams to understand service reliability needs</span></li><li><span style="font-weight: 300;">Partner with TPMs, RA/QA, and compliance stakeholders to align operational practices with regulatory requirements</span></li><li><span style="font-weight: 300;">Participate in cross-functional incident reviews and postmortems</span></li><li><span style="font-weight: 300;">Share knowledge and document operational standards for consistency and onboarding</span></li><li><span style="font-weight: 300;">Design and run fire drills / tabletop exercises as well as disaster recovery exercises<br></span></li></ul><p><strong>Culture:</strong></p><ul><li><span style="font-weight: 300;">Model Freenome’s values and principles in your work and interactions</span></li><li><span style="font-weight: 300;">Promote a collaborative, reliable engineering culture across product, infra, and lab engineering teams</span></li><li><span style="font-weight: 300;">Contribute to documentation, runbooks, and operational standards</span></li><li><span style="font-weight: 300;">Foster a culture of accountability, learning, and psychological safety<br></span></li></ul><p><strong>Technical Leadership:&nbsp;</strong></p><ul><li><span style="font-weight: 300;">Independently drive reliability improvements in scoped systems or services</span></li><li><span style="font-weight: 300;">Provide mentorship to peers on observability, incident management, and operational best practices</span></li><li><span style="font-weight: 300;">Help build and evolve Freenome’s reliability practices and contribute to team strategy discussions<br></span></li></ul><p><strong>Must haves:</strong></p><ul><li style="font-weight: 300;"><span style="font-weight: 300;">Bachelor’s degree in Computer Science, Engineering, or equivalent experience</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">5+ years in software engineering or Infra/DevOps/SRE roles (Python or Go are what we currently use)</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Experience deploying cloud infrastructure via automation (e.g. Terraform, Pulumi, Bicep/ARM, etc.)</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Incident management experience in cloud/software engineering as well as familiarity with incident management platforms (e.g., Incident.io, ServiceNow, Opsgenie, Pagerduty, etc.)</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Hands-on experience operating production workloads in cloud environments</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Familiarity with Kubernetes (AKS, GKE, or EKS)</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Strong troubleshooting and root-cause analysis skills in distributed systems</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Experience with observability platforms (e.g., DataDog, Prometheus/Grafana, OpenTelemetry)</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Ability to define and implement metrics, dashboards, and alerting</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Demonstrated ability to work autonomously and own technical outcomes</span></li><li style="font-weight: 300;"><span style="font-weight: 300;">Strong understanding of cloud Infrastructure and Networking architectures and automation<br></span></li></ul><p><strong>Nice to haves:</strong></p><ul><li><span style="font-weight: 300;">Experience supporting regulated environments (healthcare, biotech, financial)</span></li><li><span style="font-weight: 300;">Familiarity with compliance-driven change management and release processes (FDA, HIPAA)</span></li><li><span style="font-weight: 300;">Knowledge of CI/CD deployment strategies and change automation</span></li><li><span style="font-weight: 300;">Experience with both GCP and Azure cloud platforms</span></li><li><span style="font-weight: 300;">Interest in mentorship and system reliability practices at scale<br></span></li></ul><p><strong>Benefits and additional information:</strong></p><p><span style="font-weight: 300;">The US target range of our</span><span style="font-weight: 300;"> base salary</span><span style="font-weight: 300;"> for new hires is $131,325 - $201,000.</span><span style="font-weight: 300;">&nbsp;You will also be eligible to receive pre-IPO equity, cash bonuses, and a full range of medical, financial, and other benefits depending on the position offered.&nbsp; Please note that individual total compensation for this position will be determined at the Company’s sole discretion and may vary based on several factors, including but not limited to, location, skill level, years and depth of relevant experience, and education. We invite you to check out our career page @ <span style="font-weight: 300;"><em><a href="http://freenome.com/job-openings/" target="_blank" data-saferedirecturl="https://www.google.com/url?q=http://freenome.com/job-openings/&amp;source=gmail&amp;ust=1708620295766000&amp;usg=AOvVaw0K-a4lTJ0RquYagpqS-yp4">freenome.com/job-openings/</a></em></span></span><span style="font-weight: 300;">&nbsp;for additional company information.&nbsp;&nbsp;</span></p><p><span style="font-weight: 300;">Freenome is proud to be an equal-opportunity employer, and we value diversity. Freenome does not discriminate on the basis of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy, genetic information, gender, sexual orientation, gender identity or expression, veteran status, or any other status protected under federal, state, or local </span><span style="font-weight: 300;">law.</span></p><p><em><span style="font-weight: 300;">Applicants have rights under Federal Employment Laws.&nbsp;&nbsp;</span></em></p><ul><li><a href="https://www.dol.gov/agencies/whd/posters/fmla" target="_blank"><em><span style="font-weight: 300;">Family &amp; Medical Leave Act (FMLA)</span></em></a></li><li><a href="https://www.dol.gov/agencies/ofccp/posters" target="_blank"><em><span style="font-weight: 300;">Equal Employment Opportunity (EEO)</span></em></a></li><li><a href="https://www.dol.gov/agencies/whd/posters/employee-polygraph-protection-act" target="_blank"><em><span style="font-weight: 300;">Employee Polygraph Protection Act (EPPA)</span></em></a></li></ul><p><span style="color: #ecf0f1;"><em><span style="font-weight: 300;">#LI-REMOTE</span></em></span></p>

Loading similar jobs...

USA Remote Jobs

Discover fully remote job opportunities in the United States at USA Remote Jobs. Apply for roles like Software Developer, Customer Service Specialist, Project Manager, and more!

© 2025 Created by USA Remote Jobs. All rights reserved.