Location: Remote. Must overlap with US Eastern time hours
Company: Introzy (subsidiary of Sanguine Technology Solutions)
Type: Full-time or contractor-to-hire
Sanguine is the parent company behind Introzy, our core product — a multi-app platform (including Funlz and Profylz) designed to unify networking, workflow, and productivity. We’re an early-stage company moving fast to deliver value, with a lean engineering team, clear priorities, and a culture that embraces AI as a core development partner.
We’re standardizing on a simple, vanilla, AI-friendly stack: React, Node.js with Express, PostgreSQL, AWS, and GitHub Actions for CI/CD. Our goal is to build reliable, scalable systems that are easy to extend and maintain while shipping quickly.
About Sanguine Technology Solutions
Sanguine Technology Solutions is the technology engine of the Sanguine portfolio, 2024 Inc. 5000 #94. Its mission is to build category-defining software that addresses high-friction, high-value problems for small and mid-sized businesses. From partner operations (Introzy) and intelligent surveys (Funlz) to AI-powered automation and custom SaaS, STS delivers tools that help businesses grow faster, with more clarity and less chaos.
We’re looking for a Senior DevOps Engineer who will own developer operations and platform reliability across Introzy’s product stack. You’ll lead how we run infrastructure on Render, design and evolve our observability and alerting, shape our CI/CD and release practices, and continuously improve internal developer experience so the engineering team can ship quickly and safely.
This is a hands-on, high-ownership role with direct impact on uptime, incident response, and overall engineering velocity. You’ll partner closely with engineering leadership to define standards, roadmaps, and guardrails for how we build and operate services.
Quality first: Reliability and observability are core product features, not afterthoughts.
Bias to action: We ship in small, safe increments with strong feedback loops.
AI as a teammate: We lean on AI tools to accelerate infra-as-code, runbooks, documentation, and debugging.
Simplicity over cleverness: We prefer straightforward infrastructure and pipelines that are easy to reason about and operate.
Collaboration: Dev, DevOps, and product work as one team; no "throw it over the wall" mentality.
Operational excellence: Incidents are opportunities to improve systems, runbooks, and defaults.
Infrastructure and Reliability (Render)
Own configuration and operation of services running on Render (web services, background jobs, workers, and scheduled tasks).
Implement and maintain scalable, cost-aware infrastructure patterns (autoscaling, right-sizing, environment strategy).
Ensure high availability and reliability through sane defaults, health checks, and capacity planning.
Infrastructure and Reliability (Render)
Own configuration and operation of services running on Render (web services, background jobs, workers, scheduled tasks).
Implement and maintain scalable, cost-aware infrastructure patterns (autoscaling, right-sizing, environment strategy).
Ensure high availability and reliability through health checks, capacity planning, and sane defaults.
Monitoring, Logging, and Alerting
Design, implement, and maintain observability across the stack (metrics, logs, traces).
Configure and tune logging pipelines and alerting rules to minimize noise while catching real issues early.
Own SLOs/SLIs for key services; work with engineering to define and track them.
Monitoring, Logging, and Alerting
Design, implement, and maintain observability across the stack (metrics, logs, traces).
Configure and tune logging pipelines and alerting rules to minimize noise while catching real issues early.
Define and track SLOs/SLIs for key services in partnership with engineering.
Tooling and Integrations (Sentry, Segment, etc.)
Configure and maintain Sentry for error monitoring, release tracking, and performance insights.
Configure and maintain Segment for analytics events, destinations, and environments.
Standardize patterns for how application teams instrument logs, metrics, events, and traces.
Tooling and Integrations (Sentry, Segment, etc.)
Configure and maintain Sentry for error monitoring, release tracking, and performance insights.
Configure and maintain Segment for analytics events, destinations, and environments.
Standardize patterns for how teams instrument logs, metrics, events, and traces.
CI/CD and Deployment Pipelines (GitHub)
Own GitHub Actions workflows for build, test, and deploy.
Improve pipeline reliability, speed, and developer feedback (caching, parallelization, reusable workflows).
Implement safe deployment practices (feature flags, canary / phased rollouts, rollbacks).
Maintain consistent branching, tagging, and release practices in collaboration with engineering.
CI/CD and Deployment Pipelines (GitHub)
Own GitHub Actions workflows for build, test, and deploy.
Improve pipeline reliability, speed, and developer feedback (caching, parallelization, reusable workflows).
Implement safe deployment practices (feature flags, canary or phased rollouts, rollbacks).
Maintain consistent branching, tagging, and release practices in collaboration with engineering.
Security, Compliance, and Operational Hygiene
Implement and maintain secrets management and access controls for infra and CI.
Partner with engineering on security baselines, dependency management, and patching.
Contribute to SOC 2–aligned operational practices (logging, change management, incident response), as applicable.
Internal Developer Experience
Reduce friction in local and cloud dev environments (environment parity, simple setup, docs, scripts).
Maintain shared tooling and templates for new services, jobs, and pipelines.
Gather feedback from engineers and continuously improve internal workflows, tools, and documentation.
Infrastructure and hosting: Render, AWS (as needed), modern PaaS
Application: TypeScript, Node.js/Express, React, PostgreSQL
CI/CD: GitHub Actions
Observability: Sentry, logging/metrics/tracing tools (to be standardized)
Product and collaboration: Notion, Linear, Slack, Segment, and related tooling
5+ years in a DevOps, Site Reliability Engineering, or Platform Engineering role.
Experience managing cloud infrastructure and application runtimes (Render experience is a plus, but similar PaaS or IaaS is acceptable).
Strong understanding of Linux-based systems, containerization, and networking fundamentals.
Hands-on experience designing and running CI/CD systems (ideally GitHub Actions) and modern deployment practices.
Experience with logging, metrics, and tracing tools; able to design and evolve observability standards.
Proficient with Git and modern branching/review workflows.
Comfortable operating in a fast-paced, early-stage environment with high autonomy.
Experience in a high-growth B2B SaaS environment.
Prior exposure to SOC 2 or similar security/compliance frameworks.
Experience building internal tools, CLIs, or dashboards for engineering teams.
Familiarity with Render, Sentry, Segment, Notion, and Linear.
Experience leading incident response, postmortems, and reliability reviews.
Direct ownership of a critical layer of the Introzy platform with executive visibility.
Opportunity to define our reliability, observability, and deployment standards from the ground up.
Work closely with a small, senior team and ship improvements that meaningfully impact developer velocity and customer trust.
Chance to shape how we use AI to operate and improve infrastructure and developer workflows.
Loading similar jobs...
Discover fully remote job opportunities in the United States at USA Remote Jobs. Apply for roles like Software Developer, Customer Service Specialist, Project Manager, and more!