Stord is The Consumer Experience Company, powering seamless checkout through delivery for today's leading brands. Stord is rapidly growing and is on track to double our revenue in the next 18 months. To meet and exceed this target, Stord is strategically scaling teams across the entire company, and seeking energetic experts to help us achieve our mission.
By combining comprehensive commerce-enablement technology with high-volume fulfillment services, Stord provides brands a platform to compete with retail giants. Stord manages over $10 billion of commerce annually through its fulfillment, warehousing, transportation, and operator-built software suite including OMS, Pre- and Post-Purchase, and WMS platforms. Stord is leveling the playing field for all brands to deliver the best consumer experience at scale.
With Stord, brands can increase cart conversion, improve unit economics, and drive sustained customer loyalty. Stord’s end-to-end commerce solutions combine best-in-class omnichannel fulfillment and shipping with leading technology to ensure fast shipping, reliable delivery promises, easy access to more channels, and improved margins on every order.
Hundreds of leading DTC and B2B companies like AG1, True Classic, Native, Seed Health, quip, goodr, Sundays for Dogs, and more trust Stord to deliver industry-leading consumer experiences on every order. Stord is headquartered in Atlanta with facilities across the United States, Canada, and Europe. Stord is backed by top-tier investors including Kleiner Perkins, Franklin Templeton, Founders Fund, Strike Capital, Baillie Gifford, and Salesforce Ventures.
We are seeking an experienced and strategic SRE Manager to lead our growing site reliability engineering team. This role combines deep technical expertise with strong leadership skills to drive the reliability, scalability, and performance of our production systems at scale. You'll be responsible for building and mentoring a high-performing team of SREs while setting the technical vision and strategy for our infrastructure platform.Team Leadership & People ManagementÂ
Build, lead, and scale a team of SREsÂ
Provide career development, mentoring, and technical guidance to team membersÂ
Establish hiring practices and interview processes to attract top SRE talentÂ
Foster a culture of reliability, automation, and continuous improvementÂ
Manage team performance, conduct reviews, and facilitate professional growthÂ
Define on-call practices and ensure sustainable operational load across the teamÂ
Strategic Planning & Technical VisionÂ
Develop and execute the long-term infrastructure and reliability strategyÂ
Establish reliability standards, SLOs, and engineering practices across the organizationÂ
Drive architectural decisions for scalable, multi-region infrastructure on GCPÂ
Partner with engineering leadership to align infrastructure roadmap with business objectivesÂ
Evaluate and introduce new technologies, tools, and practices to improve team effectivenessÂ
Lead capacity planning and infrastructure cost optimization initiativesÂ
Cross-Functional CollaborationÂ
Work closely with development teams to embed reliability practices into the software development lifecycleÂ
Collaborate with Product, Security, and Compliance teams on infrastructure requirementsÂ
Represent the SRE team in engineering leadership meetings and strategic planning sessionsÂ
Drive incident response processes and lead major incident coordinationÂ
Establish SLAs and communication protocols with internal stakeholdersÂ
Technical Excellence & OversightÂ
Maintain hands-on technical involvement in critical infrastructure decisionsÂ
Review and approve major architectural changes and infrastructure proposalsÂ
Ensure implementation of best practices for Infrastructure as Code, monitoring, and automationÂ
Drive the adoption of chaos engineering, disaster recovery, and business continuity practicesÂ
Oversee security hardening and compliance efforts across infrastructure systemsÂ
Leadership & Management ExperienceÂ
3+ years of experience managing and leading technical teams (5+ people)Â
Proven track record of building and scaling SRE, platform, or infrastructure teamsÂ
Experience with hiring, performance management, and career development of technical staffÂ
Strong ability to balance technical hands-on work with people management responsibilitiesÂ
Experience leading incident response and managing high-stakes technical escalationsÂ
Technical ExpertiseÂ
8+ years of experience in site reliability, platform engineering, or infrastructure rolesÂ
Deep expertise with cloud platforms, particularly Google Cloud Platform (GCP)Â
Strong proficiency in multiple programming languages (Python, Go, Java, etc.)Â
Extensive experience with containerization (Docker), orchestration (Kubernetes), and microservicesÂ
Expert-level knowledge of Infrastructure as Code (Terraform, CloudFormation, Pulumi)Â
Advanced understanding of monitoring, observability, and distributed systems architectureÂ
Experience with CI/CD pipelines, automation frameworks, and DevOps practicesÂ
Strategic & Communication SkillsÂ
Ability to translate technical concepts into business value and communicate with executive leadershipÂ
Experience developing technical roadmaps and long-term strategic planningÂ
Strong project management skills and experience with agile/scrum methodologiesÂ
Excellent written and verbal communication skills for technical and non-technical audiencesÂ
Experience with budget management and vendor relationshipsÂ
Experience managing teams in high-growth startup or scale-up environmentsÂ
Background in managing distributed teams and remote-first engineering culturesÂ
Advanced GCP certifications (Professional Cloud Architect, Cloud DevOps Engineer)Â
Experience with multi-cloud architectures and cloud migration strategiesÂ
Knowledge of modern data infrastructure (BigQuery, streaming platforms, data pipelines)Â
Previous experience as a technical lead or principal engineer before transitioning to managementÂ
Familiarity with functional programming languages and event-driven architecturesÂ
Loading similar jobs...
Discover fully remote job opportunities in the United States at USA Remote Jobs. Apply for roles like Software Developer, Customer Service Specialist, Project Manager, and more!