This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Principal Software Engineer, Infrastructure & Operations in New York (USA).
As a Principal Software Engineer in Infrastructure & Operations, you will lead the architecture, development, and scaling of cloud-native infrastructure supporting enterprise-grade SaaS platforms. You will design resilient systems for compute, networking, data, and CI/CD pipelines while driving operational excellence and automation. This role requires close collaboration across engineering, security, and product teams to ensure high availability, observability, and performance at scale. You will mentor engineers, set technical standards, and influence platform-wide strategies that enable innovation and developer velocity. Working in a fast-paced, growth-oriented environment, you’ll integrate emerging technologies—including AI tools—to optimize workflows and improve efficiency. This position is ideal for someone who thrives in hands-on leadership roles with broad technical impact.
· Lead the architecture and development of scalable infrastructure spanning compute, data, networking, and CI/CD.
· Design and maintain infrastructure as code systems using Kubernetes, Terraform, and Vault.
· Build and operate high-availability foundations including messaging systems (Kafka, SQS), databases (Postgres, Clickhouse, Redis), and service networking.
· Guide observability and reliability practices, including post-incident reviews and process improvements.
· Collaborate with security, product, and data teams to align infrastructure initiatives with business priorities.
· Drive platform-wide strategies for automation, operational efficiency, and developer productivity.
· Mentor engineers and cultivate a culture of ownership, technical rigor, and continuous learning.
· Evaluate and integrate new technologies with a focus on long-term scalability and resilience.
Requirements
· 10–15+ years of experience scaling backend and infrastructure systems in cloud environments (AWS, GCP).
· Deep expertise in distributed systems, infrastructure automation, and service reliability engineering (SRE).
· Hands-on proficiency with Kubernetes, CI/CD pipelines, Terraform, and observability tooling.
· Strong experience with messaging systems, databases, and scripting languages (Python, Go, etc.).
· Proven ability to lead cross-team initiatives and influence platform architecture decisions.
· Aptitude for solving complex problems in ambiguous and fast-paced environments.
· Excellent communication skills and demonstrated ability to mentor and collaborate across teams.
· Willingness to explore and integrate AI tools into infrastructure workflows.
Preferred Qualifications:
· Experience in regulated or financial cloud environments.
· Contributions to open-source infrastructure projects.
· Familiarity with event-driven microservices or distributed SQL.
· Knowledge of data observability practices.
Benefits
· Competitive salary range ($239,000 – $281,000; $255,000 – $300,000 in NY/NV/CA markets).
· Participation in company stock plan for all employees.
· Unlimited vacation and flexible time-off policy.
· Education and wellness reimbursements.
· $0 cost employee insurance plans.
· Remote-first work with collaboration opportunities in NY, Reno, and San Ramon.
· Career development and mentorship in a high-performing, collaborative culture.
Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.
When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.
🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.
📊 It compares your profile to the job’s core requirements and past success factors to determine your match score.
🎯 Based on this analysis, we automatically shortlist the three candidates with the highest match to the role.
🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.
The process is transparent, skills-based, and free of bias — focusing solely on your fit for the role. Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team.
Thank you for your interest!
#LI-CL1
Loading similar jobs...
Discover fully remote job opportunities in the United States at USA Remote Jobs. Apply for roles like Software Developer, Customer Service Specialist, Project Manager, and more!