This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Sr. Software Engineer in the United States.
As a Sr. Software Engineer, you will play a critical role in designing, building, and maintaining scalable observability platforms that provide deep insights into system performance and reliability. You will collaborate closely with cross-functional engineering teams, integrating real-time metrics pipelines, time-series databases, and monitoring tools to optimize distributed systems. This role requires a strong sense of ownership, problem-solving skills, and the ability to influence platform strategy while contributing to high-impact, global-scale solutions. You will work in a flexible remote or hybrid environment, shaping the future of observability and ensuring that operational data is reliable, actionable, and accessible to internal teams. Your work will directly improve system reliability, operational efficiency, and the overall developer experience.
· Architect, implement, and maintain observability platforms using tools such as Prometheus, Grafana, OpenTelemetry, InfluxDB, and AWS CloudWatch.
· Design real-time metrics pipelines and time-series data processing systems to support monitoring and analytics.
· Develop scalable APIs and services to expose observability data to internal teams and applications.
· Integrate observability tooling into CI/CD pipelines and deployment workflows for enhanced operational visibility.
· Collaborate with SRE, DevOps, and application teams to define SLIs, SLOs, alerting strategies, and performance dashboards.
· Troubleshoot complex distributed systems and contribute to reducing mean time to detect (MTTD) and resolve (MTTR) incidents.
· Stay current with emerging observability technologies and advocate for adoption of best practices.
Requirements
· 4+ years of experience in software engineering, platform engineering, or SRE roles.
· Deep expertise in observability tools and practices (e.g., Prometheus, Grafana, OpenTelemetry, Splunk, Coralogix).
· Strong programming skills in Python or similar languages.
· Experience designing fault-tolerant systems and optimizing performance at scale.
· Proficiency with real-time systems, metrics collection, and time-series databases (e.g., InfluxDB, AWS Timestream).
· Familiarity with AWS services, especially CloudWatch, Lambda, and related infrastructure.
· Experience with infrastructure as code (Terraform, CloudFormation) and container orchestration (Kubernetes).
· Solid understanding of distributed systems, microservices, and event-driven architectures.
· Excellent communication skills and a collaborative mindset.
· Nice to have: knowledge of APM tools, distributed tracing, PromQL/LogQL, or contributions to open-source observability projects.
Benefits
· Competitive salary ranging from $89,620 to $156,340 depending on location and experience.
· 401(k) with employer match.
· Comprehensive medical, dental, and vision coverage.
· Life, disability, accident, and illness insurance.
· Paid holidays, parental leave, and flexible time off.
· Health and dependent care flexible spending accounts and wellness benefits.
· Opportunities for professional growth, development, and certifications.
· Flexible remote or hybrid work arrangements depending on office proximity.
Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.
When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.
🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.
📊 It compares your profile to the job’s core requirements and past success factors to determine your match score.
🎯 Based on this analysis, we automatically shortlist the three candidates with the highest match to the role.
🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.
The process is transparent, skills-based, and free of bias — focusing solely on your fit for the role. Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team.
Thank you for your interest!
#LI-CL1
Loading similar jobs...
Discover fully remote job opportunities in the United States at USA Remote Jobs. Apply for roles like Software Developer, Customer Service Specialist, Project Manager, and more!