• Company Bobsled
  • Employment Full-time
  • Location 🇺🇸 United States nationwide
  • Submitted Posted 1 week ago - Updated 18 hours ago
<div class="content-intro"><p><span style="font-size: 12pt;"><strong>About Bobsled</strong></span></p><p><span style="font-size: 10pt;">Bobsled is building AI-powered analytics experiences that turn natural language into accurate, production-grade insights. Our mission is to enable enterprise customers to leverage the full power of AI and data agents, transforming how they access and act on their data. As we scale our AI product, we’re seeking hands-on specialists to ensure our customers’ deployments are robust, contextually tuned, and delivering measurable value.</span></p></div><p><span style="font-size: 12pt;"><strong>What You’ll Do</strong></span></p><ul><li style="font-size: 10pt;"><span style="font-size: 10pt;">Own the text-to-SQL accuracy problem end-to-end: design evals, iterate prompts, and improve retrieval/routing</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Build and operate the experimentation and evaluation loop (automatic evals, regression suites, dataset curation)</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Design pragmatic LLM application architectures (RAG, agent routing, tool-use orchestration) optimized for accuracy and latency</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Ship production-grade code and support deployments; instrument, monitor, and troubleshoot model behavior in real customer environments</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Partner closely with engineering and customers to improve semantic models, SQL generation, and data alignment</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Create feedback loops from users to systematically capture issues and convert them into measurable improvements</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Contribute to automation of environment provisioning and dev workflows to enable fast iteration</span></li></ul><p><span style="font-size: 12pt;"><strong>What We’re Looking For</strong></span></p><ul><li style="font-size: 10pt;"><span style="font-size: 10pt;">2+ years in ML/AI or data-focused engineering or data science roles building production systems data or AI systems</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Demonstrated experience tuning LLM applications: prompt engineering, evals, retrieval, agent design, or similar</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Strong hands-on coding in Python or TypeScript (TypeScript familiarity a plus; willingness to work across the stack required)</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">ML engineering mindset beyond notebooks: testing, CI, observability, performance, and deployment in production</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Comfort with SQL and complex data modeling; familiarity with data warehouses and pipelines</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Pragmatic, product-oriented approach—optimize for impact over novelty; complement existing systems rather than rebuild from scratch</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Ability to design experiments, quantify improvements, and communicate trade-offs clearly</span></li></ul><p><span style="font-size: 12pt;"><strong>Nice to Have</strong></span></p><ul><li style="font-size: 10pt;"><span style="font-size: 10pt;">Experience with text-to-SQL systems, semantic layers, or BI/analytics workflows</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Exposure to RAG frameworks, knowledge graphs, vector stores, and evaluation tooling</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Prior work in analytics engineering or data engineering environments</span></li></ul><p><span style="font-size: 12pt;"><strong>Success Looks Like</strong></span></p><ul><li style="font-size: 10pt;"><span style="font-size: 10pt;">Measurable improvements in text-to-SQL accuracy across target datasets and partners</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Reliable eval pipeline and regression suite running in CI to catch degradations</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Clear architecture and documentation for context/agent systems that others can contribute to</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Short feedback cycles with partners leading to fast, meaningful product wins</span></li></ul><p><span style="font-size: 12pt;"><strong>Compensation</strong></span></p><ul><li style="font-size: 10pt;"><span style="font-size: 10pt;">Competitive salary and meaningful equity</span></li><li style="font-size: 10pt;"><span style="font-size: 10pt;">Comprehensive benefits&nbsp;</span></li></ul><p><span style="font-size: 10pt;"><span style="color: rgb(255, 255, 255);">#LI-REMOTE</span></span></p><div class="content-conclusion"><p style="line-height: 1.4;"><span style="font-size: 10pt; font-family: helvetica, arial, sans-serif;"><span style="font-size: 8pt; color: #ffffff;">-Remote</span></span></p></div>

Loading similar jobs...

USA Remote Jobs

Discover fully remote job opportunities in the United States at USA Remote Jobs. Apply for roles like Software Developer, Customer Service Specialist, Project Manager, and more!

© 2025 Created by USA Remote Jobs. All rights reserved.