San Francisco, California, United States

Together AI is hiring a Staff Engineer

Responsibilities

  • Own the evolution of the product runtime and application architecture, ensuring systems are resilient, scalable, and production-ready
  • Drive structural improvements within the existing web application, improving system boundaries, performance, and long-term maintainability
  • Lead initiatives that strengthen production stability, performance, and reliability across the application layer
  • Define and evolve runtime architecture patterns for server-side behavior, request handling, and scaling
  • Improve deployment safety, release confidence, and environment consistency across product engineering
  • Establish strong observability practices across the application stack, including logging, metrics, tracing, and debuggability
  • Identify and address structural bottlenecks that slow development or introduce operational risk
  • Partner with the API Platform team to help extract and separate API responsibilities from the application layer
  • Collaborate with the UI Platform team on runtime performance, framework behavior, and production characteristics of the web stack
  • Drive performance optimization efforts across the application tier, including latency reduction, scaling behavior, and resource efficiency
  • Improve CI/CD architecture and operational maturity to support fast, safe iteration
  • Mentor engineers and influence architectural direction across teams

Requirements

  • 8+ years of experience building and operating large-scale production systems
  • Strong hands-on experience with Node.js in production environments
  • Strong proficiency with TypeScript in large, complex codebases
  • Proven experience evolving real-world systems into stable, scalable platforms
  • Deep understanding of system design, performance, and reliability at production scale
  • Experience improving runtime stability, performance, and operational maturity of server-side systems
  • Experience working across application and infrastructure layers
  • Demonstrated ability to drive architectural change across teams without formal authority
  • Strong experience with CI/CD systems, deployment workflows, and production operations
  • Experience establishing or improving observability (logging, metrics, tracing) in production environments
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or equivalent practical experience

Nice to Have

  • Experience with Next.js and/or React in large-scale production environments
  • Experience evolving monolithic web applications into modular or service-oriented architectures
  • Experience operating production workloads in Kubernetes-based environments
  • Experience optimizing SSR performance and server-side runtime behavior
  • Experience improving system performance under high request volume and growth
  • Experience with Golang
Required Skills
Next.jsGo
About company
Together AI
Together AI is a research-driven artificial intelligence company that believes open and transparent AI systems will drive innovation. They are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models, and have contributed to leading open-source research, models, and datasets.
All jobs at Together AI Visit website
Job Details
Department Engineering
Category backend
Posted 4 months ago