United States Remote (Global)

Sumo Logic, Inc. is hiring a Staff Machine Learning Engineer

Responsibilities

Collaborate with technical leaders to assess and integrate next-generation agentic AI platforms such as Claude, LangChain, and AWS Bedrock.
Design and deploy multi-agent AI solutions for security operations use cases, covering threat detection, triage, investigation, and automated response.
Lead the development of core agent components including planning modules, execution logic, tool integration, memory systems, and persistent workflows.
Develop comprehensive evaluation frameworks for AI agents using offline and online testing, curated datasets, synthetic data, and human or LLM-based assessment.
Oversee fine-tuning and alignment of large language models to enhance accuracy and reasoning in security and observability domains.
Build scalable infrastructure for AI operations, including inference management, latency reduction, cost efficiency, and system monitoring.
Work closely with product, security, and data teams to transition AI prototypes into production-grade customer-facing systems.
Guide and mentor AI engineering teams in the development of agentic systems and advanced LLM applications.
Establish standards for safety, reliability, evaluation, and monitoring of AI agents in production environments.
Serve as a technical leader in uncertain or complex domains, defining strategy, decomposing challenges, and coordinating cross-team execution.

Work Arrangement

Remote (Worldwide)

Team

Lead and partner with fellow leadership members and teams

Other

Must be authorized to work in the United States at the time of hire and for the duration of employment.
We are unable to provide non-immigrant visa sponsorship for this role.

Not offering non-immigrant visa sponsorship

Required Skills

PythonPytorchAirflowDockerKubernetesAWSMachine LearningDistributed Systems

About company

Sumo Logic helps make the digital world secure, fast, and reliable by unifying critical security and operational data through its Intelligent Operations Platform. Built to address the increasing complexity of modern cybersecurity and cloud operations challenges, the company empowers digital teams to move from reaction to readiness—combining agentic AI-powered SIEM and log analytics into a single platform to detect, investigate, and resolve modern challenges. The platform enables organizations to protect against security threats, ensure reliability, and gain powerful insights into their digital environments.

All jobs at Sumo Logic, Inc. Visit website

Job Details

Department Software Development

Category data

Posted 5 months ago