Mountain View, California, United States USD 174,000 - 252,000 Yearly

Google is hiring a Senior Software Engineer

Google is hiring a Senior Software Engineer to join a horizontal machine learning infrastructure and efficiency team. You will support the training framework for our foundation recommender model and its customers, with a mission to accelerate product innovations through ML for recommendations and user modeling.

What You'll Do

  • Architect and implement the transition from data-parallel to model-parallel training paradigms.
  • Design and manage large-scale training runs across multi-pod environments, maximizing data center network bandwidth and minimizing communication bottlenecks.
  • Research and integrate transformer model optimizations and novel architectural variants to reduce training time and resource consumption.
  • Write and optimize low-level model code, including custom pallas kernels, to maximize performance out of the hardware.
  • Work cross-functionally with the team and the Kernel optimization team to co-design and implement compiler-level optimizations that accelerate model execution.

What We're Looking For

  • Bachelor’s degree or equivalent practical experience.
  • 5 years of experience programming in Python or C++.
  • 3 years of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).

Nice to Have

  • Master’s degree or PhD in Computer Science, Machine Learning, Computer Engineering, or a related technical field.
  • Experience scaling machine learning models (e.g., Large Language Models (LLMs) or foundation models), managing the complexities of transitioning architectures from data-parallel to model, tensor, pipeline-parallel configurations, or related fields.
  • Experience with deep learning frameworks (e.g., JAX, PyTorch, or TensorFlow), including a track record of contributing to or modifying their core internals to support novel and emerging use cases.
  • Experience with co-designing hardware-aware optimizations to accelerate model execution.
  • Knowledge of machine learning compilers (e.g., Accelerated Linear Algebra (XLA) or Multi-Level Intermediate Representation (MLIR)).

Technical Stack

  • Python, C++
  • JAX, PyTorch, TensorFlow
  • XLA, MLIR

Team & Environment

This role is part of the RecML team within Core ML's Applied ML organization.

Benefits & Compensation

  • Base salary range: $174,000-$252,000
  • Equity included

Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.

Required Skills
PythonC++JAXPyTorchTensorFlowXLAMLIRML InfrastructureModel DeploymentModel EvaluationOptimizationData ProcessingDebugging PythonC++JAXPyTorchTensorFlowXLAMLIRML InfrastructureModel DeploymentModel EvaluationOptimizationData ProcessingDebugging
Relocating to Thailand?

Visa and work permit handled by experts

SVBL manages your entire visa process — from application to approval. Work permits, extensions, and compliance all covered. One partner for legal, immigration, and settling in.

Work permit processing
Visa extensions & renewals
Immigration compliance
Banking & housing guidance
Get free consultation
Free initial consultation
About company
Google
Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Google Cloud accelerates every organization’s ability to digitally transform its business and industry, delivering enterprise-grade solutions that leverage Google’s cutting-edge technology.
All jobs at Google Visit website
Job Details
Department Software Development
Category data
Posted 2 months ago