Apple · Cupertino

On-device ML Infrastructure Engineer (ML Modeling Semantics & Representation)

(w/m/d) · 24.3.2025

Description

As an engineer in this role, you will be primarily focused on the interplay between higher-level ML authoring frameworks (such as PyTorch, JAX, MLX, etc.) and Apple’s on-device ML infrastructure. The role requires an understanding of ML modeling (architectures, training vs inference trade-offs, etc.) and ML deployment optimizations (compression, distillation, quantization, hardware optimizations, etc.). We are building the first end-to-end developer experience for ML development that, by taking advantage of Apple’s vertical integration, allows developers to iterate on model authoring, optimization, transformation, execution, debugging, profiling and analysis. The ML representation, translation and optimization is the entry point of such infrastructure stack.

Qualifications

  • Bachelors in Computer Sciences, Engineering, or related discipline.
  • Highly proficient in Python programming, familiarity with C++ is required.
  • Proficiency in at least one ML authoring framework, such as PyTorch, TensorFlow, JAX, MLX.
  • Strong understanding of ML fundamentals, including common architectures such as Transformers.

Preferred Qualifications

  • Hands on experience working with and/or developing ML optimization techniques such as quantization.
  • Experience with accelerators, GPU programming is a strong plus.
  • Experience with any on-device ML stack, such as TFLite, ONNX, etc.
  • Experience with MLIR compiler stack is a strong plus
  • Good communication skills, including ability to communicate with cross-functional audiences.
  • Excellent communication & interpersonal skills.

Benefits

Application

View listing at origin and apply!