Apple · Austin

AIML - Machine Learning Engineering, Machine Learning Platform and Infrastructure

(f/m/d) · 1/15/2025

Description

Work closely with product teams to build production grade solutions to launch models serving millions of customers in real time. * Work along side Foundation Model Research team to prototype and develop inference for cutting edge model architectures. * Build tools to understand bottlenecks in Inference for different hardwares and use cases. * Mentor and guide engineers in the organization.

Qualifications

  • 8+ years of experience leading and driving complex, ambiguous projects.
  • Strong industry background and experience in ML technologies (LLMs, Machine Learning, NLP, Information Retrieval, Statistics).
  • Rich experience with high throughput services particularly at supercomputing scale.
  • Proficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker etc.
  • Proficient in building and maintaining systems written in modern languages (eg: Golang, python)

Preferred Qualifications

  • Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow.
  • Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.
  • Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server etc.

Application

View listing at origin and apply!