Description
As an engineer in this role, you will be primarily focused on the ingestion and optimization of ML programs from different authoring frameworks (such as PyTorch) into CoreML using a combination of graph capture, conversion, and compilation pipelines.
We are building the first end-to-end developer experience for ML development that, by taking advantage of Apple’s vertical integration, allows developers to iterate on model authoring, optimization, transformation, execution, debugging, profiling and analysis. The ML model representation and frontend compilation is the entrypoint to this stack.
KEY RESPONSIBILITIES:
- Develop technologies to quickly onboard new ML models to our on-device stack, including contributions to ML authoring frameworks.
- Understand different ML operations, architectures, and graph representations in different authoring frameworks. Keep abreast of latest innovations in this space.
- Architect and build CoreML’s model representation that can efficiently represent program semantics from the authored frameworks, while allowing for peak execution performance.
- Define and develop optimizations such as quantization, operator transformations, fusions, etc. to make models more amenable to efficient on-device deployment