Description
- Create optimized implementations of ML workloads on Apple silicon including Neural Engine, GPU and CPU.
- Collaborate with IP and SoC architecture teams to develop performance models and simulations of future hardware.
- Conduct performance studies to inform and validate architecture decisions.
- Collaborate with system team to create high level performance models of emerging ML techniques and analyze system architecture trade-offs.