Description
We develop compiler technology to accelerate deep learning applications for Apple products. In this role, you will be empowered to:
- Architect and develop the compiler for Apple proprietary Neural Engine Accelerator architecture, to enable inference of deep learning networks onto this architecture with an emphasis on performance and power
- Bring up new hardware silicon and add support in the compiler for these hardware features
- Work on bringing the compiler code to production quality and enable a wide range of applications of deep learning technology for internal clients and 3rd party developers
- Evaluate existing hardware blocks and work closely with the platform architecture team on the definition of new hardware features, and hardware specification review
- Work with the micro-architecture design team to understand the functional and performance goals of the design
- Architect and lead complex compiler features