Description
Work closely with product teams to build production grade solutions to launch models serving millions of customers in real time.
* Work along side Foundation Model Research team to prototype and develop inference for cutting edge model architectures.
* Build tools to understand bottlenecks in Inference for different hardwares and use cases.
* Mentor and guide engineers in the organization.