The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and aligned with our mission of broad societal benefit.
About the Role
As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements into robust systems, and harden pipelines that serve as the backbone for Sora’s rapid iteration cycles.
We’re looking for engineers who are detail-oriented, have strong experience with distributed systems, and excel at building reliable infrastructure in high-stakes environments.
This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.
In this role, you will:
Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security.
Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient
Partner with researchers to deeply understand requirements and translate them into production-ready systems.
Harden, optimize, and maintain critical data infrastructure systems that power multimodal training and evaluation.
Have strong experience with distributed systems and large-scale infrastructure with a strong interest in data.
Are detail-oriented and bring rigor to building and maintaining reliable systems.
Demonstrate excellent software engineering fundamentals and organizational skills.
Are comfortable with ambiguity and rapid change