Business Data Science & Analytics is OpenAI’s hub for applying data to growth, revenue, and go-to-market strategy. We leverage data and analytics to understand the business, identify new opportunities, and evaluate impact — all in service of OpenAI’s mission to maximize the benefits of AGI for all of humanity. We build reliable, centralized data models that power decision-making across Marketing, Sales, Partnerships, and more. Our work guides strategic investments, uncovers growth levers, and supports OpenAI’s most important business bets.
About the Role
We’re seeking a Marketing Data Engineer to take the lead in building the data pipelines and core marketing datasets that power OpenAI’s growth. These pipelines are essential to understanding and optimizing our marketing and partnership performance—helping us measure ROI, guide investment decisions, and accelerate adoption of our products around the world.
If you’re passionate about working with data that drives business outcomes and want to build systems from the ground up, this is the role. You’ll collaborate closely with Marketing, Partnerships, Finance, and Data Science teams—as well as the researchers and engineers behind ChatGPT—to shape how we measure and scale OpenAI’s reach.
In this role, you will:
Design, build, and manage pipelines that integrate marketing, and partnership data into our data warehouse.
Develop canonical datasets to track key business metrics such as spend, LTV, CAC, ROI, and incremental performance.
Partner with Marketing, Partnerships, Data Science, Finance, and Product teams to understand data needs and deliver scalable solutions.
Implement robust systems for data ingestion and processing across multiple channels.
Participate in data architecture and engineering decisions that define the foundation for marketing analytics.
Ensure the security, integrity, and compliance of data according to industry and company standards.
Have 3+ years of experience as a Data Engineer and 8+ years of overall software engineering experience (including data engineering).
Are proficient in Python, Scala, or Java for data engineering.
Have experience with distributed processing technologies (e.g., Hadoop, Flink) and distributed storage systems (e.g., HDFS, S3).
Are skilled with ETL orchestration tools such as Airflow, Dagster, or Prefect.
Have a solid understanding of Spark, including writing, debugging, and optimizing Spark code.
Bring familiarity with marketing data sources (e.g., ad platforms, attribution systems, CRM, web analytics).
Thrive in ambiguity, love to build from 0→1, and want your work to directly shape how OpenAI grows.