Job Highlights
AI-extracted key information
The Staff Software Engineer, Ads ML Inference Infrastructure at Pinterest is responsible for leading the development of next-generation model inference and feature serving systems that enhance the company's monetization efforts. This role involves designing low-latency, high-throughput inference pipelines and collaborating with cross-functional teams to integrate new technologies and optimize performance.
Salary Range
$208k - $365k/year
Experience Level
Senior Level
Benefits & Perks
Education Requirements
bachelor degree
Staff Software Engineer, Ads ML Inference Infrastructure
Posted 1 weeks ago
Full-Time
Employment Type
Remote
Work Location
$208,454 - $364,795
per year
About This Role
About Pinterest
Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we’re on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product.
Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other’s unique experiences and embrace the
flexibility
to do your best work. Creating a career you love? It’s Possible.
Staff Software Engineer, Ads ML Inference Infrastructure
The Ads ML Inference Infra team owns the online inference and feature serving systems that power real-time model scoring and delivery for all Ads models at Pinterest. The team is looking for a staff engineer with strong hands-on experience in large-scale ML inference systems, as well as capabilities in solving ambiguous technical problems and driving strategic, cross-functional efforts.
What You’ll Do
Lead and drive efforts to build next-generation
model inference and feature serving systems
that power up to
100x larger models
and directly uplevel Pinterest’s monetization business.
Design and optimize
low-latency, high-throughput inference pipelines
to meet strict SLOs while improving
performance, efficiency, and cost
.
Partner with Ads ML and product teams to
productionize new model architectures
(including LLMs and multi-stage ranking models) and scale them reliably to global traffic.
Evolve the
online feature platform
(feature computation, caching, and retrieval) to improve coverage, freshness, and consistency for Ads models.
Evaluate and integrate new technologies (e.g.,
GPU acceleration, model compression, Triton, vLLM, Dynamo
) to advance our inference stack.
Build strong partnerships with other infra and ML teams to improve
end-to-end reliability, observability, and developer velocity
for Ads ML.
Mentor and coach other engineers, guiding them through technical decisions, system design, and career development.
What We’re Looking For
BS (or higher) degree in
Computer Science
or a related field.
~8+ years of relevant industry experience designing and operating
large-scale, production ML or distributed infra systems
.
Deep knowledge of at least one programming language (
Java, C++, Python
).
Deep experience with
distributed systems or recommendation / ads serving infrastructure
(e.g., request routing, online storage, caching, feature serving, APIs).
Hands-on experience with at least one deep learning framework (
PyTorch
or
TensorFlow
) and bringing models from offline experimentation to production.
[Preferred] Experience with
model / hardware accelerator libraries
(e.g., CUDA, quantization, distillation, low-precision inference).
[Preferred] Experience with
inference optimization and serving frameworks
such as
Triton, vLLM, or Dynamo
.
Proven track record of
leading complex projects
, setting technical direction, and
collaborating across functions and orgs
; experience mentoring and coaching other engineers.
In-office Requirement Statement
We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.
This role will need to be in the office for in-person collaboration 1-2 times per week and therefore needs to be in a commutable distance from one of the following offices Palo Alto, CA; San Francisco, CA; Seattle, WA.
Relocation Statement
This position is not eligible for relocation assistance. Visit our
PinFlex
page to learn more about our working model.
8
At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise.
Information regarding the culture at Pinterest and benefits available for this position can be found
here
.
US based applicants only
$208,454
—
$364,795 USD
Our Commitment To Inclusion
Pinterest is an equal opportunity employer and makes employment decisions on the basis of merit. We want to have the best qualified people in every job. All qualified applicants will receive consideration for employment without regard to race, color, ancestry, national origin, religion or religious creed, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected veteran, physical or mental disability, medical condition, genetic information or characteristics (or those of a family member) or any other consideration made unlawful by applicable federal, state or local laws. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you require a medical or religious accommodation during the job application process, please complete
this form
for support.
Compensation
$208,454 - $364,795
Annual salary
Ready to Apply?
Click the button below to submit your application directly to Pinterest. Make sure your resume is up to date and highlights relevant experience for this role.
Apply Now at PinterestApply to Multiple Jobs with AI
Let our AI automatically apply to hundreds of remote jobs on your behalf. Just upload your resume and set your preferences.
500+
Jobs Applied
24/7
Auto-Apply
5 min
Setup Time
You Might Also Like
At Talkspace, we are committed to fostering a diverse, equitable, inclusive, and belonging-centered workplace where everyone can thrive while making a...
Who we are At Twilio, we’re shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions to hundreds of th...
