AI Software Engineer
Company: Zoom
Location: Seattle
Posted on: April 1, 2026
|
|
|
Job Description:
AI Software Engineer What you can expect The AI Infra team at
Zoom is dedicated to building a world-class inference
infrastructure that powers all of Zoom’s AI services. Our mission
is to deliver high efficiency, scalability, and cost optimization
across a wide range of AI applications, including large language
models (LLM), vision-language models (VLM), automatic speech
recognition (ASR), and machine translation. We focus on creating a
seamless collaboration between small and large models, ensuring
cost-effective, privacy-preserving, and high-quality AI services
for our customers. About the Team As an AI Software Engineer on
Zoom’s AI Infra team, you will design, optimize, and scale the
runtimes and services that power our AI models. Your work will
directly improve efficiency, reduce latency, and lower costs across
Zoom’s AI stack, ensuring reliable, high-performance AI experiences
for millions of users. Responsibilities Develop and optimize AI
runtimes for LLMs, ASR, and MT systems with a focus on performance
and cost efficiency. Apply GPU-level optimization techniques
including CUDA , kernel fusion, and memory throughput improvements.
Implement inference optimizations such as TorchCompile, graph
optimization, KV cache, and continuous batching. Build scalable,
highly available infrastructure services to support
enterprise-grade AI workloads. Optimize models for edge devices
(laptops , PCs and mobile devices ) as well as large-scale cloud
deployments. Continuously improve latency, throughput, and
efficiency across serving pipelines. Rapidly integrate and optimize
new industry models to stay ahead in AI infrastructure. What we’re
looking for Track record of building scalable, reliable AI
infrastructure under real-world production constraints. Strong
expertise in GPU programming and optimization ( CUDA , kernel-level
development). Deep e xperience with transformer-based models and
inference frameworks (vLLM, TensorRT-LLM, SGLang, ONNX Runtime).
Proficiency in Python and C++ (Java is a plus). Hands-on experience
with PyTorch (TorchCompile, graph-level optimization) and/or
TensorFlow. Knowledge of low-level hardware concepts (GPU memory
hierarchy, caching, vectorization). Familiarity with cloud
platforms (AWS, GCP, Azure) and AI deployment tools (Docker,
Kubernetes, MLflow). Salary Range or On Target Earnings: Minimum:
Maximum: In addition to the base salary and/or OTE listed Zoom has
a Total Direct Compensation philosophy that takes into
consideration; base salary, bonus and equity value. Note: Starting
pay will be based on a number of factors and commensurate with
qualifications & experience. We also have a location based
compensation structure; there may be a different range for
candidates in this and other locations. Ways of Working Our
structured hybrid approach is centered around our offices and
remote work environments. The work style of each role, Hybrid,
Remote, or In-Person is indicated in the job description/posting.
Benefits As part of our award-winning workplace culture and
commitment to delivering happiness, our benefits program offers a
variety of perks, benefits, and options to help employees maintain
their physical, mental, emotional, and financial health; support
work-life balance; and contribute to their community in meaningful
ways. Click Learn for more information. About Us Zoomies help
people stay connected so they can get more done together. We set
out to build the best collaboration platform for the enterprise,
and today help people communicate better with products like Zoom
Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and
Zoom Webinars. We’re problem-solvers, working at a fast pace to
design solutions with our customers and users in mind. Find room to
grow with opportunities to stretch your skills and advance your
career in a collaborative, growth-focused environment. Our
Commitment? At Zoom, we believe great work happens when people feel
supported and empowered. We’re committed to fair hiring practices
that ensure every candidate is evaluated based on skills,
experience, and potential. If you require an accommodation during
the hiring process, let us know—we’re here to support you at every
step. We welcome people of different backgrounds, experiences,
abilities and perspectives including qualified applicants with
arrest and conviction records and any qualified applicants
requiring reasonable accommodations in accordance with the law. If
you need assistance navigating the interview process due to a
medical disability, please submit an Accommodations Request Form
and someone from our team will reach out soon. This form is solely
for applicants who require an accommodation due to a qualifying
medical disability. Non-accommodation-related requests, such as
application follow-ups or technical issues, will not be addressed.
Think of this opportunity as a marathon, not a sprint! We're
building a strong team at Zoom, and we're looking for talented
individuals to join us for the long haul. No need to rush your
application – take your time to ensure it's a good fit for your
career goals. We continuously review applications, so submit yours
whenever you're ready to take the next step.
Keywords: Zoom, Bellingham , AI Software Engineer, IT / Software / Systems , Seattle, Washington