posted Jun 12

Senior Software Engineer, Training, ML Infrastructure

AWS Azure Cloud Distributed Systems Google Cloud Platform Python PyTorch Tensorflow Go senior

Job Location: Bay Area, California

Salary: $192,000 - $243,000 a year

Job Description

• Report into the Head of ML Training • Design distributed systems tailored for machine learning workloads of different sizes, considering factors such as scalability, fault tolerance, and resource use • Optimize system performance by identifying bottlenecks and implementing efficient algorithms for distributed training • Increase training efficiency of different neural network architectures • Improve the developer experience and performance of our scalable ML framework • Collaborate with machine learning engineers and other partners to understand their requirements and provide infrastructure support for their experiments and projects.

Qualifications

• BS in Computer Science, Math, or 5+ years equivalent real-world experience • Python or C++ industry experience • Prior experience with Machine Learning frameworks (e.g., TensorFlow, PyTorch) and distributed training algorithms • Practical familiarity using ML accelerator profiling tools to uncover performance bottlenecks • Understand ML compiler infrastructure, such as of HLO and MLIR • Familiarity with cloud computing platforms including AWS, Azure, GCP and experience deploying and managing distributed systems in cloud environments

Benefits

• Health and wellness: Our people are at the heart of everything we do. At Waymo, you can enjoy top-notch medical, dental and vision insurance, mental wellness support, a Flexible Spending Account (FSA), a Health Saving Account (HSA), and special wellness programs. • Financial wellness: Your financial peace of mind is important to us. At Waymo, we offer competitive compensation, bonus opportunities, equity, a generous 401(k) plan, 1-on-1 financial coaching, a 529 College Savings Plan and lots of other perks and employee discounts. • Flexibility and time off: Take the time you need to relax and recharge. Enjoy the flexibility to work from another location for four weeks per year. We support an on-site or hybrid work model and offer remote working opportunities, paid time off, bereavement, sick, and parental leave. • Supporting families: When it comes to growing your family or caring for your loved ones, you have our full support. Enhanced leave options include paid parental leave (birthing parent gets 24 weeks of paid leave with up to 4 weeks of additional leave before their due date, and non-birthing parent gets 18 weeks of paid leave), and 20 days of subsidized backup childcare or adult/elder care.. Access to fertility care or adoption support as you grow your family. • Community and personal development: At Waymo, you'll find a range of opportunities to grow, connect, and give back. We offer education reimbursement, personal and professional development, mentorship, and other ways to connect through Employee Resource Groups (ERGs), other internal groups, and even time off to volunteer. • Cool perks: Access to Google offices, cafes, wellness centers, massages, and so much more. To support your wellbeing at home, you can enjoy at-home fitness and cooking classes, and more.

logo
Company
Spring Health
Post Date
New
Title
Machine Learning Engineer II
Type
$120,000 - $150,250 a year
Location
Remote
logo
Company
Zocdoc
Post Date
New
Title
Engineering Manager
Type
$190,000 - $255,000 a year
Location
Manhattan, New York
logo
Company
SupportLogic
Post Date
New
Title
Machine Learning Engineer
Type
$120,000 - $170,000 a year
Location
Remote
logo
Company
WelbeHealth
Post Date
New
Title
Outreach & Enrollment Coordinator
Type
$25 - $30 a year
Location
Bay Area, California
logo
Company
Cruise
Post Date
New
Title
Senior Machine Learning Engineer II, Behaviors
Type
$161,200 - $237,000 a year
Location
San Francisco, California