Staff Software Engineer - ML Acceleration

July 31

Apply Now

Description

• Responsible for analyzing and profiling ML models to identify performance bottlenecks • Collaborate with ML researchers to balance model accuracy and speed • Develop efficient model export and optimization solutions

Requirements

• Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field • 5+ years of experience (including experience with GPU programming and optimization) • Strong programming skills in C++ and Python • Proven experience in GPU programming and optimization • Familiarity with deep learning frameworks, especially PyTorch • CUDA programming • Triton language for GPU kernels • PyTorch optimization techniques • TensorRT implementation • ONNX model conversion and deployment • Custom GPU kernel development • Deep understanding of GPU architectures and performance optimization • Strong analytical and problem-solving skills • Excellent verbal and written communication skills, with the ability to convey complex technical concepts to non-technical stakeholders

Benefits

• Equal opportunity workplace • Commitment to diversity and inclusion

Apply Now

Similar Jobs

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com