Software Engineer - Supercomputing Platform

Yesterday

Apply Now

Description

• Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. • Design and build resilient and optimized solutions for AI workloads on massive Computing Clusters. • Work closely with the training and inference teams to deliver high performance and reliability across storage, networking, and distributed computing designs. • Build the software stack to run massive-scale (thousands of GPUs), highly available supercomputing infrastructure. • Troubleshoot and resolve complex issues across hardware accelerated devices, networking, storage subsystems (local NVMe/Block Storage/NFS), OS, drivers and cloud environments, and automate detection and recovery processes. • Operate data-intensive workloads at petabyte-scale. • Increase the ease-of-use and self-serviceability of the compute platforms at Magic through top-notch documentation and developer workflow design. • Investigate and resolve incidents across security and availability.

Requirements

• Experience working with production GPU deployments, data-intensive applications, large-scale model training and HPC • Strong understanding of networking-, storage- and data-related technologies • Experience with GCP, AWS, Azure, OCI or similar cloud platforms • Strong software engineering skills • Strong IaC knowledge with extensive experience in Terraform, Pulumi, AWS CDK/CloudFormation or similar

Benefits

• Annual salary range: $100K - $550K • Equity is a significant part of total compensation, in addition to salary • 401(k) plan with 6% salary matching • Generous health, dental and vision insurance for you and your dependents • Unlimited paid time off • Visa sponsorship and relocation stipend to bring you to SF, if possible • A small, fast-paced, highly focused team

Apply Now

Similar Jobs

2 days ago

Pulley

11 - 50

Join Pulley as a Full-Stack Engineer to develop tools for startup equity management.

2 days ago

Meetsta

2 - 10

Seeking Full Stack Developer at Meetsta, focusing on React, TypeScript, and gRPC for social networking applications.

3 days ago

Meetsta

2 - 10

Join Meetsta as a Full Stack Developer, creating web and mobile applications. Contribute to building a dynamic social networking platform.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com