Solutions Architect - HPC AI Infiniband Network Engineering

October 9

Apply Now
Logo of NVIDIA

NVIDIA

GPU-accelerated computing • artificial intelligence • deep learning • virtual reality • gaming

10,000+

Description

• Primary responsibilities will include building and validating AI/HPC infrastructure for new and existing customers. • Support operational and reliability aspects of large-scale AI clusters with a focus on performance at scale, real-time monitoring, logging and alerting. • Engage in and improve the whole lifecycle of services as domain expert—from inception and design through deployment, operation, and refinement. • Create and handover related documentation and perform knowledge transfers required to support customers as they roll out some of the most sophisticated systems in the world. • Participate in continuous improvement processes to internal teams such as opening bugs, documenting workarounds, and suggesting enhancements.

Requirements

• 5+ years of experience with InfiniBand. • Experience in solving problems in large-scale InfiniBand network environments. • Driven focus on customer needs and satisfaction. • Self-motivated with excellent leadership skills. • Strong written, verbal, and listening skills are essential. • Proven customer-facing expertise. • Ability to travel to customer sites within the United States. • Minimum of a four-year degree from an accredited university or college in Computer Science, Electrical or Computer Engineering, or equivalent experience.

Benefits

• equity • benefits

Apply Now

Similar Jobs

October 9

OpenX

201 - 500

Solutions Architect for OpenX's multi-screen ad technology platform integration.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com