Senior Solution Architect - HPC and AI

October 9

Apply Now
Logo of NVIDIA

NVIDIA

GPU-accelerated computing • artificial intelligence • deep learning • virtual reality • gaming

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

Description

• Building robust AI/HPC infrastructure for new and existing customers. • Support operational and reliability aspects of large-scale AI clusters, focusing on performance at scale, training stability, real-time monitoring, logging, and alerting. • Engage in and improve the whole lifecycle of services from inception and design through deployment, operation, and refinement. • Understanding the AI workload and how it interacts with other parts of the system. • Help maintain services once they are live by measuring and monitoring progress of AI jobs. • Provide feedback to internal teams like opening bugs, documenting workarounds, and suggesting improvements.

Requirements

• BS/MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields with at least 8 years work or research experience with Python/ C++ / other software development. • Track record of medium to large scale AI training and understanding of key libraries used for NLP/LLM/VLA training (NeMo Framework, DeepSpeed etc.) • Experience with integration and deployment of software products in production enterprise environments, and microservices software architecture. • Experience working with multiple levels and teams across organisations (Engineering, Product, Sales and Marketing team) • Ability to multitask in a fast-paced environment. • Driven with strong analytical and problem-solving skills. • Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very sophisticated projects. • You are a self-starter with demeanour for growth, passion for continuous learning and sharing findings across the team. • Technical leadership and strong understanding of NVIDIA technologies, and success in working with customers. • Excellent verbal, written communication, and technical presentation skills in English.

Benefits

• equity • benefits

Apply Now

Similar Jobs

October 8

Senior Solutions Architecture Manager educating enterprise customers on AI testing solutions.

October 4

Senior Solutions Architect for Skyven’s decarbonization heat pump technology projects.

September 27

Senior Solutions Architect at Britive focuses on cloud security and privileged access management.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com