Senior Linux Kernel Engineer – High-Performance Computing

Job not on LinkedIn

🕒 April 29

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of The Next Chapter

The Next Chapter

1 - 10 employees

Founded 2021

🎯 Recruiter

Recruitment • IT • Engineering

The Next Chapter is a specialized recruitment agency focusing on international job opportunities in IT and Engineering sectors. Based in The Netherlands, they assist English-speaking professionals in finding permanent positions and provide support with work permits and relocation. Their approach emphasizes transparency and expertise in navigating the Dutch job market for technical roles.

📋 Description

• Tuning the performance of clusters and InfiniBand networks to ensure optimal operation in HPC and GPU-based environments. • Analyzing and troubleshooting the root cause of issues related to GPUs and InfiniBand networks, and proposing corrective actions. • Integrating new hardware into the existing infrastructure, including support for new GPU hardware through software stacks like Kubernetes, QEMU, and KVM. • Enhancing automation systems for proactive monitoring, detecting, and resolving issues in GPU and InfiniBand environments. • Configuring and managing GPU devices and InfiniBand fabrics, ensuring efficient and reliable operation.

🎯 Requirements

• 5+ years of professional experience in system-level software development (focused on performance optimization, low-level programming) • 3+ years of hands-on experience with Linux systems (administration, troubleshooting, and/or performance tuning) • Experience with relevant "tools of the trade" for kernel profiling & tuning: perf, ftrace, (e)BPF etc. • In-depth understanding of server architecture, including PCIe devices, NICs, Linux OS/Kernel etc. • Strong proficiency in one or more performance-oriented programming languages (C/C++, Go, Python). • It would be a plus (but not key....) if you have: • Experience with GPU end-to-end testing in a cluster environment using InfiniBand networking. • Proven track record of analyzing and optimizing the performance of HPC workloads (e.g., simulations, data analysis, AI/ML workloads). • Familiarity with RDMA, RoCE, and InfiniBand protocols for high-performance communication. • Background in Software-Defined Networking (SDN) and experience with HPC cluster networking. • Understanding of QEMU/KVM virtualization and managing virtualized environments. • Experience with deep learning frameworks such as PyTorch and TensorFlow, and their integration with HPC systems. • Familiarity with collective communication libraries like MPI and NCCL for distributed computing.

🏖️ Benefits

• Flexible working arrangements • A dynamic and collaborative work environment that values initiative and innovation.

Apply Now

Similar Jobs

🕒 April 24

Zuzeum Art Centre

11 - 50

📚 Education

📱 Media

GTM Engineer at Unframe designing and implementing systems for optimizing GTM motion and enterprise sales execution. Collaborating closely with Sales Leadership, Enablement, Marketing, and Solutions teams.

SQL

🕒 April 22

Mozilla

501 - 1000

👥 B2C

🔒 Cybersecurity

Senior Software Engineer improving Firefox development workflows for Mozilla. Work with teams to enhance productivity and mentor developers while using cutting-edge technologies.

Android

AWS

Cloud

Django

Flask

Google Cloud Platform

JavaScript

Linux

Python

SQL

🕒 April 20

DataIQ

11 - 50

🤖 Artificial Intelligence

☁️ SaaS

Fullstack Engineer designing and developing AI-driven applications at Dataiku. Collaborating with teams across the tech stack to empower business users in diverse sectors.

Angular

Flask

JavaScript

Python

React

Vue.js

🕒 April 2

JetBrains

1001 - 5000

🤝 B2B

☁️ SaaS

🤖 Artificial Intelligence

Senior Software Engineer developing AI-first debugging tools within JetBrains Innovation Hub. Leading features for reliable and secure production systems.

Cloud

Java

Kotlin

Kubernetes

Python

.NET

🕒 April 1

Shockbyte

51 - 200

🎮 Gaming

☁️ SaaS

Senior Software Developer at Shockbyte building and scaling game server systems worldwide. Working with cutting-edge technologies to deliver high-quality solutions for the gaming community.

Kubernetes

SQL

TypeScript

Vue.js

Go