Poolside

Website LinkedIn All Job Openings

Blockchain • Accelerator • Incubator • Hub • Fundraising

11 - 50 employees

🌐 Web 3

💳 Fintech

🎮 Gaming

Member of Engineering - Inference

October 31

🇪🇺 Anywhere in Europe – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🖥 Software Engineer

AWS

Python

PyTorch

Rust

Apply Now

Poolside

Website LinkedIn All Job Openings

Blockchain • Accelerator • Incubator • Hub • Fundraising

11 - 50 employees

🌐 Web 3

💳 Fintech

🎮 Gaming

Description

• ABOUT POOLSIDE • In this decade, the world will create artificial intelligence that reaches human level intelligence (and beyond) by combining learning and search. • poolside exists to be one of these companies - to build a world where AI will drive the majority of economically valuable work and scientific progress. • We believe our applied research needs to culminate in products that are put in the hands of people. • ABOUT OUR TEAM • We are a remote-first team that sits across Europe and North America. • Our R&D and production teams are a combination of more research and more engineering-oriented profiles. • ABOUT THE ROLE • You will be focused on building out our multi-device inference of Large Language Models. • You will be working on improvements for both NVIDIA and AWS hardware. • YOUR MISSION • To develop and continuously improve the inference of LLMs for source code generation. • RESPONSIBILITIES • Follow the latest research on LLMs, inference and source code generation • Propose and evaluate innovations in the quality and efficiency of the inference • Monitor and implement LLM inference metrics in production • Write high-quality high-performance Python, Cython, C/C++, Triton, ThunderKittens, native CUDA, Amazon Neuron code. • Work in the team: plan future steps, discuss, and always stay in touch.

Requirements

• Experience with Large Language Models (LLM) • Confident knowledge of the computational properties of transformers • Knowledge/Experience with cutting-edge inference tricks • Knowledge/Experience of distributed and lower precision inference • Knowledge of deep learning fundamentals • Strong engineering background • Theoretical computer science knowledge is a must • Experience with programming for hardware accelerators • SIMD algorithms • Expert in matrix multiplication bottlenecks • Know hardware operation latencies by heart • Research experience (nice to have) • Nice to have: Author of scientific papers on topics: applied deep learning, LLMs, source code generation, etc • Can discuss the latest papers and descend to fine details • Programming experience • Linux • Git • Python with PyTorch or Jax • C/C++, CUDA, Triton, ThunderKittens • Use modern tools and always looking to improve • Strong critical thinking and ability to question code quality policies when applicable • Prior experience in non-ML programming is a nice to have

Benefits

• Fully remote work & flexible hours • 37 days/year of vacation & holidays • Health insurance allowance for you and dependents • Company-provided equipment • Wellbeing, always-be-learning and home office allowances • Frequent team get togethers • Great diverse & inclusive people-first culture

Apply Now

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com