September 18
• Design, train and improve upon cutting-edge models. • Help us develop new techniques to train and serve models safer, better, and faster. • Train extremely large-scale models on massive datasets. • Explore continual and active learning strategies for streaming data. • Learn from experienced senior machine learning technical staff. • Work closely with product teams to develop solutions.
• Proficiency in Python and related ML frameworks such as Tensorflow, TF-Serving, JAX, and XLA/MLIR. • Experience using large-scale distributed training strategies. • Familiarity with autoregressive sequence models, such as Transformers. • Strong communication and problem-solving skills. • A demonstrated passion for applied NLP models and products. • Bonus: experience writing kernels for GPUs using CUDA. • Bonus: experience training on TPUs. • Bonus: papers at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP).
Apply Now