November 7
• Lead multimodal research including text-to-image, video, 3D, and audio generation • Architect, train, and evaluate foundational models • Work in exploratory phases to gain new insights and improve model outcomes
• Experience in text-to-image, video synthesis, 3D generation, and audio processing • Skill in building foundational models from scratch, leveraging GPU technology for performance and scaling • Background in GANs, transformers, and diffusion models, especially for multimodal applications • Record of peer-reviewed publications and contributions to open-source projects • Expertise in Python, deep learning frameworks, and cloud computing • Strong team player experienced in cross-functional collaboration • Ability to stay current with tools and advancements in AI • PhD in AI with research experience in ML or equivalent experience
• We offer a generous package including paid parental leave • A generous home office and wellness budget • Remote working from abroad • Free lunch on Fridays in the Sydney office
Apply Now