Proven engineering leadership ability: Track record of 5+ years in software or ML engineering, focusing on building robust, scalable systems. Demonstrated success leading teams, fostering collaboration and achieving significant milestones. Skilled in creating and managing multi-instance clusters for data and model parallel training on GPUs/TPUs, preferably using DeepSpeed or PyTorch FSDP. Experience with orchestration systems like SLURM or Ray, along with MLOps tools such as Kubernetes, Vertex, or Sagemaker. Proficient in serving large machine learning models at scale, including quantization, distributed computing, and using frameworks like vLLM or Ray Serve. Nice to have Experience at a leading machine learning company (Mistral, Anthropic, X.ai, HuggingFace, Neuralink, OpenAI, etc.) Publications in top AI conferences. Interested in and thoughtful about the impacts of AI technology. About you
You are going to thrive at Atla with the following mindset: Collaborative and team-oriented, with strong communication skills. Comfortable with the uncertainty of a hyper-growth startup. Unpretentious and hard working; find the best ideas wherever they come from. Compensation
£150K - £300K Significant stake in equity as one of our core technical leaders Pension plan with employer contributions Medical, dental, and vision benefits Join us in making a dent in the universe by engineering safe, beneficial AI systems!
#J-18808-Ljbffr