ML Infrastructure Engineer

·
Full time
Location: London
·
Job offered by: spAItial AI
·
Category: IT & Technology
SpAItial is pioneering the development of a frontier 3D foundation model, pushing the boundaries of AI, computer vision, and spatial computing. Our mission is to redefine how industries, from robotics and AR/VR to gaming and movies, generate and interact with 3D content. We’re looking for individuals who are bold, innovative, and driven by a passion for pushing the boundaries of what’s possible. You should thrive in an environment where creativity meets challenge and be fearless in tackling complex problems. Our team is built on a foundation of dedication and a shared commitment to excellence, so we value people who take immense pride in their work and place the collective goals of the team above personal ambition. As a part of our startup, you’ll be at the forefront of the AI revolution in 3D technology, and we want you to be excited about shaping the future of this dynamic field. If you’re ready to make an impact, embrace the unknown, and collaborate with a talented group of visionaries, we want to hear from you. Responsibilities Create and maintain ML and cloud infra for nascent AI company.

Design and Deploy Infrastructure: Develop and maintain scalable, high-performance cloud-based infrastructure for ML workloads and serving ML APIs or client endpoints.

Cloud Platforms: Deploy, manage, and optimize cloud-based infrastructure (AWS, Azure, GCP). Setup ML nodes for local development and distributed training workloads, maintain compatibility between the two.

System Management: Install, configure, and monitor servers.

Storage management: Optimize various types of shared / local storage maintaining big data for ML workloads.

Containerization and Orchestration: Manage and scale containerized applications using Docker, Kubernetes, Terraform, etc.

Collaboration: Work closely with the rest of the technical team to ensure smooth orchestration of the ML and production workloads.

Incident Response: Respond to cloud / production incidents, perform analysis, and implement solutions to prevent recurrence.

Key Qualifications: 3 years professional experience in a cloud-related role, preferred ML-related.

Proficiency in writing scripts (Bash, PowerShell, Python, …) to automate tasks.

Proficiency in cloud platforms (e.g., AWS, GCP, Azure).

Proficiency in containerization (e.g., Docker, Kubernetes).

Proficiency in orchestrating a cloud.

Preferred Qualifications Familiarity with Python (Jupyter) and ML frameworks (PyTorch).

Familiarity with cloud monitoring tools (e.g., Prometheus, Grafana).

Familiarity with cloud-based database systems (Amazon RDS, Aurora, Redshift, Google Cloud SQL, Spanner, …) and data-visualisation tools (Amazon QuickSight, Apache Superset).

Familiarity with CI/CD tools (e.g., CircleCI).

At SpAItial, we are committed to creating a diverse and inclusive workplace. We welcome applications from people of all backgrounds, experiences, and perspectives. We are an equal opportunity employer and ensure all candidates are treated fairly throughout the recruitment process.

#J-18808-Ljbffr

Recent Jobs

London (On site) · Full time

Are you a smart, driven professional who takes pride in making a difference in local communities? Turner & Townsend’s Real Estate division is experiencing significant growth and we’re looking for an experienced industry professional with health project experience to join our high-performing and collaborative Project Management team. Why Join Us? Impactful Work: Contribute to social [...]Read More... from Assistant Project Manager – Healthcare See details

Chasetown (On site) · Full time

My client, Autosmart International are a manufacturing success story! Site Operations Manager – leading fast-paced manufacturing and warehousing About Our Client Autosmart International is a manufacturing success story, leading the field in vehicle cleaning products. We are the No.1 choice of automotive trade customers across the UK. We have doubled in size in the last [...]Read More... from Site Operations Manager See details

London (On site) · Full time

CSS are looking for an experienced duty officer to join our client’s team who are a local council responsible for all areas within the Tendering district. Working hours: All shifts are 8 hours long with various start times available: Monday to Friday – start times between 6AM – 3PM Saturday & Sunday – 6AM – [...]Read More... from Duty Officer See details