You will optimise large scale systems delivering an extremely fast inference engine developed for large language models (LLMs).
Skills
Experience developing large distributed systems. Debugging skills, getting to root cause efficiently. Interest in complex systems and continuous optimisation.
Responsibilities
Write elegant, concise, performance code in multiple languages: Go or Golang, Rust, C++, or Python. Adapt quickly to new languages and technologies as needed. Eliminate latency across all systems. Work with components from TCP packets to the Linux kernel scheduler.
#J-18808-Ljbffr