Software AI Engineer

·
Full time
Location: London
·
Job offered by: microTECH Global Limited
·
Category: IT & Technology
As a Senior Software Engineer in the AI Processor Software & Hardware Co-design Lab, you will be responsible for designing and implementing both compile-time and run-time optimizations to enable real-time AI applications on Huawei AI processors. You will collaborate closely with cross-functional teams to integrate and deploy AI solutions on the Ascend platform, leveraging your expertise to shape the performance, functionality, and efficiency of our AI models and systems.

Required: Rich experience in optimizing AI chip architectures and AI systems, familiar with mainstream heterogeneous computing software and hardware architectures in the industry, and have comprehensive capabilities from applications to basic software to chips. Hands-on experience with one of the following technologies: Numerical Calculation, Compilation, Algorithm & chip co-design, Runtime, Shared Memory. Knowledge of AI industry application scenarios, familiar with mainstream models and algorithm development trends, and able to extract requirements for the chip layer. Experience in analyzing workload sensitivity to micro-architecture features, evaluating performance trade-offs, and recommending improvements to both micro-architecture and application software for optimal efficiency. Familiarity with the performance impact of different compute, memory, and communication configurations, as well as hardware and software implementation choices, on AI acceleration. Experience with GPU compute APIs such as CUDA or OpenCL, and the ability to utilize GPU/NPU-optimized libraries to enhance performance. Experience in the development of deep learning frameworks, compilers, or system software. Strong background in compilers and optimization techniques; experience with LLVM-MLIR is a plus, but not required. Experience in software development using C/C++ and Python.

Desired: Relevant experience in several sub-fields of AI application algorithms, frameworks, runtime, modeling and simulation, and compilers. In-depth understanding of the innovative methods, platforms, and tools of AI head manufacturers, and experience in transforming application and academic research achievements into commercial products. Experience with GPU acceleration using AMD or Nvidia GPUs. Experience in developing inference backends and compilers for GPU or NPU. Experience with AI/ML inference frameworks like ONNXRuntime, IREE, or TVM. Experience with deploying AI models in production environments.

#J-18808-Ljbffr

Recent Jobs

London (On site) · Full time

Are you a smart, driven professional who takes pride in making a difference in local communities? Turner & Townsend’s Real Estate division is experiencing significant growth and we’re looking for an experienced industry professional with health project experience to join our high-performing and collaborative Project Management team. Why Join Us? Impactful Work: Contribute to social [...]Read More... from Assistant Project Manager – Healthcare See details

Chasetown (On site) · Full time

My client, Autosmart International are a manufacturing success story! Site Operations Manager – leading fast-paced manufacturing and warehousing About Our Client Autosmart International is a manufacturing success story, leading the field in vehicle cleaning products. We are the No.1 choice of automotive trade customers across the UK. We have doubled in size in the last [...]Read More... from Site Operations Manager See details

London (On site) · Full time

CSS are looking for an experienced duty officer to join our client’s team who are a local council responsible for all areas within the Tendering district. Working hours: All shifts are 8 hours long with various start times available: Monday to Friday – start times between 6AM – 3PM Saturday & Sunday – 6AM – [...]Read More... from Duty Officer See details