← All Jobs
Posted May 5, 2026

AI Inference Engineer

Apply Now

Be part of the team creating the software foundation for next-generation AI compute platforms. In this role, you’ll work across the full stack — from low-level kernels and hardware-optimized operators to large-scale ML deployment frameworks — in close collaboration with compiler developers, ML scientists, and hardware specialists. This position offers the chance to contribute to state-of-the-art AI infrastructure, fine-tune software for custom hardware, and deepen your expertise in system software and machine learning.

Responsibilities (some of the following)

Minimum qualifications

You are a great fit if you have experience in at least one of the following areas:

Contribution to open-source projects (e.g., LLVM, PyTorch, TensorFlow, ONNX Runtime, xDSL, IREE) is a big plus.

Interested in this role?Apply on iHire