Inferact03.02.26
AI SCORE 8.5

Inference Runtime Engineer - Remote Position

$200K–$400K/year

About the Role

We're hiring an Inference Runtime Engineer to join our team at Inferact. This remote position offers an exciting opportunity to work on cutting-edge AI inference technologies. As an Inference Runtime Engineer, you will play a crucial role in optimizing the execution of large language models (LLMs) across diverse hardware and architectures, making inference cheaper and faster.

What You'll Do

  • Develop and optimize inference engines for LLMs and diffusion models.
  • Implement and enhance model architectures and inference techniques based on the latest research.
  • Collaborate with cross-functional teams to push the boundaries of AI inference capabilities.
  • Contribute to the vLLM core, ensuring performant and maintainable code.
  • Debug complex ML codebases and improve the overall efficiency of the inference process.

Requirements

  • Bachelor's degree or equivalent experience in computer science, engineering, or a related field.
  • Strong programming skills in Python, with extensive experience in PyTorch internals.
  • Deep understanding of transformer architectures and their variants.
  • Experience with LLM inference systems such as vLLM, TensorRT-LLM, or SGLang.
  • Ability to read and implement model architectures from research papers.

Nice to Have

  • Familiarity with KV-cache memory management and hybrid model serving.
  • Experience with multimodal inference (audio/image/video/text).
  • Contributions to open-source ML projects.

What We Offer

  • Annual salary range of $200,000 - $400,000, depending on experience.
  • Equity options to share in the company's success.
  • Comprehensive health, dental, and vision benefits.
  • 401(k) company match to help you save for the future.
  • Flexible remote work options.
Why This Job8.5 of 10

This role offers a unique opportunity to work at the forefront of AI inference technology. With a competitive salary and equity options, it's an attractive position for skilled engineers.

Salary Range
Required
0/1
Optional
0/1
Bonus
0/1

About Inferact

Explore Inferact careers in 2026. Discover a wide range of remote, hybrid, and office positions tailored to your skills. Utilize our advanced filters, application tracking, and gain valuable company insights to enhance your job search experience. Stay informed with the latest industry news and seize the best career opportunities at Inferact. Start your journey towards an exciting future today!

Industry
Tech
Location
Remote

Generating success profile...

Analyzing job requirements and market data

Loading market overview...

Analyzing market trends and skill demands

Industry News

Loading latest industry news...

Finding relevant articles from the last 6 months

All job postings are automatically gathered by algorithms. We do not review or verify listings, be careful when applying and do not sign-in with iCloud or Google services.