Red Hat, Inc.18.02.26
AI SCORE 8.5

Remote Forward Deployed Engineer - AI Inference Specialist

$190K–$313K/year

About the Role

Join Red Hat as a Remote Forward Deployed Engineer specializing in AI Inference. In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D and vLLM) and our customers' most critical production environments. As a Remote Forward Deployed Engineer, you will interface directly with engineering teams to deploy, optimize, and scale distributed Large Language Model (LLM) inference systems.

What You'll Do

  • Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters, setting up advanced deployments to maximize hardware utilization.
  • Optimize for Production: Run performance benchmarks, tune vLLM parameters, and configure intelligent inference routing policies to meet SLOs for latency and throughput.
  • Code Side-by-Side: Collaborate with customer engineers to write production-quality code (Python/Go/YAML) that integrates our inference engine into their existing Kubernetes ecosystem.
  • Solve the "Unsolvable": Debug complex interactions between model architectures, hardware accelerators, and Kubernetes networking.
  • Feedback Loop: Act as the "Customer Zero" for core engineering teams, channeling field learnings back to product development.

Requirements

  • 8+ years of engineering experience in Backend Systems, SRE, or Infrastructure Engineering.
  • Deep Kubernetes expertise, fluent in K8s primitives and high-performance networking.
  • Proficiency in Python and Go for systems programming.
  • Experience with Infrastructure as Code tools like Helm and Terraform.
  • Understanding of AI inference, including KV Caching and continuous batching.

Nice to Have

  • Experience contributing to open-source AI infrastructure projects.
  • Knowledge of Envoy Proxy or Inference Gateway (IGW).
  • Familiarity with model optimization techniques like Quantization.

What We Offer

  • Comprehensive medical, dental, and vision coverage.
  • 401(k) with employer match.
  • Paid time off and holidays.
  • Paid parental leave plans for all new parents.
  • Flexible work environment with remote options.
Why This Job8.5 of 10

This Remote Forward Deployed Engineer role at Red Hat offers an exciting opportunity to work with cutting-edge AI technologies and Kubernetes. With a competitive salary range and flexible work environment, it's a great chance to make an impact.

Salary Range
Required
0/1
Optional
0/1
Bonus
0/1

Generating success profile...

Analyzing job requirements and market data

Loading market overview...

Analyzing market trends and skill demands

Industry News

Loading latest industry news...

Finding relevant articles from the last 6 months

All job postings are automatically gathered by algorithms. We do not review or verify listings, be careful when applying and do not sign-in with iCloud or Google services.