Remote Forward Deployed Engineer - AI Inference Specialist
About the Role
Join Red Hat as a Remote Forward Deployed Engineer specializing in AI Inference. In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D and vLLM) and our customers' most critical production environments. As a Remote Forward Deployed Engineer, you will interface directly with engineering teams to deploy, optimize, and scale distributed Large Language Model (LLM) inference systems.
What You'll Do
- Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters, setting up advanced deployments to maximize hardware utilization.
- Optimize for Production: Run performance benchmarks, tune vLLM parameters, and configure intelligent inference routing policies to meet SLOs for latency and throughput.
- Code Side-by-Side: Collaborate with customer engineers to write production-quality code (Python/Go/YAML) that integrates our inference engine into their existing Kubernetes ecosystem.
- Solve the "Unsolvable": Debug complex interactions between model architectures, hardware accelerators, and Kubernetes networking.
- Feedback Loop: Act as the "Customer Zero" for core engineering teams, channeling field learnings back to product development.
Requirements
- 8+ years of engineering experience in Backend Systems, SRE, or Infrastructure Engineering.
- Deep Kubernetes expertise, fluent in K8s primitives and high-performance networking.
- Proficiency in Python and Go for systems programming.
- Experience with Infrastructure as Code tools like Helm and Terraform.
- Understanding of AI inference, including KV Caching and continuous batching.
Nice to Have
- Experience contributing to open-source AI infrastructure projects.
- Knowledge of Envoy Proxy or Inference Gateway (IGW).
- Familiarity with model optimization techniques like Quantization.
What We Offer
- Comprehensive medical, dental, and vision coverage.
- 401(k) with employer match.
- Paid time off and holidays.
- Paid parental leave plans for all new parents.
- Flexible work environment with remote options.
This Remote Forward Deployed Engineer role at Red Hat offers an exciting opportunity to work with cutting-edge AI technologies and Kubernetes. With a competitive salary range and flexible work environment, it's a great chance to make an impact.
Generating success profile...
Analyzing job requirements and market data
Loading market overview...
Analyzing market trends and skill demands
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months