xAI03.02.26
AI SCORE 8.5

Senior Model Evaluation Engineer - AI/​ML Focus

$180K–$440K/year

About the Role

We are looking for a Senior Model Evaluation Engineer to join our team at xAI. This Senior Model Evaluation Engineer remote position is an exciting opportunity to work on cutting-edge AI systems that aim to understand the universe and aid humanity. You will be part of a small, highly motivated team focused on engineering excellence and innovation.

What You'll Do

  • Conduct comprehensive assessments of AI models to ensure their quality and performance.
  • Deep dive into model training and data to identify weaknesses revealed during evaluations.
  • Collaborate with modeling and data teams to develop plans for improving model quality.
  • Build infrastructure and frameworks for user-friendly model evaluation, utilizing inference frameworks like SGlang and vLLM.
  • Develop model assessment and evaluation tasks, including public and in-house benchmarking.

Requirements

  • Strong experience with Python, JAX, XLA, Rust, or C++.
  • Proven track record in model evaluation and assessment.
  • Ability to synthesize data for new evaluations and benchmarks.
  • Excellent communication skills to share insights and collaborate effectively with team members.
  • Experience in building evaluation frameworks and infrastructure.

Nice to Have

  • Familiarity with Spark and large-scale data processing.
  • Experience in AI/ML model development and deployment.
  • Knowledge of best practices in model evaluation and benchmarking.

What We Offer

  • Competitive salary ranging from $180,000 to $440,000 annually.
  • Equity options as part of the compensation package.
  • Comprehensive medical, vision, and dental coverage.
  • Access to a 401(k) retirement plan with employer contributions.
  • Short and long-term disability insurance and life insurance.
  • Relocation support for candidates willing to move to the Bay Area.
  • A flat organizational structure that encourages initiative and leadership.
Language Requirements
EnglishC1
BasicIntermediateAdvancedNative
Why This Job8.5 of 10

This Senior Model Evaluation Engineer role at xAI offers a competitive salary, equity options, and the chance to work on groundbreaking AI technologies.

Salary Range
Required
0/1
Optional
0/1
Bonus
0/1

About xAI

Explore xAI careers in 2026 and discover exciting job openings in remote, hybrid, and office roles. Our platform offers tailored application tracking, insightful company information, and advanced filters to help you find the perfect position at xAI. Stay updated with the latest industry news and vacancy scores to enhance your job search experience and unlock your potential in the innovative world of artificial intelligence.

Industry
Tech
Location
Remote

Who Will Succeed Here

Proficient in Python and experienced with JAX and XLA for optimizing machine learning models, demonstrating a strong understanding of numerical computing and performance tuning.

Self-motivated and capable of working independently in a remote setting, showing a proactive approach to problem-solving and collaboration with cross-functional teams through effective use of digital communication tools.

Possesses a growth mindset with at least 3-5 years of experience in AI/ML model evaluation, showcasing an ability to adapt to evolving technologies like Rust and C++ in a fast-paced tech environment.

Learning Resources

Python for Data Science Handbookguide

Career Path

Senior Model Evaluation Engineer - AI/ML Focus(Now)Lead AI Model Engineer(1-2 years)Director of AI Model Evaluation(3-5 years)

Market Overview

Python Market Size 2024
$20B
Annual Growth
11.5%
AI/ML Adoption Rate
75%
Investment in AI/ML
+150%
Labour Demand for AI/ML Engineers
+30%
Avg Salary for Senior AI/ML Engineers
$145K

Skills & Requirements

Required
PythonJAXXLA
Growing in Demand
TensorFlowPyTorchData Engineering
Declining
MATLABR

Domain Trends

Increased Demand for Explainable AI
With 60% of organizations prioritizing explainability in AI systems, engineers skilled in model evaluation are in high demand.
Shift Towards Edge AI
By 2025, it's expected that 30% of AI workloads will be processed at the edge, driving demand for engineers proficient in lightweight frameworks like JAX and Rust.
Integration of AI in Traditional Industries
Industries like healthcare and finance are adopting AI technologies at a rate of 40% annually, increasing the need for skilled model evaluators.

Industry News

Loading latest industry news...

Finding relevant articles from the last 6 months

All job postings are automatically gathered by algorithms. We do not review or verify listings, be careful when applying and do not sign-in with iCloud or Google services.