Software Engineer – Code QA for AI Evaluation (Remote)
About the Role
We are seeking a talented Software Engineer – Code QA for AI Evaluation to join our remote team. In this role, you will craft realistic developer prompts and evaluate AI outputs, ensuring they reflect real-world developer workflows. This position offers the flexibility of remote work and a commitment of 10–40 hours per week.
What You'll Do
- Develop chat-style developer prompts focusing on code review, debugging, and error diagnosis.
- Source and adapt real pull requests to create authentic evaluation tasks.
- Write clear, technically accurate model responses that demonstrate strong reasoning.
- Evaluate AI outputs based on logic, clarity, and technical judgment.
- Collaborate asynchronously with research teams to refine task quality.
Requirements
- Bachelor’s degree in Software Engineering, Computer Science, or a related field.
- Strong experience in software engineering or technical research.
- Proficiency in languages such as Python, JavaScript, Java, or C++.
- Hands-on experience with debugging, testing, and validating code.
- Strong technical writing skills with high attention to detail.
- Ability to work independently in a fully remote environment.
Nice to Have
- Experience in educational content development.
- Familiarity with AI evaluation metrics and methodologies.
What We Offer
- Competitive hourly compensation ranging from $70 to $120.
- Flexible working hours with a remote-first approach.
- Opportunity to work on innovative AI projects.
- Collaborative team environment with asynchronous communication.
- Professional development opportunities in AI and software engineering.
This remote Software Engineer role offers a unique opportunity to work on AI evaluation projects with flexible hours and competitive pay.
About Crossing Hurdles
Explore Crossing Hurdles careers in 2026, featuring a wide array of job openings including remote, hybrid, and office roles. Utilize our advanced filters to streamline your job search and tailor your resume for the best fit. Gain valuable insights into company culture and track your applications seamlessly. Discover exciting career opportunities at Crossing Hurdles and take the next step in your professional journey today.
Who Will Succeed Here
Proficient in Python and JavaScript with a strong ability to debug and write clean, efficient code, specifically in the context of AI evaluation and testing frameworks.
Self-motivated individual who thrives in a remote work environment, demonstrating excellent time management skills and the ability to work independently while meeting deadlines for code evaluations.
Middle-level experience with a growth mindset, eager to learn about AI systems and their practical applications in development, while also possessing strong technical writing skills to document findings and recommendations.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months