Posted 12 May 2026, 5:57 pm

Freelance Agent Evaluation Engineer

at Mindrift

language Canada schedule Full-time travel_explore Nomad Score: Low (8) ads_click Moderate (48) payments $90k work Other

We're building a dataset to evaluate AI coding agents by creating challenging tasks and evaluation criteria within realistic simulated environments. You'll work on a part-time, non-permanent project, creating tasks for AI agents to evaluate and improve their coding abilities.

Requirements

  • Degree in Computer Science, Software Engineering, or related fields
  • 5+ years in software development, primarily Python
  • Background in full-stack development, with experience building React-based interfaces and robust back-end systems
  • Experience writing tests, familiarity with Docker containers, CI/CD tools, and infrastructure tools

Benefits

  • Opportunity to work on a challenging project, Flexible schedule, Compensation up to $45 per hour

Originally posted on Himalayas

The offering company is responsible for the content on this page / the job offer. Mindrift · Source: Himalayas

Similar Job Offers

8 Jun 2026, 3:21 am

Evolent

language Serbia schedule Full-time travel_explore Nomad Score: Medium ads_click Moderate
business
8 Jun 2026, 3:19 am

Elbit America

language Canada schedule Full-time travel_explore Nomad Score: Low ads_click Moderate payments $135k – $145k
8 Jun 2026, 3:18 am

Bjak

language South Korea schedule Full-time travel_explore Nomad Score: Low ads_click Hidden Gem

Ready to unplug?

Join thousands of nomads receiving curated job alerts weekly. We only send jobs that respect your freedom.