Freelance AI Evaluation Engineer (Python/Full-Stack)

Mindrift•March 14, 2026

Create challenging coding test cases for AI systems, review and refine production codebases, and analyze AI failures. Work on part-time, non-permanent projects for leading tech companies.

Requirements

Degree in Computer Science, Software Engineering, or related fields
5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
Experience writing tests (functional, integration – not just running them)
Docker containers (running evaluations locally in containers)
CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
English proficiency - B2

Benefits

Flexible work schedule
Opportunity to work on challenging projects with leading tech companies
Potential earnings of up to $30 per hour equivalent

Originally posted on Himalayas

Apply Now

You will be redirected to the company's application page.

Job Details

Locations

Portugal

Role

Fullstack

Duration

Full Time

Experience

Mid

Salary

unknown salary

Tech Stack & Tags

Freelance-AI-EngineerMid-Level-Full-Stack-AI-EngineerFreelance-AI-ML-DeveloperMid-Level-Full-Stack-AI-Developer-(Python-Azure)Evaluation-Engineer

Benefits

flexible