Freelance AI Evaluation Engineer (Python/Full-Stack)
Mindrift•March 14, 2026
Create challenging coding test cases for AI systems, review and refine production codebases, and analyze AI failures. Work on part-time, non-permanent projects for leading tech companies.
Requirements
- Degree in Computer Science, Software Engineering, or related fields
- 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
- Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
- Experience writing tests (functional, integration – not just running them)
- Docker containers (running evaluations locally in containers)
- CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
- English proficiency - B2
Benefits
- Flexible work schedule
- Opportunity to work on challenging projects with leading tech companies
- Potential earnings of up to $30 per hour equivalent
Originally posted on Himalayas
Apply Now
You will be redirected to the company's application page.
Job Details
Locations
Portugal
Role
Fullstack
Duration
Full Time
Experience
Mid
Salary
unknown salary
Tech Stack & Tags
Freelance-AI-EngineerMid-Level-Full-Stack-AI-EngineerFreelance-AI-ML-DeveloperMid-Level-Full-Stack-AI-Developer-(Python-Azure)Evaluation-Engineer
Benefits
- flexible