Freelance AI Evaluation Engineer (Python/Full-Stack)
Mindrift•March 14, 2026
Freelance AI Evaluation Engineer (Python/Full-Stack) opportunity to create challenging coding test cases, review and refine codebases, write functional tests, and analyze AI failures for leading tech companies.
Requirements
- Degree in Computer Science, Software Engineering or related fields
- 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
- Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
- Experience writing tests (functional, integration – not just running them)
- Docker containers (running evaluations locally in containers)
- CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
- English proficiency - B2
Benefits
- Flexibility to choose projects and work schedule
- Opportunity to work with leading tech companies
- Potential to earn up to $21 per hour equivalent
- Variety of projects with different scopes and complexities
Originally posted on Himalayas
Apply Now
You will be redirected to the company's application page.
Job Details
Locations
Colombia
Role
Fullstack
Duration
Full Time
Experience
Mid
Salary
unknown salary
Tech Stack & Tags
Senior-Full-Stack-AI-EngineerPython-AI-EngineerFreelance-AI-EngineerFreelance-AI-DeveloperMid-Level-Full-Stack-AI-EngineerMid-Level-Full-Stack-AI-Developer-(Python-Azure)Software-Engineer-(AI)