Freelance AI Evaluation Engineer (Python/Full-Stack)

Mindrift•March 14, 2026

Freelance AI Evaluation Engineer (Python/Full-Stack) opportunity to create challenging coding test cases, review and refine codebases, write functional tests, and analyze AI failures for leading tech companies.

Requirements

Degree in Computer Science, Software Engineering or related fields
5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
Experience writing tests (functional, integration – not just running them)
Docker containers (running evaluations locally in containers)
CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
English proficiency - B2

Benefits

Flexibility to choose projects and work schedule
Opportunity to work with leading tech companies
Potential to earn up to $21 per hour equivalent
Variety of projects with different scopes and complexities

Originally posted on Himalayas

Apply Now

You will be redirected to the company's application page.

Job Details

Locations

Colombia

Role

Fullstack

Duration

Full Time

Experience

Mid

Salary

unknown salary

Tech Stack & Tags

Senior-Full-Stack-AI-EngineerPython-AI-EngineerFreelance-AI-EngineerFreelance-AI-DeveloperMid-Level-Full-Stack-AI-EngineerMid-Level-Full-Stack-AI-Developer-(Python-Azure)Software-Engineer-(AI)