انتهت صلاحية هذا الإعلان الوظيفي
انتهت بتاريخ ١ أبريل ٢٠٢٦
Evaluation Scenario Writer - AI Agent Testing Specialist
وصف الوظيفة
Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems.
المسؤوليات
- Create structured test cases simulating human workflows
- Define gold-standard behavior and scoring logic
- Analyze agent logs and failure modes
- Iterate on prompts and instructions
- Work with code repositories to validate scenarios
المؤهلات
- 3+ years of software development experience with strong Python focus.