Job Description
Job Title:
AI Agent Evaluation Engineer
Experience Level:
5-7 Years (6+ years in Software QA required)
Work Split:
70% Automation / 30% Manual Testing
Key Focus:
Responsible AI, Safety Evals, and Google ADK
AI/LLM Testing Experience:
Minimum 2 years focused specifically on testing/evaluating AI systems, conversational agents, or LLMs.
Safety & Red Teaming:
Direct experience in Safety Evals is mandatory. This includes red teaming, adversarial testing, jailbreaking, and measuring toxicity/bias.
Google ADK Knowledge:
Must have direct experience or a strong conceptual understanding of the
Google Agent Development Kit (ADK)
and Vertex AI.
Technical Stack:
o Strong proficiency in
Python
for scripting and automation.
o Experience with
PyTest .
o
Prompt Injection
experience.
Tooling Familiarity:
Experienc...
AI Agent Evaluation Engineer
Experience Level:
5-7 Years (6+ years in Software QA required)
Work Split:
70% Automation / 30% Manual Testing
Key Focus:
Responsible AI, Safety Evals, and Google ADK
AI/LLM Testing Experience:
Minimum 2 years focused specifically on testing/evaluating AI systems, conversational agents, or LLMs.
Safety & Red Teaming:
Direct experience in Safety Evals is mandatory. This includes red teaming, adversarial testing, jailbreaking, and measuring toxicity/bias.
Google ADK Knowledge:
Must have direct experience or a strong conceptual understanding of the
Google Agent Development Kit (ADK)
and Vertex AI.
Technical Stack:
o Strong proficiency in
Python
for scripting and automation.
o Experience with
PyTest .
o
Prompt Injection
experience.
Tooling Familiarity:
Experienc...
Apply for this Position
Ready to join Brillio? Click the button below to submit your application.
Submit Application