Job Description
Rate model outputs for quality, helpfulness, safety and instruction-fidelity; perform pairwise comparisons and preference judgments used to train reward models. This is central to RLHF and instruction tuning workflows.
Responsibilities
<...
Apply for this Position
Ready to join Levit8 Technologies? Click the button below to submit your application.
Submit Application