Job Description
How do you make a large language model genuinely human‑centred, capable of reasoning, empathy, and nuance rather than just pattern‑matching?
This team is built to answer that question. They’re a small, focused group of researchers and engineers working on the post‑training challenges that matter most: RLHF, RLAIF, continual learning, multilingual behaviour, and evaluation frameworks designed for natural, reliable interaction.
You’ll work alongside a team from NVIDIA, Meta, Microsoft, Apple, and Stanford, in an environment that combines academic rigour with production‑level delivery. Backed by over $400 million in funding, they have the freedom, compute, and scale to run experiments that push beyond the limits of standard alignment research.
This is a role where your work moves directly into deployed products. The team’s models are live, meaning every insight you develop, every method you refine, and every experiment you run has immediate, measurable impact...
Apply for this Position
Ready to join Trades Workforce Solutions? Click the button below to submit your application.
Submit Application