Job Description
LLM Architect
Edinburgh (on-site)
£100k-120k + exceptional benefits
A rare chance to drive the future of AI infrastructure at one of the world's leading R&D tech organisations.
This is a senior opportunity with a global research leader, where you’ll architect and optimise the platforms that deliver large-scale language models to production. You’ll be working on some of the hardest challenges in distributed AI systems: building ultra-reliable, ultra-scalable environments for inference and deployment.
What you’ll be doing
Designing cloud-native architectures to run large language models on serverless frameworks (e.g. Kubernetes, Knative, or custom-built FaaS).
Developing approaches to minimise cold-start latency through advanced container snapshotting, weight pre-loading, and graph partitioning.
Building distributed inference pipelines with tensor parallelism, model sharding, and efficient memory scheduling to serve LLMs at scal...
Edinburgh (on-site)
£100k-120k + exceptional benefits
A rare chance to drive the future of AI infrastructure at one of the world's leading R&D tech organisations.
This is a senior opportunity with a global research leader, where you’ll architect and optimise the platforms that deliver large-scale language models to production. You’ll be working on some of the hardest challenges in distributed AI systems: building ultra-reliable, ultra-scalable environments for inference and deployment.
What you’ll be doing
Designing cloud-native architectures to run large language models on serverless frameworks (e.g. Kubernetes, Knative, or custom-built FaaS).
Developing approaches to minimise cold-start latency through advanced container snapshotting, weight pre-loading, and graph partitioning.
Building distributed inference pipelines with tensor parallelism, model sharding, and efficient memory scheduling to serve LLMs at scal...
Apply for this Position
Ready to join Bright Purple? Click the button below to submit your application.
Submit Application