Job Description

We are looking for a Python Backend Developer to build a custom Retrieval-Augmented Generation (RAG) chat application. Our goal is to build a lean, high-performance system from the ground up, focusing on custom logic for data retrieval and AI orchestration rather than relying on heavy third-party frameworks.

Key Responsibilities

  • Custom RAG Development: Build the end-to-end RAG pipeline, including custom document parsing, text chunking strategies, and context injection logic.
  • API Architecture: Develop and scale high-concurrency, asynchronous RESTful APIs using FastAPI (preferred) or Flask.
  • Vector Search Implementation: Integrate and optimize vector databases (e.g., pgvector, Pinecone, or Weaviate) to ensure high-precision retrieval for AI grounding.
  • Core AI Logic: Implement memory management, window handling, and custom prompt engineering natively in Python.
  • System Performance: Optimize the latency of the Retrieve-the...

Apply for this Position

Ready to join Authority Partners? Click the button below to submit your application.

Submit Application