Job Description
Role Overview
We are seeking a hands-on Testing Lead to own quality and documentation for our Deep Learning, LLM, and Vision-Language Model (VLM) products. You will define how we test, measure, document, and communicate AI quality—working closely with ML, Engineering, and Product teams in a fast-paced startup environment.
This role is ideal for someone who believes clear documentation is as critical as good testing , especially for non-deterministic AI systems.
What You’ll Do
Own Quality & Documentation End-to-End
- Define testing strategy for LLMs, VLMs, and DL pipelines .
- Create and maintain clear, lightweight documentation covering:
- Model testing strategies and assumptions
- Evaluation metrics and acceptance criteria
- Known limitations, risks, and failure modes
- Release readiness and quality sign-off
- Ensure documentation evolves with models, data, and prompts.
LLM / GenAI Testing
- Design tests for:
- Prompt templates and prompt changes
- RAG pipelines (retrieval quality, grounding, hallucination control)
- Multi-turn conversations and long-context behaviour
- Maintain golden datasets , regression test suites, and test result summaries.
- Document prompt behaviour, edge cases, and known model quirks.
Vision & Multimodal Testing
- Test VLMs for image-text alignment, OCR, captioning, and reasoning.
- Document model performance across different image types, quality levels, and domains.
- Track and publish model behaviour changes between versions.
Automation, MLOps & Reporting
- Build Python-based automation for evaluation and regression testing.
- Integrate tests into CI/CD and MLOps pipelines .
- Produce readable quality reports and dashboards for engineers and leadership.
- Monitor and document production issues such as model/data drift and degradation .
Build a Quality-First Culture
- Establish QA and documentation standards that scale with a startup.
- Mentor engineers on writing testable code and meaningful documentation.
- Act as the single source of truth for AI quality, testing, and known risks.
What we’re looking For
Must-Have
- Strong background in software testing with lead or ownership experience .
- Hands-on experience testing LLMs, DL models, or GenAI systems .
- Strong Python skills for test automation and data validation.
- Proven ability to write clear, structured technical documentation .
- Understanding of:
- Transformer-based models and DL workflows
- Model evaluation metrics and non-deterministic system testing
- Comfortable working in ambiguity and moving fast in a startup.
Nice-to-Have
- Experience with VLMs, multimodal models, or computer vision .
- Exposure to RAG architectures , vector databases, and embeddings.
- Familiarity with tools like LangChain, LlamaIndex, MLflow, or similar.
- Experience documenting AI risks, limitations, or compliance requirements.
Interested can apply to
Requirements
Job description: 1) Perform basic clinical nursing practices in accordance with scope of practice and standard as determined by RxDx. · Taking of specimens. · Sterilizing the instruments. · Monitor vital signs and observe reactions to medication and treatment (EMR). 2) Maintain accurate patients’ records and statistics. 3) Provide nursing care to Health Check patients. 4) Provide assistance to Doctors. 5) Educate patients on their health and required precautions to be taken (for certain cases). 6) Utilize equipment proficiently and promote its use and safekeeping. 7) Essentially, an appropriate (as per our mentioned guidelines) disposal of used equipments. 8) Maintaining the confidentiality of all information pertaining to patients. 9) Ensuring to be present all times at the designated area / work station. 10) Entering all the pending jobs of the day in the duty handover register and getting it signed by the person assuming duty in the next shift. 11) Keeping all the records for all house visits. 12) Constantly work on the ways to improve the quality of service. 13) Dealing with all queries from patients. 14) Ensures efficient and smooth flow of all clinic activities thru coordination and planning. 15) Assesses patient’s physical, psychosocial and/or emotional needs. 16) Informing physician of patient status. 17) Handling emergency cases / situations.
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application