Job Description
Core Technical Requirements
Backend & API Development
Expert-level proficiency in modern API frameworks (FastAPI, Django, Flask)
Experience building robust, scalable backend services for AI/ML applications
Understanding of API rate limiting, caching strategies, and performance optimization
Knowledge of microservices architecture and event-driven design patterns
Foundation Models & LLM Integration
Hands-on experience integrating commercial LLM APIs (OpenAI, Anthropic, Google, Azure OpenAI, etc.)
Understanding of prompt engineering techniques and best practices
Experience with model selection, evaluation, and performance monitoring
Knowledge of token management, cost optimization, and response streaming
Open Source AI/ML
Experience deploying and fine-tuning open source models (Llama, Mistral, etc.)
Familiarity with model hosting solutions (Databricks serving, Hugging Face, Ollama, vLLM, or similar)
Understanding of model quantization and optimiza...
Backend & API Development
Expert-level proficiency in modern API frameworks (FastAPI, Django, Flask)
Experience building robust, scalable backend services for AI/ML applications
Understanding of API rate limiting, caching strategies, and performance optimization
Knowledge of microservices architecture and event-driven design patterns
Foundation Models & LLM Integration
Hands-on experience integrating commercial LLM APIs (OpenAI, Anthropic, Google, Azure OpenAI, etc.)
Understanding of prompt engineering techniques and best practices
Experience with model selection, evaluation, and performance monitoring
Knowledge of token management, cost optimization, and response streaming
Open Source AI/ML
Experience deploying and fine-tuning open source models (Llama, Mistral, etc.)
Familiarity with model hosting solutions (Databricks serving, Hugging Face, Ollama, vLLM, or similar)
Understanding of model quantization and optimiza...
Apply for this Position
Ready to join Crescendo Global? Click the button below to submit your application.
Submit Application