Local AI inference with OpenAI-compatible API
Interactive chat with streaming responses, conversation tracking, and system prompt injection.
Streaming RESTSpeech-to-text via file upload or live microphone streaming with real-time results.
WebSocket RESTAI-powered CSV column mapping with confidence scoring, date detection, and async processing.
Async RESTRAG-powered ICD-10 code suggestions from visit summaries using vector search and LLM clinical analysis.
RAG REST