Python Client: llama-stack-client for Python 3.12+ with full agent and model APIs
Vector Store APIs: Create and query vector stores from the client, including PGVector-backed stores with provider_id="pgvector" and Milvus-backed stores with provider_id="milvus-remote"
REST-Friendly: Server exposes APIs for inference, agents, and tool runtime; can be wrapped in FastAPI or other web frameworks for production use