Providers | liteLLM (original) (raw)
Learn how to deploy + call models from different providers on LiteLLM
📄️ Integrate as a Model ProviderQuick Start for OpenAI-Compatible Providers
📄️ OpenAI (Text Completion)LiteLLM supports OpenAI text completion models
📄️ AnthropicLiteLLM supports all anthropic models.
📄️ AWS SagemakerLiteLLM supports All Sagemaker Huggingface Jumpstart Models
📄️ LiteLLM Proxy (LLM Gateway)| Property | Details |
📄️ AI21LiteLLM supports the following AI21 models:
📄️ AI/ML APIhttps://aimlapi.com/
📄️ Aleph AlphaLiteLLM supports all models from Aleph Alpha.
📄️ Amazon Nova| Property | Details |
📄️ Anyscalehttps://app.endpoints.anyscale.com/
📄️ Apertis AI (Stima API)Overview
📄️ BasetenLiteLLM supports both Baseten Model APIs and dedicated deployments with automatic routing.
📄️ BytezLiteLLM supports all chat models on Bytez!
📄️ Cerebrashttps://inference-docs.cerebras.ai/api-reference/chat-completions
📄️ Cloudflare Workers AIhttps://developers.cloudflare.com/workers-ai/models/text-generation/
📄️ CompactifAIhttps://docs.compactif.ai/
📄️ Custom API Server (Custom Format)Call your custom torch-serve / internal LLM APIs via LiteLLM
📄️ Dashscope API (Qwen models)https://dashscope.console.aliyun.com/
📄️ DatabricksLiteLLM supports all models on Databricks
📄️ DeepgramLiteLLM supports Deepgram's /listen endpoint.
📄️ DeepInfrahttps://deepinfra.com/
📄️ Deepseekhttps://deepseek.com/
📄️ Docker Model RunnerOverview
📄️ Featherless AIhttps://featherless.ai/
📄️ Galadrielhttps://docs.galadriel.com/api-reference/chat-completion-API
📄️ Githubhttps://github.com/marketplace/models
📄️ GitHub Copilothttps://docs.github.com/en/copilot
📄️ GradientAIhttps://digitalocean.com/products/gradientai
📄️ Infinity| Property | Details |
📄️ Jina AIhttps://jina.ai/embeddings/
📄️ LangGraphCall LangGraph agents through LiteLLM using the OpenAI chat completions format.
📄️ LlamafileLiteLLM supports all models on Llamafile.
📄️ LM Studiohttps://lmstudio.ai/docs/basics/server
📄️ ManusUse Manus AI agents through LiteLLM's OpenAI-compatible Responses API.
📄️ Meta Llama| Property | Details |
📄️ Milvus - Vector StoreUse Milvus as a vector store for RAG.
📄️ Mistral AI APIhttps://docs.mistral.ai/api/
📄️ MorphLiteLLM supports all models on Morph
📄️ Nebius AI Studiohttps://docs.nebius.com/studio/inference/quickstart
📄️ NLP CloudLiteLLM supports all LLMs on NLP Cloud.
📄️ Novita AI| Property | Details |
📄️ Nscale (EU Sovereign)https://docs.nscale.com/docs/inference/chat
📄️ OllamaLiteLLM supports all models from Ollama
📄️ OpenRouterLiteLLM supports all the text / chat / vision / embedding models from OpenRouter
📄️ Sarvam.aiLiteLLM supports all the text models from Sarvam ai
📄️ 🆕 OVHCloud AI EndpointsLeading French Cloud provider in Europe with data sovereignty and privacy.
📄️ PetalsPetals//github.com/bigscience-workshop/petals
📄️ PredibaseLiteLLM supports all models on Predibase
📄️ Pydantic AI AgentsCall Pydantic AI Agents via LiteLLM's A2A Gateway.
📄️ RAGFlowLitellm supports Ragflow's chat completions APIs
📄️ Recrafthttps://www.recraft.ai/
📄️ ReplicateLiteLLM supports all models on Replicate
📄️ SambaNovahttps://cloud.sambanova.ai/
📄️ SAP Generative AI HubLiteLLM supports SAP Generative AI Hub's Orchestration Service.
📄️ ScalewayLiteLLM supports all models available on Scaleway Generative APIs ↗.
📄️ Stability AIhttps://stability.ai/
📄️ Together AILiteLLM supports all models on Together AI.
📄️ Topaz| Property | Details |
📄️ Triton Inference ServerLiteLLM supports Embedding Models on Triton Inference Servers
📄️ Volcano Engine (Volcengine)https://www.volcengine.com/docs/82379/1263482
📄️ Voyage AIhttps://docs.voyageai.com/embeddings/
📄️ Weights & Biases Inferencehttps://weave-docs.wandb.ai/quickstart-inference
📄️ Xiaomi MiMohttps://platform.xiaomimimo.com/#/docs
📄️ Xinference [Xorbits Inference]https://inference.readthedocs.io/en/latest/index.html