API Reference — llama-stack documentation (original) (raw)

Llama Stack
Quickstart
Detailed Tutorial
Why Llama Stack?
- Our Solution: A Universal Stack
- Our Philosophy
Core Concepts
OpenAI API Compatibility
- Server path
- Clients
  * Llama Stack Client
  * OpenAI Client
- APIs implemented
  * Models
  * Responses
  * Simple inference
  * Structured Output
  * Chat Completions
  * Simple inference
  * Structured Output
  * Completions
  * Simple inference
Providers Overview
- External Providers
- Agents
- DatasetIO
- Eval
- Inference
- Post Training
  * Post Training Providers
  * External Providers
  * HuggingFace SFTTrainer
  * TorchTune
  * NVIDIA NEMO
- Safety
- Scoring
- Telemetry
- Tool Runtime
- Vector IO
  * Vector IO Providers
  * External Providers
  * Faiss
  * SQLite-Vec
  * Chroma
  * Postgres PGVector
  * Qdrant
  * Milvus
  * Weaviate
Distributions Overview
Building AI Applications (Examples)
Llama Stack Playground
- Key Features
  * Playground
  * Chatbot
  * Evaluations
  * Inspect
- Starting the Llama Stack Playground
Contributing to Llama-Stack
References