Phi Models — NVIDIA NeMo Microservices (original) (raw)

This page provides detailed technical specifications for the Phi model family supported by the NVIDIA NeMo Customizer microservice. For information about supported features and capabilities, refer to the Support Matrix in the Model Catalog.

Microsoft Phi-4#

Property	Value
Creator	Microsoft
Architecture	Decoder-only Transformer
Description	Phi-4 is Microsoft’s most advanced small language model, designed to deliver strong reasoning capabilities while being efficient to deploy.
Max I/O Tokens	16K
Parameters	14 billion
Training Data	High-quality data with emphasis on reasoning and code
Recommended GPUs for Customization	2
Default Name	microsoft/phi-4
Version	nvidia/nemo/phi-4:1.0