Phi Models — NVIDIA NeMo Microservices (original) (raw)

This page provides detailed technical specifications for the Phi model family supported by the NVIDIA NeMo Customizer microservice. For information about supported features and capabilities, refer to the Support Matrix in the Model Catalog.

Microsoft Phi-4#

Property Value
Creator Microsoft
Architecture Decoder-only Transformer
Description Phi-4 is Microsoft’s most advanced small language model, designed to deliver strong reasoning capabilities while being efficient to deploy.
Max I/O Tokens 16K
Parameters 14 billion
Training Data High-quality data with emphasis on reasoning and code
Recommended GPUs for Customization 2
Default Name microsoft/phi-4
Version nvidia/nemo/phi-4:1.0