NKI API Reference Manual — AWS Neuron Documentation (original) (raw)

Search Engine: Default Google

Overview

ML Frameworks

NeuronX Distributed (NxD)

Additional ML Libraries

Developer Flows

Runtime & Tools

Compiler

Neuron Compiler
- NeuronX Compiler for Trn1 & Inf2
  * API Reference Guide
  * Neuron Compiler CLI Reference Guide
  * Developer Guide
  * Mixed Precision and Performance-accuracy Tuning (neuronx-cc)
  * Misc
  * FAQ
  * What's New
- Neuron Compiler for Inf1
  * API Reference Guide
  * Neuron compiler CLI Reference Guide (neuron-cc)
  * Developer Guide
  * Mixed precision and performance-accuracy tuning (neuron-cc)
  * Misc
  * FAQ
  * What's New
  * Neuron Supported operators
Neuron Kernel Interface (Beta)
Neuron C++ Custom Operators

Learning Neuron

Legacy Software

Apache MXNet
- MXNet Neuron Setup
- Inference (Inf1)
  * Tutorials
  * Computer Vision Tutorials
  * Natural Language Processing (NLP) Tutorials
  * Utilizing Neuron Capabilities Tutorials
  * API Reference Guide
  * Neuron Apache MXNet Compilation Python API
  * Developer Guide
  * Flexible Execution Group (FlexEG) in Neuron-MXNet
  * Misc
  * Troubleshooting Guide for Neuron Apache MXNet
  * What's New
  * Neuron Apache MXNet Supported operators

About Neuron

This document is relevant for: Inf2, Trn1, Trn2

NKI API Reference Manual#

Summary of different NKI API sets:

nki top-level module contains APIs to decorate and simulate NKI kernels as well as NKI object types.
nki.language consists of high-level compute and data movement APIs designed for ease-of-use. nki.languageallows NKI programmers to transition from NumPy/Triton implementation to NKI quickly without the need to fully understand underlying NeuronDevice architecture. Most language APIs invoke one or more nki.isa APIs (that is, NeuronDevice hardware instructions) under the hood.
nki.isa consists of low-level APIs that highly resemble hardware instructions in NeuronDevice ISA (instruction set architecture) designed to provide fine control over the hardware. These APIs expose all the programmable input parameters of the corresponding hardware instructions and also enforce the same tile-size and layout requirements as specified in NeuronDevice ISA.
nki.compiler consists of features that control the compilation process of a NKI kernel.
Other documents:
- NKI API Common Fields documents common NKI API input parameters such as data types and masks, as well as common API behavior such as type promotion.
- NKI API Errors captures common error types that are thrown by the NKI kernel compilation frontend, including syntax, tile-size and layout violation errors.
nki
- Decorators
- Types
nki.language
nki.isa
nki.compiler
- Allocation Control
- Kernel Decorators
NKI API Common Fields
NKI API Errors

This document is relevant for: Inf2, Trn1, Trn2