LLVM: llvm::MCSchedModel Struct Reference (original) (raw)

Machine model for scheduling, bundling, and heuristics. More...

#include "[llvm/MC/MCSchedule.h](MCSchedule%5F8h%5Fsource.html)"

Public Member Functions
bool	hasExtraProcessorInfo () const
unsigned	getProcessorID () const
bool	hasInstrSchedModel () const
	Does this machine model include instruction-level scheduling.
const MCExtraProcessorInfo &	getExtraProcessorInfo () const
bool	isComplete () const
	Return true if this machine model data for all instructions with a scheduling class (itinerary class or SchedRW list).
bool	isOutOfOrder () const
	Return true if machine supports out of order execution.
unsigned	getNumProcResourceKinds () const
const MCProcResourceDesc *	getProcResource (unsigned ProcResourceIdx) const
const MCSchedClassDesc *	getSchedClassDesc (unsigned SchedClassIdx) const
StringRef	getSchedClassName (unsigned SchedClassIdx) const
LLVM_ABI int	computeInstrLatency (const MCSubtargetInfo &STI, unsigned SClass) const
LLVM_ABI int	computeInstrLatency (const MCSubtargetInfo &STI, const MCInstrInfo &MCII, const MCInst &Inst) const
template<typename MCSubtargetInfo, typename MCInstrInfo, typename InstrItineraryData, typename MCInstOrMachineInstr>
int	computeInstrLatency (const MCSubtargetInfo &STI, const MCInstrInfo &MCII, const MCInstOrMachineInstr &Inst, llvm::function_ref< const MCSchedClassDesc (const MCSchedClassDesc )> ResolveVariantSchedClass=[](const MCSchedClassDesc *SCDesc) { return SCDesc;}) const
LLVM_ABI double	getReciprocalThroughput (const MCSubtargetInfo &STI, const MCInstrInfo &MCII, const MCInst &Inst) const

Static Public Member Functions
static LLVM_ABI int	computeInstrLatency (const MCSubtargetInfo &STI, const MCSchedClassDesc &SCDesc)
	Returns the latency value for the scheduling class.
static LLVM_ABI double	getReciprocalThroughput (const MCSubtargetInfo &STI, const MCSchedClassDesc &SCDesc)
static LLVM_ABI double	getReciprocalThroughput (unsigned SchedClass, const InstrItineraryData &IID)
static LLVM_ABI unsigned	getForwardingDelayCycles (ArrayRef< MCReadAdvanceEntry > Entries, unsigned WriteResourceIdx=0)
	Returns the maximum forwarding delay for register reads dependent on writes of scheduling class WriteResourceIdx.
static LLVM_ABI unsigned	getBypassDelayCycles (const MCSubtargetInfo &STI, const MCSchedClassDesc &SCDesc)
	Returns the bypass delay cycle for the maximum latency write cycle.

Public Attributes
unsigned	IssueWidth
unsigned	MicroOpBufferSize
unsigned	LoopMicroOpBufferSize
unsigned	LoadLatency
unsigned	HighLatency
unsigned	MispredictPenalty
bool	PostRAScheduler
bool	CompleteModel
bool	EnableIntervals
unsigned	ProcID
const MCProcResourceDesc *	ProcResourceTable
const MCSchedClassDesc *	SchedClassTable
unsigned	NumProcResourceKinds
unsigned	NumSchedClasses
const StringTable *	SchedClassNames
const InstrItinerary *	InstrItineraries
const MCExtraProcessorInfo *	ExtraProcessorInfo

Static Public Attributes
static const unsigned	DefaultIssueWidth = 1
static const unsigned	DefaultMicroOpBufferSize = 0
static const unsigned	DefaultLoopMicroOpBufferSize = 0
static const unsigned	DefaultLoadLatency = 4
static const unsigned	DefaultHighLatency = 10
static const unsigned	DefaultMispredictPenalty = 10
static LLVM_ABI const MCSchedModel	Default
	Returns the default initialized model.

Friends
class	InstrItineraryData

Machine model for scheduling, bundling, and heuristics.

The machine model directly provides basic information about the microarchitecture to the scheduler in the form of properties. It also optionally refers to scheduler resource tables and itinerary tables. Scheduler resource tables model the latency and cost for each instruction type. Itinerary tables are an independent mechanism that provides a detailed reservation table describing each cycle of instruction execution. Subtargets may define any or all of the above categories of data depending on the type of CPU and selected scheduler.

The machine independent properties defined here are used by the scheduler as an abstract machine model. A real micro-architecture has a number of buffers, queues, and stages. Declaring that a given machine-independent abstract property corresponds to a specific physical property across all subtargets can't be done. Nonetheless, the abstract model is useful. Futhermore, subtargets typically extend this model with processor specific resources to model any hardware features that can be exploited by scheduling heuristics and aren't sufficiently represented in the abstract.

The abstract pipeline is built around the notion of an "issue point". This is merely a reference point for counting machine cycles. The physical machine will have pipeline stages that delay execution. The scheduler does not model those delays because they are irrelevant as long as they are consistent. Inaccuracies arise when instructions have different execution delays relative to each other, in addition to their intrinsic latency. Those special cases can be handled by TableGen constructs such as, ReadAdvance, which reduces latency when reading data, and ReleaseAtCycles, which consumes a processor resource when writing data for a number of abstract cycles.

TODO: One tool currently missing is the ability to add a delay to ReleaseAtCycles. That would be easy to add and would likely cover all cases currently handled by the legacy itinerary tables.

A note on out-of-order execution and, more generally, instruction buffers. Part of the CPU pipeline is always in-order. The issue point, which is the point of reference for counting cycles, only makes sense as an in-order part of the pipeline. Other parts of the pipeline are sometimes falling behind and sometimes catching up. It's only interesting to model those other, decoupled parts of the pipeline if they may be predictably resource constrained in a way that the scheduler can exploit.

The LLVM machine model distinguishes between in-order constraints and out-of-order constraints so that the target's scheduling strategy can apply appropriate heuristics. For a well-balanced CPU pipeline, out-of-order resources would not typically be treated as a hard scheduling constraint. For example, in the GenericScheduler, a delay caused by limited out-of-order resources is not directly reflected in the number of cycles that the scheduler sees between issuing an instruction and its dependent instructions. In other words, out-of-order resources don't directly increase the latency between pairs of instructions. However, they can still be used to detect potential bottlenecks across a sequence of instructions and bias the scheduling heuristics appropriately.

Definition at line 258 of file MCSchedule.h.

◆ computeInstrLatency() [1/4]

◆ computeInstrLatency() [2/4]

Definition at line 433 of file MCSchedule.h.

References computeInstrLatency(), llvm::MCInstrInfo::get(), llvm::MCSubtargetInfo::getCPU(), llvm::MCSubtargetInfo::getInstrItineraryForCPU(), llvm::InstrItineraryData::getOperandCycle(), llvm::MCInstrDesc::getSchedClass(), getSchedClassDesc(), hasInstrSchedModel(), InstrItineraryData, llvm::MCSchedClassDesc::isValid(), and llvm::Latency.

◆ computeInstrLatency() [3/4]

◆ computeInstrLatency() [4/4]

◆ getBypassDelayCycles()

◆ getExtraProcessorInfo()

◆ getForwardingDelayCycles()

◆ getNumProcResourceKinds()

unsigned llvm::MCSchedModel::getNumProcResourceKinds ( ) const	inline

◆ getProcessorID()

unsigned llvm::MCSchedModel::getProcessorID ( ) const	inline

◆ getProcResource()

◆ getReciprocalThroughput() [1/3]

Definition at line 123 of file MCSchedule.cpp.

References llvm::MCInstrInfo::get(), llvm::MCInst::getOpcode(), getProcessorID(), getReciprocalThroughput(), llvm::MCInstrDesc::getSchedClass(), getSchedClassDesc(), IssueWidth, llvm::MCSchedClassDesc::isValid(), llvm::MCSchedClassDesc::isVariant(), llvm_unreachable, and llvm::MCSubtargetInfo::resolveVariantSchedClass().

◆ getReciprocalThroughput() [2/3]

Definition at line 98 of file MCSchedule.cpp.

References assert(), getProcResource(), llvm::MCSubtargetInfo::getSchedModel(), llvm::MCSubtargetInfo::getWriteProcResBegin(), llvm::MCSubtargetInfo::getWriteProcResEnd(), I, IssueWidth, llvm::MCSchedClassDesc::NumMicroOps, and llvm::MCProcResourceDesc::NumUnits.

Referenced by llvm::TargetSchedModel::computeReciprocalThroughput(), llvm::TargetSchedModel::computeReciprocalThroughput(), getReciprocalThroughput(), and llvm::AArch64TTIImpl::hasKnownLowerThroughputFromSchedulingModel().

◆ getReciprocalThroughput() [3/3]

◆ getSchedClassDesc()

◆ getSchedClassName()

StringRef llvm::MCSchedModel::getSchedClassName ( unsigned SchedClassIdx) const	inline

◆ hasExtraProcessorInfo()

bool llvm::MCSchedModel::hasExtraProcessorInfo ( ) const	inline

◆ hasInstrSchedModel()

bool llvm::MCSchedModel::hasInstrSchedModel ( ) const	inline

◆ isComplete()

bool llvm::MCSchedModel::isComplete ( ) const	inline

Return true if this machine model data for all instructions with a scheduling class (itinerary class or SchedRW list).

Definition at line 350 of file MCSchedule.h.

References CompleteModel.

◆ isOutOfOrder()

bool llvm::MCSchedModel::isOutOfOrder ( ) const	inline

◆ InstrItineraryData

◆ CompleteModel

bool llvm::MCSchedModel::CompleteModel

◆ Default

Initial value:

= {DefaultIssueWidth,

false,

true,

false,

nullptr,

nullptr}

static const unsigned DefaultLoopMicroOpBufferSize

static const unsigned DefaultHighLatency

static const unsigned DefaultLoadLatency

static const unsigned DefaultMicroOpBufferSize

static const unsigned DefaultMispredictPenalty

static const unsigned DefaultIssueWidth

Returns the default initialized model.

Definition at line 424 of file MCSchedule.h.

Referenced by llvm::MCSubtargetInfo::getSchedModelForCPU(), and llvm::MCSubtargetInfo::InitMCProcessorInfo().

◆ DefaultHighLatency

◆ DefaultIssueWidth

◆ DefaultLoadLatency

◆ DefaultLoopMicroOpBufferSize

const unsigned llvm::MCSchedModel::DefaultLoopMicroOpBufferSize = 0	static

LLVM: llvm::MCSchedModel Struct Reference (original) (raw)

◆ computeInstrLatency() [1/4]

◆ computeInstrLatency() [2/4]

◆ computeInstrLatency() [3/4]

◆ computeInstrLatency() [4/4]

◆ getBypassDelayCycles()

◆ getExtraProcessorInfo()

◆ getForwardingDelayCycles()

◆ getNumProcResourceKinds()

◆ getProcessorID()

◆ getProcResource()

◆ getReciprocalThroughput() [1/3]

◆ getReciprocalThroughput() [2/3]

◆ getReciprocalThroughput() [3/3]

◆ getSchedClassDesc()

◆ getSchedClassName()

◆ hasExtraProcessorInfo()

◆ hasInstrSchedModel()

◆ isComplete()

◆ isOutOfOrder()

◆ InstrItineraryData

◆ CompleteModel

◆ Default

◆ DefaultHighLatency

◆ DefaultIssueWidth

◆ DefaultLoadLatency

◆ DefaultLoopMicroOpBufferSize

◆ DefaultMicroOpBufferSize

◆ DefaultMispredictPenalty

◆ EnableIntervals

◆ ExtraProcessorInfo

◆ HighLatency

◆ InstrItineraries

◆ IssueWidth

◆ LoadLatency

◆ LoopMicroOpBufferSize

◆ MicroOpBufferSize

◆ MispredictPenalty

◆ NumProcResourceKinds

◆ NumSchedClasses

◆ PostRAScheduler

◆ ProcID

◆ ProcResourceTable

◆ SchedClassNames

◆ SchedClassTable