如何使用指令映射

简介

本文档包含关于为目标添加指令映射支持的信息。此功能背后的动机来自于在各种优化期间需要在不同指令格式之间切换的需求。一种方法可能是使用 switch 语句,其中列出所有指令以及它们可以转换成的格式。然而,由于硬编码的指令名称,它具有很大的维护开销。此外,每当在 .td 文件中添加新指令时,都应相应地修改所有相关的 switch 语句。相反,相同的功能可以通过 TableGen 和来自 .td 文件的一些支持来实现,维护成本仅为一小部分。

InstrMapping 类概述

TableGen 使用关系模型来映射指令。这些模型使用 InstrMapping 类作为基类进行描述。每个模型设置 InstrMapping 类的各种字段,以便它们可以唯一地描述使用该模型的所有指令。TableGen 解析所有关系模型,并使用这些信息来构建关系表,这些表将指令相互关联。这些表与查询它们的函数一起在 XXXInstrInfo.inc 文件中发出。以下是在 Target.td 文件中定义的 InstrMapping 类的定义

class InstrMapping {
  // Used to reduce search space only to the instructions using this
  // relation model.
  string FilterClass;

  // List of fields/attributes that should be same for all the instructions in
  // a row of the relation table. Think of this as a set of properties shared
  // by all the instructions related by this relationship.
  list<string> RowFields = [];

  // List of fields/attributes that are same for all the instructions
  // in a column of the relation table.
  list<string> ColFields = [];

  // Values for the fields/attributes listed in 'ColFields' corresponding to
  // the key instruction. This is the instruction that will be transformed
  // using this relation model.
  list<string> KeyCol = [];

  // List of values for the fields/attributes listed in 'ColFields', one for
  // each column in the relation table. These are the instructions a key
  // instruction will be transformed into.
  list<list<string> > ValueCols = [];
}

示例

假设我们想要有一个函数 int getPredOpcode(uint16_t Opcode, enum PredSense inPredSense),它接受一个非谓词指令,并根据一些输入标志 inPredSense 返回其谓词真或假形式。此过程的第一步是定义一个关系模型,通过为 InstrMapping 字段分配适当的值,将谓词指令与其非谓词形式相关联。对于此关系,非谓词指令被视为键指令,因为它们是用于查询接口函数的指令。

def getPredOpcode : InstrMapping {
  // Choose a FilterClass that is used as a base class for all the
  // instructions modeling this relationship. This is done to reduce the
  // search space only to these set of instructions.
  let FilterClass = "PredRel";

  // Instructions with same values for all the fields in RowFields form a
  // row in the resulting relation table.
  // For example, if we want to relate 'ADD' (non-predicated) with 'Add_pt'
  // (predicated true) and 'Add_pf' (predicated false), then all 3
  // instructions need to have same value for BaseOpcode field. It can be any
  // unique value (Ex: XYZ) and should not be shared with any other
  // instruction not related to 'add'.
  let RowFields = ["BaseOpcode"];

  // List of attributes that can be used to define key and column instructions
  // for a relation. Key instruction is passed as an argument
  // to the function used for querying relation tables. Column instructions
  // are the instructions they (key) can transform into.
  //
  // Here, we choose 'PredSense' as ColFields since this is the unique
  // attribute of the key (non-predicated) and column (true/false)
  // instructions involved in this relationship model.
  let ColFields = ["PredSense"];

  // The key column contains non-predicated instructions.
  let KeyCol = ["none"];

  // Two value columns - first column contains instructions with
  // PredSense=true while second column has instructions with PredSense=false.
  let ValueCols = [["true"], ["false"]];
}

TableGen 使用上述关系模型来发出关系表,该表将非谓词指令与其谓词形式映射。它还输出接口函数 int getPredOpcode(uint16_t Opcode, enum PredSense inPredSense) 来查询表。在这里,函数 getPredOpcode 接受两个参数:当前指令的操作码和所需指令的 PredSense,并返回指令的谓词形式(如果在关系表中找到)。为了将指令添加到关系表中,它需要在其定义中包含相关信息。例如,考虑以下 ADD、ADD_pt (true) 和 ADD_pf (false) 指令的当前定义

def ADD : ALU32_rr<(outs IntRegs:$dst), (ins IntRegs:$a, IntRegs:$b),
            "$dst = add($a, $b)",
            [(set (i32 IntRegs:$dst), (add (i32 IntRegs:$a),
                                           (i32 IntRegs:$b)))]>;

def ADD_Pt : ALU32_rr<(outs IntRegs:$dst),
                       (ins PredRegs:$p, IntRegs:$a, IntRegs:$b),
            "if ($p) $dst = add($a, $b)",
            []>;

def ADD_Pf : ALU32_rr<(outs IntRegs:$dst),
                       (ins PredRegs:$p, IntRegs:$a, IntRegs:$b),
            "if (!$p) $dst = add($a, $b)",
            []>;

在此步骤中,我们修改这些指令以包含关系模型 <tt>getPredOpcode</tt> 所需的信息,以便可以将它们关联起来。

def ADD : PredRel, ALU32_rr<(outs IntRegs:$dst), (ins IntRegs:$a, IntRegs:$b),
            "$dst = add($a, $b)",
            [(set (i32 IntRegs:$dst), (add (i32 IntRegs:$a),
                                           (i32 IntRegs:$b)))]> {
  let BaseOpcode = "ADD";
  let PredSense = "none";
}

def ADD_Pt : PredRel, ALU32_rr<(outs IntRegs:$dst),
                       (ins PredRegs:$p, IntRegs:$a, IntRegs:$b),
            "if ($p) $dst = add($a, $b)",
            []> {
  let BaseOpcode = "ADD";
  let PredSense = "true";
}

def ADD_Pf : PredRel, ALU32_rr<(outs IntRegs:$dst),
                       (ins PredRegs:$p, IntRegs:$a, IntRegs:$b),
            "if (!$p) $dst = add($a, $b)",
            []> {
  let BaseOpcode = "ADD";
  let PredSense = "false";
}

请注意,以上所有指令都使用 PredRel 作为基类。这非常重要,因为 TableGen 将其用作筛选器,用于选择 getPredOpcode 模型的指令。任何未从 PredRel 派生的指令都将从分析中排除。BaseOpcode 是另一个重要字段。由于它被选为模型的 RowFields,因此要求所有 3 个指令都具有相同的值才能关联。接下来,PredSense 用于通过将其值与 KeyColValueCols 进行比较来确定其列位置。如果指令将其 PredSense 值设置为关系模型中未使用的值,则不会在关系表中为其分配列。