文章目錄

介紹

這個論文提出了一種簡單高效率的插入 $\phi$ -node的方法，也指出了傳統插入 $\phi$ -node算法的一些弊端。
注：這個論文還有一些前置論文，我懶得看了

想要解決的問題

論文想要解決的是在計算dominance frontier時候潛在的 $O(N^2)$ 的複雜度。論文指出計算 $\phi$ -node插入位置可以在線性時間內完成，核心就在於處理dominator tree的順序，同時這種方式還可以on-the-fly的方式計算dominance frontier。

該論文使用了一種 $DJ-graph$ 的結構來作爲整個算法的基礎。DJ-Graph的本質上是在dominator tree上添加了J-edge（join-edge），

The tree skeleton is augmented with J-edges (join edges) that correspond to all edges of the CFG whose source does not strictly dominate its destination. - Static Single Assignment Book

注：上圖源於Static Single Assignment Book

傳統placing $\phi$ -node算法回顧

在構造SSA介紹的插入 $\phi$ -node的算法比較粗糙，還沒有考慮到live信息，效率也比較低。

注：上圖來自於Data Flow Analysis Theory and Pratice

這個算法有兩個特點，一是預先計算好所有的dominance frontier信息，二是迭代的方式插入 $\phi$ -node的效率比較低。

背景知識

有兩點背景知識以前沒有接觸過，一個是dominance frontier的拓展，從一個節點的 $DF(x)$ 拓展到一個節點集合 $DF(S)$ 。

$DF(S) = \bigcup_{x \in S} DF(x)$

另一個是iterated dominance frontier $IDF(S)$ 或者（ $DF^+(S$ ）（這也是我爲什麼看llvm的代碼IDFCalculatorBase看不懂的原因 😃）, $IDS(S)$ 是通過迭代計算 $DF(S)$ 得到的，其實也就是 $DF$ 的傳遞閉包。

$IDF_1(S) = DF(S) \\ IDF_{i+1} = DF(S \cup IDF_i(S))$

其實在傳統的 $\phi$ -node插入算法中，迭代就是爲了計算這個 $IDF(S)$ 。

另外，對於 $J-edge(a, b)$ ，所有 $a$ DT上的ancestors（包括 $a$ ）也不會strictly dominate $b$ ，也就是 $b$ 也在這些ancestor的DF集合中。例如Fig3.3中，( $F$ , $G$ )是一個 $J-edge$ ，所有{( $F$ , $G$ ), ( $E$ , $G$ ), ( $B$ , $G$ )}也是 $DF-edge$ 。

那麼 $J-edge$ 和 $DF$ 的關係是 $DF$ 可以有簡單的 $J-edge$ 推出來。

核心實現

首先 $DJ-graph$ 有幾個需要在着重強調的特性，

線性時間構造DJ-graph

注：上圖來源於論文

首先 $DJ-graph$ 以dominator tree作爲骨架，第一點就是在其上添加join edges，例如我們要爲Figure 2中的節點2附着join edge，首先在flowgraph中找到destination爲節點2的邊，例如 $1 \rightarrow 2$ 和 $2 \rightarrow 6$ ，但是 $1$ 支配 $2$ ，所以我們在dominator tree加上 $6 \rightarrow 2$ 。只要我們考察完flowgraph所有的邊，再結合dominator tree就可以在構造出 $DJ-graph$ 。

$DJ-graph$ 有以下三個屬性：

前面我們已經探討了 $J$ edge 和dominance frontier的關係，例如對於 $J-edge (a, b)$ ， $b$ 在所有 $a$ 及其ancestor的 $DF$ 集合中。
對於 $y \in DF(x)$ （同樣 $y \in IDF(x)$ ）， $y$ 在dominator tree中的level永遠小於等於 $x$ 。這是整篇論文的關鍵，換句話說，如果我們要找 $x$ 的dominance fontier，只找level值小於等於 $x$ 的節點就夠了。
$y \in DF(x)$ ，當且僅當存在 $z \in SubTree(x)$ ，並且存在一條 $J-edge$ $z \rightarrow y$ 同時 $y$ 的level值小於等於 $x$ 的level值。

computing dominance frontier

論文推出了一條引理，

Lemma 1 : A node $z \in DF(x)$ iff there exists a $y \in SubTree(x)$ with $y \rightarrow z$ as a $J-edge$ and $z.level \le x.level$

通過上面的引理論文給出了一個計算dominance frontier的算法，

例如我們要計算Figure 2中節點 $3$ 的dominance frontier，首先 $SubTree(3) = {3, 9, 10, 11, 12, 13, 14}$ ， $J-edge$ 有 ${10 \rightarrow 12, 11 \rightarrow 12, 13 \rightarrow 3, 13 \rightarrow 15, 14 \rightarrow 12}$ 。而其中節點 $3$ ， $15$ 滿足上面的引理，所以 $DF(3) = {3, 15}$ 。

該篇論文算法的另一個核心就是順序，例如我們要計算 $DF({9, 12})$ ，因爲 $12 \in SubTree(9)$ ，所以我們在計算dominance frontier時，節點 $12$ 的 $SubTree$ 被處理了兩遍，所以在計算dominance frontier時按照dominator tree的level從下到上處理。

如下圖所示，在處理 $DF(w)$ 之前， $DF(x)$ 已經計算出來了。

注：上圖來自與Static Single Assignment Book

插入 $\phi$ -node

在我們得到 $DJ-graph$ 之後，就可以計算 $\phi$ -node插入的位置。這裏的算法使用《Static Single Assignment Book》的描述。

例如對 $v$ 進行定義的節點有 $1$ ， $3$ ， $4$ ， $7$ 。首先算法使用一個 $OrderedBucket$ 來組織這些節點，然後按照depth從大到小處理以這些節點爲起始點的 $J$ -edge，如果這個edge滿足引理Lemma 1，則把 $J$ -edge的終止節點加入 $DF({1, 3, 4, 7})$ 中。

這篇論文的算法針對《構造SSA》的改進有以下幾點：

把計算dominance frontier的粒度從單個節點擴展到一個節點集合。例如對於變量 $x$ 的 $def$ 通常也是一個節點集合。
不需要預先計算dominance frontier，可以on-the-fly地計算dominance frontier
通過 $J$ -edge，以bottom up的方式地進行處理，保證每個節點每條邊只處理一遍沒提升了效率

效果及使用情況

通過論文作者的描述，該算法實現了5倍的提升。llvm最開始的時候使用的是Cytron的算法，後來就使用本論文中的算法，見GenericIteratedDominanceFrontier.h。

//===- IteratedDominanceFrontier.h - Calculate IDF ------------*- C++ -*-===//
//
// Compute iterated dominance frontiers using a linear time algorithm.
//
// The algorithm used here is based on:
//
//  Sreedhar and Gao. A linear time algorithm for placing phi-nodes.
//  In Proceedings of the 22nd ACM SIGPLAN-SIGACT Symposium on Principles of
//  Programming Languages
//  POPL '95. ACM, New York, NY, 62-73.
//
// It has been modified to not explicitly use the DJ graph data structure and
// to directly compute pruned SSA using per-veriable liveness information.
//
//===--------------------------------------------------------------------===//

論文 A Linear Time Algorithm for Placing phi-Nodes：閱讀筆記

文章目錄

介紹

想要解決的問題

傳統placing $\phi$ -node算法回顧

背景知識

核心實現

線性時間構造DJ-graph

computing dominance frontier

插入 $\phi$ -node

效果及使用情況

AI 畫圖真刺激，手把手教你如何用 ComfyUI 來畫出刺激的圖

公司剛入職了一名 Java 中級開發，短短 4 行代碼居然湊齊了 3 個 bug！我哭了~~

數據展示動態（跑分）顯示

公衆號5月C#/.NET熱文一覽

git 下載大陸鏡像地址

調試LLVM如何生成SSA

深度調試gopls(過時，低質量-How-自用)

PLP: 4.2/4.3 Attribute Gramma閱讀筆記3

深度學習的數學：深度學習和卷積神經網絡讀書筆記

深度學習的數學：反向傳播算法

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結

論文 A Linear Time Algorithm for Placing phi-Nodes：閱讀筆記

文章目錄

介紹

想要解決的問題

傳統placing ϕ\phiϕ-node算法回顧

背景知識

核心實現

線性時間構造DJ-graph

computing dominance frontier

插入ϕ\phiϕ-node

效果及使用情況

傳統placing $\phi$ -node算法回顧

插入 $\phi$ -node