0x00 前言

作爲學術生涯的最後一門課，選了一門據說是最難的，上下來的感覺也確實是難得不行，不太懂……
決定照着ppt和上課的筆記整理一下，以此爭取達到複習的目的。
（意思是有些雖然寫出來了，但自己都不見得明白，有的部分存疑後續去詢問之後再做修改）

Useful Inequalities

在隨機算法的問題中有大量不等式常被使用，爲了在運用時能想得起來，有些甚至要背熟。

0x01 Union Bound

Randomized Algorithm - Chapter 3.2 (P45)
n個隨機事件各自發生的概率之和，不小於這n個事件中至少有一個發生的概率

Let $E_i$ be a random event, then we have
$Pr[\cup_{i=1}^{n}E_i] \le \sum_{i=1}^{n}Pr(E_i)$

0x02 馬爾可夫不等式 (Markov Inequality)

Let $Y$ be a random variable assuming only non-negative values. Then
$\text{for all } t>0,~Pr[Y \ge t]\le \frac{E[Y]}{t}$

0x03 切比雪夫不等式 (Chebyshev’s Inequality)

Let $X$ be a random variable with expectation $\mu_X$ and standard deviation $\sigma_X$ , then
$\text{for any }t>0,~Pr[|X-\mu_X|\ge t\sigma_X] \le \frac{1}{t^2}$

0x04 切爾諾夫約束 (Chernoff’s Bound)

Randomized Algorithm - Chapter 4.1 (P67)
切爾諾夫約束有三種表現方式，在多個獨立的泊松實驗中

Let $X_1, X_2, \cdots, X_n$ be independent Poisson trials such that,
for $1 \le i \le n,~Pr[X_i=1]=p_i$ , where $0<p_i<1$ . Then

Chernoff’s Bound(1)

$\text{for }X=\sum_{i=1}^{n}X_i,~\mu=E[X]=\sum_{i=1}^{n}p_i, \text{ and any } \delta>0,$
$Pr[X>(1+\delta)\mu]<\left[ \frac{e^{\delta}}{(1+\delta)^{(1+\delta)}} \right]^{\mu}$

Chernoff’s Bound(2)

$\text{for }X=\sum_{i=1}^{n}X_i,~\mu=E[X]=\sum_{i=1}^{n}p_i, \text{ and any } 0<\delta<1,$
$Pr[X<(1-\delta)\mu]<\left[ \frac{e^{-\delta}}{(1-\delta)^{(1-\delta)}} \right]^{\mu}$

Chernoff’s Bound(3)

$\text{for }X=\sum_{i=1}^{n}X_i,~\mu=E[X]=\sum_{i=1}^{n}p_i, \text{ and any } 0<\delta<1,$
$Pr[|X-\mu| >\delta\mu]<2e^{-\frac{\delta^2}{3}\mu}$

0x05 Prove in detail

Chebyshev’s Inequality in 0x03

Let $X$ be a random variable with expectation $\mu_X$ and standard deviation $\sigma_X$ , then
$\text{for any }t>0,~Pr[|X-\mu_X|\ge t\sigma_X] \le \frac{1}{t^2}$

$\begin{aligned} Pr \left( |X-\mu_X| \ge t\sigma_X \right) \\ = Pr \left( (X-\mu_X)^2 \ge (t\sigma_X)^2 \right) \\ \textbf{set } Y \triangleq (X-\mu_X)^2 \ge 0 \\ Pr \left( Y \ge (t\sigma)^2 \right) \le \frac{E(Y)}{(t\sigma_X)^2} \\ \because E(Y) = E\left( (X-\mu_X)^2 \right) = \sigma_X^2 \\ \therefore Pr \left( Y \ge (t\sigma)^2 \right) \le \frac{\sigma_X^2}{(t\sigma_X)^2} = \frac{1}{t^2} \\ \end{aligned}$

Chernoff’s Bound in 0x04

Let $X_1, X_2, \cdots, X_n$ be independent Poisson trials such that,
for $1 \le i \le n,~Pr[X_i=1]=p_i$ , where $0<p_i<1$ . Then

Chernoff’s Bound(1)

$\text{for }X=\sum_{i=1}^{n}X_i,~\mu=E[X]=\sum_{i=1}^{n}p_i, \text{ and any } \delta>0,$
$Pr[X>(1+\delta)\mu]<\left[ \frac{e^{\delta}}{(1+\delta)^{(1+\delta)}} \right]^{\mu}$

對於隨機變量 (RandomVariable):

$\begin{aligned} & R.V. ~x_1, x_2, \cdots, x_n \\ & Pr(X_i=1) = p_i, Pr(X_i=0) = 1-p_i \\ & \mu = \sum_{i=1}^{n}p_i, X = \sum_{i=1}^{n}x_i, E(X)=\mu \\ & Pr(X>(1+\delta)\mu) \le \frac{E(X)}{(1+\delta)\mu} = \frac{1}{1+\delta} \\ =~& Pr(e^{\lambda X}>e^{\lambda(1+\delta)\mu}) \le \frac{E(e\lambda X)}{e^{\lambda(1+\delta)\mu}}\le \frac{e^{\mu(e^{\lambda}-1)}}{e^{\lambda(1+\delta)\mu}} \\ \end{aligned}$

令 $\lambda = ln(1+\delta)$ ，則上式化爲 $\left( \frac{e^{\delta}}{(1+\delta)^{(1+\delta)}} \right)^{\mu}$ ，得證。

Chernoff’s Bound(2)

$\text{for }X=\sum_{i=1}^{n}X_i,~\mu=E[X]=\sum_{i=1}^{n}p_i, \text{ and any } 0<\delta<1,$
$Pr[X<(1-\delta)\mu]<\left[ \frac{e^{-\delta}}{(1-\delta)^{(1-\delta)}} \right]^{\mu}$

其中：

$\begin{aligned} E(e^{-\lambda X}) &= E(e^{-\lambda(\sum_{i=1}^{n}X_i)}) \\ &= E(\prod_{i=1}^{n} e^{-\lambda X_i}) = \prod_{i=1}^{n}E(e^{-\lambda X_i}) \\ &= \prod_{i=1}^{n}(p_i \cdot e^{-\lambda} + (1-p_i)) \\ &= \prod_{i=1}^{n}( 1 + p_i (e^{-\lambda}-1)) \\ &= e^{\mu(e^{-\lambda}-1)} \end{aligned}$

代入原式子，有：

$\begin{aligned} Pr[X < (1-\delta)\mu] &\le \frac{E(e^{-\lambda X})}{e^{-\lambda (1-\delta) \mu}} \\ &= \frac{e^{\mu(e^{-\lambda}-1)}}{e^{-\lambda (1-\delta) \mu}} \\ &= e^{\mu(e^{-\lambda}-1+\lambda-\lambda\delta)} \end{aligned}$

令 $f(\lambda) = e^{-\lambda}-1+\lambda-\lambda\delta$ ,
當 $f'(\lambda) = -e^{-\lambda} + 1 - \delta = 0$ 時, $\lambda = -\ln (1-\delta)$
故 $Pr[X<(1-\delta)\mu] < e^{\mu f(-ln(1-\delta))} = \left( \frac{e^{-\delta}}{(1-\delta)^{(1-\delta)}} \right)^{\mu}$

Chernoff’s Bound(3)

$\text{for }X=\sum_{i=1}^{n}X_i,~\mu=E[X]=\sum_{i=1}^{n}p_i, \text{ and any } 0<\delta<1,$
$Pr[|X-\mu| >\delta\mu]<2e^{-\frac{\delta^2}{3}\mu}$

首先去掉絕對值符號：
$Pr[|X-\mu| > \delta\mu] = Pr[X-\mu > \delta\mu] + Pr[X-\mu < -\delta\mu]$
對於第一個部分：
$\begin{aligned} Pr[X-\mu > \delta\mu] &= Pr[X > (\delta+1)\mu] \\ &< \left( \frac{e^{\delta}}{(1+\delta)^{(1+\delta)}} \right)^{\mu} \\ &= e^{\mu \cdot (\delta - (1+\delta) \ln (1+\delta))} \\ &< e^{-\frac{3}{\delta^2}\mu} \end{aligned}$
同理可證 $Pr[X-\mu < -\delta\mu] < e^{-\frac{3}{\delta^2}\mu}$
$\begin{aligned} Pr[|X-\mu| > \delta\mu] &= Pr[X-\mu > \delta\mu] + Pr[X-\mu < -\delta\mu] \\ &< e^{-\frac{3}{\delta^2}\mu} + e^{-\frac{3}{\delta^2}\mu} \\ &= 2e^{-\frac{3}{\delta^2}\mu} \end{aligned}$
故 $Pr[|X-\mu|>\delta\mu]<2e^{-\frac{3}{\delta^2}\mu}$ 得證

Balls and Bins

原先以爲往盒子裏放球取球只是個抽屜原理或者排列組合的問題，
高等算法裏把這研究得還要更深刻一些……

0x01 Balls and Bins

$m$ balls, $n$ bins. You randomly throw each ball to some bin.
$X_i$ : number of balls in the $i$ -th bin.
Let $k \triangleq max(X_1, X_2, \cdots, X_n)$ .
Question: expectation and distribution of $k$ ?

$m = o(\sqrt{n})$ ; (Case 1)
- prove $Pr(k>1)=o(1)$ .
- $k=1~w.h.p$
$m = \Theta(\sqrt{n})$ ; (Case 2, Birthday Paradox)
- compute $Pr(k>1)$ again.
- $k=1~or~2~w.h.p$
$m=n$ ; (Case 3)
- find suitable $x$ , such that $Pr(k \le x)=1-o(1)$
- $k=\Theta(\frac{\ln n}{\ln \ln n})~w.h.p$
$m \ge n\ln n$ ; (Case 4)
- $k=\Theta (\frac{m}{n})~w.h.p$

0xFF Prove in detail

Case 1

$m = o(\sqrt{n})$

prove $Pr(k>1)=o(1)$ .
$k=1~w.h.p$
$m=1, Pr(k=1) = 1-o(1)$
$m=2, \begin{cases} Pr(k=1)=1-1/n \\ Pr(k=2)=1/n \end{cases}$
$m= ? ~, Pr(k=1)=1-o(1)$

對於這個 $Pr(k=1)=1-o(1)$ ，我們可以等價地視作：
$Pr(max(X_1, X_2, \cdots, X_n)\ge 2) = o(1)$

那麼，根據 Useful Inequalities 中提到過的 Union Bound，有：
$\begin{aligned} Pr(X_1 \ge 2~or~X_2 \ge 2~or~\cdots~or~X_n \ge 2) ~&\le \sum_{i=1}^{n}Pr(X_i \ge 2) \\ & = n \cdot Pr(X_1 \ge 2) \end{aligned}$

其中，
$\begin{aligned} Pr(X_1 \ge 2) ~&\le \binom{m}{2} \left(\frac{1}{n} \right)^2 = \Theta(\frac{m^2}{n^2}) \\ Pr(X_1 \ge 2) ~&= \sum_{k=2}^{m}Pr(X_1=k) \\ &= \sum_{k=2}^{m} \binom{m}{k}\cdot(\frac{1}{n})^k(1-\frac{1}{n})^{m-k} \\ &= 1- Pr(X_1=0) - Pr(X_1=1) \\ &= 1-(1-\frac{1}{n})^m - m\cdot \frac{1}{n} \cdot (1-\frac{1}{n})^{m-1} \\ & = \Theta(\frac{m^2}{n^2}) \end{aligned}$

代入原式子，故有：
$n \cdot Pr(X_1 \ge 2) = \Theta(m^2/n) = o(1) \\ \therefore m = o(\sqrt{n})$

Case 2

$m = \Theta(\sqrt{n})$ ; (Birthday Paradox)
+ compute $Pr(k>1)$ again.
+ $k=1~or~2~w.h.p$

$\begin{aligned} m = \Theta(\sqrt{n})~&=c\sqrt{n} \\ Pr(X_1 \ge 2) ~&\le \binom{m}{2} \left(\frac{1}{n} \right)^2 \approx \frac{c^2}{2n} \\ Pr(k > 1) ~&\le n \cdot Pr(X_1 \ge 2) \le \frac{c^2}{2} \\ Pr(k = 1) ~& = \frac{n-1}{n} \cdot \frac{n-2}{n} \cdot \frac{n-3}{n} \cdots \frac{n-m+1}{n} \\ &= Pr(E_1 \cdots E_m) ~, E_i \triangleq Pr(E_1)Pr(E_2|E_1)Pr(E_3|E_1E_2)\cdots \\ &= (1-\frac{1}{n}) \cdot (1-\frac{2}{n}) \cdot (1-\frac{3}{n}) \cdots (1-\frac{m-1}{n}) \end{aligned}$

根據 Union Bound：
$\begin{aligned} Pr(k = 1) ~&= (1-\frac{1}{n}) \cdot (1-\frac{2}{n}) \cdot (1-\frac{3}{n}) \cdots (1-\frac{m-1}{n})\\ &\ge (1-\frac{m-1}{n})^{m-1} ~~~~\textbf{ (Union Bound)} \\ &\sim (1-\frac{m-1}{n})^{\frac{n}{m-1}\cdot{\frac{(m-1)^2}{n}}} \sim (\frac{1}{e})^{\frac{m^2}{n}} \end{aligned}$

又因爲 $1-x \le e^{-x}$ :
$\begin{aligned} &(1-\frac{1}{n}) \cdot (1-\frac{2}{n}) \cdot (1-\frac{3}{n}) \cdots (1-\frac{m-1}{n}) \\ \le~ & e^{-1/n} \cdot e^{-2/n} \cdot e^{-3/n} \cdots e^{-(m-1)/n} \\ \approx~ & e^{-m^2/2n} < 1 \\ \therefore ~ & Pr(k \ge 2) = 1 - Pr(k = 1) \ge 1- e^{-c^2/2} \end{aligned}$

而對於 $k \ge 3$ 時：
(這段的板書順序較爲混亂，資質愚鈍足足半個小時仍無法看懂，暫且擱置)

Prepare for case 3

爲了 case 3 的證明，我們需要事先準備一個階乘的近似界
$(\frac{m}{x})^x \le \binom{m}{x} \le (\frac{em}{x})^x$

先證 $\tbinom{m}{x} = \frac{m!}{x!(m-x)!} \sim \frac{m^x}{x!}$
$\begin{aligned} \lim\limits_{m \rightarrow \infty}\frac{\tbinom{m}{x}}{\frac{m^x}{x!}} &= \lim\limits_{m \rightarrow \infty}\frac{m(m-1)(m-2)\cdots(m-x+1)}{m^x} \\ &= \lim\limits_{m \rightarrow \infty} 1\cdot(1-\frac{1}{m})(1-\frac{2}{m})\cdots(1-\frac{x-1}{m}) \\ &= 1 \end{aligned}$

這裏，我們需要引入階乘的逼近公式：斯特林公式(Stirling’s formula):
$n! \sim \sqrt{2 \pi n}(\frac{n}{e})^n$

$\frac{m^x}{x!} \sim \frac{m^x}{\sqrt{2\pi x}(\frac{x}{e})^x}=\frac{e^xm^x}{\sqrt{2\pi x}x^x}=\frac{e^x}{\sqrt{2\pi x}}(\frac{m}{x})^x \le (\frac{em}{x})^x$
並且
$\frac{e^x}{\sqrt{2\pi x}} > 1$
所以
$\frac{e^x}{\sqrt{2\pi x}}(\frac{m}{x})^x \ge (\frac{m}{x})^x$
即
$(\frac{m}{x})^x \le \binom{m}{x} \le (\frac{em}{x})^x$

Case 3

$m=n$
+ find suitable $x$ , such that $Pr(k \le x)=1-o(1)$
+ $k=\Theta(\frac{\ln n}{\ln \ln n})~w.h.p$

令 $x = \frac{\ln n}{\ln ln n}$ ，先證下界:
$Pr(k \le x) = 1-o(1)$

即證：
$Pr(k \ge x) = o(1)$

於是，根據 Union Bound 有：
$Pr(k \ge x) \le n \cdot Pr(X_1 \ge x) \le n \cdot \binom{m}{x}\left( \frac{1}{n} \right)^x = n \cdot \binom{n}{x}\left( \frac{1}{n} \right)^x$

上一小節我們通過斯特林公式(Stirling’s formula) 得到:
$(\frac{m}{x})^x \le \binom{m}{x} \le (\frac{em}{x})^x$

代入，有：
$n \cdot \binom{n}{x}\left( \frac{1}{n} \right)^x \le n\cdot \left( \frac{en}{x} \right)^x \left( \frac{1}{n} \right)^x = n\cdot \left( \frac{e}{x} \right)^x = o(1)$

再證上界：
$Pr(k \ge c \cdot x) = 1-o(1)$

即證：
$Pr(k \le c \cdot x) = Pr(E_1 \land \cdots \land E_n)$

其中， $E_i$ 表示：
$x_i \le c \cdot x,~Y_i=\begin{cases} 1, ~E_i\text{ 沒發生}\\ 0, ~E_i\text{ 發生} \end{cases}$

則有：
$Pr(k \le c \cdot x) = Pr(k \le c \cdot x)=Pr(\forall i, Y_i=0) = Pr(\sum_{i=1}^{n}Y_i=0)$

而上式不大於：
$Pr \left( \left|\sum_{i=1}^{n} - E(\sum_{i=1}^{n}Y_i) \right| \ge E(\sum_{i=1}^{n}Y_i) \right) \le \frac{\sigma^2(\sum_{i=1}^{n}Y_i)}{(E(\sum_{i=1}^{n}Y_i))^2}$

(期望與方差的推導較長，暫時擱置，事後有時間再補)，故：
$Pr(k<cx)=Pr(Y_1+Y_2+\cdots+Y_n=0)$
$\le \frac{Var(\sum_{i=1}^{n}Y_i)}{E^2(\sum_{i=1}^{n}Y_i)} = O\left(\frac{n}{(n^{1-c})^2}\right) \sim \frac{1}{n^{1/3}},~~~\therefore c=1/3$

$\frac{\ln n}{3\ln\ln n}<k<\frac{\ln n}{\ln\ln n}$

Consider the case with $n$ balls and $n$ bins,
let $X$ be the random variable of the number of empty bins. Compute $E(X)$ , and the deviation between $X$ and $E(X)$ .
the result should be in the form $Pr(|X-E(X)|>a)<b$

令 $Z_i$ 表示第 $i$ 個盒子裏是否沒有球: 沒有球時爲 $Z_i=1$ ，反之爲 $Z_i=0$
則有
$Y=\sum_{i=1}^{n}Z_i$
$E(Y)=E(\sum_{i=1}^{n}Z_i)=\sum_{i=1}^{n}E(Z_i)=nE(Z_1)$
其中
$E(Z_1)=p(Z_1=0)\cdot 1 + p(Z_1=1)\cdot 0 = 1 - (1-\frac{1}{n})^n = 1-e^{-1}$
所以
$E(X) = E(n-Y) = n-E(Y) = e^{-1}n$
對於 $\lambda > 0$
$\mu = E[Z] = n(1-\frac{1}{n})^n \sim ne^{-1}$
$Pr[|Z-\mu|\ge \lambda]\le 2\cdot exp(-\frac{\lambda^2}{2n})$

特別地, 當 $m \gg n$ 時:
$\mu = E[Z] = n(1-\frac{1}{n})^m \sim ne^{-m/n}$
$Pr[|Z-\mu|\ge \lambda]\le 2\cdot exp(-\frac{\lambda^2(n-1/2)}{n^2-\mu^2})$

Case 4

$m \ge n\ln n$
+ $k=\Theta (\frac{m}{n})~w.h.p$

要證：
$Pr(k \ge c \cdot \frac{m}{n}) = o(1)$

即證：
$Pr(x_1 \ge c\frac{m}{n}~~or~~x_2 \ge c\frac{m}{n}~~or~\cdots~or~~x_n \ge c\frac{m}{n})$

而根據 Union Bound，
$Pr(k \ge c \cdot \frac{m}{n}) \le n \cdot Pr(x_1 \ge c \frac{m}{n})$

先證上界：
$Pr \left(x_1 \ge c\frac{m}{n} \right) \le \binom{m}{c\frac{m}{n}} \left( \frac{1}{n} \right)^{c\frac{m}{n}} \le \left( \frac{em}{c\frac{m}{n}} \right)^{c\frac{m}{n}} \left( \frac{1}{n} \right)^{c\frac{m}{n}} = \left( \frac{e}{c} \right)^{c\frac{m}{n}}$

由於 $m \ge n\ln n$ ，
$Pr(k \ge c\frac{m}{n})= \left( \frac{e}{c} \right)^{c\frac{m}{n}} \le \left( \frac{e}{c} \right)^{c\ln n} = o(1/n)$

再證下界，根據 Chernoff’s Bound:
$Pr\left( \left| Y_1 + \cdots + Y_n - E(Y_1 + \cdots + Y_n) \right| \right) \le~?$

其中， $Y_i$ 指 $i$ -th ball 扔進了第一個盒子， $X_1 = \sum_{i=1}^{m}Y_i,~~Y_i=\begin{cases} 1,~~1/n \\ 0,~~1-1/n \end{cases}$

$Pr( |X_1 - m/n| > c_1\frac{m}{n} ) \le 2 \cdot exp(-\frac{c_1^2}{3}\cdot\frac{m}{n}) \le 2\cdot exp(-\frac{c_1^2}{3}\ln n) = 2 \frac{1}{n^{\frac{c1^2}{3}}} = o(\frac{1}{n})$

Advanced Algorithm 聽課筆記（Useful Inequalities & Balls and Bins）

0x00 前言

Useful Inequalities

0x01 Union Bound

0x02 馬爾可夫不等式 (Markov Inequality)

0x03 切比雪夫不等式 (Chebyshev’s Inequality)

0x04 切爾諾夫約束 (Chernoff’s Bound)

Chernoff’s Bound(1)

Chernoff’s Bound(2)

Chernoff’s Bound(3)

0x05 Prove in detail

Chebyshev’s Inequality in 0x03

Chernoff’s Bound in 0x04

Chernoff’s Bound(1)

Chernoff’s Bound(2)

Chernoff’s Bound(3)

Balls and Bins

0x01 Balls and Bins

0xFF Prove in detail

Case 1

Case 2

Prepare for case 3

Case 3

Case 4

【Tensorflow】用於處理checkpoint中參數名稱與矩陣數值的工具類

Advanced Algorithm 聽課筆記（Useful Inequalities & Balls and Bins）

【GraphLite】同步圖運算初試-數三角形

【Pytorch】Windows10下配置Pytorch環境

【selenium】Windows平臺下使用python自動登陸網關 (更新至 v1.1.0)

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結