Curriculum adversarial training

原創

大眼呆萌君

2020-07-04 18:42

Weakness of adversarial training: overfit to the attack in use and hence does not generalize to test data

Curriculum adversarial training

思想：train model from weak attack to strong attack

方法

Let $l$ denote the attack strength, $K$ denote the maximal attack strength. $\mathcal{A}(l)$ denotes an attack class parameterized with $l$ .

Basic curriculum learning

i). start from no attack;
ii). train the model for one epoch and, once finished, calculate the $\tilde{l}$ -accuracy;
iii-a). if $\tilde{l}$ increases at least once over the last 10 epoches, continue training;
iii-b). if $\tilde{l}$ does not increase over the last 10 epoches, set the parameters of the model to be the best ones (i.e. 10 epoches ago), and increase $l$ by 1;
iv). Stop when $l>K$ .

Benefit: Training efficiency

Additional optimization technique: batch mixing

Motivation: The basic curriculum training can achieve a significantly reduction on the training time, it does not increase the robustness. One issues is $\textcolor{red}{\text{\small forgetting}}$ : when the model is trained with a larger $l$ , it will forget the adversarial examples generated for a smaller $l$ .

Solution: Generate some adversarial examples using $PGD(i)$ for each $i \in \{0, 1, ..., l\}$ , and combine them to form a batch. The loss function is updated accordingly as:
$\sum_{i=0}^k \alpha_i \sum_{x,y \sim \mathcal{D}}\mathcal{L}(f_\theta(\mathcal{A}_i(x),y),$
where $\alpha_i$ 's are hyperparameters such as $a_i \in [0,1],\sum \alpha_i=1$ . The authors set $\alpha_i=\frac{1}{l+1}$ and generate the same amount of adversarial examples for each attack strength.

Additional optimization technique: quantization

Motivation: The model trained with CAT may not defend against attacks that are stronger than the strongest attack used during training.
Solution: Employ quantization, i.e. restrict $x \in [0,1]$ to a $b$ -bit integer.
Rationale: Quantization reduces the space of adversarial examples. Specifically, let $x^\star$ denotes the adversarial example. The difference of $x^\star-x$ takes value from an infinite space if $x$ is real-valued; in contrast, it takes value from a finite space if $x$ is quantized to an integer vector.
Remark: Quantization is a generic inference time defense technique. This technique alone is not shown to provide resilience against strong white-box attacks. However, it is effective when using together with CAT since the model remembers adversarial example generated by weak attacks. Although a stronger attack can better optimize the loss function, the adversarial examples that it generates are highly likely to coincide with those generated by a weaker attack, because the entire adversarial example space is small.

實驗：Improve both efficiency and empirical worst-case accuracy against adversarial examples (termed resilience)

文獻：
Cai, Qi-Zhi, Chang Liu, and Dawn Song. “Curriculum adversarial training.” In Proceedings of the 27th International Joint Conference on Artificial Intelligence, pp. 3740-3747. 2018.

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Curriculum adversarial training

Curriculum adversarial training

方法

Basic curriculum learning

Additional optimization technique: batch mixing

Additional optimization technique: quantization

使用c#強大的表達式樹實現對象的深克隆之解決循環引用的問題

GPT-4o 引領人機交互新風向，向量數據庫賽道沸騰了

痞子衡嵌入式：恩智浦i.MX RT1xxx系列MCU啓動那些事（12.A）- uSDHC eMMC啓動時間(RT1170)

企業大模型如何成爲自己數據的“百科全書”？

本地SSL證書過期輸入命令在IIS自動生成

基於Ubuntu-22.04安裝K8s-v1.28.2實驗（二）使用kube-vip實現集羣VIP訪問

.NET週刊【5月第2期 2024-05-12】

梯度下降、隨機梯度下降法、及其改進

機器學習中的凸和非凸優化問題

L1正則項與稀疏性

驗證梯度的正確性

Deep Learning相關概念

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結