Notes

要实现 [1] 的 piece-wise threshold function，类似于 Htanh，也需要自定义梯度，用到 @tf.custom_gradient。
函数是： $g(s)=\begin{cases} 0, & s < 0.5-\epsilon \\ s, & 0.5-\epsilon\leq s < 0.5+\epsilon \\ 1, & s \geq 0.5+\epsilon \end{cases}$
定义其导数： $\frac{\partial g(s)}{\partial s}=\begin{cases} 1, & 0.5-\epsilon\leq s < 0.5+\epsilon \\ 0, & else \end{cases}$
其中 $\epsilon$ 是超参，训练时会变，用 placeholder 传参。

Codes

import tensorflow as tf
import numpy as np

@tf.custom_gradient
def pw_threshold(x, epsilon):
	"""piece-wise threshold"""
    cond_org = ((0.5 - epsilon) <= x) & (x < (0.5 + epsilon))
    cond_one = x >= (0.5 + epsilon)
    ones = tf.ones_like(x)
    zeros = tf.zeros_like(x)
    y = tf.where(cond_org, x, zeros) + \
            tf.where(cond_one, ones, zeros)

    def grad(dy):
        cond = ((0.5 - epsilon) <= x) & (x < (0.5 + epsilon))
        zeros = tf.zeros_like(dy)
        # 返回的 epsilon 没用，但需要这样，有几个输入就对应返回几个梯度
        return tf.where(cond, dy, zeros), epsilon

    return y, grad


# 测试
epsilon = tf.placeholder("float64", [])
x = tf.constant(np.arange(-0.25, 1.26, 0.25))
y = pw_threshold(x, epsilon)
grad = tf.gradients(y, x)

with tf.Session() as sess:
    print("x:", sess.run(x))
    print("y:", sess.run(y, feed_dict={epsilon: 0.25}))
    print("grad:", sess.run(grad, feed_dict={epsilon: 0.25}))

输出：

x: [-0.25  0.    0.25  0.5   0.75  1.    1.25]
y: [0.   0.   0.25 0.5  1.   1.   1.  ]
grad: [array([0., 0., 1., 1., 0., 0., 0.])]

References

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

tensorflow自定义梯度

Notes

Codes

References

10分钟搞定Mysql主从部署配置

如何使用 JS 判断用户是否处于活跃状态

「Pygors跨平台GUI」2：安装MinGW-w64、MSYS2还是WSL2

[转帖]

python列出centos7内存使用前50的进程信息

「Pygors跨平台GUI」1：Pygors跨平台GUI应用研究

一键自动化博客发布工具,用过的人都说好(掘金篇)

lightdb数据库超时相关控制参数

lightdb秒级增加列和删除列（not null带默认值）

Java ThreadPoolShutdown

lasagne embedding layer理解

tensorflow實現triplet loss

NUS-WIDE數據集劃分

pickle讀文件解碼問題

tensorflow用gather/scatter實現advanced indexing

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結