1.target encoding
先整理一下链接,之后会看。
简介入门:
https://zhuanlan.zhihu.com/p/40231966
一个各种category 变量编码的库:
https://github.com/scikit-learn-contrib/categorical-encoding
说明文档:
http://contrib.scikit-learn.org/categorical-encoding/targetencoder.html
一个英文的关于它的说明
http://www.saedsayad.com/encoding.htm
kaggle上实现target encoding的一个kernel:
https://www.kaggle.com/ogrellier/python-target-encoding-for-categorical-features
how lightgbm deals with categorical features:
https://www.kaggle.com/c/home-credit-default-risk/discussion/58950
2.count encoding
离散变量与较少取值的连续变量可用