台部落ssswill

本文標題較長，主要內容分爲兩部分。一是說明對層操作的add與concatenate方法的原理與應用。二是，在keras使用了這兩個方法後，在model.summary中會出現[0][0]的層，在此作出解釋。一。關於add與c

2020-06-04 10:06:57

1 賽題描述 link: https://www.kesci.com/home/competition/5c77ab9c1ce0af002b55af86/content/1 本練習賽所用數據，是名爲「Roman Urdu Data

2020-06-04 10:06:57

建議先看：如何使用glove,fasttext等詞庫進行word embedding?（原理篇）再看本篇。先睹爲快：本文會用到的全部代碼： def get_coefs(word, *arr): return word

2020-06-04 10:06:47

大牛們雲集分享思路： https://www.kaggle.com/c/santander-customer-transaction-prediction/discussion/89320#latest-524314

2020-06-04 10:06:47

忙裏偷閒~記錄一些筆記。你多次fit，只會覆蓋。並不會記住以前fit的數據。來自： https://stackoverflow.com/questions/49841324/what-does-calling-fit-mul

2020-06-04 10:06:47

1.target encoding 先整理一下鏈接，之後會看。簡介入門： https://zhuanlan.zhihu.com/p/40231966 一個各種category 變量編碼的庫： https://github.com

2020-06-04 10:06:47

import tensorflow as tf from sklearn.metrics import roc_auc_score def auroc(y_true, y_pred): return tf.py_func

2019-07-30 09:23:58

講到nlp，我們常用的都是lstm/gru。舉個例子，因爲我們總會說，因爲句子經過embdding後，句子爲一個三維張量，假設爲：（None,20,300）。其中20爲timestep,也就是一個句子的單詞個數，300爲embd

2019-06-25 11:17:25

請參見: https://blog.csdn.net/ddydavie/article/details/82667890

2019-06-12 00:49:22

寫在前面：HashVectorizer與tfidf類似，都是講文本向量化的表示方法，但它節省內存，也更快。當數據集較大時，可以作爲tfidf的替代。 from：https://www.cnblogs.com/pinard/p/6

2019-06-12 00:49:21

基礎知識： python自帶的file函數只能存儲和讀取字符串格式的數據. pickle可以存儲和讀取成其他格式比如list dict的數據, 來自：https://www.zhihu.com/question/38355589

2019-06-12 00:49:21

代碼： #include <iostream> using namespace std; int max_ = 0; int sum = 0; int sum_temp = 0; int count_ = 0; //n is t

2019-06-12 00:49:21

https://github.com/slundberg/shap

2019-06-12 00:49:21

0.寫在前面 0.1本文配套github: https://github.com/willinseu/kaggle-Jigsaw-Unintended-Bias-in-Toxicity-Classification-solution

2019-05-14 19:29:02

1.二者初步介紹在keras的中文官方文檔中，寫到：可以結合着一起看，出自：https://stackoverflow.com/questions/48315094/using-sample-weight-in-keras-fo

2019-05-14 19:29:02