Kaggle | Digit Recognizer

原創

widon1104

2018-08-27 02:59

Digit Recognizer題目地址

使用的就是mnist data, train set 42000, test set 28000

1) 使用random forest決策樹來實現，準確率0.966左右

library(randomForest)

set.seed(0)

numTrain <- 42000
numTrees <- 200

train <- read.csv("train.csv")

rows <- sample(1:nrow(train), numTrain)
labels <- as.factor(train[rows, 1])

print(head(labels))

train <- train[rows, -1]

gc()
print(memory.size())
print(memory.limit())

rf <- randomForest(train, labels, ntree = numTrees)
rm(train)

test <- read.csv("test.csv")
pre <- predict(rf, newdata = test)

print(head(pre))

predictions <- data.frame(ImageId = 1:nrow(test), Label = pre)

write.csv(predictions, "predict.csv", row.names = FALSE)

Fri, 28 Aug 2015 12:41:01

Edit description

predict.csv

0.96600

2) 使用DeepLearnToolbox中的cnn庫來實現，參數都沒怎麼改，numepochs設置的比較大，準確率大約0.98829

widon@widon-X401A:~$ ls lib/DeepLearnToolbox/
CAE CONTRIBUTING.md data LICENSE README_header.md REFS.md tests
CNN create_readme.sh DBN NN README.md SAE util

%function test_example_CNN
%load mnist_uint8;

%test = csvread('test.csv', 1, 0);

clear ; close all; clc

load('digitdata.mat')

casenum = 42000
tmp = randperm(size(train, 1), casenum);
train_x = train(tmp, 2:end);
label = train(tmp, 1);
test_x = test;

m = size(label, 1)
train_y = zeros(m, 10);
for i=1:m
	train_y(i, label(i)+1) = 1;
end
train_x = double(reshape(train_x',28,28,casenum))/255;
test_x = double(reshape(test_x',28,28,28000))/255;
train_y = double(train_y');
%test_y = double(test_y');

%% ex1 Train a 6c-2s-12c-2s Convolutional neural network 
%will run 1 epoch in about 200 second and get around 11% error. 
%With 100 epochs you'll get around 1.2% error

rand('state',0)

cnn.layers = {
    struct('type', 'i') %input layer
    struct('type', 'c', 'outputmaps', 6, 'kernelsize', 5) %convolution layer
    struct('type', 's', 'scale', 2) %sub sampling layer
    struct('type', 'c', 'outputmaps', 12, 'kernelsize', 5) %convolution layer
    struct('type', 's', 'scale', 2) %subsampling layer
};


opts.alpha = 1;
opts.batchsize = 50;
opts.numepochs = 200;

cnn = cnnsetup(cnn, train_x, train_y);
cnn = cnntrain(cnn, train_x, train_y, opts);

clear train train_x train_y
test_y = cnnff(cnn, test_x);
[~, y] = max(test_y.o);
y = y - 1;
y = y'
csvwrite('pre.csv', y);

Sat, 29 Aug 2015 00:01:20

Edit description

pre.csv

0.98829

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Kaggle | Digit Recognizer

985 碩士程序員，空窗 4 個月沒有 Offer！

一文搞懂 Spring 循環依賴

賽博鬥地主——使用大語言模型扮演Agent智能體玩牌類遊戲。

VScode右鍵打開(添加到右鍵)

golang 線段樹實現

leaf protobuf demo例子

ubuntu (linux) 字體優化方法

如何修改gnome evince 文檔查看器的設置

交叉編譯 dhcp-4.2.5-P1

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結