轉載自：http://blog.csdn.net/zoro_lov3/article/details/74550735

FCN製作自己的數據集、訓練和測試全流程

**
花了兩三週的時間，在導師的催促下，把FCN的全部流程走了一遍，期間走了很多彎路，現在記錄一下。系統環境：ubuntu 16.04LTS

一、數據集的製作
注：我的數據集是仿照VOC數據集進行製作的

1.resize 數據集

我的GPU顯存4G，跑過大的圖片帶不動，需要resize圖片大小，放幾個修改圖片大小的程序。
（1）單張圖片resize

# coding = utf-8  
import Image  

def  convert(width,height):
    im = Image.open("C:\\xxx\\test.jpg")
    out = im.resize((width, height),Image.ANTIALIAS)
    out.save("C:\\xxx\\test.jpg")
if __name__ == '__main__':
    convert(256,256)

（2）resize整個文件夾裏的圖片

# coding = utf-8
import Image
import os

def convert(dir,width,height):
    file_list = os.listdir(dir)
    print(file_list)
    for filename in file_list:
        path = ''
        path = dir+filename
        im = Image.open(path)
        out = im.resize((256,256),Image.ANTIALIAS)
        print "%s has been resized!"%filename
        out.save(path)

if __name__ == '__main__':
   dir = raw_input('please input the operate dir:')
   convert(dir,256,256)

（3）按比例resize

# coding = utf-8
import Image
import os

def convert(dir,width,height):
    file_list = os.listdir(dir)
    print(file_list)
    for filename in file_list:
        path = ''
        path = dir+filename
        im = Image.open(path)
        out = im.resize((256,256),Image.ANTIALIAS)
        print "%s has been resized!"%filename
        out.save(path)

if __name__ == '__main__':
   dir = raw_input('please input the operate dir:')
   convert(dir,256,256)

2.製作索引圖
（1）下載labelme

下載地址：https://github.com/wkentaro/labelme
下載後按提示打開軟件，進行標註，保存會生成後綴爲json的文件。

（2）生成dataset文件夾
在終端輸入指令：

labelme_json_to_dataset _static/apc2016_obj3.json //這裏的文件名根據自己的實際情況更改

（3）爲文件夾下的label.png着色
首先需要對照VOC分割的顏色進行着色，一定要保證顏色的準確性。Matlab代碼:

function cmap = labelcolormap(N)

if nargin==0
    N=256
end
cmap = zeros(N,3);
for i=1:N
    id = i-1; r=0;g=0;b=0;
    for j=0:7
        r = bitor(r, bitshift(bitget(id,1),7 - j));
        g = bitor(g, bitshift(bitget(id,2),7 - j));
        b = bitor(b, bitshift(bitget(id,3),7 - j));
        id = bitshift(id,-3);
    end
    cmap(i,1)=r; cmap(i,2)=g; cmap(i,3)=b;
end
cmap = cmap / 255;

對應的VOC數據集中的顏色類別：

類別名稱 R G B 
background 0 0 0 背景 
aeroplane 128 0 0 飛機 
bicycle 0 128 0 
bird 128 128 0 
boat 0 0 128 
bottle 128 0 128 瓶子 
bus 0 128 128 大巴 
car 128 128 128 
cat 64 0 0 貓 
chair 192 0 0 
cow 64 128 0 
diningtable 192 128 0 餐桌 
dog 64 0 128 
horse 192 0 128 
motorbike 64 128 128 
person 192 128 128 
pottedplant 0 64 0 盆栽 
sheep 128 64 0 
sofa 0 192 0 
train 128 192 0 
tvmonitor 0 64 128 顯示器

然後使用Python的skimage庫進行顏色填充，具體函數是skimage.color.label2rgb(),代碼較長，有需要可私信。

————————-2017年9月26日新增————————-
太多人和我私信要代碼，前一段時間很忙都沒有及時回覆大家，所以這裏放個鏈接，需要自取。
https://github.com/hitzoro/FCN-ColorLabel

3.將填充後的label.png轉爲灰度圖
如果不轉，在訓練的時候回報錯，轉換matlab代碼如下：

dirs=dir('F:/xxx/*.png');
for n=1:numel(dirs)
     strname=strcat('F:/xxx/',dirs(n).name);
     img=imread(strname);
     [x,map]=rgb2ind(img,256);
     newname=strcat('F:/xxx/',dirs(n).name);
     imwrite(x,map,newname,'png');
end

轉化後，在python中檢查一下圖片的格式：

In [23]: img = PIL.Image.open('000001_json/label.png')
In [24]: np.unique(img)
Out[24]: array([0, 1, 2], dtype=uint8)

如果輸出一致，則索引圖製作正確。

二、FCN訓練自己的數據集

1.前期準備
默認已經安裝好顯卡驅動，cuda，cudnn，opencv。
最新版caffe下載： https://github.com/BVLC/caffe
fcn源代碼下載：https://github.com/shelhamer/fcn.berkeleyvision.org
caffe的配置安裝可以參考我的另一篇博客：http://blog.csdn.net/zoro_lov3/article/details/60581174

2.數據集準備
這裏我們需要兩個數據集包，benchmark和VOC2012，進入fcn/data，新建sbdd文件夾（如果沒有），將benchmark解壓到sbdd中，將VOC2012解壓到data下的pascal文件夾下。
這兩個在網上都可以找得到。
這兩個數據集有什麼用呢？在FCN中VOC數據集的訓練需要他倆，benchmark中的dataset用來存放訓練時的數據，VOC2012存放測試時的數據。

先製作訓練時的數據
進入dataset中的img文件夾，這裏存放訓練用的原圖，把原來的原圖替換爲你自己的原圖。接着修改train.txt ，寫入你訓練圖片的名字，注意不要加後綴。如下即可：

進入cls文件夾，這裏原本需要存放mat格式的文件，但是製作mat文件有點麻煩，參考了網上的資料，修改代碼，使得這裏也可以直接存放索引圖。
方式如下：

修改fcn目錄下的voc_layers.py

註釋掉原本的load_label ，修改爲新的

#    def load_label(self, idx):
#        """
#        Load label image as 1 x height x width integer array of label indices.
#        The leading singleton dimension is required by the loss.
#        """
#        import scipy.io
#        mat = scipy.io.loadmat('{}/cls/{}.mat'.format(self.sbdd_dir, idx))
#        label = mat['GTcls'][0]['Segmentation'][0].astype(np.uint8)
#        label = label[np.newaxis, ...]
#        return label

    def load_label(self, idx):
        """
        Load label image as 1 x height x width integer array of label indices.
        The leading singleton dimension is required by the loss.
        """
        im = Image.open('{}/cls/{}.png'.format(self.sbdd_dir, idx))
        label = np.array(im, dtype=np.uint8)
        label = label[np.newaxis, ...]
        return label

製作測試集數據

測試集的製作簡單一寫，進入VOC2012，進入JPEGImages文件夾，裏面存放測試用的原圖，然後進入SegmentationClass，裏面存放測試用的索引圖，最後進入ImageSets/Segmentation，有一個名爲seg11valid.txt的文件，它和train.txt的性質一樣，存放測試用的圖片名。

到此，數據集就準備完成了。

3.修改網絡參數

下載VGG16的預訓練模型並放在FCN源碼文件夾中的ilsvrc-nets文件夾下
https://pan.baidu.com/s/1qYJeFfQ

爲了避免運行程序時候出現no module named caffe
在代碼中包含import caffe的py文件（solve.py）的第一行加入

import sys  
sys.path.append('/home/hitoia/caffe/python')

其中，/home/hitoia/caffe/python爲你下載的caffe源碼中python文件夾的路徑

cd進入fcn源碼路徑
以個人路徑爲例：/home/hitoia/fcn.berkeleyvision.org/
將其中所有的py文件，例如surgery.py等等，全部複製到voc-fcn32s文件夾中

solver.prototxt文件修改
進入voc-fcn32s文件夾打開solver.prototxt
其中snapshot:10000 表示訓練10000次保存一次模型
snapshot_prefix:”/home/hitoia/fcn.berkeleyvision.org/voc-fcn32s/snapshot/train”
表示訓練得到的模型，也就是model存放的路徑
在此，我附上個人的solver.prototxt供大家參考

train_net: "/home/hitoia/fcn.berkeleyvision.org/voc-fcn32s/train.prototxt"
test_net: "/home/hitoia/fcn.berkeleyvision.org/voc-fcn32s/val.prototxt"
test_iter: 736
# make test net, but don't invoke it from the solver itself
test_interval: 999999999
display: 20
average_loss: 20
lr_policy: "fixed"
# lr for unnormalized softmax
base_lr: 1e-10
# high momentum
momentum: 0.99
# no gradient accumulation
iter_size: 1
max_iter: 100000
weight_decay: 0.0005
snapshot: 4000
snapshot_prefix: "/home/hitoia/fcn.berkeleyvision.org/voc-fcn32s/snapshot/train"
test_initialization: false

solve.py的修改
在這裏鄭重聲明一下：如果訓練fcn32s的網絡模型，一定要修改solve.py利用transplant的方式獲取vgg16的網絡權重。
具體操作爲：

import sys  
sys.path.append('/home/hitoia/caffe/python')
import caffe
import surgery, score

import numpy as np
import os
import sys

try:
    import setproctitle
    setproctitle.setproctitle(os.path.basename(os.getcwd()))
except:
    pass

vgg_weights = '../ilsvrc-nets/vgg16-fcn.caffemodel'  
vgg_proto = '../ilsvrc-nets/VGG_ILSVRC_16_layers_deploy.prototxt'  
weights = '../ilsvrc-nets/vgg16-fcn.caffemodel'
#weights = '../ilsvrc-nets/vgg16-fcn.caffemodel'

# init
#caffe.set_device(int(sys.argv[1]))
caffe.set_mode_gpu()
caffe.set_device(0)

#solver = caffe.SGDSolver('solver.prototxt')
#solver.net.copy_from(weights)
solver = caffe.SGDSolver('solver.prototxt')
vgg_net=caffe.Net(vgg_proto,vgg_weights,caffe.TRAIN) 
surgery.transplant(solver.net,vgg_net)  
del vgg_net

# surgeries
interp_layers = [k for k in solver.net.params.keys() if 'up' in k]
surgery.interp(solver.net, interp_layers)

# scoring
val = np.loadtxt('/home/hitoia/fcn.berkeleyvision.org/data/pascal/VOCdevkit/VOC2012/ImageSets/Segmentation/seg11valid.txt', dtype=str)

for _ in range(25):
    solver.step(1000)
    score.seg_tests(solver, False, val, layer='score')

關於VGG_ILSVRC_16_layers_deploy.prototxt 可以在http://pan.baidu.com/s/1geLL6Sz下載。

如果訓練fcn16s，則可以直接copy自己的fcn32s的model的權重，不需要transplant，也就是不需要修改solve.py
如果訓練fcn8s，則可以直接copy自己的fcn16s的model的權重，不需要transplant,也就是不需要修改solve.py
只有如此，才能避免loss高居不下的情況
這裏的：

 for _ in range(25):
    solver.step(1000)
    score.seg_tests(solver, False, val, layer='score')

奇怪的現象：修改solver.prototxt中的max_iter: 100000沒有改變最大迭代次數，只有改變這個step裏的數字纔有用，這裏最大迭代次數等於25*1000 = 25000次。

train.prototxt / val.prototxt 修改
所有num_output 爲21 的地方都修改爲自己分類數 + 1 （加的1是背景），最開始的param_str需要根據自己的情況修改，放一下我自己的

train.prototxt:

param_str: "{\'sbdd_dir\': \'/home/hitoia/fcn.berkeleyvision.org/data/sbdd/benchmark/benchmark_RELEASE/dataset\', \'seed\': 1337, \'split\': \'train\', \'mean\': (104.00699, 116.66877, 122.67892)}"

val.prototxt:

param_str: "{\'voc_dir\': \'/home/hitoia/fcn.berkeleyvision.org/data/pascal/VOCdevkit/VOC2012\', \'seed\': 1337, \'split\': \'seg11valid\', \'mean\': (104.00699, 116.66877, 122.67892)}"

準備完成，在voc-fcn32s路徑下輸入

python solve.py

就可以開始訓練

三、單張測試
在fcn源碼文件夾，找到infer.py。

import numpy as np
from PIL import Image
import matplotlib.pyplot as plt


import sys  
sys.path.append('/home/hitoia/caffe/python')
import caffe

# load image, switch to BGR, subtract mean, and make dims C x H x W for Caffe im = Image.open('000030.jpg') in_ = np.array(im, dtype=np.float32) in_ = in_[:,:,::-1] in_ -= np.array((104.00698793,116.66876762,122.67891434)) in_ = in_.transpose((2,0,1)) # load net #net = caffe.Net('voc-fcn8s/deploy.prototxt', 'voc-fcn8s/fcn8s-heavy-pascal.caffemodel', caffe.TEST) net = caffe.Net('voc-fcn32s/deploy.prototxt', 'voc-fcn32s/snapshot/train_iter_24000.caffemodel', caffe.TEST) #net = caffe.Net('voc-fcn8s/deploy.prototxt', 'siftflow-fcn32s/train_iter_100000.caffemodel', caffe.TEST) # shape for input (data blob is N x C x H x W), set data net.blobs['data'].reshape(1, *in_.shape) net.blobs['data'].data[...] = in_ # run net and take argmax for prediction net.forward() out = net.blobs['score'].data[0].argmax(axis=0) #plt.imshow(out,cmap='gray'); plt.imshow(out); plt.axis('off') plt.savefig('000030_out32.png')

plt.show()

其中，net = caffe.Net(‘voc-fcn32s/deploy.prototxt’, ‘voc-fcn32s/snapshot/train_iter_24000.caffemodel’, caffe.TEST)，其中train_iter_24000.caffemodel’是我訓練後得到的模型。

如果沒有deploy文件，可以參考如下方法：
首先，根據你利用的模型，例如模型是voc-fcn32s的，那麼你就去voc-fcn32s的文件夾，裏面有train.prototxt文件，將文件打開，全選，複製，新建一個名爲deploy.prototxt文件，粘貼進去，
然後ctrl+F 尋找所有名爲loss的layer 只要有loss 無論是loss還是geo_loss 將這個layer統統刪除,這就是此次的deploy.prototxt。

大功告成，至此整個流程全部完成。整個過程心酸不斷，fcn的資料不多，求助了很多人，在此感謝

無奈的小心酸，深度學習思考者

對我的幫助。

數據下載參考地址：

http://blog.csdn.net/weixin_38437404/article/details/78089035?fps=1&locationNum=3 中有benchmark下載地址；

參考博客：
http://blog.csdn.net/wangkun1340378/article/details/70238290
http://blog.csdn.net/u010402786/article/details/72883421
http://blog.csdn.net/supe_king/article/details/55657136
http://blog.csdn.net/a244513086/article/details/72520630

FCN製作自己的數據集、訓練和測試全流程

FCN製作自己的數據集、訓練和測試全流程

plt.show()

通過HPA+CronHPA組合應對業務複雜彈性伸縮場景

數據庫技術與併發（筆記）

車牌識別系統概述

安裝ROS時，rosdep init出錯的解決辦法

模式識別之特徵提取算法

卷積神經網絡CNN經典模型整理Lenet，Alexnet，Googlenet，VGG，Deep Residual Learning

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結