以resnet作爲前置網絡的ssd目標提取檢測

轉自：http://blog.csdn.net/zhangjunbob/article/details/53119959

以resnet作爲前置網絡的ssd目標提取檢測

1.目標

本文的目標是將resnet結構作爲前置網絡，在imagenet數據集上進行預訓練，隨後將ssd目標提取檢測網絡（一部分）接在resnet前置網絡之後，形成一個完整的ssd網絡。

ssd網絡下載和配置參考點擊打開鏈接

2.resnet前置網絡pretrain

2.1 利用imagenet數據生成lmdb，採用create_imagenet.sh生成，內容如下：

[plain]view
plain copy

#!/usr/bin/env sh  

# Create the imagenet lmdb inputs  

# N.B. set the path to the imagenet train + val data dirs  

set -e  

EXAMPLE=models/resnet  

DATA=/home/jzhang/data/VOCdevkit/VOC2007  

TOOLS=build/tools  

TRAIN_DATA_ROOT=/home/jzhang/data/VOCdevkit/VOC2007/JPEGImages/  

# Set RESIZE=true to resize the images to 256x256. Leave as false if images have  

# already been resized using another tool.  

RESIZE=true  

if $RESIZE; then  

  RESIZE_HEIGHT=224  

  RESIZE_WIDTH=224  

else  

  RESIZE_HEIGHT=0  

  RESIZE_WIDTH=0  

fi  

if [ ! -d "$TRAIN_DATA_ROOT" ]; then  

  echo "Error: TRAIN_DATA_ROOT is not a path to a directory: $TRAIN_DATA_ROOT"  

  echo "Set the TRAIN_DATA_ROOT variable in create_imagenet.sh to the path" \  

       "where the ImageNet training data is stored."  

  exit 1  

fi  

echo "Creating train lmdb..."  

GLOG_logtostderr=1 $TOOLS/convert_imageset \  

    --resize_height=$RESIZE_HEIGHT \  

    --resize_width=$RESIZE_WIDTH \  

    --shuffle \  

    $TRAIN_DATA_ROOT \  

    $DATA/train.txt \  

    $EXAMPLE/resnet_train_lmdb  

echo "Done."

生成的過程採用TRAIN_DATA_ROOT下的圖片，具體的圖片目錄在train.txt中：

train.txt的內容大致如下：

[plain]view
plain copy

000001.jpg 0  

000002.jpg 1  

000003.jpg 2  

000004.jpg 3  

000005.jpg 4  

000006.jpg 5  

000007.jpg 6  

000008.jpg 7  

000009.jpg 8  

000010.jpg 9

前面的爲TRAIN_DATA_ROOT下的圖片文件名，後面的數字代表其標籤label。

運行create_imagenet.sh後就會在EXAMPLE目錄下生成lmdb文件夾，其中包含data.mdb和lock.mdb。這些都是caffe需要使用的數據格式。

2.2 編寫solver和prototxt

先寫各層網絡結構的定義res_pretrain.prototxt：

[plain]view
plain copy

name: "ResNet-50"  

layer {  

  name: "imagenet"  

  type: "Data"  

  top: "data"  

  top: "label"  

  include {  

    phase: TRAIN  

  }  

  data_param {  

    source: "models/resnet/resnet_train_lmdb"         //剛纔產生的train的lmdb  

    batch_size: 8  

    backend: LMDB  

  }  

}  

layer {  

  name: "imagenet"  

  type: "Data"  

  top: "data"  

  top: "label"  

  include {  

    phase: TEST  

  }  

  data_param {  

    source: "models/resnet/resnet_test_lmdb"          //同理可以產生的test的lmdb  

    batch_size: 1  

    backend: LMDB  

  }  

}  

/////////////////////////////////////////////////////////////////  

////                 resnet結構                              ////  

/////////////////////////////////////////////////////////////////  

layer {  

    bottom: "data"  

    top: "conv1"  

    name: "conv1"  

    type: "Convolution"  

    convolution_param {  

        num_output: 64  

        kernel_size: 7  

        pad: 3  

        stride: 2  

    }  

}  

layer {  

    bottom: "conv1"  

    top: "conv1"  

    name: "bn_conv1"  

    type: "BatchNorm"  

    batch_norm_param {  

        use_global_stats: true  

    }  

}  

layer {  

    bottom: "conv1"  

    top: "conv1"  

    name: "scale_conv1"  

    type: "Scale"  

    scale_param {  

        bias_term: true  

    }  

}  

layer {  

    bottom: "conv1"  

    top: "conv1"  

    name: "conv1_relu"  

    type: "ReLU"  

}  

layer {  

    bottom: "conv1"  

    top: "pool1"  

    name: "pool1"  

    type: "Pooling"  

    pooling_param {  

        kernel_size: 3  

        stride: 2  

        pool: MAX  

    }  

}  

layer {  

    bottom: "pool1"  

    top: "res2a_branch1"  

    name: "res2a_branch1"  

    type: "Convolution"  

    convolution_param {  

        num_output: 256  

        kernel_size: 1  

        pad: 0  

        stride: 1  

        bias_term: false  

    }  

}  

layer {  

    bottom: "res2a_branch1"  

    top: "res2a_branch1"  

    name: "bn2a_branch1"  

    type: "BatchNorm"  

    batch_norm_param {  

        use_global_stats: true  

    }  

}  

//...............................  

layer {  

    bottom: "res5c_branch2a"  

    top: "res5c_branch2a"  

    name: "bn5c_branch2a"  

    type: "BatchNorm"  

    batch_norm_param {  

        use_global_stats: true  

    }  

}  

layer {  

    bottom: "res5c_branch2a"  

    top: "res5c_branch2a"  

    name: "scale5c_branch2a"  

    type: "Scale"  

    scale_param {  

        bias_term: true  

    }  

}  

layer {  

    bottom: "res5c_branch2a"  

    top: "res5c_branch2a"  

    name: "res5c_branch2a_relu"  

    type: "ReLU"  

}  

layer {  

    bottom: "res5c_branch2a"  

    top: "res5c_branch2b"  

    name: "res5c_branch2b"  

    type: "Convolution"  

    convolution_param {  

        num_output: 512  

        kernel_size: 3  

        pad: 1  

        stride: 1  

        bias_term: false  

    }  

}  

layer {  

    bottom: "res5c_branch2b"  

    top: "res5c_branch2b"  

    name: "bn5c_branch2b"  

    type: "BatchNorm"  

    batch_norm_param {  

        use_global_stats: true  

    }  

}  

layer {  

    bottom: "res5c_branch2b"  

    top: "res5c_branch2b"  

    name: "scale5c_branch2b"  

    type: "Scale"  

    scale_param {  

        bias_term: true  

    }  

}  

layer {  

    bottom: "res5c_branch2b"  

    top: "res5c_branch2b"  

    name: "res5c_branch2b_relu"  

    type: "ReLU"  

}  

layer {  

    bottom: "res5c_branch2b"  

    top: "res5c_branch2c"  

    name: "res5c_branch2c"  

    type: "Convolution"  

    convolution_param {  

        num_output: 2048  

        kernel_size: 1  

        pad: 0  

        stride: 1  

        bias_term: false  

    }  

}  

layer {  

    bottom: "res5c_branch2c"  

    top: "res5c_branch2c"  

    name: "bn5c_branch2c"  

    type: "BatchNorm"  

    batch_norm_param {  

        use_global_stats: true  

    }  

}  

layer {  

    bottom: "res5c_branch2c"  

    top: "res5c_branch2c"  

    name: "scale5c_branch2c"  

    type: "Scale"  

    scale_param {  

        bias_term: true  

    }  

}  

layer {  

    bottom: "res5b"  

    bottom: "res5c_branch2c"  

    top: "res5c"  

    name: "res5c"  

    type: "Eltwise"  

}  

layer {  

    bottom: "res5c"  

    top: "res5c"  

    name: "res5c_relu"  

    type: "ReLU"  

}  

layer {  

    bottom: "res5c"  

    top: "pool5"  

    name: "pool5"  

    type: "Pooling"  

    pooling_param {  

        kernel_size: 7  

        stride: 1  

        pool: AVE  

    }  

}  

layer {  

    bottom: "pool5"  

    top: "fc1000"  

    name: "fc1000"  

    type: "InnerProduct"  

    inner_product_param {  

        num_output: 1000  

    }  

}  

//loss function  

layer {  

  name: "accuracy"  

  type: "Accuracy"  

  bottom: "fc1000"  

  bottom: "label"  

  top: "accuracy"  

  include {  

    phase: TEST  

  }  

}  

layer {  

  name: "loss"  

  type: "SoftmaxWithLoss"  

  bottom: "fc1000"  

  bottom: "label"  

  top: "loss"  

}

寫好了網絡層的prototxt之後，寫solver，res_pretrain_solver.prototxt內容如下：

[plain]view
plain copy

net: "models/resnet/res_pretrain.prototxt"             //上一步中寫的網絡層次結構  

test_iter: 10  

test_interval: 10  

base_lr: 0.01                                          //基礎學習率 learning-rate  

lr_policy: "step"                                      //學習策略  

gamma: 0.1  

stepsize: 100000  

display: 20  

max_iter: 450000                                       //迭代次數  

momentum: 0.9                                          //學習率衰減係數  

weight_decay: 0.0005                                   //權重衰減係數，防止過擬合  

snapshot: 1000                                         //每1000次迭代保存一次參數中間結果  

snapshot_prefix: "models/resnet/resnet_train"  

solver_mode: CPU

2.3 進行pretrain訓練

在caffe目錄下運行

[plain]view
plain copy

./build/tools/caffe train --solver=models/resnet/res_pretrain_solver.prototxt  

solver=之後寫的是上面的prototxt地址。

至此，在imagenet上的預訓練到此爲止。訓練之後會生成一個caffemodel，這就是之後需要接到ssd之前網絡的參數。

3.接入ssd網絡

ssd網絡finetuning的流程與之前pretrain基本一致。

3.1產生lmdb

ssd使用的lmdb與之前略有不同。

其train.txt文件下不再是圖片對應類型，因爲有boundingbox的存在，所以一個圖片對應一個xml文件，如下：

[plain]view
plain copy

VOC2007/JPEGImages/000001.jpg VOC2007/Annotations/000001.xml  

VOC2007/JPEGImages/000002.jpg VOC2007/Annotations/000002.xml  

VOC2007/JPEGImages/000003.jpg VOC2007/Annotations/000003.xml  

VOC2007/JPEGImages/000004.jpg VOC2007/Annotations/000004.xml  

VOC2007/JPEGImages/000006.jpg VOC2007/Annotations/000006.xml  

VOC2007/JPEGImages/000008.jpg VOC2007/Annotations/000008.xml  

VOC2007/JPEGImages/000010.jpg VOC2007/Annotations/000010.xml  

VOC2007/JPEGImages/000011.jpg VOC2007/Annotations/000011.xml  

VOC2007/JPEGImages/000013.jpg VOC2007/Annotations/000013.xml  

VOC2007/JPEGImages/000014.jpg VOC2007/Annotations/000014.xml

其create_data.sh腳本內容大致如下：

[plain]view
plain copy

cd $root_dir  

redo=1  

data_root_dir="$HOME/data/VOCdevkit"  

dataset_name="VOC0712"  

mapfile="$root_dir/data/$dataset_name/labelmap_voc.prototxt"  

anno_type="detection"  

db="lmdb"  

min_dim=0  

max_dim=0  

width=0  

height=0  

extra_cmd="--encode-type=jpg --encoded"  

if [ $redo ]  

then  

  extra_cmd="$extra_cmd --redo"  

fi  

for subset in test trainval  

do  

  python $root_dir/scripts/create_annoset.py --anno-type=$anno_type --label-map-file=$mapfile --min-dim=$min_dim --max-dim=$max_dim   

--resize-width=$width --resize-height=$height --check-label $extra_cmd $data_root_dir $root_dir/data/$dataset_name/$subset.txt   

$data_root_dir/$dataset_name/$db/$dataset_name"_"$subset"_"$db examples/$dataset_name  

done

至此可以產生新的lmdb，假定爲ssd_train_lmdb用於整體網絡的數據輸入。

3.2 編寫solver和prototxt

首先定義ssd網絡層次結構ssd_finetuning.prototxt：

[cpp]view
plain copy

//ssd中輸入層的定義非常複雜，但其中只有一些需要改動，其餘的照搬就行  

layer {  

  name: "data"  

  type: "AnnotatedData"  

  top: "data"  

  top: "label"  

  include {  

    phase: TRAIN  

  }  

  transform_param {  

    mirror: true  

    mean_value: 104  

    mean_value: 117  

    mean_value: 123  

    resize_param {  

      prob: 1  

      resize_mode: WARP  

      height: 300  

      width: 300  

      interp_mode: LINEAR  

      interp_mode: AREA  

      interp_mode: NEAREST  

      interp_mode: CUBIC  

      interp_mode: LANCZOS4  

    }  

    emit_constraint {  

      emit_type: CENTER  

    }  

  }  

  data_param {  

    source: "models/resnet/<span style="font-size:14px;">ssd_train_lmdb</span>"               //剛纔生成的新的lmdb  

    batch_size: 32  

    backend: LMDB  

  }  

  annotated_data_param {  

    batch_sampler {  

      max_sample: 1  

      max_trials: 1  

    }  

    batch_sampler {  

      sampler {  

        min_scale: 0.3  

        max_scale: 1.0  

        min_aspect_ratio: 0.5  

        max_aspect_ratio: 2.0  

      }  

      sample_constraint {  

        min_jaccard_overlap: 0.1  

      }  

      max_sample: 1  

      max_trials: 50  

    }  

    batch_sampler {  

      sampler {  

        min_scale: 0.3  

        max_scale: 1.0  

        min_aspect_ratio: 0.5  

        max_aspect_ratio: 2.0  

      }  

      sample_constraint {  

        min_jaccard_overlap: 0.3  

      }  

      max_sample: 1  

      max_trials: 50  

    }  

    batch_sampler {  

      sampler {  

        min_scale: 0.3  

        max_scale: 1.0  

        min_aspect_ratio: 0.5  

        max_aspect_ratio: 2.0  

      }  

      sample_constraint {  

        min_jaccard_overlap: 0.5  

      }  

      max_sample: 1  

      max_trials: 50  

    }  

    batch_sampler {  

      sampler {  

        min_scale: 0.3  

        max_scale: 1.0  

        min_aspect_ratio: 0.5  

        max_aspect_ratio: 2.0  

      }  

      sample_constraint {  

        min_jaccard_overlap: 0.7  

      }  

      max_sample: 1  

      max_trials: 50  

    }  

    batch_sampler {  

      sampler {  

        min_scale: 0.3  

        max_scale: 1.0  

        min_aspect_ratio: 0.5  

        max_aspect_ratio: 2.0  

      }  

      sample_constraint {  

        min_jaccard_overlap: 0.9  

      }  

      max_sample: 1  

      max_trials: 50  

    }  

    batch_sampler {  

      sampler {  

        min_scale: 0.3  

        max_scale: 1.0  

        min_aspect_ratio: 0.5  

        max_aspect_ratio: 2.0  

      }  

      sample_constraint {  

        max_jaccard_overlap: 1.0  

      }  

      max_sample: 1  

      max_trials: 50  

    }  

    label_map_file: "data/VOC0712/labelmap_voc.prototxt"  

  }  

}  

//resnet結構  

layer {  

bottom: "data"  

top: "conv1"  

name: "conv1"  

type: "Convolution"  

convolution_param {  

num_output: 64  

kernel_size: 7  

pad: 3  

stride: 2  

}  

}  

layer {  

bottom: "conv1"  

top: "conv1"  

name: "bn_conv1"  

type: "BatchNorm"  

batch_norm_param {  

use_global_stats: true  

}  

}  

layer {  

    bottom: "data"  

    top: "conv1"  

    name: "conv1"  

    type: "Convolution"  

    convolution_param {  

        num_output: 64  

        kernel_size: 7  

        pad: 3  

        stride: 2  

    }  

}  

//省略很多resnet層  

layer {  

  bottom: "res5c"  

  top: "res5c"  

  name: "res5c_relu"  

  type: "ReLU"  

}  

layer {  

  bottom: "res5c"  

  top: "pool5"  

  name: "pool5"  

  type: "Pooling"  

  pooling_param {  

    kernel_size: 7  

    stride: 1  

    pool: AVE  

    }  

}

[cpp]view
plain copy

//至此resnet主體結構完成，隨後接上ssd的結構  

//用pool5作爲bottom分別產生mbox_loc/mbox_conf/mbox_priorbox  

layer {  

  name: "pool5_mbox_loc"  

  type: "Convolution"  

  bottom: "pool5"                               //選取pool5作爲bottom，產生mbox_loc  

  top: "pool5_mbox_loc"  

  param {  

    lr_mult: 1  

    decay_mult: 1  

  }  

  param {  

    lr_mult: 2  

    decay_mult: 0  

  }  

  convolution_param {  

    num_output: 24  

    pad: 1  

    kernel_size: 3  

    stride: 1  

    weight_filler {  

      type: "xavier"  

    }  

    bias_filler {  

      type: "constant"  

      value: 0  

    }  

  }  

}  

layer {  

  name: "pool5_mbox_loc_perm"                     //將上一層產生的mbox_loc重新排序  

  type: "Permute"  

  bottom: "pool5_mbox_loc"  

  top: "pool5_mbox_loc_perm"  

  permute_param {  

    order: 0  

    order: 2  

    order: 3  

    order: 1  

  }  

}  

layer {  

  name: "pool5_mbox_loc_flat"                    //將上一層展平（例如7*7的展平成1*49，方便之後的拼接）  

  type: "Flatten"  

  bottom: "pool5_mbox_loc_perm"  

  top: "pool5_mbox_loc_flat"  

  flatten_param {  

    axis: 1  

  }  

}  

layer {  

  name: "pool5_mbox_conf"  

  type: "Convolution"  

  bottom: "pool5"                               //選取pool5作爲bottom，產生mbox_conf（之後的排序展平同理）  

  top: "pool5_mbox_conf"  

 param {  

    lr_mult: 1  

    decay_mult: 1  

  }  

  param {  

    lr_mult: 2  

    decay_mult: 0  

  }  

  convolution_param {  

    num_output: 126  

    pad: 1  

    kernel_size: 3  

    stride: 1  

    weight_filler {  

      type: "xavier"  

    }  

    bias_filler {  

      type: "constant"  

      value: 0  

    }  

  }  

}  

layer {  

  name: "pool5_mbox_conf_perm"  

  type: "Permute"  

  bottom: "pool5_mbox_conf"  

  top: "pool5_mbox_conf_perm"  

  permute_param {  

    order: 0  

    order: 2  

    order: 3  

    order: 1  

  }  

}  

layer {  

  name: "pool5_mbox_conf_flat"  

  type: "Flatten"  

  bottom: "pool5_mbox_conf_perm"  

  top: "pool5_mbox_conf_flat"  

  flatten_param {  

    axis: 1  

  }  

}  

layer {  

  name: "pool5_mbox_priorbox"  

  type: "PriorBox"  

  bottom: "pool5"                                //選取pool5作爲bottom，產生mbox_priorbox（之後排序展平）  

  bottom: "data"  

  top: "pool5_mbox_priorbox"  

  prior_box_param {  

    min_size: 276.0  

    max_size: 330.0  

    aspect_ratio: 2  

    aspect_ratio: 3  

    flip: true  

    clip: true  

    variance: 0.1  

    variance: 0.1  

    variance: 0.2  

    variance: 0.2  

  }  

}  

//同理用res5c作爲bottom分別產生mbox_loc/mbox_conf/mbox_priorbox  

layer {  

  name: "res5c_mbox_loc"  

  type: "Convolution"  

  bottom: "res5c"  

  top: "res5c_mbox_loc"                                           

  param {  

    lr_mult: 1  

    decay_mult: 1  

  }  

  param {  

    lr_mult: 2  

    decay_mult: 0  

  }  

  convolution_param {  

    num_output: 24  

    pad: 1  

    kernel_size: 3  

    stride: 1  

    weight_filler {  

      type: "xavier"  

    }  

    bias_filler {  

      type: "constant"  

      value: 0  

    }  

  }  

}  

layer {  

  name: "res5c_mbox_loc_perm"  

  type: "Permute"  

  bottom: "res5c_mbox_loc"  

  top: "res5c_mbox_loc_perm"  

  permute_param {  

    order: 0  

    order: 2  

    order: 3  

    order: 1  

  }  

}  

layer {  

  name: "res5c_mbox_loc_flat"  

  type: "Flatten"  

  bottom: "res5c_mbox_loc_perm"  

  top: "res5c_mbox_loc_flat"  

  flatten_param {  

    axis: 1  

  }  

}  

layer {  

  name: "res5c_mbox_conf"  

  type: "Convolution"  

  bottom: "res5c"  

  top: "res5c_mbox_conf"  

  param {  

    lr_mult: 1  

    decay_mult: 1  

  }  

  param {  

    lr_mult: 2  

    decay_mult: 0  

  }  

  convolution_param {  

    num_output: 126  

    pad: 1  

    kernel_size: 3  

    stride: 1  

    weight_filler {  

      type: "xavier"  

    }  

    bias_filler {  

      type: "constant"  

      value: 0  

    }  

  }  

}  

layer {  

  name: "res5c_mbox_conf_perm"  

  type: "Permute"  

  bottom: "res5c_mbox_conf"  

  top: "res5c_mbox_conf_perm"  

  permute_param {  

    order: 0  

    order: 2  

    order: 3  

    order: 1  

  }  

}  

layer {  

  name: "res5c_mbox_conf_flat"  

  type: "Flatten"  

  bottom: "res5c_mbox_conf_perm"  

  top: "res5c_mbox_conf_flat"  

  flatten_param {  

    axis: 1  

  }  

}  

layer {  

  name: "res5c_mbox_priorbox"  

  type: "PriorBox"  

  bottom: "res5c"  

  bottom: "data"  

  top: "res5c_mbox_priorbox"  

  prior_box_param {  

    min_size: 276.0  

    max_size: 330.0  

    aspect_ratio: 2  

    aspect_ratio: 3  

    flip: true  

    clip: true  

    variance: 0.1  

    variance: 0.1  

    variance: 0.2  

    variance: 0.2  

  }  

}  

//Concat層將剛纔的res5c和pool5產生的mbox_loc/mbox_conf/mbox_priorbox拼接起來形成一個層  

layer {  

  name: "mbox_loc"  

  type: "Concat"  

  bottom: "res5c_mbox_loc_flat"  

  bottom: "pool5_mbox_loc_flat"  

  top: "mbox_loc"  

  concat_param {  

    axis: 1  

  }  

}  

layer {  

  name: "mbox_conf"  

  type: "Concat"  

  bottom: "res5c_mbox_conf_flat"  

  bottom: "pool5_mbox_conf_flat"  

  top: "mbox_conf"  

  concat_param {  

    axis: 1  

  }  

}  

layer {  

  name: "mbox_priorbox"  

  type: "Concat"  

  bottom: "res5c_mbox_priorbox"  

  bottom: "pool5_mbox_priorbox"  

  top: "mbox_priorbox"  

  concat_param {  

    axis: 2  

  }  

}  

<span style="color:#ff0000;">//mbox_loc,mbox_conf,mbox_priorbox一起做的loss-function</span>  

layer {  

  name: "mbox_loss"  

  type: "MultiBoxLoss"  

  bottom: "mbox_loc"  

  bottom: "mbox_conf"  

  bottom: "mbox_priorbox"  

  bottom: "label"  

  top: "mbox_loss"  

  include {  

    phase: TRAIN  

  }  

  propagate_down: true  

  propagate_down: true  

  propagate_down: false  

  propagate_down: false  

  loss_param {  

    normalization: VALID  

  }  

  multibox_loss_param {  

    loc_loss_type: SMOOTH_L1  

    conf_loss_type: SOFTMAX  

    loc_weight: 1.0  

    num_classes: 21  

    share_location: true  

    match_type: PER_PREDICTION  

    overlap_threshold: 0.5  

    use_prior_for_matching: true  

    background_label_id: 0  

    use_difficult_gt: true  

    do_neg_mining: true  

    neg_pos_ratio: 3.0  

    neg_overlap: 0.5  

    code_type: CENTER_SIZE  

  }  

}

ssd中，mbox_loc層產生x,y,w,h四個值，mbox_conf對於每一個分類都有一個值，如果有20個分類，那就會產生20個值。

對於剛纔的prototxt中，res5c層的尺寸爲7*7，每一個像素會產生6個boundingbox，pool5層的尺寸爲1*1，每一個像素會產生6個boundingbox。總共是7*7*6+1*1*6個候選的boundingbox。

如果需要增加候選的數量，那麼就和pool5一樣，在resnet中任意選取中間層randomlayer，在這些層之後加入randomlayer_mbox_loc/randomlayer_mbox_conf/randomlayer_mbox_priorbox，最終將這些層都展平並拼接在一起。

至此，ssd的整體網絡結構prototxt已經編寫完成。

對於solver，與之前沒有什麼區別，ssd_finetuning_solver：

[plain]view
plain copy

net: "models/resnet/ssd_finetuning.prototxt"  

base_lr: 0.01  

lr_policy: "step"  

gamma: 0.1  

stepsize: 100000  

display: 20  

max_iter: 450000  

momentum: 0.9  

weight_decay: 0.0005  

snapshot: 10000  

snapshot_prefix: "models/resnet/resnet_train"  

solver_mode: CPU

3.3 訓練網絡

在caffe目錄下運行：

[plain]view
plain copy

./build/tools/caffe train --solver=models/resnet/ssd_finetuning_solver.prototxt -weights models/resnet/res_pretrain.caffemodel  

solver=之後加solver地址， weights參數後加預訓練pretrain中res_pretrain.caffemodel的參數。

至此，就將pretrain好的resnet網絡接入了ssd前面。

jasonzzj

發佈了41 篇原創文章 · 獲贊 192 · 訪問量 73萬+

私信關注

以resnet作爲前置網絡的ssd目標提取檢測

tensorflow 使用多塊GPU同時訓練多個模型

Python的 range()函數解析

交叉熵代價函數(損失函數)及其求導推導 (Logistic Regression)

以resnet作爲前置網絡的ssd目標提取檢測

softmax 帶 tempreture 參數

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結