最近基於VGG-16縮進了網絡做了一個CNN模型用於處理圖像分類，實際項目，訓練對象是448×32的長條試紙圖片。

項目源碼百度雲

項目源碼百度雲鏈接：https://pan.baidu.com/s/1aWLeh4Kaft7NPlB0GxBZMg
提取碼：vjhu

裏面項目名字沒改，VGG16因爲是改造的，名字也沒好好取，能用就行。。

model	儲存模型文件，爲了方便下載，已經刪除了，可以自己訓練
logs	存放日誌文件，已經刪除，本文後續有圖片展示tensorboard日誌
data	數據文件格式如上右圖，test1和train2爲原始圖片，test和train爲處理後圖片，統一爲448×32大小，用於網絡訓練。每個下有lh1、lh2等爲類別，每個類別下分別存放了圖片注：test/lh1,test1/lh1,train/lh1,train2/lh1下各有一張圖片供參考，數據就不大量泄漏了，其餘文件爲空
VGG16_RAW.py	VGG16模型源文件
VGG16_mini2.py	改造的小型CNN模型，訓練出來模型大概7M左右
tf009_predition.py	預測文件
tf009.py	訓練文件
image_pre_deal.py	圖像預處理文件，將原始圖片轉爲統一大小格式的圖片
calculate_mean.py	計算圖片平均值的文件，用於後續減均值處理

tensorboard可視化展示

tensorboard:（不要在意波折的細節，只要看清準確率，損失值數值就行了，採用的Mini-batch的迭代方式，這次訓練有點亂）

模型：

源代碼

tf009.py源碼，訓練主文件：（寫的遷移學習不要介意，懶得改了。。）

'''
VGG16遷移學習訓練主函數
tensorboard --logdir=D:\python\vgg16\logs

'''

import tensorflow as tf
import os
os.environ["CUDA_VISIBLE_DEVICES"]="-1"  # 由於出現顯卡內存不足問題，所以。。。
import numpy as np
from time import time
import vgg16.VGG16_mini2 as model


def get_batch(image_list,label_list,img_width,img_height,batch_size,capacity):#通過讀取列表來載入批量圖片及標籤
    image = tf.cast(image_list,tf.string)
    label = tf.cast(label_list,tf.int32)
    input_queue = tf.train.slice_input_producer([image,label],shuffle=True)
    label = input_queue[1]
    image_contents = tf.read_file(input_queue[0])

    image = tf.image.decode_jpeg(image_contents,channels=3)
    image = tf.cast(image,tf.float32)
    image -= [42.79902,42.79902,42.79902] # 減均值
    # image = preprocess_for_train(image,img_height,img_width)
    image.set_shape((img_height,img_width,3))
    image_batch,label_batch = tf.train.batch([image,label],batch_size=batch_size,num_threads=64,capacity=capacity)
    label_batch = tf.reshape(label_batch,[batch_size])

    return image_batch,label_batch

def get_file(file_dir):
    images = []
    for root,sub_folders,files in os.walk(file_dir):
        for name in files:
            images.append(os.path.join(root,name))
    labels = []
    for label_name in images:
        letter = label_name.split("\\")[-2]
        if letter =="lh1":labels.append(0)
        elif letter =="lh2":labels.append(1)
        elif letter == "lh3":labels.append(2)
        elif letter == "lh4":labels.append(3)
        elif letter == "lh5":labels.append(4)
        elif letter == "lh6":labels.append(5)
        elif letter == "lh7":
            labels.append(6)

    print("check for get_file:",images[0],"label is ",labels[0])
    #shuffle
    temp = np.array([images,labels])
    temp = temp.transpose()
    np.random.shuffle(temp)
    image_list = list(temp[:,0])
    label_list = list(temp[:,1])
    label_list = [int(float(i)) for i in label_list]
    return image_list,label_list

#標籤格式重構
def onehot(labels):
    n_sample = len(labels)
    n_class = 7  # max(labels) + 1
    onehot_labels = np.zeros((n_sample,n_class))
    onehot_labels[np.arange(n_sample),labels] = 1
    return onehot_labels


if __name__ == '__main__':
    startTime =time()
    batch_size = 8
    record_epoch = 70000/batch_size
    small_loop = int(7000/batch_size)
    capacity = 256  # 內存中存儲的最大數據容量
    pic_height,pic_width = 32,448   # 修改圖片大小參數，應當爲32的倍數！不然會導致錯誤

    xs,ys = get_file('./data/train')#獲取圖像列表與標籤列表
    image_batch,label_batch = get_batch(xs,ys,img_width=pic_width,img_height=pic_height,batch_size=batch_size,capacity=capacity)

    # 驗證集
    xs_val,ys_val = get_file('./data/test')#獲取圖像列表與標籤列表
    image_val_batch,label_val_batch = get_batch(xs_val,ys_val,img_width=pic_width,img_height=pic_height,batch_size=455,capacity=capacity)

    x = tf.placeholder(tf.float32,[None,pic_height,pic_width,3])
    y = tf.placeholder(tf.int32,[None,7])#7分類

    vgg = model.vgg16(x)
    fc8_fineuining = vgg.probs #即softmax(fc8)
    prediction_out = tf.argmax(fc8_fineuining,1)
    real_out = tf.argmax(y,1)
    correct_prediction = tf.equal(prediction_out,real_out)#檢查預測類與實際類別是否匹配
    accuracy = tf.reduce_mean(tf.cast(correct_prediction,tf.float32))#準確率
    loss_function = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=fc8_fineuining,labels=y))#損失函數
    optimizer = tf.train.GradientDescentOptimizer(learning_rate=0.001).minimize(loss_function)

    sess = tf.Session()
    sess.run(tf.global_variables_initializer())
    # vgg.load_weights('vgg16_weights.npz',sess)
    saver = tf.train.Saver()

    # # 斷點續訓
    # ckpt_dir = "./model/"
    # ckpt = tf.train.latest_checkpoint(ckpt_dir)
    # if ckpt != None:
    #     saver.restore(sess, ckpt)
    #     print('saver restore finish')
    # else:
    #     print("training from scratch")

    #啓動線程
    coord = tf.train.Coordinator()#使用協調器管理線程
    threads = tf.train.start_queue_runners(coord=coord,sess=sess)
    # 日誌記錄
    summary_writer = tf.summary.FileWriter('./logs/', graph=sess.graph, flush_secs=15)
    summary_writer2 = tf.summary.FileWriter('./logs/plot2/', flush_secs=15)
    tf.summary.scalar(name='loss_func', tensor=loss_function)
    tf.summary.scalar(name='accuracy', tensor=accuracy)
    merged_summary_op = tf.summary.merge_all()


    epoch_start_time = time()

    # 採用Mini-batch迭代
    step = 0
    epoch = 10000
    for i in range(epoch):
        for j in range(small_loop):

            images,labels = sess.run([image_batch,label_batch])
            labels = onehot(labels)

            # # 可視化
            # plt.subplot(221)
            # plt.imshow(images[0, :, :, 0])
            # plt.show()
            # print(1)

            sess.run(optimizer,feed_dict={x:images,y:labels})
            merged_summary,loss,real_train_out = sess.run([merged_summary_op,loss_function,real_out],feed_dict={x:images,y:labels})
            summary_writer.add_summary(merged_summary, global_step=step)
            # print(i,j,"to see train data:",real_train_out[:10])
            step += 1

        images_val, labels_val = sess.run([image_val_batch, label_val_batch])
        labels_val = onehot(labels_val)
        merged_summary_val, loss_val,accuracy_val,prediction_val_out,real_val_out = sess.run([merged_summary_op, loss_function,accuracy,prediction_out,real_out], feed_dict={x: images_val, y: labels_val})
        summary_writer2.add_summary(merged_summary_val, global_step=step)

        # 輸出每個類別正確率
        lh1_right, lh2_right, lh3_right, lh4_right, lh5_right, lh6_right, lh7_right = 0, 0, 0, 0, 0, 0, 0
        lh1_wrong, lh2_wrong, lh3_wrong, lh4_wrong, lh5_wrong, lh6_wrong, lh7_wrong = 0, 0, 0, 0, 0, 0, 0
        for ii in range(len(prediction_val_out)):
            if prediction_val_out[ii] == real_val_out[ii]:
                if real_val_out[ii] == 0:lh1_right+=1
                elif real_val_out[ii] == 1:lh2_right+=1
                elif real_val_out[ii] == 2:lh3_right += 1
                elif real_val_out[ii] == 3:lh4_right += 1
                elif real_val_out[ii] == 4:lh5_right += 1
                elif real_val_out[ii] == 5:lh6_right += 1
                elif real_val_out[ii] == 6:lh7_right += 1
            else:
                if real_val_out[ii] == 0:lh1_wrong+=1
                elif real_val_out[ii] == 1:lh2_wrong+=1
                elif real_val_out[ii] == 2:lh3_wrong += 1
                elif real_val_out[ii] == 3:lh4_wrong += 1
                elif real_val_out[ii] == 4:lh5_wrong += 1
                elif real_val_out[ii] == 5:lh6_wrong += 1
                elif real_val_out[ii] == 6:lh7_wrong += 1
        print(i,"correct rate :",((lh1_right)/(lh1_right+lh1_wrong)),((lh2_right)/(lh2_right+lh2_wrong)),((lh3_right)/(lh3_right+lh3_wrong)),((lh4_right)/(lh4_right+lh4_wrong)),((lh5_right)/(lh5_right+lh5_wrong)),((lh6_right)/(lh6_right+lh6_wrong)),((lh7_right)/(lh7_right+lh7_wrong)))
        # print(i,"nums:",((lh1_right+lh1_wrong)),(lh2_right+lh2_wrong),((lh3_right+lh3_wrong)),((lh4_right+lh4_wrong)),(lh5_right+lh5_wrong),((lh6_right+lh6_wrong)),((lh7_right+lh7_wrong)))

        print(i,"epoch's accuracy:",accuracy_val)
        print(i," loss is %f"%loss,"val loss is %f"%loss_val)


        epoch_end_time =time()
        print(i," epoch takes:",(epoch_end_time-epoch_start_time))
        epoch_start_time = epoch_end_time
        if i % 1 == 0 and i != 0:
            saver.save(sess,os.path.join("./model/",'epoch{:06d}.ckpt'.format(i)))
            print("------------model saved")
        # print("-------------Epoch %d is finished"%i)

    summary_writer.close()
    saver.save(sess,"./model/")
    print("optimization finished")

    duration = time() - startTime
    print("train takes:","{:.2f}".format(duration))

    coord.request_stop()#通知線程關閉
    coord.join(threads)#等其他線程關閉這一函數才返回

CNN圖像分類（實際項目，特殊訓練集，95%準確率，數據代碼百度雲）

項目源碼百度雲

tensorboard可視化展示

源代碼

MySQL 核心模塊揭祕 | 18 期 | 鎖在內存里長什麼樣*

使用perf工具生成火焰圖

大齡程序員思考

響應式界面控件DevExtreme * 更強的數據分析和可視化功能

HttpSecurity 是如何組裝過濾器鏈的

數說海南——近6年海南各市縣人口簡單看

長序列中Transformers的高級注意力機制總結

WebStorm 創建 Vue 項目

CNN圖像分類（實際項目，特殊訓練集，95%準確率，數據代碼百度雲）

Mac cv2.VideoCapture報錯（Process finished with exit code 134 (interrupted by signal 6: SIGABRT)

python-f-string(簡潔明瞭的輸出，拒絕冗餘，編程好習慣）

卷積操作參數量與計算量

面試題：棧實現dfs（不能遞歸）

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結