基於非參數核密度估計的行人分割

原創

2020-02-20 23:50

這是模式識別的一個實驗，參考文獻是：

[1] L. Zhao, L.S. Davis, “Iterative figure-ground discrimination,” 17th International Conference on Pattern Recognition (ICPR), vol. 1, pp. 67-70, 2004.

[2] Automatic Pedestrian Segmentation Combining Shape, Puzzle and Appearance，2013

原理：略

算法步驟：

令Ft(x) 和Bt(x) 分別爲像素x在第t(t=0,…,N)次迭代中屬於前景和背景的概率，基於KDE-EM的前景概率估計過程如下：

初始化：

初始的前景概率圖爲一個先驗統計圖PM，即
F0(x)=PM(x) 。
B0(x)=1−F0
先驗統計圖PM爲300幅前景掩碼圖的疊加：

開始迭代，設置迭代次數：

S-步驟：

M-步驟：

計算前景和背景概率：

根據下式更新圖像中所有像素點屬於前景和背景的概率

F t = c F t - 1 (y) \sum x i \in X F t - 1 (x i) \prod j = 1 d k e r j (y j - x i, j)

B t = c B t - 1 (y) \sum x i \in X B t - 1 (x i) \prod j = 1 d k e r j (y j - x i, j)

x——採樣點
y——圖像中所有的點

歸一化：

F = F F + B

B = 1 - F

結果圖：

代碼：

# -*- coding: utf-8 -*-
# Author: XieYi

from PIL import Image
import matplotlib.pyplot as plt
import numpy as np
import math
from sklearn import preprocessing
from skimage import filters
import matplotlib.cm as cm
import time
"""
準備工作，讀入圖片
"""
img = Image.open("p26.bmp")
img = img.convert('L')
img = np.array(img,'f')
img = img/img.max()
m,n = img.shape
imgVector = img.reshape((m*n,1))

# 參數
c = 1.0
sigma = 0.005   # 控制邊界精度，即模糊程度sigma越大越模糊
sampleRate = 0.05

"""
讀入模板
"""
#將前三列置0
mask = Image.open('mask.png')
mask = mask.convert('L')
mask = mask.resize((n,m))
mask = np.array(mask,'f')
mask[:,:3] = 0

# 採樣點,初始化F0和B0
nSamples = int(sampleRate * m * n)
F0 = np.zeros((m*n,1))
F0 = mask.reshape((m*n,1))
F0max = F0.max()
F0 = F0 / F0max
B0 = 1-F0
F, B = F0, B0
#迭代
t0 = time.time()
for i in np.arange(6):
    print str(i+1),"次迭代...."
    #S步
    samples = np.zeros((nSamples,1))
    for j in np.arange(nSamples):
        y = np.random.randint(0, m-1)
        x = np.random.randint(0, n-1)
        samples[j,0] = y * n + x

    #M步
    f = np.zeros((m*n,1))
    b = np.zeros((m*n,1))
    for Xi in np.arange(nSamples):
        posSample = samples[Xi,0]
        valueSample = imgVector[samples[Xi,0],0]
        diffMat = imgVector - np.tile(valueSample,(m*n,1))
        diffMat  = diffMat**2
        expDiff = np.exp(-diffMat / 2.0 / (sigma**2))
        f = f + c * (F[posSample,0]) / math.sqrt(2*math.pi) / sigma * expDiff

    f = f * F
    min_max_scaler = preprocessing.MinMaxScaler()
    f = min_max_scaler.fit_transform(f)

    for Yi in np.arange(nSamples):
        posSample = samples[Yi,0]
        valueSample = imgVector[samples[Yi,0],0]
        diffMat = imgVector - np.tile(valueSample,(m*n,1))
        diffMat  = diffMat**2
        expDiff = np.exp(-diffMat / 2.0 / (sigma**2))
        b = b + c * (B[posSample,0]) / math.sqrt(2*math.pi) / sigma * expDiff
    b = b * B
    b = min_max_scaler.fit_transform(b)

    add = f + b
    f = f / add
    b = 1 - f
    F, B = f, b
    output = F.reshape((m,n))
    plt.imsave(str(i+1), output,  cmap=cm.gray)
    plt.figure(str(i+1))
    plt.imshow(output,cmap=cm.gray)

print "time:",time.time() - t0
thresh = filters.threshold_otsu(output)
dst =(output >= thresh)*1.0
plt.figure("output")
plt.imshow(dst,cmap=cm.gray)

# 直接利用閾值分割
threshImg = filters.threshold_otsu(img)
dstImg =(img >= threshImg)*1.0
plt.figure("Img")
plt.imshow(dstImg,cmap=cm.gray)
plt.show()

Ruff_XY

發佈了34 篇原創文章 · 獲贊 10 · 訪問量 8萬+

私信關注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

基於非參數核密度估計的行人分割

原理：略

算法步驟：

初始化：

開始迭代，設置迭代次數：

S-步驟：

M-步驟：

計算前景和背景概率：

歸一化：

結果圖：

代碼：

使用c#強大的表達式樹實現對象的深克隆之解決循環引用的問題

GPT-4o 引領人機交互新風向，向量數據庫賽道沸騰了

free AI online tools All In One

痞子衡嵌入式：恩智浦i.MX RT1xxx系列MCU啓動那些事（12.A）- uSDHC eMMC啓動時間(RT1170)

基於Ubuntu-22.04安裝K8s-v1.28.2實驗（二）使用kube-vip實現集羣VIP訪問

企業大模型如何成爲自己數據的“百科全書”？

本地SSL證書過期輸入命令在IIS自動生成

.NET週刊【5月第2期 2024-05-12】

基於Ubuntu-22.04安裝K8s-v1.28.2實驗（一）部署K8s

基於Ubuntu-22.04安裝K8s-v1.28.2實驗（三）數據卷掛載NFS（網絡文件系統）

leetcode--344.Reverse String

VTK基礎概念

no override found for vtkpolydatamapper解決方法

《機器學習實戰》學習筆記——第3章決策樹

恢復軟件環境時遇到的關於opencv的幾個錯誤

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結