基于非参数核密度估计的行人分割

原創

2020-02-20 23:50

这是模式识别的一个实验，参考文献是：

[1] L. Zhao, L.S. Davis, “Iterative figure-ground discrimination,” 17th International Conference on Pattern Recognition (ICPR), vol. 1, pp. 67-70, 2004.

[2] Automatic Pedestrian Segmentation Combining Shape, Puzzle and Appearance，2013

原理：略

算法步骤：

令Ft(x) 和Bt(x) 分别为像素x在第t(t=0,…,N)次迭代中属于前景和背景的概率，基于KDE-EM的前景概率估计过程如下：

初始化：

初始的前景概率图为一个先验统计图PM，即
F0(x)=PM(x) 。
B0(x)=1−F0
先验统计图PM为300幅前景掩码图的叠加：

开始迭代，设置迭代次数：

S-步骤：

M-步骤：

计算前景和背景概率：

根据下式更新图像中所有像素点属于前景和背景的概率

F t = c F t - 1 (y) \sum x i \in X F t - 1 (x i) \prod j = 1 d k e r j (y j - x i, j)

B t = c B t - 1 (y) \sum x i \in X B t - 1 (x i) \prod j = 1 d k e r j (y j - x i, j)

x——采样点
y——图像中所有的点

归一化：

F = F F + B

B = 1 - F

结果图：

代码：

# -*- coding: utf-8 -*-
# Author: XieYi

from PIL import Image
import matplotlib.pyplot as plt
import numpy as np
import math
from sklearn import preprocessing
from skimage import filters
import matplotlib.cm as cm
import time
"""
准备工作，读入图片
"""
img = Image.open("p26.bmp")
img = img.convert('L')
img = np.array(img,'f')
img = img/img.max()
m,n = img.shape
imgVector = img.reshape((m*n,1))

# 参数
c = 1.0
sigma = 0.005   # 控制边界精度，即模糊程度sigma越大越模糊
sampleRate = 0.05

"""
读入模板
"""
#将前三列置0
mask = Image.open('mask.png')
mask = mask.convert('L')
mask = mask.resize((n,m))
mask = np.array(mask,'f')
mask[:,:3] = 0

# 采样点,初始化F0和B0
nSamples = int(sampleRate * m * n)
F0 = np.zeros((m*n,1))
F0 = mask.reshape((m*n,1))
F0max = F0.max()
F0 = F0 / F0max
B0 = 1-F0
F, B = F0, B0
#迭代
t0 = time.time()
for i in np.arange(6):
    print str(i+1),"次迭代...."
    #S步
    samples = np.zeros((nSamples,1))
    for j in np.arange(nSamples):
        y = np.random.randint(0, m-1)
        x = np.random.randint(0, n-1)
        samples[j,0] = y * n + x

    #M步
    f = np.zeros((m*n,1))
    b = np.zeros((m*n,1))
    for Xi in np.arange(nSamples):
        posSample = samples[Xi,0]
        valueSample = imgVector[samples[Xi,0],0]
        diffMat = imgVector - np.tile(valueSample,(m*n,1))
        diffMat  = diffMat**2
        expDiff = np.exp(-diffMat / 2.0 / (sigma**2))
        f = f + c * (F[posSample,0]) / math.sqrt(2*math.pi) / sigma * expDiff

    f = f * F
    min_max_scaler = preprocessing.MinMaxScaler()
    f = min_max_scaler.fit_transform(f)

    for Yi in np.arange(nSamples):
        posSample = samples[Yi,0]
        valueSample = imgVector[samples[Yi,0],0]
        diffMat = imgVector - np.tile(valueSample,(m*n,1))
        diffMat  = diffMat**2
        expDiff = np.exp(-diffMat / 2.0 / (sigma**2))
        b = b + c * (B[posSample,0]) / math.sqrt(2*math.pi) / sigma * expDiff
    b = b * B
    b = min_max_scaler.fit_transform(b)

    add = f + b
    f = f / add
    b = 1 - f
    F, B = f, b
    output = F.reshape((m,n))
    plt.imsave(str(i+1), output,  cmap=cm.gray)
    plt.figure(str(i+1))
    plt.imshow(output,cmap=cm.gray)

print "time:",time.time() - t0
thresh = filters.threshold_otsu(output)
dst =(output >= thresh)*1.0
plt.figure("output")
plt.imshow(dst,cmap=cm.gray)

# 直接利用阈值分割
threshImg = filters.threshold_otsu(img)
dstImg =(img >= threshImg)*1.0
plt.figure("Img")
plt.imshow(dstImg,cmap=cm.gray)
plt.show()

Ruff_XY

发布了34 篇原创文章 · 获赞 10 · 访问量 8万+

私信关注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

基于非参数核密度估计的行人分割

原理：略

算法步骤：

初始化：

开始迭代，设置迭代次数：

S-步骤：

M-步骤：

计算前景和背景概率：

归一化：

结果图：

代码：

MySQL 核心模块揭秘 | 18 期 | 锁在内存里长什么样*

使用perf工具生成火焰图

HttpSecurity 是如何组装过滤器链的

数说海南——近6年海南各市县人口简单看

长序列中Transformers的高级注意力机制总结

大龄程序员思考

响应式界面控件DevExtreme * 更强的数据分析和可视化功能

leetcode--344.Reverse String

VTK基礎概念

no override found for vtkpolydatamapper解決方法

《機器學習實戰》學習筆記——第3章決策樹

恢復軟件環境時遇到的關於opencv的幾個錯誤

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結