中国科学院机构知识库网格系统: 基于形状的图像匹配

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

基于形状的图像匹配

文献类型：学位论文


作者	张翠
学位类别	工学硕士
答辩日期	2011-05-20
授予单位	中国科学院研究生院
授予地点	中国科学院自动化研究所
导师	普林特
关键词	形状匹配形状描述子谱分析形状上下文 shape matching shape descriptor spectral analysis shape context
其他题名	Image Matching based on Shape
学位专业	模式识别与智能系统
中文摘要	在人的视觉感知、识别和理解中，形状是图像中目标的基本内在特性，是用于目标识别的重要特征,因此基于形状的图像匹配、目标识别方法研究具有重要意义。针对不同的研究目的，人们已经提出了许多种不同的形状匹配方法，如：k邻接分割（kAS）[11]，金字塔匹配[21]，形状上下文[6–8]，基于内积的形状上下文[22, 23]等。形状的描述及其表达依然是一个开放性问题，其中最困难的问题是缺乏对于形状这一概念的清晰的定义[42]。虽然，形状的轮廓能够通过参数曲线，例如：傅立叶系数等来表示，但是，这些数学方法难以解决我们在描述和比较不同形状时所遇到的问题。例如：谱图理论在图像分割中得到了广泛的应用，但是，在应用到3D图像匹配时却需要对尺度进行归一化[19, 20]；而形状上下文描述子也受到各位学者的推崇，但是却依赖于对目标轮廓的检测。本文的目的在于探索新方法来描述和比较目标的形状，并且应用到形状匹配和识别等方面。因此，本文首先系统地分析了目前较流行的几类形状描述算子，并比较了它们的优缺点。接下来详细介绍了我们基于谱图理论，以及形状上下文描述子的工作。本文的主要工作和贡献可以概括如下： 1. 提出了基于特征向量直方图匹配的形状匹配算法这部分工作主要基于D.Knossow等人基于谱图理论的模糊匹配[19, 20]的工作展开的，由于图的Laplacian矩阵同时包含了形状轮廓的局部和全局信息，我们通过Laplacian矩阵的特征向量直方图来表示2维形状轮廓。但是，谱分析本身存在许多难点，例如，如何有效的进行特征值排序，为了克服上述问题，我们引入子矩阵匹配的方法来对由不同尺度图形的特征谱定义的特征空间匹配。通过设定恰当的距离度量方法来比较两个形状之间的相似性。通过在图像数据库MPEG-7CE-Shape-1上的实验，证明了方法的有效性。 2. 提出了基于特征向量直方图匹配的形状匹配算法这部分工作主要基于D.Knossow等人基于谱图理论的模糊匹配[19, 20]的工作展开的，由于图的Laplacian矩阵同时包含了形状轮廓的局部和全局信息，我们通过Laplacian矩阵的特征向量直方图来表示2维形状轮廓。但是，谱分析本身存在许多难点，例如，如何有效的进行特征值排序，为了克服上述问题，我们引入子矩阵匹配的方法来对由不同尺度图形的特征谱定义的特征空间匹配。通过设定恰当的距离度量方法来比较两个形状之间的相似性。通过在图像数据库MPEG-7CE-Shape-1上的实验，证明了方法的有效性。 2. 提出了基于夹角直方图的形状匹配算法为了避免上一章方法中因计算特征值和特征向量而引起的高计算量，这一章我们基于S.Belongie等人的形状上下文描述子[6–8]，进行了相关的改进。首先我们选取轮廓上的局部曲率最大值作为特征点，形状轮廓的中心作为参照点，计算给定特征点相对其他所有点的夹角，从而每个特征点可以表示为相对其他点的角度直方图。所有特征点的直方图可以作为形状的全局描述子，进行形状相似性比较。最后，我们在图像数据库MPEG-7CE-Shape-1上的实验结果验证了该方法的有效性。
英文摘要	Shape is an intrinsic and discriminant property of object for human visual perception.Therefore,shape is very critical for object recognition and image understanding.To address diﬀerent application cases,a large variety of shape descriptors have been proposed.such as:K-Adjacent Segmentation[11], Spatial Pyramid Matching[21],Shape Context[6–8],Shape Context Based on Inner-Distance[22, 23] and so on. Shape description and representation is an old and still open problem. The intrinsic diﬃculty is the absence of a clear deﬁnition of ”what is a shape ” [42].Though classically, a shape contour can be represented as a parametric curve or fourier coeﬃcients for example, these mathematical approaches are far from being satisfying when dealing with the problem of describing and comparing diﬀerent shapes. The objective of this master is to explore new ways to describe and compare the shape of objects. Applications of this work are concerned with shape match ing and recognition.In the ﬁrst part of the thesis, we introduce and compare several shape descriptors recently proposed in the computer vision literature, analyze their advantages and limitations. In the rest of the thesis, we present and detail the main work of this master. Contributions can be summarized as follows: 1. A shape matching method based on eigenvector histogram matching is proposed. Inspired by recent researches on spectral graph theory for 3D geometry matching [19, 20], we use histograms of eigenvectors of the Laplacian matrix to represent the 2D shape contour. The motivation for doing so is that the Laplacian of a graph deﬁned over the shape contour encompasses both local and global properties. In this process however, we are facing an intrinsic diﬃculty of spectral analysis, ie the appropriate ordering of eigenvalues. In order to overcome this problem and to allow the alignment of the basis of
语种	中文
其他标识符	200828014628060
源URL	[http://ir.ia.ac.cn/handle/173211/7610]
专题	毕业生_硕士学位论文
推荐引用方式 GB/T 7714	张翠. 基于形状的图像匹配[D]. 中国科学院自动化研究所. 中国科学院研究生院. 2011.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。