Metric learning 度量学习

距离测度学习的目的即为了衡量样本之间的相近程度，而这也正是模式识别的核心问题之一。大量的机器学习方法，比如K近邻、支持向量机、径向基函数网络等分类方法以及K-means聚类方法，还有一些基于图的方法，其性能好坏都主要有样本之间的相似度量方法的选择决定。

AlgoComp

8135人浏览 · 2015-11-14 16:00:09

AlgoComp · 2015-11-14 16:00:09 发布

介绍

定义

起源

　　Eric Xing在NIPS 2002提出。

优点

　　度量学习通常的目标是使同类样本之间的距离尽可能缩小，不同类样本之间的距离尽可能放大。

缺点

　　TODO

应用领域

　　人脸识别、物体识别、音乐的相似性、人体姿势估计、信息检索、语音识别、手写体识别等领域。

解法

Reference 2中可找到《An Overview of Distance Metric Learning》、《Distance Metric Learning: A Comprehensive Survey》。

Supervised Distance Metric Learning

Methods	Locality	Linearity	Learning Strategies	Code Download
Probablistic Global Distance Metric Learning (PGDM)	global	linear	constrained convex programming	by Eric P. Xing
Relevant Components Analysis (RCA)	global	linear	capture global structure; use equivalence constraints	by Aharon Bar-Hillel and Tomer Hertz,
Discriminative Component Analysis (DCA)	global	linear	improve RCA by exploring negative constraints	by Steven C.H. Hoi
Local Fisher Discriminant Analysis (LFDA)	local	linear	extend LDA by assigning greater weights to closer connecting examples	[by Masashi Sugiyama]
Neighborhood Component Analysis (NCA)	local	linear	extend the nearest neighbor classifier toward metric learing	[by Charless C. Fowlkes]
Large Margin NN Classifier (LMNN)	local	linear	extend NCA through a maximum margin framework	[by Kilian Q. Weinberger]
Localized Distance Metric Learning (LDM)	local	linear	optimize local compactness and local separability in a probabilistic framework	[by Liu Yang]
DistBoost	global	linear	learn distance functions by training binary classifiers with margins in a boosting framework	by Tomer Hertz and Aharon Bar-Hillel

			notes on calling its kernel version
Active Distance Metric Learning (BAYES+VAR)	global	linear	select example pairs with the greatest uncertainty, posterior estimation with a full Bayesian treatment	[by Liu Yang]

- Unsupervised Distance Metric Learning

Methods	Locality	Linearity	Learning Strategies	Code Download
Principal Component Analysis(PCA)	global structure preserved	linear	best preserve the variance of the data	[by Deng Cai]
Multidimensional Scaling(MDS)	global structure preserved	linear	best preserve inter-point distance in low-rank	[ included in Matlab Toolbox for Dimensionality Reduction]
ISOMAP	global structure preserved	nonlinear	preserve the geodesic distances	[by J. B. Tenenbaum, V. de Silva and J. C. Langford]
Laplacian Eigenamp (LE)	local structure preserved	nonlinear	preserve local neighbor	[by Mikhail Belkin]
Locality Preserving Projections (LPP)	local structure preserved	linear	linear approximation to LE	[LPP by Deng Cai]

			[Kernel LPP by Deng Cai]
Locally Linear Embedding (LLE)	local structure preserved	nonlinear	nonlinear preserve local neighbor	[by Sam T. Roweis and Lawrence K. Saul]

			Hessian LLE can be found at [MANI fold Learning Matlab Demo, by Todd Wittman]
Neighborhood Preserving Embedding (NPE)	lobal structure preserved	linear	linear approximation to LLE	[by Deng Cai]

实现

Python

metric-learn
https://pypi.python.org/pypi/metric-learn/
- LMNN
  python from metric_learn import LMNN import numpy as np X = np.array([[0., 0., 1.], [0., 0., 2.], [1.,0.,0.], [2.,0.,0.], [2.,2.,2.], [2.,5.,4.]]) Y = np.array([1, 1, 2, 2, 0, 0]) lmnn = LMNN(k=2, learn_rate=1e-6) lmnn.fit(X, Y, verbose=False) Y_c = lmnn.transform(X)
  - output
    text >>> Y_c array([[ 0. , -0.07987306, 0.11081795], [ 0. , -0.15974612, 0.22163591], [ 0.07113444, 0. , 0. ], [ 0.14226889, 0. , 0. ], [ 0.14226889, -0.04460763, 0.06188978], [ 0.14226889, -0.03164602, 0.04390651]])

Matlab

DistLearnKit
http://www.cs.cmu.edu/~liuy/distlearn.htm

R

Supervised Distance Metric Learning
https://github.com/road2stat/sdml

应用

　　TODO

参考

点击阅读全文

CSDN学习社区

CSDN联合极客时间，共同打造面向开发者的精品内容学习社区，助力成长！

更多推荐

嵌入式作业（七）：基于Ardunio的STM32串口通信

嵌入式作业（七）0作业要求1Ardunio 完成STM32的串口通信（1）安装Ardunio IDE（2）stm32串口通信2关于 stduino IDE0作业要求安装 Ardunio IDE 和相关软件支持库，在Ardunio 完成STM32板子的串口通信程序：（1）持续向串口输出“Hello world！”；（2）当接收到“stop!”时，停止输出。网上有一个国人版的MCU集成开发平台， st

CSDN学习社区

JDBC详解

JDBC文章目录JDBC什么是JDBC?JDBC驱动程序:Java使用JDBC访问数据库的步骤:设置classpath:Oracle连接字符串的书写格式:简单的例子:常用数据库的驱动程序及JDBC URL:Oracle数据库:SQL Server数据库MySQL数据库Access数据库PreparedStatement接口:JNDI-数据源（Data Source）与连接池（Connection

CSDN学习社区

“模式识别与机器学习”学习笔记no2.再谈感知机

接**上篇：上篇主要进行了PLA，Pocket算法的理论过程分析和在给定数据集上利用pocket算法对数据集进行分类学习，得到错分数量最少的分类面。上篇中pocket算法的过程已经进行了编程和测试，框架已经建立了起来，这一篇主要上篇中没有提到或涉及不深的几个问题。1.数据集的构造。上篇是直接使用了题目给的向量，这次来根据正态分布来产生数据集。np.random.normal函数可以根据均值和方差生