21世纪初最有影响力的30篇计算机视觉会议论文（转）

选取论文的原则：

（1）会议论文，主要来源于以下会议：CVPR, ICCV, ECCV, BMVC, FG, ICIP, ICPR, WACV, ICASSP, MM, IJCAI, UAI, AAAI…（其实后面的几个会议都是打酱油的，AI类的期刊也木有高引CV文章，事实也是如此，没有收录ICML和NIPS等，见谅）

（2）发表在2000年以后，数据来源于微软学术搜索(http://academic.research.microsoft.com/) ，2012年12月初的检索结果

（3）检索次数大于200，PER>50，微软学术搜索的被检索次数未必精确，但能反应大概趋势，PER指意味着每年检索量，自发表次年算到2012年，PER=Cited/(2012-YEAR)

榜单Top 30如下，欢迎拾遗补缺：

[1] Rapid Object Detection using a Boosted Cascade of Simple Features (PER=299.64)

Paul A. Viola, Michael J. Jones @CVPR , vol. 1, pp. 511-518, 2001

[2] Histograms of Oriented Gradients for Human Detection ( PER=243.43)

Navneet Dalal, Bill Triggs @CVPR , vol. 1, pp. 886-893, 2005

[3] SURF: Speeded-Up Robust Features (PER=175.67)

Herbert Bay, Tinne Tuytelaars, Luc J. Van Gool @ECCV , pp. 404-417, 2006

[4] Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories ( PER=145.5)

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce @CVPR , vol. 2, pp. 2169-2178, 2006

[5] Object Class Recognition by Unsupervised Scale-Invariant Learning ( PER=119)

Robert Fergus, Pietro Perona, Andrew Zisserman @CVPR , vol. 2, pp. 264-271, 2003

[6] Robust Real-Time Face Detection ( PER=99.27)

Paul A. Viola, Michael J. Jones @ ICCV , 2001

[7] A Bayesian hierarchical model for learning natural scene categories (PER=96.71)

Fei-Fei Li, Pietro Perona @CVPR , vol. 2, pp. 524-531, 2005

[8] Scalable Recognition with a Vocabulary Tree (PER=95)

David Nistér, Henrik Stewénius @CVPR , vol. 2, pp. 2161-2168, 2006

[9] Real-Time Tracking of Non-Rigid Objects Using Mean Shift (PER=94.33)

Dorin Comaniciu, Visvanathan Ramesh, Peter Meer @CVPR , vol. 2, pp. 2142-149 vol.2, 2000

[10]Visual Categorization with Bags of Keypoints (PER=93.13)

Gabriella Csurka, Christopher R. Dance, Lixin Fan, etc @ECCV , 2004

[11]Video Google: A Text Retrieval Approach to Object Matching in Videos (PER=87.78)

Josef Sivic, Andrew Zisserman @ ICCV , pp. 1470-1477, 2003

[12]What Energy Functions Can Be Minimized via Graph Cuts? ( PER=84.2)

Vladimir Kolmogorov, Ramin Zabih @ECCV , pp. 65-81, 2002

[13]Overview of the Face Recognition Grand Challenge (PER=82.57)

P. Jonathon Phillips, Patrick J. Flynn, W. Todd Scruggs, etc @CVPR , vol. 1, pp. 947-954, 2005

[14]Robust wide baseline stereo from maximally stable extremal regions ( PER=81)

Jiri Matas, Ondrej Chum, Martin Urban, etc @BMVC , vol. 1, 2002

[15]PCA-SIFT: A More Distinctive Representation for Local Image Descriptors ( PER=79.88)

Yan Ke, Rahul Sukthankar @CVPR , vol. 2, pp. 506-513, 2004

[16]Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in ND Images ( PER=72.36)

Yuri Y. Boykov, Marie-pierre Jolly @ ICCV , pp. 105-112, 2001

[17]An extended set of Haar-like features for rapid object detection (PER=71)

Rainer Lienhart, Jochen Maydt @ICIP , vol. 1, pp. 900-903, 2002

[18]A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics ( PER=68.18)

David R. Martin, Charless Fowlkes, Doron Tal, etc @ ICCV , pp. 416-425, 2001

[19]Detecting Pedestrians Using Patterns of Motion and Appearance (PER=64.89)

Paul A. Viola, Michael J. Jones, Daniel Snow @ ICCV , pp. 734-741, 2003

[20]Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary (PER=60.3)

Pinar Duygulu, Kobus Barnard, João F. G. De Freitas, etc @ECCV , pp. 97-112, 2002

[21]Real-Time Simultaneous Localisation and Mapping with a Single Camera ( PER=58.56)

Andrew J. Davison @ ICCV , pp. 1403-1410, 2003

[22]Recognizing Human Actions: A Local SVM Approach (PER=55)

Christian Schüldt, Ivan Laptev, Barbara Caputo @ICPR , pp. 32-36, 2004

[23]Actions as Space-Time Shapes (PER=54.14)

Moshe Blank, Lena Gorelick, Eli Shechtman, etc @ ICCV , vol. 2, pp. 1395-1402, 2005

[24]A Discriminatively Trained, Multiscale, Deformable Part Model (PER=53.75)

Pedro F. Felzenszwalb, David A. Mcallester, Deva Ramanan @CVPR , pp. 1-8, 2008

[25]Non-parametric Model for Background Subtraction (PER=53.5)

Ahmed M. Elgammal, David Harwood, Larry S. Davis @ECCV , pp. 751-767, 2000

[26]A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms(PER=53)

Steven M. Seitz, Brian Curless, James Diebel, etc @CVPR , vol. 1, pp. 519-528, 2006

[27]Comprehensive Database for Facial Expression Analysis (PER=53)

Takeo Kanade, Yingli Tian, Jeffrey F. Cohn @FG , pp. 46-53, 2000

[28]Learning Realistic Human Actions from Movies ( PER=52.75)

Ivan Laptev, Marcin Marszalek, Cordelia Schmid, etc @CVPR , pp. 1-8, 2008

[29]Object Retrieval with Large Vocabularies and Fast Spatial Matching (PER=51.6)

James Philbin, Ondrej Chum, Michael Isard, etc @CVPR , 2007

[30]Statistical Shape Influence in Geodesic Active Contours ( PER=51.33)

Michael E. Leventon, W. Eric L. Grimson, Olivier D. Faugeras @CVPR , vol. 1, pp. 1316-1323, 2000

简单小结：

总共30篇会议文章，其中CVPR 14篇，ECCV 5篇，ICCV 7篇，BMVC、FG、ICIP、ICPR各1篇。每年发表的论文数如图所示。

我个人比较关注3个方面，

(1)检测Detection（比如基于Harr-like的Viola-Jones的3篇文章，[1][6][19]，还有[17]，基于HOG的[2]，以及在PASCAL VOC上大放光芒的DPM[24]）；

(2)基于Bag of Words (Features)和LDA的图像检索Retrieval和类别学习Categorization（比如基于BoW的[10][11]，之后的[8][29]，以及非常流行的SPM[4]，基于LDA的[7]等）；

(3)特征Features，比如说提到的Harr-like，SURF[3]，HOG[2]，MSER[14]，PCA-SIFT[15]等。

值得注意的是，排在前面的大部分作者，都是CV界的成名人物，非常值得关注他们的其他一些论文。

PS：[12]的作者Ramin Zabih从TPAMI的主编位置上退下，[20]的作者David Forsyth成为TPAMI新的掌舵者，大家有什么看法？

最后额外推荐几篇ML的方法，来自于ICML和NIPS。

[1] Reinforcement learning: An introduction

Richard S. Sutton, Andrew G. Barto @NIPS , vol. 9, no. 5, pp. 1054-1054, 1998

[2] Experiments with a New Boosting Algorithm

Yoav Freund, Robert E. Schapire @ICML , pp. 148-156, 1996

[3] Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

John D. Lafferty, Andrew McCallum, Fernando C. N. Pereira @ICML , pp. 282-289, 2001

[4] Latent Dirichlet Allocation

David M. Blei, Andrew Y. Ng, Michael I. Jordan @NIPS , pp. 601-608, 2001

[5] On Spectral Clustering: Analysis and an algorithm

Andrew Y. Ng, Michael I. Jordan, Yair Weiss @NIPS , pp. 849-856, 2001

[6] Algorithms for Nonnegative Matrix Factorization

Daniel D. Lee, H. Sebastian Seung @NIPS , vol. 13, pp. 556-562, 2000

[7] Semi-Supervised Learning Using Gaussian Fields and Harmonic Functions

Xiaojin Zhu, Zoubin Ghahramani, John D. Lafferty @ICML , pp. 912-919, 2003

21世纪初最有影响力的30篇计算机视觉会议论文（转）

相关文章

21世纪初最有影响力的20篇计算机视觉期刊论文(转)

为什么GPU比CPU更重要

Machine Learning in Python (Scikit-learn)-(转)

Huber robust error function

立体视觉入门

heavy-tailed distribution（重尾分布）

深度恢复

K-D tree