書籍紹介: Visual Object Recognition

Visual Object Recognition (Synthesis Lectures on Artificial Intelligence and Machine Learning)

作者: Kristen Grauman,Bastian Leibe
出版社/メーカー: Morgan & Claypool Publishers
発売日: 2011/02/28
メディア: ペーパーバック
購入: 12人クリック: 182回
この商品を含むブログを見る

最近読んだ本の紹介をしたいと思います。
名前の通り画像認識に関するチュートリアル本で、出版されたのは2011年2月28日です。
著者がKristen GraumanとBastian Leibeの二人なので、これはと思い衝動買いしてしまいましたが、期待通りいい内容でした。

前半と後半で特定物体認識と一般物体認識の話に大きく分かれており、それぞれ基礎から比較的最近の手法まで非常に分かりやすく、コンパクトにまとめてあります。これから画像認識を本格的に学ぼうとする人にとっては持っておいて絶対に損はない本だと思います。また、専門の人にとっても、知識を整理したり、抜けを補完したりする上で有用だと思います。私も、constellation model以降のpart-based手法の流れは知らなかったものも多く、大変勉強になりました。

欲を言えば、最新の手法に関しては少しカバーが甘いのが物足りなかったところです。例えば、bag-of-visual-wordsに関しては古くからある一般的なものしか記述がありません。また、物体検出に関しても、DPMまでの代表的な手法のみ扱っているようです。まあ、ページ数的に仕方ないとは思いますが・・・。

いずれにせよ、教科書としてはいい本で、多分修士の学生さんくらいに一番向いている気がします。それほど分量は多くない（厚さ12mmくらい）ので多分くじけず読み進めることができます（笑）。輪講などでとりあげてもいいかも知れませんね。

参考までに、以下に目次だけ引用します。

1 Introduction
　1.1 Overview
　1.2 Challenges
　1.3 The State of the Art
2 Overview:Recognition of Specific Objects
　2.1 Global Image Representations
　2.2 Local Feature Representations
3 Local Features:Detection and Description
　3.1 Introduction
　3.2 Detection of Interest Points and Regions
　3.3 Local Descriptors
　3.4 Concluding Remarks
4 Matching Local Features
　4.1 Efficient Similarity Search
　4.2 Indexing Features with Visual Vocabularies
　4.3 Concluding Remarks
5 Geometric Verification of Matched Features
　5.1 Estimating Geometric Models
　5.2 Dealing with Outliers
6 Example Systems:Specific-Object Recognition
　6.1 Image Matching
　6.2 Object Recognition
　6.3 Large-Scale Image Retrieval
　6.4 Mobile Visual Search
　6.5 Image Auto-Annotation
　6.6 Concluding Remarks
7 Overview:Recognition of Generic Object Categories
8 Representations for Object Categories
　8.1 Window-based Object Representations
　8.2 Part-based Object Representations
　8.3 Mixed Representations
　8.4 Concluding Remarks
9 Generic Object Detection:Finding and Scoring Candidates
　9.1 Detection via Classification
　9.2 Detection with Part-based Models
10 Learning Generic Object Category Models
　10.1 Data Annotation
　10.2 Learning Window-based Models
　10.3 Learning Part-based Models
11 Example Systems:Generic Object Recognition
　11.1 The Viola-Jones Face Detector
　11.2 The HOG Person Detector
　11.3 Bag-of-Words Image Classification
　11.4 The Implicit Shape Model
　11.5 Deformable Part-based Models
12 Other Considerations and Current Challenges
　12.1 Benchmarks and Datasets
　12.2 Context-based Recognition
　12.3 Multi-Viewpoint and Multi-Aspect Recognition
　12.4 Role of Video
　12.5 Integrated Segmentation and Recognition
　12.6 Supervision Considerations in Object Category Learning
　12.7 Language, Text, and Images
13 Conclusions