Outline
'Big Data' in Computer Vision Map/Reduce and Computer Vision Map/Reduce Image Search Application: Screenshot Retrieval
Output Metadata
G. Shakhnarovich, P. Viola, T. Darrell Fast pose estimation with parameter-sensitive hashing, October 2003.
J. Hays and A. A. Efros, "Scene completion using millions of photographs," in SIGGRAPH '07: ACM SIGGRAPH 2007 papers. New York, NY, USA: ACM, 2007, pp. 4+.
J. Hays and A. A. Efros, "Im2gps: estimating geographic information from a single image," Computer Vision and Pattern Recognition, IEEE Computer Society Conference on, vol. 0, pp. 1-8, 2008.
A. Torralba, R. Fergus, and W. T. Freeman, "80 million tiny images: A large data set for nonparametric object and scene recognition," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 30, no. 11, pp. 19581970, May 2008
C. Liu, J. Yuen, and A. Torralba, "Nonparametric scene parsing: Label transfer via dense scene alignment," Computer Vision and Pattern Recognition, IEEE Computer Society Conference on, vol. 0, pp. 19721979, 2009.
C. Liu, J. Yuen, A. Torralba, J. Sivic, and W. T. Freeman, "Sift flow: Dense correspondence across different scenes," in ECCV '08: Proceedings of the 10th European Conference on Computer Vision. Berlin, Heidelberg: SpringerVerlag, 2008, pp. 28-42.
Outline
'Big Data' in Computer Vision Map/Reduce and Computer Vision Map/Reduce Image Search Application: Screenshot Retrieval
Hadoop+CV: No Reducer
Map
Map
Map
Example Maps Object Detection (e.g., cars, faces) Feature Computation (e.g., SIFT) Sliding Windows (given a region+image)
Map
Map
Map
Reduce
Map: Feature Computation Red: Model Creation Examples Classifiers (e.g., SVM, Bayes) Geometry Problems (e.g., RANSAC, SfM)
Reduce
Map: Fit data to model given parameters (E-Step) Red: Compute new model parameters given data (M-Step) Iterate until stopping conditions are met. Examples Clustering (e.g., K-Means) Mixture Models (e.g., MoG)
Outline
'Big Data' in Computer Vision Map/Reduce and Computer Vision Map/Reduce Image Search Application: Screenshot Retrieval
Image Features (SURF 64D) Remove Dupes (Curr./Prev.) K-Means Clustering (Initial) K-Means Clustering Median Computation Hamming Embedding
Map In: (image_url, image_hash, image_data, image_tags) Map Out: (image_hash, image_url, image_features)
Map In: [image_hash, image_url, image_features] or Map In: [image_hash] (for images already in the DB) Map Out Key: image_hash Map Out Val: image_features Reduce Out: [image_hash, image_feature]
File: cluster_means Map In: [image_hash, image_feature] Map Out Key: cluster_num (nearest cluster) Map Out Val: image_feature (extended by 1 dim to get count) Reduce Out: [cluster_num, cluster_mean]
File: cluster_means Map In: [image_hash, image_feature] Map Out Key: cluster_num (nearest cluster) Map Out Val: image_feature Reduce Out: [cluster_num, cluster_median]
File: cluster_means, cluster_medians Map In: [image_hash, image_feature] Map Out Key: cluster_num (nearest cluster) Map Out Val: hamming_embedding Reduce Out: [cluster_num, hamming_embedding]
Image Features (SURF 64D) For each feature... Find Nearest Cluster Compute hamming embedding (using cluster median) Vote (tf-idf) for DB image if a feature if hamming dist < Thresh
Outline
'Big Data' in Computer Vision Map/Reduce and Computer Vision Map/Reduce Image Search Application: Screenshot Retrieval
Outline
'Big Data' in Computer Vision Map/Reduce and Computer Vision Map/Reduce Image Search Application: Screenshot Retrieval
Conclusion
Vision has 'Big Data' applications Many image search applications Common design patterns for M/R+Vision Hadoop useful image search
References
[1] P. Duygulu, K. Barnard, J. de Freitas, and D. Forsyth, "Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary," in Computer Vision ECCV 2002, ser. Lecture Notes in Computer Science, 2002, ch. 7, pp. 349-354. [2] A. Makadia, V. Pavlovic, and S. Kumar, "A new baseline for image annotation," in ECCV '08: Proceedings of the 10th European Conference on Computer Vision. Berlin, Heidelberg: Springer-Verlag, 2008, pp. 316-329. [3] Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek and Cordelia Schmid, "Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation." ICCV 2009
[4] A. Torralba, R. Fergus, and W. T. Freeman, "80 million tiny images: A large data set for nonparametric object and scene recognition," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 30, no. 11, pp. 1958-1970, May 2008.