Classical Computer Vision Methods
This article provides a complete, concise, mathematically supported explanation of all important classical computer vision techniques. This includes edge detection, feature descriptors, tracking, segmentation, transforms, stereo, motion analysis, and more.
1. Edge, Corner & Keypoint Detectors
1.1 Sobel, Prewitt, Roberts Operators
These detect edges by convolving horizontal and vertical gradient kernels.
1.2 Laplacian of Gaussian (LoG)
Detects edges via second derivatives and zero-crossings.
1.3 Difference of Gaussian (DoG)
1.4 Canny Edge Detector
Involves Gaussian smoothing, gradient computation, non-max suppression, hysteresis thresholding.
1.5 Harris Corner Detector
1.6 Shi–Tomasi Corner Detector
1.7 SUSAN (Smallest Univalue Segment Assimilating Nucleus)
Counts pixels similar to the center; corner occurs when USAN area is small.
1.8 FAST (Features from Accelerated Segment Test)
Checks N pixels on a circle if they are brighter/darker than the center.
1.9 AGAST (Adaptive and Generic Accelerated Segment Test)
FAST improved using adaptive decision trees.
2. Feature Descriptors
2.1 SIFT (Scale-Invariant Feature Transform)
Uses DoG keypoints, orientation histograms, and 128-D descriptors.
2.2 SURF (Speeded Up Robust Features)
Efficient LoG approximation using box filters and integral images.
2.3 BRIEF (Binary Robust Independent Elementary Features)
2.4 ORB (Oriented FAST and Rotated BRIEF)
Combines FAST keypoints with rotation-corrected BRIEF descriptors.
2.5 BRISK (Binary Robust Invariant Scalable Keypoints)
Scale-space sampling and binary intensity comparisons.
2.6 FREAK (Fast Retina Keypoint)
Binary descriptor based on retina-inspired sampling.
2.7 HOG (Histogram of Oriented Gradients)
Histograms of gradient orientations inside cells.
2.8 LBP (Local Binary Patterns)
2.9 Shape Context
Histogram describing relative spatial distribution of points.
2.10 GIST Descriptor
Global scene representation using multi-scale Gabor filters.
3. Feature Matching & Tracking
3.1 RANSAC (Random Sample Consensus)
Fits a robust model by sampling minimal sets and counting inliers.
3.2 Lucas–Kanade Optical Flow
3.3 Horn–Schunck Optical Flow
3.4 KLT (Kanade–Lucas–Tomasi) Tracker
Tracks Shi–Tomasi features using Lucas–Kanade optical flow.
4. Filters & Transform Methods
4.1 Gaussian Filter
4.2 Median Filter
Replaces each pixel with the neighborhood median.
4.3 Bilateral Filter
4.4 Anisotropic Diffusion (Perona–Malik)
4.5 Fourier Transform
4.6 Discrete Cosine Transform (DCT)
Used in JPEG compression.
4.7 Wavelet Transform
Multi-resolution analysis using scalable basis functions.
4.8 Radon Transform
4.9 Hough Transform
Votes in parameter space to detect lines and shapes.
5. Segmentation Methods
5.1 K-means Segmentation
5.2 Graph Cut
5.3 GrabCut
Uses Graph Cut + Gaussian Mixture Models.
5.4 Watershed
Treats gradient magnitude as a topographic map.
5.5 Mean Shift Segmentation
Clusters by shifting data toward local maxima.
5.6 Felzenszwalb–Huttenlocher Algorithm
Graph-based region merging based on internal variation.
5.7 SLIC (Simple Linear Iterative Clustering) Superpixels
Clusters in Lab + xy 5-dimensional space.
5.8 Active Contours (Snakes)
5.9 Level Set Methods
6. Classical Object Detection
6.1 Viola–Jones (Haar Cascade)
Uses Haar features, integral images, AdaBoost, and cascaded classifiers.
6.2 HOG + SVM (Support Vector Machine)
6.3 Deformable Part Models (DPM)
6.4 Template Matching
7. Stereo Vision & 3D
7.1 Block Matching
7.2 Semi-Global Matching (SGM)
Aggregates matching cost across multiple directions.
7.3 Epipolar Geometry
7.4 Essential Matrix
7.5 Triangulation
7.6 Structure from Motion (SfM)
Estimates camera poses and 3D structure from multiple views.
7.7 Bundle Adjustment
7.8 Visual Odometry
Estimates camera motion using sequential feature correspondences.
8. Motion Analysis & Tracking
8.1 Background Subtraction (Mixture of Gaussians - MOG/MOG2)
8.2 Kalman Filter
8.3 Particle Filter
Approximates posterior distribution using weighted particles.
8.4 Mean Shift Tracking
Tracks objects by iteratively shifting kernel windows.
8.5 CAMShift (Continuously Adaptive Mean Shift)
Enhances Mean Shift with adaptive window size.
References
- Gonzalez, R. C. & Woods, R. E. (2018). Digital Image Processing (4th ed.).
- Szeliski, R. (2011). Computer Vision: Algorithms and Applications.
- Forsyth, D. A. & Ponce, J. (2012). Computer Vision: A Modern Approach (2nd ed.).




Comments
Post a Comment