Category: article sharing
-

The origin of YOLO (You Only Look Once)
The You Only Look Once (YOLO) series is popular and versatile in 2D computer vision tasks like object detection, segmentation, or pose estimation. Its good performance and high efficiency allow the YOLO series to become the state-of-the-art real-time model. As shown in the graph below, the latest YOLO11 achieves a mean average precision of 54.7…
-

Article Review: SuperPoint: Self-Supervised Interest Point Detection and Description
The Key point detection and feature extraction are fundamental techniques in many computer vision downstream tasks, like camera calibration, homography estimation, structure-from-motion, and visual-SLAM. The task is to extract and describe the key points in each image. The traditional key point detectors and descriptors like ORB, FAST, and SIFT can achieve good performance in the…
-

FoundationStereo framework explained: 5 key features for zero-shot stereo depth estimation
FoundationStereo by NVIDIA presents a state-of-the-art stereo model excelling in depth estimation without any prior fine-tuning. This article introduces the stereo problem, why is it challenging, how does FoundationStereo solve it, and what are my opinions.