Principle:Ggml org Ggml Detection Post Processing
Appearance
Summary
Converting raw neural network detection outputs into usable bounding boxes with class labels.
Theory
YOLO-style detection post-processing involves multiple steps:
- Sigmoid activation on raw predictions (objectness, class probabilities)
- Anchor-based box decoding: convert grid offsets to absolute coordinates using anchor priors
- Letterbox correction: map coordinates back from padded input space to original image space
- Non-Maximum Suppression (NMS): remove duplicate detections by comparing IoU between boxes
NMS Algorithm
The Non-Maximum Suppression algorithm operates as follows:
- Sort detections by confidence (highest first)
- For each detection, suppress lower-confidence overlapping detections where IoU > threshold
This eliminates duplicate detections of the same object, keeping only the most confident prediction.
Multi-Scale Detection
Combine predictions from different grid resolutions to handle objects of varying sizes:
- 13x13 grid — for large objects
- 26x26 grid — for small objects
Multi-scale detection ensures robust coverage across the full range of object sizes present in an image.
Related
Page Connections
Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment