Overview

Key Ideas
- Predicting corners where objects don’t lie is hard for CNNs
- Represents objects by their extreme points (top, bottom, left, right)
- No need to predict embeddings for box computation
- Extreme points are commonly used for annotation
Object detection via extreme point prediction
