Fully Convolutional Networks for Semantic Segmentation, Jonathan Long, Evan Shelhamer, Trevor Darrell, 2015Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)DOI: 10.48550/arXiv.1411.4038 - Foundational paper introducing Fully Convolutional Networks (FCNs), which established mean Intersection over Union (mIoU) as a primary benchmark metric for semantic segmentation.
Microsoft COCO: Common Objects in Context, Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, Piotr Dollár, 2014European Conference on Computer Vision (ECCV)DOI: 10.48550/arXiv.1405.0312 - Introduces the COCO dataset and its widely used metrics for object detection and instance segmentation, including Average Precision (AP) based on mask IoU thresholds.