[1] |
Alex K,Sutskever I,Hinton G E.ImageNet classification with deep convolutional neural networks [J].Communications of the ACM,2017,60(6):84-90.
|
[2] |
He K M, Zhang X, Ren S, et al.Deep residual learning for image recognition [C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2016:770-778.
|
[3] |
Liu W, Anguelov D, Erhan D, et al. SSD:Single shot multibox detector [C]∥Proc of European Conference on Computer Vision,2016:21-37.
|
[4] |
Redmon J, Divvala S, Girshick R,et al.You only look once:Unified,real-time object detection [C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2016:779-788.
|
[5] |
Redmon J, Farhadi A. YOLO9000:Better,faster,stronger [C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2017:7263-7271.
|
[6] |
Redmon J, Farhadi A. YOLOv3:An incremental improvement [J].arXiv:1804.02767,2018.
|
[7] |
Bochkovskiy A, Wang C Y,Liao H Y M. YOLOv4:Optimal speed and accuracy of object detection [J].arXiv:2004.10934,2020.
|
[8] |
Lin T Y, Goyal P, Girshick R, et al.Focal loss for dense object detection [C]∥Proc of the IEEE International Confe- rence on Computer Vision,2017:2980-2988.
|
[9] |
Chollet F. Xception:Deep learning with depthwise separable convolutions [C]∥Proc of 2017 IEEE Conference on Computer Vision and Pattern Recognition,2017:1800-1807.
|
[10] |
Howard A G, Zhu M, Chen B, et al.MobileNets:Efficient convolutional neural networks for mobile vision applications [J].arXiv:1704.04861,2017.
|
[11] |
Sandler M, Howard A, Zhu M, et al.MobileNetV2:Inverted residuals and linear bottlenecks [C]∥Proc of 2018 IEEE Conference on Computer Vision and Pattern Recognition,2018:4510-4520.
|
[12] |
Howard A, Sandler M, Chu G,et al.Searching for MobileNetV3 [C]∥Proc of the IEEE International Conference on Computer Vision,2019:1314-1324.
|
[13] |
Zhang X Y, Zhou X Y, Lin M X,et al.ShuffleNet:An extremely efficient convolutional neural network for mobile devices [C]∥Proc of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition,2018:6848-6856.
|
[14] |
Ma N N,Zhang X Y,Zheng H T,et al.ShuffleNet V2:Practical guidelines for efficient CNN architecture design [C]∥Proc of European Conference on Computer Vision,2018:116-131.
|
[15] |
Chen L C,Papandreou G,Kokkinos I,et al.DeepLab:Semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected CRFs [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,40(4):834-848.
|
[16] |
Chen L C,Papandreou G,Schroff F,et al.Rethinking atrous convolution for semantic image segmentation [J].arXiv:1706.05587,2017.
|
[17] |
Dai J,Li Y,He K,et al.R-FCN:Object detection via region-based fully convolutional networks [C]∥Proc of the 30th International Conference on Neural Information Processing Systems,2016:379-387.
|
[18] |
Li Z M,Peng C,Yu G,et al.DetNet:A backbone network for object detection [J].arXiv:1804.06215,2018.
|
[19] |
Luo W,Li Y,Urtasun R,et al.Understanding the effective receptive field in deep convolutional neural networks [C]∥Proc of the 30th International Conference on Neural Information Processing Systems,2016:4898-4906.
|
[20] |
Singh P, Verma V K, Rai P,et al.HetConv:Heterogeneous kernel-based convolutions for deep CNNs [C]∥Proc of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition,2019:4835-4844.
|
[21] |
Li Y H,Chen Y, Wang N, et al.Scale-aware trident networks for object detection [C]∥Proc of the IEEE International Conference on Computer Vision,2019:6054-6063.
|
[22] |
Wang C Y, Liao H Y M, Wu Y H,et al.CSPNet:A new backbone that can enhance learning capability of cnn [C]∥Proc of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops,2020:390-391.
|
[23] |
He K M, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015,37(9):1904-1916.
|