Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (01): 91-101.
• Graphics and Images • Previous Articles Next Articles
JIANG Zhi-peng1,WANG Zi-quan1,ZHANG Yong-sheng1,YU Ying1,CHENG Bin-bin1,ZHAO Long-hai2,ZHANG Meng-wei1
Received:
2023-02-12
Revised:
2023-05-08
Accepted:
2024-01-25
Online:
2024-01-25
Published:
2024-01-15
JIANG Zhi-peng, WANG Zi-quan, ZHANG Yong-sheng, YU Ying, CHENG Bin-bin, ZHAO Long-hai, ZHANG Meng-wei. A vehicle object detection algorithm in UAV video stream based on improved Deformable DETR[J]. Computer Engineering & Science, 2024, 46(01): 91-101.
[1] | Geiger A,Lenz P,Stiller C,et al.Vision meets robotics: The kitti dataset[J].The International Journal of Robotics Research,2013,32(11): 1231-1237. |
[2] | 张继贤,顾海燕,杨懿,等.高分辨率遥感影像智能解译研究进展与趋势[J].遥感学报,2021,25(11):2198-2210. |
Zhang Ji-xian,Gu Hai-yan,Yang Yi,et al.Research progress and trend of high-resolution remote sensing imagery intelligent interpretation[J].National Remote Sensing Bulletin,2021,25(11):2198-2210. | |
[3] | Spadafora W G,Paielli P M,Llewellyn D R,et al.Intelligent transportation system: US7689230[P].2010. |
[4] | Yang S,Luo P,Loy C C,et al.Wider face: A face detection benchmark[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2016: 5525-5533. |
[5] | Du D,Qi Y,Yu H,et al.The unmanned aerial vehicle benchmark: Object detection and tracking[C]∥Proc of the European Conference on Computer Vision,2018: 370-386. |
[6] | Hearst M A,Dumais S T,Osuna E,et al.Support vector machines[J].IEEE Intelligent Systems and Their Applications,1998,13(4): 18-28. |
[7] | Dalal N,Triggs B.Histograms of oriented gradients for human detection[C]∥Proc of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2005:886-893. |
[8] | Krizhevsky A,Sutskever I,Hinton G E.Imagenet classification with deep convolutional neural networks[J].Communications of the ACM,2017,60(6): 84-90. |
[9] | Girshick R, Donahue J, Darrell T, et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2014: 580-587. |
[10] | Girshick R.Fast R-CNN[C]∥Proc of the IEEE International Conference on Computer Vision,2015: 1440-1448. |
[11] | Ren S, He K, Girshick R, et al.Faster R-CNN: Towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017,39(6):1137-1149. |
[12] | Purkait P,Zhao C,Zach C.SPP-Net: Deep absolute pose regression with synthetic views[J].arXiv:1712.03452,2017. |
[13] | He K,Gkioxari G,Dollár P,et al.Mask R-CNN[C]∥Proc of the IEEE International Conference on Computer Vision,2017: 2961-2969. |
[14] | Redmon J,Divvala S,Girshick R,et al.You only look once: Unified,real-time object detection[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2016: 779-788. |
[15] | Redmon J,Farhadi A.YOLO9000: Better,faster,stronger[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2017: 7263-7271. |
[16] | Redmon J,Farhadi A.YOLOv3: An incremental improvement[J].arXiv:1804.02767,2018. |
[17] | Bochkovskiy A,Wang C Y,Liao H Y M.YOLOv4: Optimal speed and accuracy of object detection[J].arXiv:2004.10934,2020. |
[18] | Li C,Li L,Jiang H,et al.YOLOv6: A single-stage object detection framework for industrial applications[J].arXiv:2209.02976,2022. |
[19] | Wang C Y,Bochkovskiy A,Liao H Y M.YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[J].arXiv:2207.02696,2022. |
[20] | Liu W,Anguelov D,Erhan D,et al.SSD: Single shot multibox detector[C]∥Proc of European Conference on Computer Vision,2016: 21-37. |
[21] | Tan M, Le Q.Efficientnet: Rethinking model scaling for convolutional neural networks[C]∥Proc of International Conference on Machine Learning,2019: 6105-6114. |
[22] | Lin T Y,Goyal P,Girshick R,et al.Focal loss for dense object detection[C]∥Proc of the IEEE International Confe- rence on Computer Vision,2017: 2980-2988. |
[23] | Law H,Deng J.CornerNet: Detecting objects as paired keypoints[C]∥Proc of the European Conference on Computer Vision,2018: 734-750. |
[24] | Duan K,Bai S,Xie L,et al.CenterNet: Keypoint triplets for object detection[C]∥Proc of the IEEE/CVF International Conference on Computer Vision,2019: 6569-6578. |
[25] | Dosovitskiy A, Beyer L,Kolesnikov A,et al.An image is worth 16×16 words: Transformers for image recognition at scale[J].arXiv:2010.11929,2020. |
[26] | Carion N,Massa F,Synnaeve G,et al.End-to-end object detection with transformers[C]∥Proc of European Conference on Computer Vision,2020: 213-229. |
[27] | He K,Zhang X,Ren S,et al.Deep residual learning for image recognition[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2016: 770-778. |
[28] | Lin T Y,Maire M,Belongie S,et al.Microsoft COCO: Common objects in context[C]∥Proc of European Conference on Computer Vision,2014: 740-755. |
[29] | Battaglia P W,Hamrick J B,Bapst V,et al.Relational inductive biases,deep learning,and graph networks[J].arXiv:1806.01261,2018. |
[30] | Zhu X, Su W,Lu L,et al.Deformable DETR: Deformable transformers for end-to-end object detection[J].arXiv:2010.04159,2020. |
[31] | Dai J,Qi H,Xiong Y,et al.Deformable convolutional networks[C]∥Proc of the IEEE International Conference on Computer Vision,2017: 764-773. |
[32] | Zhu X, Xiong Y,Dai J,et al.Deep feature flow for video re- cognition[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2017: 2349-2358. |
[33] | Dosovitskiy A,Fischer P,Ilg E,et al.FlowNet: Learning optical flow with convolutional networks[C]∥Proc of the IEEE International Conference on Computer Vision,2015: 2758-2766. |
[34] | Zhu X,Wang Y,Dai J,et al.Flow-guided feature aggregation for video object detection[C]∥Proc of the IEEE International Conference on Computer Vision,2017: 408-417. |
[35] | Wu H,Chen Y,Wang N,et al.Sequence level semantics aggregation for video object detection[C]∥Proc of the IEEE/CVF International Conference on Computer Vision,2019: 9217-9225. |
[36] | Chen Y,Cao Y,Hu H,et al.Memory enhanced global-local aggregation for video object detection[C]∥Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2020: 10337-10346. |
[37] | Shrivastava A,Gupta A,Girshick R.Training region-based object detectors with online hard example mining[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2016: 761-769. |
[38] | Chen L C,Papandreou G,Kokkinos I,et al.Deeplab: Semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected CRFs[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,40(4): 834-848. |
[39] | Hu J,Shen L,Sun G.Squeeze and excitation networks[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2018: 7132-7141. |
[40] | Du D, Qi Y, Yu H,et al.The unmanned aerial vehicle benchmark: Object detection and tracking[C]∥Proc of the European Conference on Computer Vision,2018: 370-386. |
[41] | Paszke A,Gross S,Massa F,et al.PyTorch: An imperative style,high-performance deep learning library[C]∥Proc of International Conference on Neural Information Processing Systems,2019:8026-8037. |
[42] | Chen K,Wang J,Pang J,et al.MMDetection: Open MMLab detection toolbox and benchmark[J].arXiv:1906.07155,2019. |
[43] | Zhao C,Nuno V. Cascade R-CNN: Delving into high quality object detection[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2018: 6154-6162. |
[44] | Lu X, Li B,Yue Y,et al.Grid-RCNN[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2019:7355-7364. |
[45] | Lin T Y,Goyal P, Girshick R, et al. Focal loss for dense object detection[C]∥Proc of the IEEE International Confe- rence on Computer Vision,2017:2980-2988. |
[46] | Chen Q,Wang Y,Yang T,et al.You only look one-level feature [J].arXiv:2103.09460, 2021. |
[47] | Ge Z,Liu S,Wang F,et al.YOLOX: Exceeding YOLO series in 2021[J].arXiv:2107.08430,2021. |
[1] | ZHAO Yue, XIAO Meng-yan, QIU Bao-jun, LUO Jun, WANG Xiao-qiang, LUO Dao-jun. Software design of acoustic scanning image defect detection based on machine vision [J]. Computer Engineering & Science, 2023, 45(10): 1806-1813. |
[2] | LI Xiao-lin, WANG Fu-gang, ZHANG Peng-fei, ZHANG Lin-yu, . YOLOv5s algorithm optimization based on multi-scale feature extraction [J]. Computer Engineering & Science, 2023, 45(06): 1054-1062. |
[3] | DENG Shan-shan, HUANG Hui, MA Yan. A small object detection algorithm based on improved Faster R-CNN [J]. Computer Engineering & Science, 2023, 45(05): 869-877. |
[4] | HUANG Xing-wei, CHEN Xi, ZHANG Su-fan. A deep learning model based on improved feature pyramid networks for small object detection [J]. Computer Engineering & Science, 2023, 45(04): 734-742. |
[5] | WANG Guan-bo, ZHAO Yi-fan, LI Bo, YANG Jun-dong, DING Hong-wei. Real-time flame detection with improved YOLO v4-tiny [J]. Computer Engineering & Science, 2022, 44(12): 2196-2205. |
[6] | LUO Yue-tong, DUAN Chang, JIANG Pei-feng, ZHUO Bo. An improved industrial defect data augmentation method based on pix2pix [J]. Computer Engineering & Science, 2022, 44(12): 2206-2212. |
[7] | ZHANG Hai-yan, FU Ying-na, DING Gui-jiang, MENG Qing-yan. Towards Anchor-free object detection with diverse receptive fields attention feature refinement network [J]. Computer Engineering & Science, 2022, 44(11): 1995-2002. |
[8] | YU Ming-hao, GAO Jian-ling. LUDet:A lightweight underwater object detector [J]. Computer Engineering & Science, 2022, 44(09): 1638-1645. |
[9] | LI Li-rong, WANG Zi-yan, ZHANG Kai, YANG Di-chun, XIONG Wei, GONG Peng-cheng, . A train bottom parts detection algorithm based on OSE-dResnet neural networks [J]. Computer Engineering & Science, 2022, 44(04): 692-698. |
[10] | LI Jing, HE Qiang, ZHANG Chang-lun, WANG Heng-you, . An indoor people counting model based on global attention [J]. Computer Engineering & Science, 2022, 44(03): 471-478. |
[11] | LIU Hong-jiang, WANG Mao, LIU Li-hua, WU Ji-bing, HUANG Hong-bin. A survey of small object detection based on deep learning [J]. Computer Engineering & Science, 2021, 43(08): 1429-1442. |
[12] | LI Jing-lin, JIANG Jing-fei, DOU Yong, XU Jin-wei, WEN Dong. A redundacy-reduced candidate box accelerator based on soft-non-maximum suppression [J]. Computer Engineering & Science, 2021, 43(04): 586-593. |
[13] | LI Ben-gao, WU Cong-zhong, XU Liang-feng, ZHAN Shu. Object detection based on multi-scale feature fusion and residual attention mechanism [J]. Computer Engineering & Science, 2021, 43(02): 347-353. |
[14] | WU Xiang-ning, HE Peng, DENG Zhong-gang, LI Jia-qi, WANG Wen, CHEN Miao. A deep learning model of small object detection based on attention mechanism [J]. Computer Engineering & Science, 2021, 43(01): 95-104. |
[15] | LI Lan, XI Shu-shu, ZHANG Cai-bao, MA Hong-yang. A surface defect recognition algorithm based on improved SSD model [J]. Computer Engineering & Science, 2020, 42(09): 1608-1615. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||
湘公网安备 43010502000083号
湘ICP备10006030号
Copyright © Computer Engineering & Science, All Rights Reserved.
Address:109 Deya Rd,Changsha,hunan(410073) Tel: 0731-87002567 Email: jsjgcykx@vip.163.com
Powered by Beijing Magtech Co., Ltd.