«上一篇/Previous Article|本期目录/Table of Contents|下一篇/Next Article»

j.issn.1672-1292.2019.04.017]
点击复制

基于卷积神经网络的仓储物体检测算法研究

分享到：

南京师范大学学报（工程技术版）[ISSN:1006-6977/CN:61-1281/TN]

卷:: 19卷
期数:: 2019年04期

页码:: 099-105

栏目:: 计算机工程

出版日期:: 2019-12-31

文章信息/Info

Title:: Research on Warehouse Object Detection AlgorithmBased on Convolutional Neural Network

文章编号:: 1672-1292(2019)04-0099-07

作者:: 王飞¹; 陈亮杰²; 王梨²; 王林²; (1.贵州民族大学人文科技学院,贵州贵阳 550025)(2.贵州民族大学数据科学与信息工程学院,贵州贵阳 550025)

Author(s):: Wang Fei¹; Chen Liangjie²; Wang Li²; Wang Lin²; (1.College of Humanities & Sciences of Guizhou Minzu University,Guiyang 550025,China)(2.College of Data Science and Information Engineering,Guizhou Minzu University,Guiyang 550025,China)

关键词:: 卷积神经网络; 仓储环境; 物体检测; DSOD

Keywords:: convolutional neural network; warehouse environment; object detection; deeply supervised object detectors(DSOD)

分类号:: TP391.41

DOI:: 10.3969/j.issn.1672-1292.2019.04.017

文献标志码:: A

摘要:: 针对仓储环境中物体检测公开数据集匮乏的问题,通过摄像机采集真实仓储环境中包含货物、托盘和叉车的大量图像进行标注,创建了一个仓储物体数据集. 同时针对传统物体检测算法在仓储环境中检测准确率较低的问题,将基于卷积神经网络的DSOD应用于仓储环境中,通过在自己创建的仓储物体数据集上从零开始训练DSOD模型,实现了仓储物体的准确性检测. 该算法的mAP达到了93.81%,比Faster R-CNN、SSD分别提高了0.04%、1.44%; 并且模型大小仅有51.3 MB,比Faster R-CNN、SSD分别减小了184.5 MB、43.4 MB. 实验结果表明,该算法获得了较为满意的仓储物体检测效果,其在仓储物体检测领域具有一定的实用价值.

Abstract:: Considering the lack of public datasets for object detection based on the warehouse environment,a large number of images containing cargos,trays and forklifts in real warehouse environment are collected and labeled to build the warehouse object dataset. Meanwhile,aiming at the problem that the traditional object detection algorithm has lower detection accuracy in warehouse environment,the deeply supervised object detectors(DSOD)based on convolutional neural network is applied to the warehouse environment,and the DSOD model is trained from scratch on the self-built warehouse object dataset,and the accuracy detection of the warehouse object is realized. The mean Average Precision(mAP)of this algorithm reaches 93.81%,which is higher than that of Faster R-CNN and SSD by 0.04 and 1.44 points respectively,and the model size of this algorithm is only 51.3 MB,which is lower than that of Faster R-CNN and SSD by 184.5 MB and 43.4 MB respectively. The experimental results show that the algorithm has a relatively satisfying warehouse object detection effect,and it has certain practical values in the field of warehouse object detection.

参考文献/References:

[1] PAPAGEORGIOU C P,OREN M,POGGIO T. A general framework for object detection[C]//Sixth International Conference on Computer Vision. Bombay,India,1998.
[2]LIENHART R,MAYDT J. An extended set of Haar-like features for rapid object detection[C]//International Conference on Image Processing. Rochester,NY,USA,2002.
[3]FREUND Y,SCHAPIRE R E. A desicion-theoretic generalization of on-line learning and an application to boosting[J]. Journal of computer and system sciences,1997,55(1):119-139.
[4]DALAL N,TRIGGS B. Histograms of oriented gradients for human detection[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego,CA,USA,2005.
[5]SUYKENS J A K,VANDEWALLE J. Least squares support vector machine classifiers[J]. Neural processing letters,1999,9(3):293-300.
[6]LOWE D G. Distinctive image features from scale-invariant keypoints[J]. International journal of computer vision,2004,60(2):91-110.
[7]FELZENSZWALB P F,GIRSHICK R B,MCALLESTER D,et al. Object detection with discriminatively trained part based models[J]. IEEE transactions on pattern analysis and machine intelligence,2010,32(9):1627-1645.
[8]GIRSHICK R,DONAHUE J,DARRELL T,et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition. Columbus,OH,USA,2014.
[9]GIRSHICK R. Fast R-CNN[C]//IEEE International Conference on Computer Vision. Santiago,Chile,2015.
[10]REN S,HE K,GIRSHICK R,et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence,2017,39(6):1137-1149.
[11]UIJLINGS J R R,SANDE K E A V D,GEVERS T,et al. Selective search for object recognition[J]. International journal of computer vision,2013,104(2):154-171.
[12]REDMON J,DIVVALA S,GIRSHICK R,et al. You only look once:unified,real-time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas,NV,USA,2016.
[13]LIU W,ANGUELOV D,ERHAN D,et al. SSD:single shot MultiBox detector[C]//European Conference on Computer Vision. Amsterdam,Netherlands,2016:21-37.
[14]李天剑,黄斌,刘江玉,等. 卷积神经网络物体检测算法在物流仓库中的应用[J]. 计算机工程,2018,44(6):176-181.
LI T J,HUANG B,LIU J Y,et al. Application of convolution neural network object detection algorithm in logistics warehouse[J]. Computer engineering,2018,44(6):176-181.(in Chinese)
[15]SHEN Z,LIU Z,LI J,et al. DSOD:learning deeply supervised object detectors from scratch[C]//IEEE International Conference on Computer Vision. Venice,Italy,2017.
[16]HUANG G,LIU Z,MAATEN L V D,et al. Densely connected convolutional networks[C]//IEEE Conference on Computer Vision and Pattern Recognition. Honolulu,Hawaii,USA,2017.

相似文献/References:

[1]曹金梦,倪蓉蓉,杨彪.面向面部表情识别的双通道卷积神经网络[J].南京师范大学学报(工程技术版),2018,18(03):001.[doi:10.3969/j.issn.1672-1292.2018.03.001]
　Cao Jinmeng,Ni Rongrong,Yang Biao.Binary-Channel Convolutional Neural Network forFacial Expression Recognition[J].Journal of Nanjing Normal University(Engineering and Technology),2018,18(04):001.[doi:10.3969/j.issn.1672-1292.2018.03.001]
[2]陈扬,曾诚,程成,等.一种基于CNN的足迹图像检索与匹配方法[J].南京师范大学学报(工程技术版),2018,18(03):039.[doi:10.3969/j.issn.1672-1292.2018.03.006]
　Chen Yang,Zeng Cheng,Cheng Cheng,et al.A CNN-based Approach to Footprint Image Retrieval and Matching[J].Journal of Nanjing Normal University(Engineering and Technology),2018,18(04):039.[doi:10.3969/j.issn.1672-1292.2018.03.006]
[3]成杰,叶文武,徐寅林.回转库档案实时定位中基于鱼眼镜头图像的处理识别技术[J].南京师范大学学报(工程技术版),2019,19(02):075.[doi:10.3969/j.issn.1672-1292.2019.02.010]
　Cheng Jie,Ye Wenwu,Xu Yinlin.Processing and Recognition Technology Based on Fisheye Lens Image in Real-Time Positioning of Rotary Library Files[J].Journal of Nanjing Normal University(Engineering and Technology),2019,19(04):075.[doi:10.3969/j.issn.1672-1292.2019.02.010]
[4]任媛媛,张显峰,马永建,等.基于卷积神经网络的无人机遥感影像农村建筑物目标检测[J].南京师范大学学报(工程技术版),2019,19(03):029.[doi:10.3969/j.issn.1672-1292.2019.03.005]
　Ren Yuanyuan,Zhang Xianfeng,Ma Yongjian,et al.Target Detection of Rural Buildings in UAV Remote Sensing ImagesBased on Convolutional Neural Network[J].Journal of Nanjing Normal University(Engineering and Technology),2019,19(04):029.[doi:10.3969/j.issn.1672-1292.2019.03.005]
[5]许博鸣,刘晓峰,业巧林,等.基于卷积神经网络面向自然场景建筑物识别技术的移动端应用[J].南京师范大学学报(工程技术版),2019,19(03):037.[doi:10.3969/j.issn.1672-1292.2019.03.006]
　Xu Boming,Liu Xiaofeng,Ye Qiaolin,et al.A Convolutional Neural Network Based on Mobile Application forIdentification of Buildings in Natural Scene[J].Journal of Nanjing Normal University(Engineering and Technology),2019,19(04):037.[doi:10.3969/j.issn.1672-1292.2019.03.006]
[6]梁秦嘉,刘怀,陆飞.基于改进YOLOv3模型的交通视频目标检测算法研究[J].南京师范大学学报(工程技术版),2021,21(02):047.[doi:10.3969/j.issn.1672-1292.2021.02.008]
　Liang Qinjia,Liu Huai,Lu Fei.Traffic Video Target Detection Algorithm Based on Improved YOLOv3[J].Journal of Nanjing Normal University(Engineering and Technology),2021,21(04):047.[doi:10.3969/j.issn.1672-1292.2021.02.008]
[7]梁秦嘉,刘怀,陆飞.基于改进YOLOv3的运动目标分类检测算法研究[J].南京师范大学学报(工程技术版),2021,21(04):027.[doi:10.3969/j.issn.1672-1292.2021.04.005]
　Liang Qinjia,Liu Huai,Lu Fei.Moving Target Classification and Detection AlgorithmBased on Improved YOLOv3[J].Journal of Nanjing Normal University(Engineering and Technology),2021,21(04):027.[doi:10.3969/j.issn.1672-1292.2021.04.005]
[8]尚文倩,曹原.FastGR:一种基于神经协同过滤的群组推荐算法[J].南京师范大学学报(工程技术版),2022,22(02):029.[doi:10.3969/j.issn.1672-1292.2022.02.005]
　Shang Wenqian,Cao Yuan.FastGR:A Group Recommendation Algorithm Based on Neural Collaborative Filtering[J].Journal of Nanjing Normal University(Engineering and Technology),2022,22(04):029.[doi:10.3969/j.issn.1672-1292.2022.02.005]
[9]韩天翊,林荣恒.一种基于决策层融合的多模态情感识别方法[J].南京师范大学学报(工程技术版),2022,22(02):035.[doi:10.3969/j.issn.1672-1292.2022.02.006]
　Han Tianyi,Lin Rongheng.A Multimodal Emotion Recognition Method Based on Decision Level Fusion[J].Journal of Nanjing Normal University(Engineering and Technology),2022,22(04):035.[doi:10.3969/j.issn.1672-1292.2022.02.006]
[10]张宇苏,吴小俊,李辉,等.基于无监督深度学习的红外图像与可见光图像融合算法[J].南京师范大学学报(工程技术版),2023,23(01):001.[doi:10.3969/j.issn.1672-1292.2023.01.001]
　Zhang Yusu,Wu Xiaojun,Li Hui,et al.Infrared Image and Visible Image Fusion Algorithm Based on Unsupervised Deep Learning[J].Journal of Nanjing Normal University(Engineering and Technology),2023,23(04):001.[doi:10.3969/j.issn.1672-1292.2023.01.001]

备注/Memo

备注/Memo:: 收稿日期:2019-07-05.
基金项目:贵州省教育厅创新群体重大研究项目(黔教合KY字[2018]018)、贵州省科技厅重点实验室(黔科合计Z字[2009]4002)、贵州民族大学人文科技学院基金科研项目(18rwjs016).
通讯联系人:王飞,助教,研究方向:图像处理、模式识别. E-mail:wangfei10248@163.com

常用功能

工具/Tools

统计/Statistics

摘要浏览/Viewed2865
全文下载/Downloads2805
评论/Comments

更新日期/Last Update: 2019-12-31