纺织学报 ›› 2023, Vol. 44 ›› Issue (10): 143-148.doi: 10.13475/j.fzxb.20221105601

• 服装工程 • 上一篇    下一篇

基于ClothResNet模型的人体衣物颜色识别

黄玥玥, 陈晓, 王海燕(), 姚海洋   

  1. 陕西科技大学 电子信息与人工智能学院, 陕西 西安 710016
  • 收稿日期:2022-11-21 修回日期:2023-04-08 出版日期:2023-10-15 发布日期:2023-12-07
  • 通讯作者: 王海燕(1965—),男,教授,博士。主要研究方向为信号与信息处理、计算机视觉及人工智能与机器学习。E-mail:hywang@sust.edu.cn
  • 作者简介:黄玥玥(1997—),女,硕士生。主要研究方向为计算机视觉、深度学习与智能服装。
  • 基金资助:
    国家自然科学基金重点项目(62031021)

Human clothing color recognition based on ClothResNet model

HUANG Yueyue, CHEN Xiao, WANG Haiyan(), YAO Haiyang   

  1. College of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an, Shaanxi 710016, China
  • Received:2022-11-21 Revised:2023-04-08 Published:2023-10-15 Online:2023-12-07

摘要:

针对人体衣物属性识别存在衣物分割困难、颜色易扭曲、识别效率和准确率低的问题,提出一种基于ClothResNet模型的高效人体衣物颜色识别方法,模型以ResNet18网络为基础,设计优化的金字塔池化模块以捕获图像中的多层语义信息,融合坐标注意力机制以关注人体衣物轮廓信息,融入空洞卷积以提升网络效率,从而利用多重信息提高人体衣物识别精确率。针对人体衣物属性数据集不足的问题,利用公开数据集活动模板(回归ATR)原始彩色图像构建衣物属性数据集pcaparsing。性能对比实验表明,ClothResNet模型在衣物颜色属性识别中的平均精确率达到94.49%,结果均优于同类方法。该识别方法在中小规模的人体衣物颜色识别系统具有广泛且重要的应用前景。

关键词: 人体衣物颜色识别, 残差网络, 优化的金字塔池化, ClothResNet模型

Abstract:

Objective Color recognition of human clothing has become a topic with widespread interest recent years due to its potential to address various issues such as clothing retail, smart security, and fashion recommendation. An accurate color recognition model for human clothing can greatly enhance user experience and service quality. However, accurately recognizing the color of clothing on a human body in images can be challenging due to the multi-angle and multi-pose nature of the clothing, as well as the influence of the material and texture of the clothing, the color and texture of the human body background, and other factors. Therefore, constructing an efficient and accurate human clothing color recognition model is important and necessary to be solved in order to improve recognition accuracy and stability.

Method To enable color recognition of human clothing, a human clothing attribute dataset called pcaparsing was first created. Then, an end-to-end convolutional neural network model called ClothResNet was constructed, which used ResNet18 as the backbone network. The model also featured an optimized pyramid pooling module responsible for capturing multi-level semantic information and a coordinate attention mechanism with a focus on the contour information of human clothing. Additionally, atrous convolution was used to improve network efficiency. The dataset was split into training and testing data at an 8∶2 ratio, with 80% of the dataset used for training the ClothResNet model and the remaining 20% used for testing its effectiveness. Comparative experiments were conducted between the traditional clothing color recognition methods of K-means and Hog-KNN, the deep learning clothing color recognition method of CNN, and the method proposed in this paper. Ablation experiments were also conducted to demonstrate the effectiveness of selecting ResNet18 as the backbone network, expanding the dataset, and adding each module to the model. Overall, the study aimed to improve the accuracy and efficiency of color recognition for human clothing.

Results This study aimed to recognize the color of human clothing in natural scene images using a convolutional neural network algorithm. The use of proposed network model, ClothResNet, achieved 94.49% accuracy rate in recognizing 12 different colors, demonstrating the feasibility and effectiveness of the deep learning method compared to traditional methods. To evaluate the effectiveness of the ClothResNet model, comparative experiments were conducted with traditional methods, and ablation experiments were designed and carried out. Expanding the dataset significantly improved the evaluation indicators of each network (Tab. 1 and Tab. 2). Furthermore, the addition of the pyramid pooling block and coordinated attention module further enhanced the performance of the model. These ablation experiments demonstrated the effectiveness of the extended dataset and network modules, laying the foundation for future work on automatic color recognition of human clothing. Overall, this study highlights the potential of deep learning methods for accurately and efficiently recognizing clothing colors in natural scene images.

Conclusion In this study, a novel human clothing color recognition model called ClothResNet was proposed, which utilizes ResNet18 as the backbone network and incorporates an improved pyramid pooling module and a coordinate attention mechanism. By combining the strengths of an end-to-end convolutional neural network, this model has led improved recognition of various human clothing colors. Through a series of experiments, we verified the feasibility and effectiveness of our proposed method. This approach provides new ideas for the development of smart clothing, although there is still room for improvement. For instance, there are far more than 12 colors of clothing in reality, so further research is needed to develop methods for recognizing a broader range of colors.

Key words: human clothing color recognition, residual network, optimized pyramid pooling, ClothResNet model

中图分类号: 

  • TS101

图1

样本集示例图片"

图2

数据扩充示例"

图3

ClothResNet 网络结构"

图4

改进的金字塔池化网络结构"

图5

坐标注意力模块"

图6

K-means、Hog-KNN、CNN、ClothResNet识别颜色精确率的对比图"

表1

原始数据集消融实验对比"

模型 精确率/% 平均交并比/% 参数量/MB
ResNet18 65.40 32.50 51.74
ResNet18+PP 68.30 40.30 52.73
ResNet18+OPP 71.67 46.80 62.08
ResNet18+CA 70.24 36.37 53.09
ClothResNet 90.20 73.20 62.15

表2

扩充数据集消融实验对比"

模型 精确率/% 平均交并比/% 参数量/MB
ResNet18 68.90 37.5 51.74
AlexNet 67.60 37.4 233.08
VGG11 62.10 35.4 506.83
ResNet18+PP 71.50 45.2 52.73
ResNet18+OPP 76.90 50.2 62.08
ResNet18+CA 75.35 40.7 53.09
ClothResNet 94.49 76.0 62.15
[1] YANG X, YUAN S, TIAN Y L. Assistive clothing pattern recognition for visually impaired people[J]. IEEE Transactions on Human-Machine Systems, 2014, 44(2): 234-243.
doi: 10.1109/THMS.2014.2302814
[2] HIDAYATI S C, YOU C W, CHENG W H, et al. Learning and recognition of clothing genres from full-body images[J]. IEEE Transactions on Cybernetics, 2017, 48(5): 1647-1659.
doi: 10.1109/TCYB.2017.2712634
[3] 许倩, 陈敏之. 基于深度学习的服装丝缕平衡性评价系统[J]. 纺织学报, 2019, 40(10): 191-195.
XU Qian, CHEN Minzhi. An evaluation system for the balance of clothing threads based on deep learning[J]. Journal of Textile Research, 2019, 40(10): 191-195.
doi: 10.1177/004051757004000213
[4] 江学为, 田润雨, 卢方骁, 等. 基于模拟评分的服装推荐改进算法[J]. 纺织学报, 2021, 42(12): 138-144.
JIANG Xuewei, TIAN Runyu, LU Fangxiao, et al. An improved algorithm for clothing recommendation based on simulated scoring[J]. Journal of Textile Science, 2021, 42(12): 138-144.
[5] CHANG Y H, ZHANG Y Y. Deep learning for clothing style recognition using YOLOv5[J]. Micromachines, 2022. DOI: 10.3390/mi13101678.
[6] 沈建冬, 陈恒. 融合HOG和颜色特征的人体姿态估计新算法[J]. 计算机工程与应用, 2017, 53(21): 190-194.
doi: 10.3778/j.issn.1002-8331.1606-0319
SHEN Jiandong, CHEN Heng. A new algorithm for human pose estimation based on hog and color features[J]. Computer Engineering and Applications, 2017, 53(21): 190-194.
doi: 10.3778/j.issn.1002-8331.1606-0319
[7] 刘正东, 刘以涵, 王首人. 西装识别的深度学习方法[J]. 纺织学报, 2019, 40(4): 158-164.
LIU Zhengdong, LIU Yihan, WANG Shouren. Deep learning method for suit recognition[J]. Journal of Textile Research, 2019, 40(4): 158-164.
doi: 10.1177/004051757004000209
[8] LIKAS A, VLASSIS N, VERBEEK J J. The global K-means clustering algorithm[J]. Pattern Recognition, 2003, 36(2): 451-461.
doi: 10.1016/S0031-3203(02)00060-2
[9] COMANICIU D, MEER P. Mean shift: a robust approach toward feature space analysis[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(5): 603-619.
doi: 10.1109/34.1000236
[10] IVANOV A Y, BORZUNOV G I, KOGOS K. Recognition and identification of the clothes in the photo or video using neural networks[C]// 2018 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus). Petersburg:IEEE, 2018: 1513-1516.
[11] ROY M A. Clothing recognition using deep learning techniques[D]. Bangkok: Asian Institute of Technology, 2019: 1-30.
[12] LIU Z, LUO P, QIU S, et al. Deepfashion: powering robust clothes recognition and retrieval with rich annotations[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 1096-1104.
[13] CHEN Bunyanon C, JIANG J H. Clothing classification with multi-attribute using convolutional neural net-work[C]// Proceedings of International Computer Symposium. Singapore: Springer, 2018: 190-196.
[14] LIANG X, LIU S, SHEN X, et al. Deep human parsing with active template regression[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(12): 2402-2414.
doi: 10.1109/TPAMI.2015.2408360 pmid: 26539846
[1] 李麒阳, 季诚昌, 郗欣甫, 孙以泽. 大尺寸异形结构芯模编织策略及纱线轨迹预测[J]. 纺织学报, 2023, 44(10): 188-195.
[2] 张子凡, 李鹏飞, 王建南, 许建梅. 丝素蛋白载药纳米粒的研究进展[J]. 纺织学报, 2023, 44(10): 205-213.
[3] 姜绍华, 梁帅童, 裴刘军, 张红娟, 王际平. 基于概率密度函数的织物染色侵入动力学分析[J]. 纺织学报, 2023, 44(10): 90-97.
[4] 姚双双, 付少举, 张佩华, 孙秀丽. 再生丝素蛋白/聚乙烯醇共混取向纳米纤维膜的制备与性能[J]. 纺织学报, 2023, 44(09): 11-19.
[5] 罗元泽, 戴梦男, 李蒙, 俞杨销, 王建南. 丝素蛋白基药物载体的应用研究进展[J]. 纺织学报, 2023, 44(09): 213-222.
[6] 汪泽幸, 周衡书, 杨敏, 谭冬宜. 不同循环加载路径下黄麻织物/聚乙烯复合材料的变形特性[J]. 纺织学报, 2023, 44(09): 99-107.
[7] 安雪, 刘太奇, 李言, 赵小龙. 牢固结合的多层纳米纤维复合材料的制备及其过滤性能[J]. 纺织学报, 2023, 44(08): 50-56.
[8] 付晗, 胡峰, 龚杰, 余联庆. 面向织物疵点检测的缺陷重构方法[J]. 纺织学报, 2023, 44(07): 103-109.
[9] 李龙, 张弦, 吴磊. 导电纱线制备方法与应用的研究进展[J]. 纺织学报, 2023, 44(07): 214-221.
[10] 成悦, 左涵, 安琪, 李大伟, 张伟, 付译鋆. 非可吸收倒刺缝合线握持力及其影响因素[J]. 纺织学报, 2023, 44(06): 66-71.
[11] 杨宏脉, 张效栋, 闫宁, 朱琳琳, 李娜娜. 一种高鲁棒性经编机上断纱在线检测算法[J]. 纺织学报, 2023, 44(05): 139-146.
[12] 胡安钟, 王成成, 钟子恒, 张丽平, 付少海. 氮化硼纳米片掺杂型快速响应温致变色织物的制备及其性能[J]. 纺织学报, 2023, 44(05): 164-170.
[13] 谢鹏浩, 李勇, 陈晓川, 汪军. 基于分形的棉条建模及其牵伸过程的仿真[J]. 纺织学报, 2023, 44(04): 55-62.
[14] 刘东炎, 郑成燕, 王晓旭, 钱坤, 张典堂. 超高分子量聚乙烯织物/聚脲柔性复合材料的抗破片侵彻机制[J]. 纺织学报, 2023, 44(03): 79-87.
[15] 姜博宸, 王玥, 王富军, 林婧, 郭爱军, 王璐, 关国平. 一体化机械编织食管覆膜支架的力学性能与编织参数关系[J]. 纺织学报, 2023, 44(03): 88-95.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!