LAPSE:2024.1930
Published Article
LAPSE:2024.1930
An Efficient Multi-Label Classification-Based Municipal Waste Image Identification
August 28, 2024
Sustainable and green waste management has become increasingly crucial due to the rising volume of waste driven by urbanization and population growth. Deep learning models based on image recognition offer potential for advanced waste classification and recycling methods. However, traditional image recognition approaches usually rely on single-label images, neglecting the complexity of real-world waste occurrences. Moreover, there is a scarcity of recognition efforts directed at actual municipal waste data, with most studies confined to laboratory settings. Therefore, we introduce an efficient Query2Label (Q2L) framework, powered by the Vision Transformer (ViT-B/16) as its backbone and complemented by an innovative asymmetric loss function, designed to effectively handle the complexity of multi-label waste image classification. Our experiments on the newly developed municipal waste dataset “Garbage In, Garbage Out”, which includes 25,000 street-level images, each potentially containing up to four types of waste, showcase the Q2L framework’s exceptional ability to identify waste types with an accuracy exceeding 92.36%. Comprehensive ablation experiments, comparing different backbones, loss functions, and models substantiate the efficacy of our approach. Our model achieves superior performance compared to traditional models, with a mean average precision increase of up to 2.39% when utilizing the asymmetric loss function, and switching to ViT-B/16 backbone improves accuracy by 4.75% over ResNet-101.
Record ID
Keywords
asymmetric loss function, multi-label image classification, Query2Label, Vision Transformer, waste management
Subject
Suggested Citation
Wu R, Liu X, Zhang T, Xia J, Li J, Zhu M, Gu G. An Efficient Multi-Label Classification-Based Municipal Waste Image Identification. (2024). LAPSE:2024.1930
Author Affiliations
Wu R: School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
Liu X: School of Intelligent and Information Engineering, Shandong University of Traditional Chinese Medicine, Qingdao 266112, China
Zhang T: School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China
Xia J: School of International Education, Beijing University of Chemical Technology, Beijing 100029, China
Li J: School of Information and Electrical Engineering, China Agricultural University, Beijing 100193, China
Zhu M: School of Management, Harbin Institute of Technology, Harbin 150001, China
Gu G: College of Naval Architecture and Ocean Engineering, Naval University of Engineering, Wuhan 430033, China [ORCID]
Liu X: School of Intelligent and Information Engineering, Shandong University of Traditional Chinese Medicine, Qingdao 266112, China
Zhang T: School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China
Xia J: School of International Education, Beijing University of Chemical Technology, Beijing 100029, China
Li J: School of Information and Electrical Engineering, China Agricultural University, Beijing 100193, China
Zhu M: School of Management, Harbin Institute of Technology, Harbin 150001, China
Gu G: College of Naval Architecture and Ocean Engineering, Naval University of Engineering, Wuhan 430033, China [ORCID]
Journal Name
Processes
Volume
12
Issue
6
First Page
1075
Year
2024
Publication Date
2024-05-24
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr12061075, Publication Type: Journal Article
Record Map
Published Article
LAPSE:2024.1930
This Record
External Link
https://doi.org/10.3390/pr12061075
Publisher Version
Download
Meta
Record Statistics
Record Views
38
Version History
[v1] (Original Submission)
Aug 28, 2024
Verified by curator on
Aug 28, 2024
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2024.1930
Record Owner
PSE Press
Links to Related Works