LAPSE:2023.35927
Published Article
LAPSE:2023.35927
Deep Learning Based Methods for Molecular Similarity Searching: A Systematic Review
June 7, 2023
In rational drug design, the concept of molecular similarity searching is frequently used to identify molecules with similar functionalities by looking up structurally related molecules in chemical databases. Different methods have been developed to measure the similarity of molecules to a target query. Although the approaches perform effectively, particularly when dealing with molecules with homogenous active structures, they fall short when dealing with compounds that have heterogeneous structural compounds. In recent times, deep learning methods have been exploited for improving the performance of molecule searching due to their feature extraction power and generalization capabilities. However, despite numerous research studies on deep-learning-based molecular similarity searches, relatively few secondary research was carried out in the area. This research aims to provide a systematic literature review (SLR) on deep-learning-based molecular similarity searches to enable researchers and practitioners to better understand the current trends and issues in the field. The study accesses 875 distinctive papers from the selected journals and conferences, which were published over the last thirteen years (2010−2023). After the full-text eligibility analysis and careful screening of the abstract, 65 studies were selected for our SLR. The review’s findings showed that the multilayer perceptrons (MLPs) and autoencoders (AEs) are the most frequently used deep learning models for molecular similarity searching; next are the models based on convolutional neural networks (CNNs) techniques. The ChEMBL dataset and DrugBank standard dataset are the two datasets that are most frequently used for the evaluation of deep learning methods for molecular similarity searching based on the results. In addition, the results show that the most popular methods for optimizing the performance of molecular similarity searching are new representation approaches and reweighing features techniques, and, for evaluating the efficiency of deep-learning-based molecular similarity searching, the most widely used metrics are the area under the curve (AUC) and precision measures.
Keywords
deep learning, drug design, drug discovery, molecular similarity searching, virtual screening
Suggested Citation
Nasser M, Yusof UK, Salim N. Deep Learning Based Methods for Molecular Similarity Searching: A Systematic Review. (2023). LAPSE:2023.35927
Author Affiliations
Nasser M: School of Computer Sciences, Universiti Sains Malaysia, Gelugor 11800, Penang, Malaysia [ORCID]
Yusof UK: School of Computer Sciences, Universiti Sains Malaysia, Gelugor 11800, Penang, Malaysia [ORCID]
Salim N: UTM Big Data Centre, Ibnu Sina Institute for Scientific and Industrial Research, Universiti Teknologi Malaysia, Johor Bahru 81310, Johor, Malaysia
Journal Name
Processes
Volume
11
Issue
5
First Page
1340
Year
2023
Publication Date
2023-04-26
Published Version
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr11051340, Publication Type: Review
Record Map
Published Article

LAPSE:2023.35927
This Record
External Link

doi:10.3390/pr11051340
Publisher Version
Download
Files
[Download 1v1.pdf] (1.7 MB)
Jun 7, 2023
Main Article
License
CC BY 4.0
Meta
Record Statistics
Record Views
76
Version History
[v1] (Original Submission)
Jun 7, 2023
 
Verified by curator on
Jun 7, 2023
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2023.35927
 
Original Submitter
Calvin Tsay
Links to Related Works
Directly Related to This Work
Publisher Version