LAPSE:2020.1000
Published Article
LAPSE:2020.1000
MPPIF-Net: Identification of Plasmodium Falciparum Parasite Mitochondrial Proteins Using Deep Features with Multilayer Bi-directional LSTM
September 23, 2020
Mitochondrial proteins of Plasmodium falciparum (MPPF) are an important target for anti-malarial drugs, but their identification through manual experimentation is costly, and in turn, their related drugs production by pharmaceutical institutions involves a prolonged time duration. Therefore, it is highly desirable for pharmaceutical companies to develop computationally automated and reliable approach to identify proteins precisely, resulting in appropriate drug production in a timely manner. In this direction, several computationally intelligent techniques are developed to extract local features from biological sequences using machine learning methods followed by various classifiers to discriminate the nature of proteins. Unfortunately, these techniques demonstrate poor performance while capturing contextual features from sequence patterns, yielding non-representative classifiers. In this paper, we proposed a sequence-based framework to extract deep and representative features that are trust-worthy for Plasmodium mitochondrial proteins identification. The backbone of the proposed framework is MPPF identification-net (MPPFI-Net), that is based on a convolutional neural network (CNN) with multilayer bi-directional long short-term memory (MBD-LSTM). MPPIF-Net inputs protein sequences, passes through various convolution and pooling layers to optimally extract learned features. We pass these features into our sequence learning mechanism, MBD-LSTM, that is particularly trained to classify them into their relevant classes. Our proposed model is experimentally evaluated on newly prepared dataset PF2095 and two existing benchmark datasets i.e., PF175 and MPD using the holdout method. The proposed method achieved 97.6%, 97.1%, and 99.5% testing accuracy on PF2095, PF175, and MPD datasets, respectively, which outperformed the state-of-the-art approaches.
Keywords
bi-directional LSTM, Machine Learning, mitochondrial protein, plasmodium falciparum
Suggested Citation
Khan SU, Baik R. MPPIF-Net: Identification of Plasmodium Falciparum Parasite Mitochondrial Proteins Using Deep Features with Multilayer Bi-directional LSTM. (2020). LAPSE:2020.1000
Author Affiliations
Khan SU: Intelligent Media Laboratory, Digital Contents Research Institute, Sejong University, Seoul 143-747, Korea [ORCID]
Baik R: Department of Computer Engineering, Convergence School of ICT, Honam University, #417 Eodeung-daero, Gwangsan-gu, Gwangju 506-090, Korea
Journal Name
Processes
Volume
8
Issue
6
Article Number
E725
Year
2020
Publication Date
2020-06-22
Published Version
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr8060725, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2020.1000
This Record
External Link

doi:10.3390/pr8060725
Publisher Version
Download
Files
[Download 1v1.pdf] (2.2 MB)
Sep 23, 2020
Main Article
License
CC BY 4.0
Meta
Record Statistics
Record Views
561
Version History
[v1] (Original Submission)
Sep 23, 2020
 
Verified by curator on
Sep 23, 2020
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2020.1000
 
Original Submitter
Calvin Tsay
Links to Related Works
Directly Related to This Work
Publisher Version