Authors: Nashwan Alromema 1, *, Hussnain Arshad 2, Sharaf J. Malebary 3, Faisal Binzagr 1, Yaser Daanial Khan 4
Affiliations:
1Department of Computer Science, Faculty of Computing and Information Technology-Rabigh, King Abdulaziz University, Jeddah, Saudi Arabia
2Department of Artificial Intelligence, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
3Department of Information Technology, Faculty of Computing and Information Technology-Rabigh, King Abdulaziz University, Jeddah, Saudi Arabia
4Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
This study presents PhageVir, an enhanced computational model developed to predict Phage Virion Proteins (PVPs), which are essential for bacteriophage infection and replication. PhageVir integrates advanced feature selection methods, including the Position Relative Incidence Matrix (PRIM) and the Reverse Position Relative Incidence Matrix (RPRIM), to effectively capture key sequence features and positional dependencies within protein sequences. Several machine learning and deep learning algorithms were employed, including LightGBM, Random Forest, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Recurrent Neural Network (RNN), and Artificial Neural Network (ANN), to classify PVPs based on sequential data. Model performance was evaluated through independent set testing, self-consistency testing, and cross-validation, using metrics such as accuracy (ACC), specificity (Sp), sensitivity (SN), Z-score, and Matthews correlation coefficient (MCC). The CNN model demonstrated strong performance in cross-validation, achieving an accuracy of 0.833, sensitivity of 0.832, specificity of 0.834, a correlation coefficient of 0.665, an AUC score of 0.927, and a Z-score of 1.37. The results confirm the effectiveness of the proposed computational approach for accurate PVP classification. Beyond its predictive power, PhageVir offers valuable biological insights into phage infection mechanisms, supporting advancements in phage therapy and antibacterial treatments.
Phage virion proteins, Computational model, Feature selection, Deep learning, Phage therapy
https://doi.org/10.21833/ijaas.2025.05.013
Alromema, N., Arshad, H., Malebary, S. J., Binzagr, F., & Khan, Y. D. (2025). PhageVir: An evaluation of computational intelligence models for the precise identification of phage virion proteins. International Journal of Advanced and Applied Sciences, 12(5), 129–147. https://doi.org/10.21833/ijaas.2025.05.013