PhageVir: An evaluation of computational intelligence models for the precise identification of phage virion proteins

Authors: Nashwan Alromema 1, *, Hussnain Arshad 2, Sharaf J. Malebary 3, Faisal Binzagr 1, Yaser Daanial Khan 4

Affiliations:

1Department of Computer Science, Faculty of Computing and Information Technology-Rabigh, King Abdulaziz University, Jeddah, Saudi Arabia
2Department of Artificial Intelligence, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
3Department of Information Technology, Faculty of Computing and Information Technology-Rabigh, King Abdulaziz University, Jeddah, Saudi Arabia
4Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan

Abstract

This study presents PhageVir, an enhanced computational model developed to predict Phage Virion Proteins (PVPs), which are essential for bacteriophage infection and replication. PhageVir integrates advanced feature selection methods, including the Position Relative Incidence Matrix (PRIM) and the Reverse Position Relative Incidence Matrix (RPRIM), to effectively capture key sequence features and positional dependencies within protein sequences. Several machine learning and deep learning algorithms were employed, including LightGBM, Random Forest, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Recurrent Neural Network (RNN), and Artificial Neural Network (ANN), to classify PVPs based on sequential data. Model performance was evaluated through independent set testing, self-consistency testing, and cross-validation, using metrics such as accuracy (ACC), specificity (Sp), sensitivity (SN), Z-score, and Matthews correlation coefficient (MCC). The CNN model demonstrated strong performance in cross-validation, achieving an accuracy of 0.833, sensitivity of 0.832, specificity of 0.834, a correlation coefficient of 0.665, an AUC score of 0.927, and a Z-score of 1.37. The results confirm the effectiveness of the proposed computational approach for accurate PVP classification. Beyond its predictive power, PhageVir offers valuable biological insights into phage infection mechanisms, supporting advancements in phage therapy and antibacterial treatments.

Keywords

Phage virion proteins, Computational model, Feature selection, Deep learning, Phage therapy

Download

📄 Full PDF

DOI

https://doi.org/10.21833/ijaas.2025.05.013

Citation (APA)

Alromema, N., Arshad, H., Malebary, S. J., Binzagr, F., & Khan, Y. D. (2025). PhageVir: An evaluation of computational intelligence models for the precise identification of phage virion proteins. International Journal of Advanced and Applied Sciences, 12(5), 129–147. https://doi.org/10.21833/ijaas.2025.05.013