Abstract
Background: A large number of allergens are derived from plant and animal proteins. A major challenge for researchers is to study the possible allergenic properties of proteins. The aim of this study was in silico analysis and comparison of several physiochemical and structural features of plant- and animal-derived allergen proteins, as well as classifying these proteins based on Chou’s pseudo-amino acid composition (PseAAC) concept combined with bioinformatics algorithms. Methods: The physiochemical properties and secondary structure of plant and animal allergens were studied. The classification of the sequences was done using the PseAAC concept incorporated with the deep learning algorithm. Conserved motifs of plant and animal proteins were discovered using the MEME tool. B-cell and T-cell epitopes of the proteins were predicted in conserved motifs. Allergenicity and amino acid composition of epitopes were also analyzed via bioinformatics servers. Results: In comparison of physiochemical features of animal and plant allergens, extinction coefficient was different significantly. Secondary structure prediction showed more random coiled structure in plant allergen proteins compared with animal proteins. Classification of proteins based on PseAAC achieved 88.24% accuracy. The amino acid composition study of predicted B- and T-cell epitopes revealed more aliphatic index in plant-derived epitopes. Conclusions: The results indicated that bioinformatics-based studies could be useful in comparing plant and animal allergens.