Cite this asShafat Z, Ahmed A, Parvez MK, Islam A, Parveen S (2022) The dark proteome of rodent hepatitis E virus: Analysis of intrinsically disordered regions. Arch Hepat Res 8(1): 005-011. DOI: 10.17352/ahr.000032
Copyright License© 2022 Shafat Z, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Hepatitis E virus (HEV) is the causative agent of Hepatitis E infections across the world. Intrinsically disordered protein regions (IDPRs) or Intrinsically Disordered Protein (IDPs) are regions or proteins that are characterized by a lack of definite structure. These regions or proteins play significant roles in a wide range of biological processes, such as cell cycle regulation, control of signaling pathways, etc. IDPRs or IDPs in proteins are associated with the virus’s pathogenicity and infectivity. The occurrence of intrinsic disorder in the proteome of rat HEV remains to be elucidated, which prompted us to explore its dark proteome. In this study, the unstructured/disordered regions of ORF proteins of rat HEV have been examined. We have analyzed the prevalence of intrinsic disorder by using a set of computational predictors. The intrinsic disorder propensity analysis showed that the ORF proteins consisted of a varying fraction of intrinsic disorder. The ORF3 protein was identified with a maximum propensity for intrinsic disorder while the protein ORF6 showed the least propensity for the intrinsic disorder. Further, the analysis revealed ORF6 as highly structured protein (ORDP); ORF1 and ORF4 as moderately disordered proteins (IDPRs); and ORF3 and ORF5 as highly disordered proteins, categorizing them as ordered protein (ORDP), a protein having Intrinsically Disordered Region (IDPR) and Intrinsically Disordered Proteins (IDP) respectively. Such disordered regions may play several important roles in the pathogenesis and replication of viruses. Collectively, this comprehensive study data from our investigation suggested ORF protein’s role in the regulation and pathogenesis of rat herpesvirus.
Hepatitis E is inflammation of the liver which is caused by the Hepatitis E virus (HEV) . Worldwide, about 20 million HEV infections and 3.3 million symptomatic hepatitis E cases occur annually which results in 44,000 deaths . HEV is of the family Hepeviridae and belongs to the genus Orthohepevirus . The genome of HEV is a single-stranded positive-sense RNA (7.2 kb in length), which is flanked with short 5′ and 3’ non-coding regions (NCR) . The HEV genome comprises three open reading frames (ORFs): ORF1, ORF2, and ORF3. The ORF1 codes for the viral non-structural polyprotein (pORF1), the ORF2 codes for the viral capsid protein (pORF2), and ORF3 codes for the viral pleiotropic protein (pORF3) .
Besides transmission of HEV through the fecal-oral route (in developing nations), It has been suspected that human infections occur mainly due to zoonotic transmission of HEV that occurs, it is suspected that these human infections result from zoonotic transmission of GT 3 of HEV, wherein, wild boars, domestic pigs and deer act as major reservoirs or host organisms . In particular, commensal rodents also act as an additional reservoir for HEV and may play a major in the epidemiology of hepatitis E [7-9]. Though the genome of rodent herpesviruses is similar in organization to human herpesviruses, however, it has been identified in the year 2009 that the genome of rats from Norway had a dissimilar organization in comparison to other herpesviruses strains . The two complete nucleotide sequences were analyzed from Norway in Germany which suggested a completely separate genotype for these rodent herpesviruses . However, it has been predicted through software that these Norway rat herpesviruses consisted of some additional open reading frames, i.e., ORF1, ORF2, ORF3, ORF4, ORF5, and ORF6 . These nucleotide sequences had high divergence to other HEV strains, i.e., HEV G1, HEV G2, HEV G3, HEV G4, and avian HEV strains . It was also identified that, unlike typical HEV genomic organization, the ORFs ORF1 and ORF3 do not overlap in these two rat HEVs strains . Three additional putative ORFs of 280 - 600 nt that overlap with ORFs 1 or 2 were predicted for each rat HEV genome strain . In this context, this study aims to summarize the features of the ORF encoded proteins of this particular rodent herpesvirus (obtained from Norway Germany).
Recent studies have determined the role of different reading frame encoded proteins in HEV regulation by analyzing their intrinsically disordered regions [11-14], as these regions are linked with virus’s infection and pathogenesis [15-17]. However, a direct correlation between the disordered segments of ORF encoded proteins and viral adaptation has not been discovered in Norway strain comprising additional reading frames. Thus, we attempted to delineate the role of these ORF encoded proteins in rat HEV pathogenesis.
The present study analyzed the structurally “unknown” regions (i.e., a fraction of a proteome that has no detectable similarity to any PDB structure) of the rat HEV. This fraction we call the “dark proteome.” The proteins or protein regions that fail to get folded into definite three-dimensional (3D) structures but remain biologically active are termed as intrinsically disordered proteins (IDPs) and intrinsically disordered protein regions (IDPRs), respectively [18-20]. These disordered protein regions exist as extremely active ensembles that are rapidly interconvertible under different physiological conditions [19-21]. Due to the occurrence of the peculiar phenomenon, i.e., binding of several disordered regions to one ligand or vice-versa (one disordered region binds to many partners), the intrinsically disordered regions are utilized in protein-protein interactions [22,23]. The prevalence of the intrinsic disorder in the proteome of rat herpesvirus remains unknown. The current study reports analysis on the disordered side of rat HEV using computational methods to check the occurrence of intrinsically disordered regions in order to shed some light on their disorder-related functions in HEV adaptation.
The protein sequence (Accession ID: GU345043) of rat HEV was obtained from the NCBI (National Center for Biotechnology Information) GenBank database.
The rat HEV proteins three-dimensional (3D) structural models were automatically generated using Phyre2 (Protein Homology/AnalogY Recognition Engine) server (http://www.sbg.bio.ic.ac.uk/~phyre2/html/page.cgi?id=index) and were used for comparative analysis.
9Intrinsically disordered regions (IDRs) of the rat HEV proteome were predicted using the PONDR® (Predictor of Natural Disordered Regions) (www.pondr.com) at its default settings. Multiple predictors such as members of the PONDR® family including PONDR®VLS2, PONDR®VL3, and PONDR® VLXT were exploited to predict the intrinsic disorder predisposition in rat HEVs. This bioinformatics tool predicts the residues or regions which fail in the propensity for an ordered structure formation. The protein residues with predicted scores between 0.2 and 0.5 were considered as flexible, while the residues which had scores, exceeding the 0.5 threshold value, were predicted as intrinsically disordered ones.
The 3D modeled structures of the ORF proteins of rat HEV were generated through the Phyre2 web server as shown in Figure 1A–F.
The predicted percentage of alpha-helix, beta-strand, and disordered residues in the generated 3D rat HEV proteins are summarized in Table 1.
Therefore, the initial structural analysis revealed that all the rat HEV proteins consisted of disordered regions (Figure 1A-F).
The intrinsic disorder propensity analysis of rat HEV proteins was carried out to elucidate their intrinsic disorder properties. The predicted intrinsic disordered residues obtained from three disorder predictors for ORF encoded proteins of rat HEV are mentioned in Table 2. The resulting disorder profiles of the rat HEV proteins are shown in Figure 2A-F.
On the basis of the predicted percentage of intrinsic disorder and the presence of disordered domain, the different ORF proteins of rat HEV were categorized as follows:
ORF1 protein: The intrinsic disorder analysis showed ORF1 protein as a moderately disordered protein, as it consisted of less than 30% (VLXT, VSL2, and VL3) of the disordered residues in its polypeptide chain with two stretches of disordered domains at positions distinct from N- and C-terminals. Thus, it was categorized into IDPRs, i.e., structured proteins with intrinsically disordered segments of proteins possessing both structured unstructured regions (Table 2).
ORF2 protein: The intrinsic disorder analysis showed ORF2 protein as a highly disordered protein, as it consisted of >30% (as predicted by VLXT and VSL2) and moderately disordered as it consisted of less than 30% (as predicted by VL3) of the disordered residues in its polypeptide chain along with the presence of disordered domain. Thus, on combining these assumptions it was categorized into both IDPs, i.e., proteins having a significant fraction of disordered regions, or IDPRs, i.e., structured proteins with intrinsically disordered segments (Table 2).
ORF3 protein: The intrinsic disorder analysis showed ORF3 protein as a highly disordered protein, as it consisted of >30% (VLXT, VSL2, and VL3) of the disordered residues in its polypeptide chain. Thus, it was categorized into IDPs (Table 2).
ORF4 protein: The intrinsic disorder analysis showed ORF4 protein as a moderately disordered protein, as it consisted of less than 30% (VLXT, VSL2, and VL3) of the disordered residues in its polypeptide chain with a stretch of a disordered domain at the N-terminus. Thus, it was categorized into IDPRs, i.e., structured proteins with intrinsically disordered segments (Table 2).
ORF5 protein: The intrinsic disorder analysis showed ORF5 protein as a highly disordered protein, as it consisted of >30% (VLXT, VSL2, and VL3) of the disordered residues in its polypeptide chain along with possession of long disordered domain towards the C-terminus. Thus, it was categorized into IDPs (Table 2).
ORF6 protein: The intrinsic disorder analysis showed ORF6 protein as a structured protein, as it consisted of less than 30% (VLXT, VSL2, and VL3) of the disordered residues in its polypeptide chain without the presence of any disordered domain. Thus, it was categorized into ORDPs, i.e., proteins possessing a significant amount of structure (Table 2).
The intrinsic disorder is linked with the pathogenesis and infection of the viruses [15-17]. To complete the life cycle, viruses require various interactions with the components of the host cells. Beginning from the virus’s attachment, its entry, commandeering the host machinery, synthesis of the viral components, and particle assembly to the last phase, i.e., exiting as new infectious particles from the host cell . All these stages rely heavily on the intrinsic disorder prevalent in viral proteins . The biology of the unstructured regions of the Norway rat HEV, comprising additional reading frames, remains to be explored. Therefore, the present study reports the analysis on the unstructured regions of the ORF encoded proteins of rat herpesvirus to shed novel light on its functionality in HEV regulation.
Analysis of protein structure provides a detailed understanding of its function. In this context, the rat HEV protein structures were examined using a web portal for protein modeling and analysis. A study has suggested that loops/coils are not necessarily disordered, however, protein disorder is only found within loops . Thus, it was revealed that the modeled 3D structures of rat HEV proteins were identified with all three major secondary structure states, i.e., alpha-helix, beta-strand, and loops/coils. Therefore, our initial investigation showed the prevalence of the intrinsic disorder in the rat HEV proteome. The specific role of disordered regions in several nonstructural proteins has been demonstrated to participate in the multiplication and regulatory functions of viruses . For instance, a recent study has shown the involvement of ORF4 protein in the regulation and pathogenesis of HEV due to the presence of a significant fraction of disordered regions . The disordered regions in the ORF1 Y-domain of HEV have also been shown to perform a crucial role in its pathogenesis due to its intrinsic disorder phenomenon . In HDV (hepatitis delta virus), the translation of a delta antigen (a single basic protein) forms the basis of its replication, which is considered as an IDP molecule . Via both experimental and computational studies . The HCV (Hepatitis C virus) interacts with several viral and host proteins required for its replication via its disordered nonstructural NS5A protein domain [29,30]. These protein-protein interactions result in the occurrence of several significant biological functions. Moreover, the PPR (Polyproline region) of nonstructural ORF1 has been associated with the regulation of HEV in addition to its role in replication, due to its characteristic intrinsic disorder property .
The rat HEV proteins were initially categorized on the basis of the overall degree of intrinsic disorder. The categories included structured proteins (0–10%), moderately disordered proteins, and highly disordered proteins (30–100%) [32,33]. Additionally, the ORF proteins were categorized on the basis of the length of disordered domains and an overall fraction of disordered residues. The categories consisted of ordered proteins (ORDPs); intrinsically disordered protein regions (IDPRs) and intrinsically disordered proteins (IDPs) [18,32]. ORDPs are intrinsic disorder protein variants that consist of less than 30% of disordered residues without disordered domain (consecutive disordered residues) at either C- or N-terminus or in positions distinct from the N- and C-terminals. IDPRs are intrinsic disorder protein variants that consist of less than 30% of disordered residues with a disordered domain at either C- or N-terminus or in positions distinct from the N- and C-terminals. IDPs are intrinsic disorder protein variants that consist of more than 30% of disordered residues. On summing up these criteria, our intrinsic disorder propensity analysis revealed ORF3 protein as the most disordered protein and ORF6 protein as the most ordered protein in the rat HEV proteome. Interestingly, we found out that the rat HEV proteome was identified with all the three intrinsic disorder variants, such as ORDP, IDPR, and IDP (Table 3).
Recent investigation on HEV proteome, consisting of three ORF encoded proteins (ORF1, ORF2, and ORF3), was carried out by analyzing its intrinsic disorder . The . In the current study, ORF3 protein possessed the highest fraction of intrinsic disorder suggesting it as an IDP which shows consistency with the previous study suggesting ORF3 as a highly disordered protein (IDP) . Moreover, our result on ORF2 protein substantiates the previous finding which showed ORF2 protein possessed a disordered segment at its N-terminus . Furthermore, our analysis revealed ORF4 as an IDPR which is in line with the previous study that showed the ORF4 obtained from the host rat is an IDPR variant . Taken together our observations, could be interpreted that ORF3 has the highest percentage of a fraction of disordered residues followed by ORF2 and ORF1 has the least fraction of disordered residues. Thus, it is noteworthy to mention that our findings are in agreement with the earlier study which demonstrated that the ORF3 had the highest prevalence of disordered residues followed by ORF2, which had a comparatively lesser fraction of intrinsic disorder, while the ORF1 had the least number of disordered residues in the HEV proteome .
The “IDPR/IDP” is defined as the disordered region in protein or disordered protein. These regions/proteins perform several significant roles in a variety of biological processes, such as control of signaling pathways, cell cycle regulation, etc. [16,22,23]. It has been suggested that IDPRs/IDPs achieve their signaling cascade by binding to their partners with low affinity and high specificity . Thus, the proteins, such as ORF1, ORF2, and ORF4 can play crucial roles in important biological processes as IDPRs. IDP plays a significant role in the recognition, signaling, regulation, and control of Protein-Protein Interaction (PPI) networks . IDPs form essential components of cellular signaling machinery due to their ability to interact differently which results in different consequences . Moreover, they are characterized by enormous flexibility and random conformation (coiled-like). Thus, taken together, these distinctive features enable IDPs to participate in one too many and vice-versa interaction [37-39]. Particularly, like IDP, ORF3 may possibly perform a crucial role in the viral regulation via PPI.
Thus, taken together, our analysis suggests that the disordered regions prevalent in rat HEV proteome, as IDPRs/IDPs, could perform significant and diverse biological roles through PPIs.
The current study provides novel intrinsic disorder analysis on the rat HEV proteome. Our data revealed the occurrence of all intrinsic disorder variants (ORDP, IDPR, and IDP) in the proteome. The ORF3 protein was identified as the most disordered protein and ORF6 protein consisted of the least fraction of intrinsic disorder. The occurrence of the unstructured regions suggested that rat HEV proteins could be engaged in diverse and essential biological functions. Further, complete experimental insights into the disorders of these viral proteins might help in identifying protein functions and the biology of rat HEV.
The authors would like to acknowledge Maulana Azad National Fellowship (MANF), University Grant Commission (UGC), Council of Scientific and Industrial Research (CSIR) (37(1697)17/EMR-II) and Central Council for Research in Unani Medicine (CCRUM), Ministry of Ayurveda, Yoga and Neuropathy, Unani, Siddha and Homeopathy (AYUSH) (F.No.3-63/2019- CCRUM/Tech) supported by the Government of India.
Subscribe to our articles alerts and stay tuned.