The most common type of eye cancer in youngsters is called retinoblastoma. Four categories are used to categorise the disease. One classification is based on whether the condition affects one eye or both; these are referred to as unilateral and bilateral, respectively. Based on gene expression analysis, retinoblastomas can be classified into two types. Group 1 exhibits an invasive tumour pattern together with a variety of different types of retinal cells. Group 2 exhibits a distinct cone photoreceptor expression profile. RBBP9 is a protein that is involved in the human cancer process and is a binding partner of retinoblastoma susceptibility protein (Rb). LxCxE is the Rb binding motif found in the sequence of RBBP9. Yeast two-hybrid experiments revealed that RBBP9 interacts with Rb. RBBP9 is 21 kD in size, and its crystal structure has been investigated. There is evidence linking RBBP9 to pancreatic cancer. Pancreatic cancer cannot develop without the protein's serine hydrolase activity. Serine hydrolase activity works by phosphorylating Smad2/3 less, which in turn suppresses TGF-β antiproliferative signalling. Our goals in this study are to visualise the protein structure in PyMol, determine the nature of the protein (whether hydrophobic or hydrophilic), use the multiple alignment tool COBALT to check for protein conservation across other species, identify both chains (A & B) using PyMol, examine the interaction between 2QS9 and 7OEX (a protein that is very similar to 2QS9) in PyMol, perform multiple sequence alignment using clustal omega to determine the nature of the protein, and plot the Ramachandran plot to visualise energetically allowed region in BioPython and Saves server and molecular docking of 2QS9 and Topotecan using CB-DOCK2.
The most common type of eye cancer in youngsters is called retinoblastoma. Leukocoria, strabismus, buphthalmus, advanced intraocular tumour, extraocular tumour involving orbital tissue, and cellulitis are the main signs of the illness.Four categories are used to categorise the disease. One classification is based on whether the condition affects one eye or both; these are referred to as unilateral and bilateral, respectively. The other group, known as intraocular and extraocular, respectively, is dependent on whether the condition is present inside the eyeball or in structures outside the eyeball (1). Retinoblastoma affects 1 in 14,000–20,000 live births worldwide. Retinoblastoma occurs in 40% of bilateral cases and 60% of unilateral instances. The illustration of a healthy eye and an afflicted eye is shown below.The affected eye has cancerous cell starting from the retina. The genetic nature of the disorder can be germline or somatic. The causative gene of retinoblastoma is RB1 and most bilateral retinoblastomas are caused by germline pathogenic mutations in this gene. 10% to 15% of unilateral retinoblastomas result from germline pathogenic mutations whereas the remaining85-90% is caused by somatic changes (2).
As per gene expression profiling, there are two groups of retinoblastomas. Group 1 has a range of various retinal cell types and the tumor pattern in this one is invasive. Group 2 showed a unique expression profile of cone photoreceptor. Below are the examples of unilateral and bilateral retinoblastoma (3).
Unilateral retinoblastoma- One eye here has leukocoria which is the white pupillary reflex.
Bilateral Retinoblastoma- Both the eyes show leukocoria
One of the proteins which is a binding partner of retinoblastoma susceptibility protein (Rb) is RBBP9 and is important in the human cancer pathway. RBBP9 has LxCxE in its sequence which is the Rb binding motif. RBBP9 was shown to interact with Rb by yeast two-hybrid and co-immunoprecipitation experiments. Substitution of Glutamine for Leucine in this motif blocked the binding of Rb. The size of RBBP9 is 21 kD and the crystal structure has been studied. RBBP9 has been implicated in pancreatic cancer. The serine hydrolase activity of the protein is necessary for the development pancreatic carcinoma. The serine hydrolase activity functions by inhibiting TGF-β antiproliferative signaling through suppressing Smad2/3 phosphorylation (4).
Below given is the crystal structure of RBBP9:
Figure1:RBBP9 interaction with other Rb family protein: (Image taken from Sergey M. Vorobiev et al., 2012)
Figure 2:1Image taken from Sergey M. Vorobiev et al., 2012.
The medications that have been utilised are taken out of the pharmacy. When it comes to chemistry and the biological sciences, PubChem is an indispensable resource. The National Centre for Biotechnology Information (NCBI), a division of the US National Library of Medicine (NLM), is responsible for maintaining this enormous database. Free access to data on the biological actions of small compounds is the main goal of PubChem [5]. The PDB database was used to obtain the protein structure. The protein's PDB id is 6U9N. With a primary focus on proteins and nucleic acids, the Protein Data Bank (PDB) is a crucial resource for three-dimensional structural data on biological macromolecules. The PDB's archived data are publicly available.The Protein Data Bank (PDB) is an essential repository for three-dimensional structural data of biological macromolecules, primarily focusing on proteins and nucleic acids. The data archived in the PDB are freely accessible to researchers worldwide, enabling them to explore the structures of proteins, nucleic acids, and complex assemblies. Scientists use this information for a multitude of purposes, including understanding biological functions, drug discovery, protein engineering, and molecular modelling [6]. Further, to study the protein structure rasmol software has been utilized and then to analyze the ligand-protein complex interaction pymol has been used. RasMol is an influential and pioneering molecular visualizationprogram that has significantly contributed to the field of structural biology.RasMol facilitated the visualization of molecular structures by providing features to rotate, translate, and zoom into the three-dimensional representations of molecules. Users could manipulate these structures to examine different angles, surface properties, and structural details, aiding in the understanding of molecular interactions, folding patterns, and active sites [7,8]. PyMOL offers a vast array of tools for structural analysis and exploration. It allows users to measure distances, angles, and dihedral angles, perform alignments between structures, analyze electrostatic potentials, generate molecular surfaces, and visualize molecular dynamics trajectories [9].For further sequence similarity and phylogenetic analysis of protein BLAST and COBALT has been used, respectively. BLAST, which stands for Basic Local Alignment Search Tool, is a fundamental and widely used bioinformatics algorithm and software tool. It's designed to compare biological sequences, such as DNA, RNA, or protein sequences, against vast databases to identify similarities and infer functional, structural, or evolutionary relationships between sequences. BLAST outputs result in a format that includes statistical measures indicating the significance of matches found. This information helps researchers assess the likelihood that a match occurred by chance and allows them to prioritize and further investigate the most relevant matches [10,11]. COBALT employs a constraint-based approach for multiple sequence alignment. It gathers a set of pairwise constraints derived from various sources, including database searches, sequence similarity data, and user input. These constraints serve as guiding principles or rules that inform the alignment process [12]. For the docking of protein and drug, CB Dock2 has been used. CB-Dock2 is a user-friendly web server designed specifically for blind docking, meaning it predicts binding modes without prior information about binding sites. CB-Dock2 showcased a notable success rate of around 85% for binding pose prediction, with root-mean-square deviation (RMSD) values less than 2.0 Å [13]. This tool is designed to compute various physicochemical properties and analyze protein sequences [14]. Then we have calculated protein interaction and try to find gene ontology. For that STRING tool has been used. STRING (Search Tool for the Retrieval of Interacting Genes/Proteins) is a bioinformatics database and web-based tool that consolidates and predicts protein-protein interactions (PPIs) and functional associations across various organisms.
According to (15), it combines predicted and known interactions to provide a thorough resource for examining protein interactions and their functional ramifications. Protein quality was assessed using ERRAT and SAVES. A bioinformatics tool called ERRAT is used to analyse protein structures using atomic coordinates. It evaluates the statistics of interactions between non-bonded atoms and points up any faults or anomalies in the structure. By examining the distribution of atomic interactions in relation to high-resolution structures, ERRAT calculates a quality factor that represents the overall quality of the model. The accuracy and dependability of protein structures, however, are evaluated by a collection of tools called SAVES (Structural Analysis and Verification Server) [16,17]
Figure3: Representation of N and C-Terminal of 2QS9. N-Terminal is blue and C-terminal is red.
Figure 4: Presentation of helix, sheet and loop in PyMol
Figure5: Multiple alignment results from COBALT representing hydropathy scale showing hydrophobic nature.In hydropathy scale, the red bars are hydrophobic and blue bars are hydrophilic. The number of red bars are more than the blue ones which shows that the protein is hydrophobic.
Analysis in Sequence similarity
Description of RBBP9 across species with percentage identity. Here we have to compare the percentage identity of RBBP9 (Homo sapiens) with the same protein across other species. 7OEX is identical to 2QS9.
Figure 6: Active site representation in 2QS9 (Chain A and Chain B)
Figure 7: Protein -Ligand interaction using the lingand Topotecan with an auto-doc score of -8.3
Figure8: Protein-Ligand interaction using the lingand Cytoxan with an auto-doc score of -4.5 (Cytoxan also known as cyclophosphamide)
Figure 9: Docking score table of molecular docking interactions of 2QS9 and Topotecan
Figure 10: Docking score table of molecular docking interactions of 2QS9 and Cytoxan
Active Residues where the Protein Ligand interaction was observed (Topotecan)
Chain A: LYS42 ASN43 PRO45 ASP46 PRO47 ILE48 THR49 ARG51 ILE54 PHE58 GLU62 ASN107
Chain B: ARG51 GLU52 SER53 ILE54 LEU56 PRO57 ARG83 THR87 HIS88 GLU106 ARG109 ALA110 SER111 GLY112 THR115 ARG116 PRO117
Active Residues where the Protein Ligand interaction was observed (Cytoxan also known as cyclophosphamide)
Chain A: PRO45 ASP46 PRO47 ILE48 THR49 ARG51 ILE54 PHE58
Chain B: ARG51 GLU52 SER53 LEU56 PRO57 GLU86 ALA110 SER111 GLY112 TYR113 THR115 ARG116
Figure 11: Molecular docking of Topotecan using CB-DOCK2-Distribution of hydrophobic areas (Green and yellow)
Figure 12: Identification of domains yellow chain A and red chain b- White background
Figure 13; ERRAT-Error Values-Quality Factor-Chain A. The error values are below 95 percentage which implies that the quality of the sample is very good to use the same for analysis.
ERRAT-Error Values-Quality Factor-Chain B. The error values are below 95 percentage which implies that the quality of the sample is very good to use the same for analysis.
Figure 14: Clustal Omega Multiple Sequence Alignment between 2QS9-7OEX High number of blues showing acidic nature of the proteins
Figure 15: Protein change representation in PDBSum with change in position Asp102 variant
Figure16: Interface statistics between chains A & B of the 2QS9 protein. The interaction between the chains is non-bonded.
Given below is the schematic diagram of interactions between protein chains. Interacting chains are joined by coloured lines, each representing a different type of interaction. The area of each circle is proportional to the surface area of the corresponding protein chain. The extent of the interface region on each chain is represented by the black wedge whose size signifies the interface surface area
Figure 17: Protein-Protein interaction residues of 2QS9 between chain A and chain B showing non-bonded contacts.
Residue interactions across interface
Coloured by residue type showing non-bonded contacts (Protein-Protein analysis in PDB web server). .
Residue analysis:
Positive – Arg51
Negative – Glu62
Neutral- Ser53
Proline45 and 57
Aliphatic- Ile54
Figure 18: Functional domain analysis using interproscan. The 2QS9 protein has Ser_hydrolase as the representative domain and this protein belongs to the hydrolase_RBBP9/YdeN family.
Figure 19; Gene view histogram of mutations across RBBP9
Figure 20:3D structure view of RBBP9 from COSMIC Database
Figure 21: Redsite in the 3D Structure of RBBP9 corresponds to the maximum frequency of the mutations
Figure 22: Ramachandran Plot of 2QS9 using Biopython.
Figure 23: Protein Contact map of 2QS9 using Biopython. This map shows the interaction (bonding) between amino acids within the protein. The yellow spots on the map represents the amino acids. The spots which are close to each other depicts strong bond whereas the scattered spots depicts weak bond.
Figure 24: Phylogenetic tree using Biopython. The tree depicts that 2QS9 is similar and related to 7OEX.
The phylogenetic score (190) between 2QS9 and 7OEX using Biopython.
When used to retinoblastoma, next-generation sequencing (NGS) offers a thorough understanding of the disease's genetic makeup, facilitating precise diagnosis, molecular categorization, and tailored treatment plans. This aids in determining structural validation, comprehending tumour heterogeneity, differentiating between sporadic and inherited cases, and identifying possible targets for therapy. Consequently, NGS has developed into a vital tool for enhancing retinoblastoma patient care and results.