Biology:Computational Resource for Drug Discovery

From HandWiki

Computational Resources for Drug Discovery (CRDD) is one of the important silico modules of Open Source for Drug Discovery (OSDD). The CRDD web portal provides computer resources related to drug discovery on a single platform. It provides computational resources for researchers in computer-aided drug design, a discussion forum, and resources to maintain Wikipedia related to drug discovery, predict inhibitors, and predict the ADME-Tox property of molecules One of the major objectives of CRDD is to promote open source software in the field of chemoinformatics and pharmacoinformatics.

Features

Under CRDD, all the resources related to computer-aided drug design have been collected and compiled. These resources are organized and presented on CRDD so users can get resources from a single source.

  • Target identification provides the resources important for searching drug targets with information on genome annotation, proteome annotation, potential targets, and protein structure
  • Virtual screening compiles the resources important for virtual screening as QSAR techniques, docking QSAR, chemoinformatics, and siRNA/miRNA
  • Drug design provides the resources important for designing drug inhibitors/molecules as lead optimization, pharmainformatics, ADMET, and clinical informatics

Community contribution

Under this category platform has been developed where community may contribute in the process of drug discovery.

  • DrugPedia: A Wikipedia for Drug Discovery is a Wiki created for collecting and compiling information related to computer-aided drug design. It is developed under the umbrella of Open Source Drug Discovery (OSDD) project and covers wide range of subjects around drugs like Bioinformatics, Cheminfiormatics, clinical informatics etc.
  • Indipedia: A Wikipedia for India is a Wiki for collecting and compiling drug information related to India. It is intended is to provide comprehensive information about India created for Indians by Indians. It is developed under the umbrella of Open Source Drug Discovery (OSDD) project.
  • The CRDD Forum was launched to discuss the challenge in developing computational resources for drug discovery.

Indigenous development: software and web services

Beside collecting and compiling resources, CRDD members develop new software and web services. All services developed are free for academic use. The following are a few major tools developed at CRDD.[citation needed]

Development of databases

  • HMRBase: It is a manually curated database of Hormones and their Receptors. It is a compilation of sequence data after extensive manual literature search and from publicly available databases. HMRbase can be searched on the basis of a variety of data types. Owing to the high impact of endocrine research in the biomedical sciences, the Hmrbase could become a leading data portal for researchers. The salient features of Hmrbase are hormone-receptor pair-related information, mapping of peptide stretches on the protein sequences of hormones and receptors, Pfam domain annotations, categorical browsing options, online data submission.[1] This database integrated in drugpedia so public can contribute.
  • BIAdb: A Database for Benzylisoquinoline Alkaloids. The Benzylisoquinoline Alkaloid Database is an attempt to gather the scattered information related to the BIA's. Many BIA's show therapeutic properties and can be considered as potent drug candidates. This database will also serve researchers working in the field of synthetic biology, as developing medicinally important alkaloids using synthetic process are one of important challenges. This database integrated in drugpedia so public can contribute.[2]
  • AntigenDB: This database contain more than 500 antigens collected from literature and other immunological resources. These antigens come from 44 important pathogenic species. In AntigenDB, a database entry contains information regarding the sequence, structure, origin, etc. of an antigen with additional information such as B and T-cell epitopes, MHC binding, function, gene-expression and post translational modifications, where available. AntigenDB also provides links to major internal and external databases.[3]
  • PolysacDB: The PolysacDB is dedicated to provide comprehensive information about antigenic polysaccharides of microbial origin (bacterial and fungal), antibodies against them, proposed epitopes, structural detail, proposed functions, assay system, cross-reactivity related information and much more. It is a manually curated database where most of data has been collected from PubMed and PubMed Central literature databases.
  • TumorHope: TumorHope is a manually curated comprehensive database of experimentally characterized tumor homing peptides. These peptides recogninze tumor tissues and tumor associated micro environment, including tumor metastasis.
  • ccPDB: The ccPDB database is designed to provide service to scientific community working in the field of function or structure annoation of proteins. This database of datasets is based on Protein Data Bank (PDB), where all datasets were derived from PDB.[4]
  • OSDDchem: OSDDChem chemical database is an open repository of information on synthesised, semi-synthesized, natural and virtually designed molecules from the OSDD community.
  • CancerDR: CancerDR is a database of 148 anticancer drugs and their effectiveness against around 1000 cancer cell lines. These are target based drugs, CancerDR maintain comprehensive information about these drugs their target gene/protein and cell lines.

Software developed

  • MycoTB: In order to assist scientific community, we extended flexible system concept for building standalone software MycoTB for Windows Users. MycoTB is one of the computer program developed under OSDD/CRDD programme. This software allow user to build their own flexible system on their personal computers to manage and annotate whole proteome of MycoTB.

Resources created

  • CRAG: Computational resources for assembling genomes (CRAG) has been to assist the users in assembling of genomes from short read sequencing (SRS). Following major objective; i) Collection and compilation of computation resources, ii) Brief description of genome assemblers, iii) Maintaining SRS and related data, iv) Service to community to assemble their genomes
  • CRIP: Computational resources for predicting protein–macromolecular interactions (CRIP) developed to provide resources related interaction. This site maintain large number of resources on interaction world of proteins that includes, protein–protein, protein–DNA, protein–ligand, protein–RNA.
  • BioTherapi: Bioinformatics for Therapeutic Peptides and Proteins (BioTherapi) developed for researchers working in the field of protein/peptide therapeutics. At present there is no single platform that provide this kind of information. This site include all the relevant information about the use of Peptides/Proteins in drug and synthesis of new peptides. It also cover problems, in their formulation, synthesis and delivery process
  • HivBio: HIV Bioinformatics (HIVbio) site contains various types of information on Human Immunodeficiency Virus (HIV) life cycle and Infection.
  • GDPbio: GDPbio (Genome based prediction of Diseases and Personal medicines using Bioinformatics) is the project focussed upon providing various resources related to genome analysis particularly for the prediction of disease susceptibility of a particular individual and personalized medicines development, aiming public health improvement.
  • AminoFAT: Functional Annotation Tools for Amino Acids (AminoFAT) server is designed to serve the bioinformatics community. Aim is to develop as many as possible tools to understand function of amino acids in proteins based on protein structure in PDB. The broad knowledge of proteins function would help in the identification of noval drug targets.

Web services for chemoinformatics

First time in the world CRDD team has developed open source platform which allows users to predict inhibitors against novel M. Tuberculosis drug targets and other important properties of drug molecules like ADMET. Following are list of few servers.

  • MetaPred: A webserver for the Prediction of Cytochrome P450 Isoform responsible for Metabolizing a Drug Molecule. MetaPred Server predict metabolizing CYP isoform of a drug molecule/substrate, based on SVM models developed using CDK descriptors. This server will be helpful for researcher working in the field of drug discovery. This study demonstrates that it is possible to develop free web servers in the field of chemoinformatics. This will encourage other researchers to develop web server for public use, which may lead to decrease the cost of discovering new drug molecules.[5]
  • ToxiPred: A server for prediction of aqueous toxicity of small chemical molecules in T. pyriformis.
  • KetoDrug:A web server for binding affinity prediction of ketoxazole derivatives against Fatty Acid Amide Hydrolase (FAAH). It is a user friendly web server for the prediction of binding affinity of small chemical molecules against FAAH.
  • KiDoQ: KiDoQ, a web server has been developed to serve scientific community working in the field of designing inhibitors against Dihydrodipicolinate synthase (DHDPS), a potential drug target enzyme of a unique bacterial DAP/Lysine pathway.[6]
  • GDoQ: GDoQ (Prediction of GLMU inhibitors using QSAR and AutoDock) is an open source platform developed for predicting inhibitors against Mycobacterium tuberculosis (M.Tb) drug target N-acetylglucosamine-1-phosphate uridyltransferase (GLMU) protein. This is a potential drug target involved in bacterial cell wall synthesis. This server uses molecular docking and QSAR strategies to predict inhibitory activity value (IC50) of chemical compounds for GLMU protein.[7]
  • ROCR: The ROCR is an R package for evaluating and visualizing classifier performance . It is a flexible tool for creating ROC graphs, sensitivity/specificity curves, area under curve and precision/recall curve. The parametrization can be visualized by coloring the curve according to cutoff.
  • WebCDK: A web interface for CDK library, it is a web interface for predicting descriptors of chemicals using CDK library.
  • Pharmacokinetics: The Pharmacokinetic data analysis determines the relationship between the dosing regimen and the body's exposure to the drug as measured by the nonlinear concentration time curve. It includes a function, AUC, to calculate area under the curve. It also includes functions for half-life estimation for a biexponential model, and a two phase linear regression

Prediction and analysis of drug targets

  • RNApred: Prediction of RNAbinding proteins from ints amino acid sequence.[8]
  • ProPrint: Prediction of interaction between proteins from their amino acid sequence.[9]
  • DomPrint: Domprint is a domain-domain interaction (DDI) prediction server.
  • MycoPrint: MycoPrint is a web interface for exploration of the interactome of Mycobacterium tuberculosis H37Rv (Mtb) predicted by "Domain Interaction Mapping" (DIM) method.
  • ATPint: A server for predicting ATP interacting residues in proteins.[10]
  • FADpred: Identification of FAD interacting residues in proteins.[11]
  • GTPbinder: Prediction of protein GTP interacting residues.[12]
  • NADbinder: Prediction of NAD binding residues in proteins.[13]
  • PreMier: Designing of Mutants of Antibacterial Peptides.[14]
  • DMAP: DMAP: Designing of Mutants of Antibacterial Peptides
  • icaars:Prediction and classification of aminoacyl tRNA synthetases using PROSITE domains [15]
  • CBtope: Prediction of Conformational B-cell epitope in a sequence from its amino acid sequence.[16]
  • DesiRM: Designing of Complementary and Mismatch siRNAs for Silencing a Gene .[17]
  • GenomeABC: A server for Benchmarking of Genome Assemblers.

References

  1. Rashid, Mamoon; Singla, Deepak; Sharma, Arun; Kumar, Manish; Raghava, Gajendra PS (2009). "Hmrbase: a database of hormones and their receptors". BMC Genomics 10: 307. doi:10.1186/1471-2164-10-307. PMID 19589147. 
  2. Singla, Deepak; Sharma, Arun; Kaur, Jasjit; Panwar, Bharat; Raghava, Gajendra PS (2010). "BIAdb: A curated database of benzylisoquinoline alkaloids". BMC Pharmacology 10: 4. doi:10.1186/1471-2210-10-4. PMID 20205728. 
  3. Ansari, H. R.; Flower, D. R.; Raghava, G. P. S. (2009). "AntigenDB: an immunoinformatics database of pathogen antigens". Nucleic Acids Research 38 (Database issue): D847. doi:10.1093/nar/gkp830. PMID 19820110. 
  4. Nucleic Acids Research, 2011
  5. Mishra, Nitish K; Agarwal, Sandhya; Raghava, Gajendra PS (2010). "Prediction of cytochrome P450 isoform responsible for metabolizing a drug molecule". BMC Pharmacology 10: 8. doi:10.1186/1471-2210-10-8. PMID 20637097. 
  6. Garg, Aarti; Tewari, Rupinder; Raghava, Gajendra PS (2010). "KiDoQ: using docking based energy scores to develop ligand based model for predicting antibacterials". BMC Bioinformatics 11: 125. doi:10.1186/1471-2105-11-125. PMID 20222969. 
  7. Singla, Deepak; Anurag, Meenakshi; dash, Debasis; Raghava, Gajendra PS (2011). "A Web Server for Predicting Inhibitors against Bacterial Target GlmU Protein". BMC Pharmacology 11: 5. doi:10.1186/1471-2210-11-5. 
  8. Kumar, M; Gromiha, MM; Raghava, GP (2010). "SVM based prediction of RNA-binding proteins using binding residues and evolutionary information". Journal of Molecular Recognition 24 (2): 303–13. doi:10.1002/jmr.1061. PMID 20677174. 
  9. Rashid, M. and Raghava, G. P. S. (2010) A simple approach for predicting protein–protein interactions. Current Protein & Peptide Science (In Press).
  10. Chauhan, JS; Mishra, NK; Raghava, GP (2009). "Identification of ATP binding residues of a protein from its primary sequence". BMC Bioinformatics 10: 434. doi:10.1186/1471-2105-10-434. PMID 20021687. 
  11. Mishra, Nitish K.; Raghava, Gajendra P. S. (2010). "Prediction of FAD interacting residues in a protein from its primary sequence using evolutionary information". BMC Bioinformatics 11: S48. doi:10.1186/1471-2105-11-S1-S48. PMID 20122222. 
  12. Chauhan, JS; Mishra, NK; Raghava, GP (2010). "Prediction of GTP interacting residues, dipeptides and tripeptides in a protein from its evolutionary information". BMC Bioinformatics 11: 301. doi:10.1186/1471-2105-11-301. PMID 20525281. 
  13. Ansari, HR; Raghava, GP (2010). "Identification of NAD interacting residues in proteins". BMC Bioinformatics 11: 160. doi:10.1186/1471-2105-11-160. PMID 20353553. 
  14. Agarwal (2011). "Identification of Mannose Interacting Residues Using Local Composition". PLoS ONE 6: e24039. doi:10.1371/journal.pone.0024039. PMID 21931639. 
  15. Panwar, Bharat; Raghava, Gajendra PS (2010). "Prediction and classification of aminoacyl tRNA synthetases using PROSITE domains". BMC Genomics 11: 507. doi:10.1186/1471-2164-11-507. PMID 20860794. 
  16. Ansari, HR; Raghava, Gajendra PS (2010). "Identification of conformational B-cell Epitopes in an antigen from its primary sequence". Immunome Research 6: 6. doi:10.1186/1745-7580-6-6. PMID 20961417. 
  17. Ahmed, F; Raghava, Gajendra PS (2011). "Designing of Highly Effective Complementary and Mismatch siRNAs for Silencing a Gene". PLoS ONE 6 (8): e23443. doi:10.1371/journal.pone.0023443. PMID 21853133. 

Further reading