Biology:DisGeNET

From HandWiki
DisGeNET
Content
DescriptionGene Disease Database
Data types
captured
Gene-disease associations
OrganismsHomo sapiens
Contact
Research centreGRIB
LaboratoryIBI Group
Author(s)Laura I. Furlong and Ferran Sanz, Team Leaders
Primary citationPMID 25877637
Release date2010
Access
WebsiteDisGeNET
Download URLDownloads
Sparql endpointDisGeNET-RDF
Miscellaneous
Software licenseThe DisGeNET data is made available on a Open Database License
Software versioningDisGeNET v5.0

DisGeNET is a discovery platform designed to address a variety of questions concerning the genetic underpinning of human diseases. DisGeNET is one of the largest and comprehensive repositories of human gene-disease associations (GDAs) currently available.[1] It also offers a set of bioinformatic tools to facilitate the analysis of these data by different user profiles. It is maintained by the Integrative Biomedical Informatics (IBI) Group, of the (GRIB)-IMIM/UPF, based at the Barcelona Biomedical Research Park (PRBB), Barcelona, Spain .

Scope and access

In the pursuit to gather different aspects of the current knowledge on the genetic basis of human diseases, DisGeNET covers information on all disease areas (Mendelian, complex and environmental diseases). With more than 400 000 genotype-phenotype relationships from different origins integrated and annotated with explicit provenance and evidence information, DisGeNET is a valuable knowledge and evidence-based discovery resource for Translational Research. DisGeNET is an open access resource that makes available a comprehensive knowledge base on disease genes and different tools for their exploitation and analysis. DisGeNET is available through a Web interface, a Cytoscape plugin,[2] as linked data for the Semantic Web, and supports programmatic access to its data. These valuable set of tools allows investigating the molecular mechanisms underlying diseases of genetic origin,[3] and are designed to support the data exploitation from different perspectives and to fulfill the needs of different types of users, including bioinformaticians, biologists and healthcare practitioners.

Integrated data

The DisGeNET database integrates over 400 000 associations between > 17 000 genes and > 14 000 diseases from human to animal model expert curated databases with text mined GDAs from MEDLINE using a NLP-based approach.[4] The highlights of DisGeNET are the data integration, standardisation and a fine-grained tracking of the provenance information. The integration is performed by means of gene and disease vocabulary mapping and by using the DisGeNET association type ontology. Furthermore, GDAs are organised according to their type and level of evidence as CURATED, PREDICTED and LITERATURE, and they are also scored based on the supporting evidence to prioritise and ease their exploration.

The DisGeNET Association Type Ontology

For a seamless integration of gene-disease association data, we developed the DisGeNET association type ontology. All association types as found in the original source databases are formally structured from a parent GeneDiseaseAssociation class if there is a relationship between the gene/protein and the disease, and represented as ontological classes. It is an OWL ontology that is integrated into the Sematicscience Integrated Ontology (SIO), which provides essential types and relations for the rich description of objects, processes and their attributes.[5] You can check SIO gene-disease association classes from this URL.

Cytoscape plugin

The DisGeNET Cytoscape plugin[2] offers a network representation of the gene-disease associations. It represents gene-disease associations in terms of bipartite graphs and additionally provides gene centric and disease centric views of the data. It assists the user in the interpretation and exploration of human complex diseases with respect to their genetic origin by a variety of built-in functions. Using the DisGeNET Cytoscape plugin you can perform queries restricted to (i) the original data source, (ii) the association type, (iii) the disorder class of interest and (iv) specific diseases or genes.

Linked Data

The information contained in DisGeNET can also be expanded and complemented using Semantic Web technologies and linked to a variety of resources already present in the Linked Open Data cloud. DisGeNET is distributed as RDF and Nanopublications linked datasets. The DisGeNET-RDF linked dataset is an alternative way to access the DisGeNET data and provides new opportunities for data integration, querying and integrating DisGeNET data to other external RDF datasets. The RDF and Nanopublication distributions of DisGeNET have been developed in the context of the Open PHACTS project to provide disease relevant information to the knowledge base on pharmacological data.

European projects

See also

References

  1. Piñero, J.; Queralt-Rosinach, N.; Bravo, A.; Deu-Pons, J.; Bauer-Mehren, A.; Baron, M.; Sanz, F.; Furlong, L. I. (15 April 2015). "DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes". Database 2015 (0): bav028–bav028. doi:10.1093/database/bav028. PMID 25877637. 
  2. 2.0 2.1 Bauer-Mehren, A; Rautschka, M; Sanz, F; Furlong, LI (15 November 2010). "DisGeNET: a Cytoscape plugin to visualize, integrate, search and analyze gene-disease networks.". Bioinformatics 26 (22): 2924–6. doi:10.1093/bioinformatics/btq538. PMID 20861032. 
  3. Bauer-Mehren, A; Bundschus, M; Rautschka, M; Mayer, MA; Sanz, F; Furlong, LI (14 June 2011). "Gene-disease network analysis reveals functional modules in mendelian, complex and environmental diseases.". PLoS ONE 6 (6): e20284. doi:10.1371/journal.pone.0020284. PMID 21695124. 
  4. Bravo, À; Piñero, J; Queralt-Rosinach, N; Rautschka, M; Furlong, LI (21 February 2015). "Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research.". BMC Bioinformatics 16 (55): 1–39. doi:10.1186/s12859-015-0472-9. PMID 25886734. 
  5. Dumontier, Michel; Baker, Christopher JO; Baran, Joachim; Callahan, Alison; Chepelev, Leonid; Cruz-Toledo, José; Del Rio, Nicholas R; Duck, Geraint et al. (2014). "The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery". Journal of Biomedical Semantics 5 (1): 14. doi:10.1186/2041-1480-5-14. PMID 24602174.