Biology:ChEMBL

ChEMBL
Content
Description	Biological database
Data types; captured	Molecules with drug-like properties and biological activity
Contact
Research centre	European Molecular Biology Laboratory
Laboratory	European Bioinformatics Institute
Author(s)	Andrew Leach, Team Leader 2016-Present; John Overington, Team Leader 2008-2015
Primary citation	PMID 21948594
Release date	2009
Access
Website	ChEMBL
Download URL	Downloads
Web service URL	ChEMBL Webservices
Sparql endpoint	ChEMBL EBI-RDF Platform
Miscellaneous
Software license	The ChEMBL data is made available on a Creative Commons Attribution-Share Alike 3.0 Unported Licence
Software versioning	ChEMBL_28

Short description: Chemical database of bioactive molecules also having drug-like properties

ChEMBL or ChEMBLdb is a manually curated chemical database of bioactive molecules with drug inducing properties.^[1] It is maintained by the European Bioinformatics Institute (EBI), of the European Molecular Biology Laboratory (EMBL), based at the Wellcome Trust Genome Campus, Hinxton, UK.

The database, originally known as StARlite, was developed by a biotechnology company called Inpharmatica Ltd. later acquired by Galapagos NV. The data was acquired for EMBL in 2008 with an award from The Wellcome Trust,^[2] resulting in the creation of the ChEMBL chemogenomics group at EMBL-EBI, led by John Overington.^[3]^[4]

Scope and access

The ChEMBL database contains compound bioactivity data against drug targets. Bioactivity is reported in Ki, Kd, IC50, and EC50.^[5] Data can be filtered and analyzed to develop compound screening libraries for lead identification during drug discovery.^[6]

ChEMBL version 2 (ChEMBL_02) was launched in January 2010, including 2.4 million bioassay measurements covering 622,824 compounds, including 24,000 natural products. This was obtained from curating over 34,000 publications across twelve medicinal chemistry journals. ChEMBL's coverage of available bioactivity data has grown to become "the most comprehensive ever seen in a public database.".^[3] In October 2010 ChEMBL version 8 (ChEMBL_08) was launched, with over 2.97 million bioassay measurements covering 636,269 compounds.^[7]

ChEMBL_10 saw the addition of the PubChem confirmatory assays, in order to integrate data that is comparable to the type and class of data contained within ChEMBL.^[8]

ChEMBLdb can be accessed via a web interface or downloaded by File Transfer Protocol. It is formatted in a manner amenable to computerized data mining, and attempts to standardize activities between different publications, to enable comparative analysis.^[1] ChEMBL is also integrated into other large-scale chemistry resources, including PubChem and the ChemSpider system of the Royal Society of Chemistry.

Associated resources

In addition to the database, the ChEMBL group have developed tools and resources for data mining.^[9] These include Kinase SARfari, an integrated chemogenomics workbench focussed on kinases. The system incorporates and links sequence, structure, compounds and screening data.

GPCR SARfari is a similar workbench focused on GPCRs, and ChEMBL-Neglected Tropical Diseases (ChEMBL-NTD) is a repository for Open Access primary screening and medicinal chemistry data directed at endemic tropical diseases of the developing regions of the Africa, Asia, and the Americas. The primary purpose of ChEMBL-NTD is to provide a freely accessible and permanent archive and distribution centre for deposited data.^[3]

July 2012 saw the release of a new malaria data service , sponsored by the Medicines for Malaria Venture (MMV), aimed at researchers around the globe. The data in this service includes compounds from the Malaria Box screening set, as well as the other donated malaria data found in ChEMBL-NTD.

myChEMBL, the ChEMBL virtual machine, was released in October 2013 to allow users to access a complete and free, easy-to-install cheminformatics infrastructure.

In December 2013, the operations of the SureChem patent informatics database were transferred to EMBL-EBI. In a portmanteau, SureChem was renamed SureChEMBL.

2014 saw the introduction of the new resource ADME SARfari - a tool for predicting and comparing cross-species ADME targets.^[10]

References

↑ ^1.0 ^1.1 Gaulton, A (2011). "ChEMBL: a large-scale bioactivity database for drug discovery". Nucleic Acids Research 40 (Database issue): D1100-7. doi:10.1093/nar/gkr777. PMID 21948594.
↑ "Open access drug discovery database launches with half a million compounds | Wellcome". wellcome.ac.uk. 18 January 2010. https://wellcome.ac.uk/press-release/open-access-drug-discovery-database-launches-half-million-compounds.
↑ ^3.0 ^3.1 ^3.2 Bender, A (2010). "Databases: Compound bioactivities go public". Nature Chemical Biology 6 (5): 309. doi:10.1038/nchembio.354.
↑ Overington J (April 2009). "ChEMBL. An interview with John Overington, team leader, chemogenomics at the European Bioinformatics Institute Outstation of the European Molecular Biology Laboratory (EMBL-EBI). Interview by Wendy A. Warr". J. Comput.-Aided Mol. Des. 23 (4): 195–8. doi:10.1007/s10822-009-9260-9. PMID 19194660. Bibcode: 2009JCAMD..23..195W.
↑ Mok, N. Yi; Brenk, Ruth (Oct 24, 2011). "Mining the ChEMBL Database: An Efficient Chemoinformatics Workflow for Assembling an Ion Channel-Focused Screening Library". J. Chem. Inf. Model. 51 (10): 2449–2454. doi:10.1021/ci200260t. PMID 21978256.
↑ Brenk, R; Schinpani, A; James, D; Krasowski, A (Mar 2008). "Lessons learnt from assembling screening libraries for drug discovery for neglected diseases". ChemMedChem 3 (3): 435–44. doi:10.1002/cmdc.200700139. PMID 18064617.
↑ ChEMBL-og (15 November 2010), ChEMBL_08 Released, http://chembl.blogspot.com/2010/11/chembl08-released.html, retrieved 2010-11-15
↑ ChEMBL-og (6 June 2011), ChEMBL_10 Released, http://chembl.blogspot.com/2011/06/chembl-10-released.html, retrieved 2011-06-09
↑ Bellis, L J (2011). "Collation and data-mining of literature bioactivity data for drug discovery.". Biochemical Society Transactions 39 (5): 1365–1370. doi:10.1042/BST0391365. PMID 21936816.
↑ Davies, M (2015). "ADME SARfari: Comparative Genomics of Drug Metabolising Systems.". Bioinformatics 31 (10): 1695–1697. doi:10.1093/bioinformatics/btv010. PMID 25964657.

External links

ChEMBLdb
Kinase SARfari
ChEMBL-Neglected Tropical Disease Archive
GPCR SARfari
The ChEMBL-og Open data and drug discovery blog run by the ChEMBL team.

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/ChEMBL. Read more

[Gaultom-1] 1.0 ^1.1 Gaulton, A (2011). "ChEMBL: a large-scale bioactivity database for drug discovery". Nucleic Acids Research 40 (Database issue): D1100-7. doi:10.1093/nar/gkr777. PMID 21948594.

[2] "Open access drug discovery database launches with half a million compounds | Wellcome". wellcome.ac.uk. 18 January 2010. https://wellcome.ac.uk/press-release/open-access-drug-discovery-database-launches-half-million-compounds.

[Bender-3] 3.0 ^3.1 ^3.2 Bender, A (2010). "Databases: Compound bioactivities go public". Nature Chemical Biology 6 (5): 309. doi:10.1038/nchembio.354.

[pmid19194660-4] Overington J (April 2009). "ChEMBL. An interview with John Overington, team leader, chemogenomics at the European Bioinformatics Institute Outstation of the European Molecular Biology Laboratory (EMBL-EBI). Interview by Wendy A. Warr". J. Comput.-Aided Mol. Des. 23 (4): 195–8. doi:10.1007/s10822-009-9260-9. PMID 19194660. Bibcode: 2009JCAMD..23..195W.

[5] Mok, N. Yi; Brenk, Ruth (Oct 24, 2011). "Mining the ChEMBL Database: An Efficient Chemoinformatics Workflow for Assembling an Ion Channel-Focused Screening Library". J. Chem. Inf. Model. 51 (10): 2449–2454. doi:10.1021/ci200260t. PMID 21978256.

[6] Brenk, R; Schinpani, A; James, D; Krasowski, A (Mar 2008). "Lessons learnt from assembling screening libraries for drug discovery for neglected diseases". ChemMedChem 3 (3): 435–44. doi:10.1002/cmdc.200700139. PMID 18064617.

[7] ChEMBL-og (15 November 2010), ChEMBL_08 Released, http://chembl.blogspot.com/2010/11/chembl08-released.html, retrieved 2010-11-15

[8] ChEMBL-og (6 June 2011), ChEMBL_10 Released, http://chembl.blogspot.com/2011/06/chembl-10-released.html, retrieved 2011-06-09

[Bellis-9] Bellis, L J (2011). "Collation and data-mining of literature bioactivity data for drug discovery.". Biochemical Society Transactions 39 (5): 1365–1370. doi:10.1042/BST0391365. PMID 21936816.

[Davies-10] Davies, M (2015). "ADME SARfari: Comparative Genomics of Drug Metabolising Systems.". Bioinformatics 31 (10): 1695–1697. doi:10.1093/bioinformatics/btv010. PMID 25964657.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

Anonymous

Search

Biology:ChEMBL

Namespaces

More

Page actions

Contents

Scope and access

Associated resources

See also

References

External links

Navigation

Navigation

Resources

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Biology:ChEMBL

Scope and access

Associated resources

See also

References

External links

Navigation

Wiki tools

Page tools

Other projects

Categories