We are delighted to announce the release of ChEMBL 34, which includes a full update to drug and clinical candidate drug data. This version of the database, prepared on 28/03/2024 contains: 2,431,025 compounds (of which 2,409,270 have mol files) 3,106,257 compound records (non-unique compounds) 20,772,701 activities 1,644,390 assays 15,598 targets 89,892 documents Data can be downloaded from the ChEMBL FTP site: https://ftp.ebi.ac.uk/pub/databases/chembl/ChEMBLdb/releases/chembl_34/ Please see ChEMBL_34 release notes for full details of all changes in this release: https://ftp.ebi.ac.uk/pub/databases/chembl/ChEMBLdb/releases/chembl_34/chembl_34_release_notes.txt New Data Sources European Medicines Agency (src_id = 66): European Medicines Agency's data correspond to EMA drugs prior to 20 January 2023 (excluding ...
The Organization of Drug Discovery Data
| | | | | | | |
Comments
You could also think of using descriptors that rely on a PCA analysis of the input data you are providing (actually I could help you with some of these, although in the form of a PP component).
We are mostly interested in bulk sequence properties at the moment (so fractional composition, hydrophobicity, features, etc), so a descriptor that gives a number for an input sequence.
There are loads of other stuff that would be cool to add, antigenicity, secondary structure prediction fractions, etc.
The sort of license issues are related to use of services which are freely available for academics, but there are some restrictions for "commercial use" - for example TMHMM http://www.cbs.dtu.dk/services/TMHMM/ where there is a download version of the software for academic institutes to use. I would guess that this license doesn't really cover the setup of a derivative service, allowing access over web services. This is just one example, not highlighted for any particular reason; but we would need to get permission from a fair number of software providers.
If we do set it up we want two things 1) freely accessible to all without restriction by user type and 2) compliant with the software licenses and wishes of the original authors.
http://nar.oxfordjournals.org/content/34/suppl_2/W32.full
the webserver is located here:
http://jing.cz3.nus.edu.sg/cgi-bin/prof/prof.cgi
However I cannot tell you anything about the performance of this particular descriptor, might serve as a benchmark to your own solution?