Dear users, Since its introduction in 2015, SureChEMBL has been a database focused on chemical annotations. We extract compound structures from patent texts, images, and Molfiles when available, and register them in our database . This chemistry-first approach is even reflected in our name. However, we know that intellectual property documents capture far more than chemistry. This was illustrated by Stefan Senger in 2017 ( 10.1186/s13321-017-0214-2 ), who showed that compound–target interactions can appear years before being mentioned in the scientific literature. Our first step into biomedical annotation A few years ago, we took a first step beyond chemistry by adding annotations for genes/proteins, diseases, and mechanisms of action in the SureChEMBL UI. These were generated by an in-house Natural Language Processing (NLP) model that performed reasonably well for an initial version. Example of biomedical annotation in a pa...
ChEBI 2.0 is here! We’re excited to announce the release of ChEBI 2.0! The next generation of the Chemical Entities of Biological Interest database. If you missed the background story and the details of the redevelopment, you can catch up in our earlier blog post here . What’s Changed: The new ChEBI web interface is live at https://www.ebi.ac.uk/chebi/ Legacy SOAP web services are deprecated — please migrate to our REST APIs New ChEBI 2.0 data products (ontology, TSV, SDF, PostgreSQL dump) are available at https://ftp.ebi.ac.uk/pub/databases/chebi Submitter accounts have been migrated — you’ll need to reset your password before logging into the new submission portal Why ChEBI 2.0? The legacy ChEBI codebase had become increasingly difficult to maintain and update. ChEBI 2.0 has been rebuilt from the ground up with: Simplified PostgreSQL schema Faster search powered by Elasticsearch + RDKit Refreshed public user interface modern infrastructure (Kuberne...