Skip to main content


Showing posts from September, 2021

Pathogen data in ChEMBL

Infectious disease is a leading cause of death globally and bioactivity data against pathogens (fungi, bacteria, viruses, and parasites) is an important category in ChEMBL, especially in light of the ongoing pandemic. In ChEMBL version 29, there are over 2 M bioactivity data points against fungal, bacterial or viral targets (for 460 K compounds) available for pathogen-related research. How can I find pathogen data? On the ChEMBL interface, the organism taxonomy is available as a filter that can be applied to bioactivity data. A sunburst visualisation of the organism taxonomy is also provided as an easy starting point to explore targets according to their taxonomy. In the full database, the organism_classification table holds the underlying data and can be used in bespoke SQL queries. For example, queries may be performed to extract high level pathogen data such as all bioactivity data for small molecules screened against bacterial targets (example below) or more specific subsets focus