Skip to main content

UKQSAR Spring 2023 Meeting Announcement



The next UKQSAR meeting will be held at the European Bioinformatic Institute (EBI) on 20th April. This is where the ChEMBL team is also located!

It would be great to welcome you for the occasion and have a chat if you have any questions on our services!

See below for the official announcement. The meeting is free but you have to register.



Join the waiting list to be notified if a place becomes available, or register virtually to join the meeting online.


UK-QSAR Spring 2023 meeting

Thursday 20th April 2023 / 9:00 AM - 17:00 PM / Wellcome Genome Campus, Hinxton, Cambridgeshire 

The Spring UKQSAR & Cheminformatics Group Meeting will be held at the Wellcome Genome Campus, Hinxton, Cambridge, on Thursday 20th April 2022. The meeting is organised jointly by EMBL-EBI and Sosei Heptares, and as always is free. The theme this time is “Learning from data” and for the occasion three of the most relevant databases for our fields will be introduced (PubChem, ChEMBL and the Cambridge Structural Database). The afternoon sessions will be focused on protein structure-based techniques (including use of AlphaFold, and ML for virtual screening) and reaction informatics (including applications of Enamine REAL, and machine learning).



Registration is now open. Please register using the following link: 

Only registered participants will be allowed to attend. 

Registration closes at 23.30 BST on the 10th April. 



The EMBL-EBI building is located on the Wellcome Genome Campus. More details on the campus are available here.

EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK

Google map directions

Please find information on traveling to EMBL-EBI on our website.



Open registration, coffee/tea


Welcome and opening remarks

Andrew R Leach

Session 1: Data for advanced research


PubChem: a wealth of information for your research

Evan Bolton (PubChem)


EMBL-EBI's Chemical Biology Resources for Drug Discovery - what does the future hold?

Barbara Zdrazil (EMBL-EBI)


Learning from the Cambridge Structural Database

Isaac Sugden (CCDC)


Lunch and poster session

Session 2: Learning from data - Protein structure-based techniques


Applications of AlphaFold and Generative Molecular Design for GPCR SBDD

Brian Bender (Sosei Heptares)


Expanding the target coverage and accuracy of virtual screening in the age of machine learning

Marton Vass (Schrodinger)


Break and poster session

Session 3: Learning from data - Reaction informatics


Making virtual REAL: creation and application

Yurii Moroz (ChemSpace)



Lucy van Dijk (Vertex) 


Machine Learning around the Design-Make-Test Cycle

Marwin Segler (Microsoft)


Concluding remarks, poster prize

Chris de Graaf/Noel O'Boyle


  • Evan Bolton, Head Chemistry Program (National Library of Medicine)

PubChem: a wealth of information for your research”

PubChem is a massive resource.  An effective strategy to access what you need requires a little knowledge and imagination.  This talk will give you a broad overview of major ways to access information from PubChem whether you are a beginner or an expert or whether you are a technophobe or a technophile.  In addition, this talk will give you some illustrative examples to help you get a sense of how PubChem can help meet your research needs.


Suggested reading

  • Márton Vass, Principal Applications Scientist (Schrodinger)

 Expanding the target coverage and accuracy of virtual screening in the age of machine learning

Machine learning technologies are now influencing all stages of the drug discovery process. Since the advent of AlphaFold2 it has been an open question whether this or derived methods could provide starting points for structure-based drug discovery techniques. We have investigated the usability of AlphaFold2 models in virtual screening experiments using the unrefined models or their binding sites refined with Schrödinger’s IFD-MD tool. We show that unrefined structures, on average, provide enrichments on par with apo crystal structures, whereas refined structures can provide enrichments close to their holo counterparts. A combination of IFD-MD and FEP technologies can also be used on homology models and AlphaFold2 structures to uncover compound binding modes in novel targets and for structure-based hit expansion and lead optimization.


Suggested reading

  • Yurii Moroz, CEO (ChemSpace)

Making virtual REAL: creation and application.   


Suggested reading


As part of the registration process, you will be offered to submit an abstract. This will be reviewed by the organising committee, who will then confirm whether you will be able to present a poster. If you would like to present a poster, please register as soon as you can to allow us time to review your abstract and you time to create your poster before the day.

Key dates and information:

  • Abstracts can only be submitted to the organising committee at the time of registration

  • Abstracts must NOT EXCEED 300 CHARACTERS, and must be uploaded as .pdf files

  • You will be notified by 5pm on 12TH APRIL if you are successful

  • All posters need to be PRE-PRINTED as A1 size (23.4 x 33.1 inches / 594 x 841 mm)

  • Posters must be brought along ready for 9AM BST on the day of the meeting to allow enough set-up time

  • The event organisers will attach all posters to the boards on the day


We recommend the following hotels. In distance order from the venue, starting with the closest first.

Hinxton Hall Conference Centre (located on campus)

Please note that Hinxton Hall has limited availability for this date so delegates are advised to book as soon as possible.

Red Lion Hinxton (5 minute walk from campus)

HOLIDAY INN EXPRESS CAMBRIDGE-DUXFORD M11, JCT.10 (a short taxi ride away from campus)

For more information on accommodation please see our website here


Popular posts from this blog

ChEMBL 34 is out!

We are delighted to announce the release of ChEMBL 34, which includes a full update to drug and clinical candidate drug data. This version of the database, prepared on 28/03/2024 contains:         2,431,025 compounds (of which 2,409,270 have mol files)         3,106,257 compound records (non-unique compounds)         20,772,701 activities         1,644,390 assays         15,598 targets         89,892 documents Data can be downloaded from the ChEMBL FTP site: Please see ChEMBL_34 release notes for full details of all changes in this release: New Data Sources European Medicines Agency (src_id = 66): European Medicines Agency's data correspond to EMA drugs prior to 20 January 2023 (excluding vaccines). 71 out of the 882 newly added EMA drugs are only authorised by EMA, rather than from other regulatory bodies e.g.

New SureChEMBL announcement

(Generated with DALL-E 3 ∙ 30 October 2023 at 1:48 pm) We have some very exciting news to report: the new SureChEMBL is now available! Hooray! What is SureChEMBL, you may ask. Good question! In our portfolio of chemical biology services, alongside our established database of bioactivity data for drug-like molecules ChEMBL , our dictionary of annotated small molecule entities ChEBI , and our compound cross-referencing system UniChem , we also deliver a database of annotated patents! Almost 10 years ago , EMBL-EBI acquired the SureChem system of chemically annotated patents and made this freely accessible in the public domain as SureChEMBL. Since then, our team has continued to maintain and deliver SureChEMBL. However, this has become increasingly challenging due to the complexities of the underlying codebase. We were awarded a Wellcome Trust grant in 2021 to completely overhaul SureChEMBL, with a new UI, backend infrastructure, and new f

Accessing SureChEMBL data in bulk

It is the peak of the summer (at least in this hemisphere) and many of our readers/users will be on holiday, perhaps on an island enjoying the sea. Luckily, for the rest of us there is still the 'sea' of SureChEMBL data that awaits to be enjoyed and explored for hidden 'treasures' (let me know if I pushed this analogy too far). See here and  here for a reminder of SureChEMBL is and what it does.  This wealth of (big) data can be accessed via the SureChEMBL interface , where users can submit quite sophisticated and granular queries by combining: i) Lucene fields against full-text and bibliographic metadata and ii) advanced structure query features against the annotated compound corpus. Examples of such queries will be the topic of a future post. Once the search results are back, users can browse through and export the chemistry from the patent(s) of interest. In addition to this functionality, we've been receiving user requests for  local (behind the