A week ago, I had the pleasure of presenting SureChEMBL2.0 at the Cambridge Cheminformatics Network Meeting, organised by Andreas Bender and kindly hosted by the Cambridge Crystallographic Data Centre.
It was a great opportunity to introduce one of the latest freely available databases of scientifically annotated patents to a broad scientific audience.
The recording of the talk is now available online, along with the slides.
What did I cover during this 30-minute talk?
-
Why scientists should pay attention to patent data
-
Why patents are challenging to work with
-
What SureChEMBL is and what it does
-
How we identify chemical compounds in patent documents
-
What SureChEMBL 2.0 has recently introduced
-
How we annotate patents for genes/proteins and diseases
-
How we are improving the quality of structures extracted from images
-
What you can download from the SureChEMBL core datasets — and what they contain
-
Examples of queries that SureChEMBL helps answer (and there are many more!)
-
A brief introduction to a new project led by my colleague Idil Ismail, who is developing an approach to extract bioactivity data from patents
If you have any questions, feel free to contact the SureChEMBL help desk at: surechembl-help@ebi.ac.uk
Nicolas
Comments