Skip to main content


Showing posts from September, 2014

The great US patent spike on SureChEMBL

Apparently, there was a huge spike of new granted US patents released by the USPTO a few days ago. The reason? In March 2013, US patent law changed. The ‘first to invent’ became ‘first inventor to file’ for patent protection purposes (see more on this  here ). As a result, a lot of people rushed to submit applications just before the change.  Fast forward 18 months later (last week), a huge spike in USPTO granted patents is observed.  Did SureChEMBL pick that up? See below the cumulative count plot of new patent documents: And the corresponding compound count extracted from these patents: For more information on SureChEMBL, see our previous  posts . George

SureChEMBL Available Now

Followers of the ChEMBL group's activities and this blog will be aware of our involvement in the migration of the previously commercially available SureChem chemistry patent system, to a new, free-for-all system, known as SureChEMBL. Today we are very pleased to announce that the migration process is complete and the SureChEMBL website is now online. SureChEMBL provides the research community with the ability to search the patent literature using Lucene-based keyword queries and, much more importantly, chemistry-based queries. If you are not familiar with SureChEMBL, we recommend you review the content of these earlier blogposts here and here . SureChEMBL is a live system, which is continuously extracting chemical entities from the patent literature. The time it takes for a new chemical in the patent literature to become searchable in the SureChEMBL system is 1-2 days (WO patents can sometimes take a bit longer due to an additional reprocessing step). At time of writi

Papers: Literature text mining and extensions to UniChem

Two new papers from the group have just been published, both in Journal of Chemoinformatics - and of course both Open Access. The first deals with some extensions to UniChem to allow far more flexible searches. The abstract is: UniChem is a low-maintenance, fast and freely available compound identifier mapping service, recently made available on the Internet. Until now, the criterion of molecular equivalence within UniChem has been on the basis of complete identity between Standard InChIs. However, a limitation of this approach is that stereoisomers, isotopes and salts of otherwise identical molecules are not considered as related. Here, we describe how we have exploited the layered structural representation of the Standard InChI to create new functionality within UniChem that integrates these related molecular forms. The service, called ‘Connectivity Search’ allows molecules to be first matched on the basis of complete identity between the connectivity layer of their co

We're hiring! Web developer for NIH Illuminating the Druggable Genome (IDG) project

We got a prize today , so we are happy. What better way to celebrate, than to recruit someone new for the group. We have a position available for a developer to support web service development and integration for the Knowledge Management Centre part of the recently announced NIH Illuminating the Druggable Genome project, see this link for details of the job. Closing deadline for applications is 12th October 2014 .