Previously in the SureChEMBL series, we described how to access SureChEMBL data in bulk , offline and locally. So, you may ask, what is the point in using the SureChEMBL web interface ? Well, how about the unprecedented functionality that allows you to submit very granular queries by combining: i) Lucene fields against full-text and bibliographic metadata and ii) advanced structure query features against the annotated compound corpus - at the same time? Let’s see each one separately first: Lucene-powered keyword searching You may use the main text box for simple keyword-based patent searches, such as ‘Apple’, ‘diabetes’ or even ' chocolate cake ' (the patent corpus as a recipe book is a new use-case here). You will get a lot of results and probably a lot of noise. With Lucene fields, you can slice and dice a query by indicating specific patent sections and bibliographic metadata, such as date/year of filing or publication, assignee, patent classification code,...
The Organization of Drug Discovery Data
| | | | | | | |