For a long time now we have been keen to release a full and freely deployable version of the ChEMBL database with compound search capabilities built in. This has been possible in the past, but complicated by commercial licenses associated with either the databases or the chemical cartridges. There are now a number mature Open Source chemical toolkits available, such as the excellent CDK, and RDKit.
So with that brief bit of background there is now an opportunity for an intern to work in the ChEMBL group on the project for 2-3 months. The idea is will be to setup a process which:
If you are looking for internship this year and have interest in the area of cheminformatics tools and some relevant experience please get in touch (as potential interns, we appreciate you may not have years of industry experience, but we would require you to have previous experience with relational databases and be competent in at least one programming language). Mail us!
So with that brief bit of background there is now an opportunity for an intern to work in the ChEMBL group on the project for 2-3 months. The idea is will be to setup a process which:
- Creates a PostgreSQL version of the ChEMBL database (database required by RDKit).
- Install the RDKit chemical cartridge.
- Migrate this setup to Amazon Web Service public image.
- Migrate existing (or new) ChEMBL interface to run off new database and package this up into AWS image.
- Develop scripts to allow new releases of ChEMBL to be processed and uploaded as a new AWS image.
If you are looking for internship this year and have interest in the area of cheminformatics tools and some relevant experience please get in touch (as potential interns, we appreciate you may not have years of industry experience, but we would require you to have previous experience with relational databases and be competent in at least one programming language). Mail us!
Comments