Joerg has made a nice blog post on the use of Open Source software and data to analyse the occurence of antithrombotics. More specifically he was trying to answer the question Which XRay ligands are closest to the Fontaine et al. structure-activity relationship data for allowing structure-based drug design? Using Blue Obelisk tools and ChemSpider […]
Extending the REST PCA Service
I recently described a REST based service for performing PCA-based visualization of chemical spaces. By visiting a URL of the form http://rguha.ath.cx/~rguha/cicc/rest/chemspace/default/ c1ccccc1,c1ccccc1CC,c1ccccc1CCC,C(=O)C(=O),CC(=O)O one would get a HTML, plain text or JSON page containing the first two principal components for the molecules specified. With this data one can generate a simple 2D plot of the […]
The ONS Challenge & Visualizing Chemical Space
The ONSChallenge has been running for some time now and the simple web query form that tied in the data from Google Docs along with web services from IU has turned out to be pretty handy. With more and more data becoming available, I had done some initial exploratory analysis of the measured solubilities. One […]
The Speedups Keep on Coming
A while back I wrote about some updates I had made to the CDK fingerprinting code to improve performance. Recently Egon and Jonathan Alvarsson (Uppsala) had made even more improvements. Some of them are simple fixes (making a String[] final, using Set rather than List) while others are more significant (efficient caching of paths). In […]
Java Port of VFLib Works and it’s Blazing
Sometime back I described how I was porting the VFLib algorithms to Java, so that we could use it for substructure search, since the current UniversalIsomorphismTester is pretty slow for this task, in general. While I had translated the Ullman algorithm implementation of VFLib and shown that it outperformed the CDK method, it turned out […]