152
Views
37
CrossRef citations to date
0
Altmetric
Original Articles

Internet resources integrating many small-molecule databases1

, &
Pages 1-9 | Received 05 Sep 2007, Accepted 20 Oct 2007, Published online: 04 Dec 2010
 

Abstract

Footnote New data, tools and services recently made available on the web server (http://cactus.nci.nih.gov) of the Computer-Aided Drug Design (CADD) Group, NCI, NIH, developed in the context of chemoinformatics and drug development work, are presented. These tools are designed for searching for structures in very large databases of small molecules. One of them is a web service–the Chemical Structure Lookup Service (CSLS)–for very rapid structure lookup in an aggregated collection of more than 80 databases comprising more than 27 million unique structures at the time of this writing. CSLS contains pointers to the entries in toxicology-related databases, catalogues of commercially available samples, drugs, assay results data sets, and databases in several other categories. CSLS allows the user to find out very rapidly in which one(s) of all these databases a given structure occurs independent of the representation of the input structure, by making use of InChIs as well as new CACTVS hashcode-based identifiers. These latter, calculable, identifiers are designed to take into account tautomerism, different resonance structures drawn for charged species, and presence of additional fragments. They make possible fine-tunable yet rapid compound identification and database overlap analyses in very large compound collections.

1Presented at CMTPI 2007: Computational Methods in Toxicology and Pharmacology Integrating Internet Resources (Moscow, Russia, September 1–5, 2007).

Acknowledgments

The authors wish to thank Wolf-Dietrich Ihlenfeldt (Xemistry GmbH) for his support with the CACTVS chemoinformatics toolkit. This project was funded in whole or in part with federal funds from the National Cancer Institute, National Institutes of Health, under contract N01-CO-12400. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government. This Research was supported in part by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research.

Notes

1Presented at CMTPI 2007: Computational Methods in Toxicology and Pharmacology Integrating Internet Resources (Moscow, Russia, September 1–5, 2007).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.