705
Views
113
CrossRef citations to date
0
Altmetric
Articles

Automatic knowledge extraction from chemical structures: the case of mutagenicity prediction

, , , , &
Pages 365-383 | Received 17 Jun 2012, Accepted 03 Aug 2012, Published online: 28 May 2013

References

  • Livingstone , D.J. 2000 . The characterization of chemical structures using molecular properties: A survey . J. Chem. Inform. Comput. Sci. , 40 : 195 – 209 .
  • Inokuchi , A. , Washio , T. and Motoda , H. 2000 . “ An a priori-based algorithm for mining frequent substructures from graph data ” . In Principles of Data Mining and Knowledge Discovery, Proceedings of 4th European Conference, PKDD 2000, 13–16 September 2000, Lyon, France , Edited by: Zighed , D.A. , Komorowski , J. and Zytkow , J. 13 – 23 . Berlin : Springer .
  • Deshpande , M. , Kuramochi , M. , Wale , N. and Karypis , G. 2005 . Frequent substructure based approaches for classifying chemical compounds . IEEE Trans. Knowl. Data Eng. , 17 : 1036 – 1050 .
  • Borgelt , C. and Berthold , M.R. 2002 . Mining molecular fragments: Finding relevant substructures of molecules, in Proceedings of the 2012 IEEE International Conference on Data Mining (ICDM 2002), 9–12 December 2002, Maebashi City, Japan . IEEE Computer Society , 2002 : 51 – 58 .
  • Agrawal , R. and Srikant , R. 1994 . “ Fast algorithms for mining association rules in large databases ” . In Proceedings of the 20th international conference on Very Large Data Bases, VLDB, Santiago, Chile , Edited by: Bocca , B. Jr. , Jarke , M. and Zaniolo , C. 487 – 489 . San Francisco : Morgan Kaufmann Publishers .
  • R. Benigni and C. Bossa, Structural alerts for carcinogenicity, and the Salmonella assay system: A novel insight through the chemical relational databases technology, Mutat. Res.-Rev. Mutat. 659 (2008), pp. 248–261.
  • Kazius , J. , Mcguire , R. and Bursi , R. 2005 . Derivation and validation of toxicophores for mutagenicity prediction . J. Med. Chem. , 48 : 312 – 320 .
  • Rosenkranz , H.S. , Zhang , Y.P. and Klopman , G. 1998 . Studies on the potential for genotoxic carcinogenicity of fragrances and other chemicals . Food Chem. Toxicol. , 36 : 687 – 696 .
  • L. Dehaspe, H. Toivonen, and R.D. King, Finding frequent substructures in chemical compounds, on Predictive Toxicology of Chemicals: Experiences and Impact of AI Tools, AAAI.SS.99, Gini, G. and Katrizky, A., eds., AAAI Press, Menlo Park, CA, 1999, pp. 78–81.
  • G. Klopman, MULTICASE: A hierarchical computer automated structure evaluation program, Quant. Struct.-Act. Rel. 11 (1992), 176–184.
  • Helma , C. 2006 . Lazy structure–activity relationships (LAZAR) for the prediction of rodent carcinogenicity and Salmonella mutagenicity . Mol. Divers. , 10 : 147 – 158 .
  • Weininger , D. 1988 . SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules . J. Chem. Inf. Comput. Sci. , 28 : 31 – 36 .
  • R. Sayle, 1st-class SMARTS patterns, EuroMUG 97, Bioinformatics Group, Research I.T., Glaxo Wellcome Research & Development, Stevenage, UK, 1997.
  • Karwath , A. and De Raedt , L. 2006 . SMIREP: predicting chemical activity from SMILES . J. Chem. Inf. Model. , 46 : 2432 – 44 .
  • Toropov , A.A. , Toropova , A.P. and Benfenati , E. 2009 . Additive SMILES-based carcinogenicity models: probabilistic principles in the search for robust predictions . Int. J. Mol. Sci. , 10 : 3106 – 3127 .
  • Benigni , R. , Netzeva , T.I. , Benfenati , E. , Bossa , C. , Franke , R. , Helma , C. , Hulzebos , E. , Marchant , C. , Richard , A. , Woo , Y.T. and Yang , C. 2007 . The expanding role of predictive toxicology: An update on the (Q)SAR models for mutagens and carcinogens . J. Environ. Sci. Health C , 25 : 53 – 97 .
  • Benfenati , E. , Benigni , R. , Demarini , D.M. , Helma , C. , Kirkland , D. , Martin , T.M. , Mazzatorta , P. , Ouedraogo-Arras , G. , Richard , A.M. , Schilter , B. , Schoonen , W.G. , Snyder , R.D. and Yang , C. 2009 . Predictive models of carcinogenicity and mutagenicity: Frameworks, state-of-the-art and perspectives . J. Environ. Sci. Health C , 27 : 57 – 90 .
  • Ames , B.N. 1984 . The detection of environmental mutagens and potential . Cancer , 53 : 2030 – 2040 .
  • Ashby , J. 1985 . Fundamental structural alerts to potential carcinogenicity or noncarcinogenicity . Environ. Mutagen. , 7 : 919 – 921 .
  • Piegorsch , W.W. and Zeiger , E. 1991 . “ Measuring intra-assay agreement for the Ames Salmonella assay ” . In Statistical Methods in Toxicology, Lecture Notes in Medical Informatics , Edited by: Hotorn , L. 35 – 41 . Berlin : Springer-Verlag .
  • Durant , J.L. , Leland , B.A. , Henry , D.R. and Nourse , J.G. 2002 . Reoptimization of MDL keys for use in drug discovery . J. Chem. Inf. Comput. Sci , 42 : 1273 – 1280 .
  • Hansch , C. , Malony , P.P. , Fujita , T. and Muir , R.M. 1962 . Correlation of biological activity of phenoxyacetic acids with Hammett substituent constants with partition coefficients . Nature , 194 : 178 – 180 .
  • Miller , J.A. and Miller , E.C. 1981 . Searches for ultimate chemical carcinogens and their reactions with cellular macromolecules . Cancer , 47 : 2327 – 2345 .
  • Ashby , J. and Tennant , R.W. 1988 . Chemical structure, Salmonella mutagenicity and extent of carcinogenicity as indicators of genotoxic carcinogenesis among 222 chemicals tested by the U.S. NCI/NTP . Mutat. Res. , 204 : 17 – 115 .
  • Liao , Q. , Yao , J. and Yuan , S. 2007 . Prediction of mutagenic toxicity by combination of recursive partitioning and support vector machines . Mol. Divers. , 11 : 59 – 72 .
  • Zheng , M. , Liu , Z. , Xue , C. , Zhu , W. , Chen , K. , Luo , X. and Jiang , H. 2006 . Mutagenic probability estimation of chemical compounds by a novel molecular electrophilicity vector and support vector machine . Bioinformatics , 22 : 2099 – 2106 .
  • Perrotta , A. , Malacarne , D. , Taningher , M. , Pesenti , R. , Paolucci , M. and Parodi , S. 1996 . A computerized connectivity approach for analyzing the structural basis of mutagenicity in Salmonella and its relationship with rodent carcinogenicity . Mol. Mutagen. , 28 : 31 – 50 .
  • Benigni , R. , Bossa , C. , Tcheremenskaia , O. and Giuliani , A. 2010 . Alternatives to the carcinogenicity bioassay: in silico methods, and the in vitro and in vivo mutagenicity assays . Exp. Opin. Drug Metab. Toxicol. , 6 : 1 – 11 .
  • Snyder , R.D. , Pearl , G.S. , Mandakes , G. , Choy , W.N. , Goodsaid , F. and Rosenblum , I.Y. 2004 . Assessment of the sensitivity of the computational programs DEREK, TOPKAT and MCASE in the prediction of the genotoxicity of pharmaceutical molecules . Environ. Mol. Mutagen. , 43 : 143 – 158 .
  • Ferrari , T. , Gini , G. and Benfenati , E. 2009 . Support vector machines in the prediction of mutagenicity of chemical compounds , Cincinnati : Proceedings NAFIPS .
  • T. Ferrari, and G. Gini, An open source multistep model to predict mutagenicity from statistical analysis and relevant structural alerts. CAESAR workshop on QSAR Models for REACH. Chem. Cent. J. 4(Suppl. 1) (2010).
  • MCASE, MultiCASE Inc., Beachwood, OH, USA; software available at http://www.multicase.com.
  • N.M. O’Boyle, C. Morley, and G.R. Hutchison, Pybel: A Python wrapper for the OpenBabel cheminformatics toolkit, Chem. Cent. J. 2 (2008).
  • Available at http://www.caesar-project.eu. Web site of the CAESAR project and QSAR platform.
  • Benigni , R. , Bossa , C. , Jeliazkova , N.G. , Netzeva , T.I. and Worth , A.P. 2008 . The Benigni/Bossa rulebase for mutagenicity and carcinogenicity – a module of Toxtree, EUR 23241 EN , Luxembourg : EUR-Scientific and Technical Report Series Office for the Official Publications of the European Communities .
  • Marchant , C.A. , Briggs , K.A. and Long , A. 2008 . In silico tools for sharing data and knowledge on toxicity and metabolism: DEREK for windows, METEOR and VITIC . Toxicol. Mech. Methods , 18 : 177 – 187 .
  • Kaden , D.A. , Hites , R.A. and Thilly , W.J. 1979 . Mutagenicity of soot and associated polycyclic hydrocarbons to Salmonella typhimurium . Cancer Res. , 39 : 4152 – 4159 .
  • Vamvakas , S. , Dekant , W. and Anders , M.W. 1989 . Mutagenicity of benzyl S-haloalkyl and S-haloalkenyl sulphides in the Ames-test . Biochem. Pharmacol. , 38 : 935 – 939 .
  • Dybing , E. , Søderlund , E.J. , Gordon , W.P. , Holme , J.A. , Christensen , T. , Becher , G. , Rivedal , E. and Thorgeirsson , S.S. 1987 . Studies on the mechanism of acetamide hepatocarcinogenicity . Pharmacol. Toxicol. , 60 : 9 – 16 .
  • Aeschbacher , Hu. , Wolleb , U. , Loliger , J. , Spadone , J.C. and Liardon , R. 1989 . Contribution of coffee aroma constituents to the mutagenicity of coffee . Food Chem. Toxicol. , 27 : 227 – 232 .
  • C.J. Smith, C. Hansch, and M.J. Morton, QSAR treatment of multiple toxicities: the mutagenicity and cytotoxicity of quinolines, Mutat. Res.-Fund. Mol. M 379 (1997), pp. 167–175.
  • A Worth, M. Fuart-Gatnik, S. Lapenna, E. Lo Piparo, A. Mostrag-Szlichtyng, and R. Serafimova, The use of computational methods in the toxicological assessment of chemicals in food: current status and future prospects, EUR 24748 EN, Joint Research Centre Scientific and Technical Report Series, Office for the Official Publications of the European Communities, Luxembourg, 2011.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.