443
Views
26
CrossRef citations to date
0
Altmetric
Articles

Building and deploying a cyberinfrastructure for the data-driven design of chemical systems and the exploration of chemical space

, , &
Pages 921-929 | Received 09 Jan 2018, Accepted 26 Apr 2018, Published online: 22 May 2018
 

Abstract

The use of modern data science has recently emerged as a promising new path to tackling the complex challenges involved in the creation of next-generation chemistry and materials. However, despite the appeal of this potentially transformative development, the chemistry community has yet to incorporate it as a central tool in every-day work. Our research program is designed to enable and advance this emerging research approach. It is centred around the creation of a software ecosystem that brings together physics-based modelling, high-throughput in silico screening and data analytics (i.e. the use of machine learning and informatics for the validation, mining and modelling of chemical data). This cyberinfrastructure is devised to offer a comprehensive set of data science techniques and tools as well as a general-purpose scope to make it as versatile and widely applicable as possible. It also emphasises user-friendliness to make it accessible to the community at large. It thus provides the means for the large-scale exploration of chemical space and for a better understanding of the hidden mechanisms that determine the properties of complex chemical systems. Such insights can dramatically accelerate, streamline and ultimately transform the way chemical research is conducted. Aside from serving as a production-level tool, our cyberinfrastructure is also designed to facilitate and assess methodological innovation. Both the software and method development work are driven by concrete molecular design problems, which also allow us to assess the efficacy of the overall cyberinfrastructure.

Acknowledgements

Computing time on the high-performance computing clusters ‘Rush’, ‘Alpha’, ‘Beta’ and ‘Gamma’ was provided by the University at Buffalo (UB) Center for Computational Research (CCR). The work presented here is part of MAFA’s, MH’s and YP’s PhD theses. In addition, this review mentions contributions of the following members and former members of the Hachmann group: Dr. Andrew J. Schultz (Research Assistant Professor; deep eutectic solvent and organic photovoltaics applications), Aditya Sonpal (MSc; ChemBDDB, ChemML, deep eutectic solvent and high-refractive index polymer applications), Gaurav Vishwakarma (MSc; ChemML), Po-Han Chen (MSc; ChemML), Vigneshwar Kumaran Sudalayandi Rajeswari (MSc; biodegradable polymer application), Amol Rajendra Mahajan (MSc; liquid organic hydrogen carrier application). Shirish Sivaraj (MSc; ChemBDDB), Noah A. Zydel (BSc; ChemML), Chi Hin Chan (BSc; deep eutectic solvent application), Andrew J. DeRooy (BSc; deep eutectic solvent application), Sykhere A. Brown (BSc; ChemML), Supriya Agrawal (MSc ’17; ChemBDDB), Sai Prasad Ganesh (BSc ’17; high-refractive index polymer application), Mark A. Pitman (BSc ’17; hydrolysis catalyst application), William S. Evangelista II (MEng ’16; ChemHTPS) Yujie Tian (MSc ’16; ChemML, organic semiconductor application), Mikhail Pechagin (BSc ’16; ChemML), Ching-Yen Shih (MSc ’15; ChemML) and Bryan A. Moore (BSc ’15; organic photovoltaics application).

Notes

No potential conflict of interest was reported by the authors.

Additional information

Funding

The work reviewed here has been supported by start-up funds provided through UB, as well as grants by the New York State Center of Excellence in Materials Informatics [1140384-8-75163], [1122381-12-66357], [1114099-1-66357]; the National Science Foundation CBET Energy for Sustainability and CAREER programs [CBET-1511528], [OAC-1751161]; the Army Research Office SBIR program [W15QKN-17-C-0078].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 827.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.