3,047
Views
5
CrossRef citations to date
0
Altmetric
Materials Informatics

Understanding the evolution of a de novo molecule generator via characteristic functional group monitoring

, , , ORCID Icon, , ORCID Icon & show all
Pages 352-360 | Received 02 Mar 2022, Accepted 04 May 2022, Published online: 01 Jun 2022

ABSTRACT

Recently, artificial intelligence (AI)-enabled de novo molecular generators (DNMGs) have automated molecular design based on data-driven or simulation-based property estimates. In some domains like the game of Go where AI surpassed human intelligence, humans are trying to learn from AI about the best strategy of the game. To understand DNMG’s strategy of molecule optimization, we propose an algorithm called characteristic functional group monitoring (CFGM). Given a time series of generated molecules, CFGM monitors statistically enriched functional groups in comparison to the training data. In the task of absorption wavelength maximization of pure organic molecules (consisting of H, C, N, and O), we successfully identified a strategic change from diketone and aniline derivatives to quinone derivatives. In addition, CFGM led us to a hypothesis that 1,2-quinone is an unconventional chromophore, which was verified with chemical synthesis. This study shows the possibility that human experts can learn from DNMGs to expand their ability to discover functional molecules.

GRAPHICAL ABSTRACT

1. Introduction

Recently, de novo molecular generators (DNMGs) that combine machine learning and quantum chemical calculation have been intensively developed [Citation1–6]; these approaches have led to the successful production of molecules as good as or better than those imaged by human under some circumstances [Citation1,Citation7,Citation8]. In the fields where AI surpassed human intelligence such as Shogi and the game of Go, humans are trying to learn from AI to enhance their abilities. In Shogi [Citation9], young players are actively using AI-shogi systems to develop their skills. Shin et al. [Citation10] reported that, after AlphaGo [Citation11], human Go players are more likely to take AI-like decisions. A key question here is if chemists can learn from AI-based systems such as DNMGs and improve their skills in developing functional compounds. Unfortunately, current DNMGs are almost indiscernible and not suitable for skill acquisition by chemists.

In this paper, we develop a method to distill useful knowledge from molecules generated by a DNMG. Since a DNMG continuously learns from an evaluator based on quantum chemical calculation, it gets ‘cleverer’ as time passes. We are interested in strategic evolution of a DNMG from a quasi-random strategy to more informed and targeted one. Given a time series of generated molecules, our method termed characteristic functional group monitoring (CFGM) finds statistically enriched functional groups in a sliding window. For a functional group, its occurrence probability in the generated molecules is compared to that in the training set. If the ratio of probabilities (i.e. odds ratio) is high, we regard that the functional group is focused on by the DNMG. Chemists, in turn, may be able to infer why the functional group works by means of additional experiments and simulations.

Our approach is tested in the task of generating molecules that absorb light of long wavelengths. Applications of such molecules include organic light emitting diodes [Citation12], dyes [Citation13] and solar cells [Citation14]. From the viewpoint of the molecular electronic structure theory based on quantum mechanics [Citation15], light absorption is mediated by the transition among quantised electronic structures in molecules. In principle, the light absorption of a molecule can be tuned by changing the gap between the highest occupied molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO). Nonetheless, the relationship between light absorption and molecular structure is not fully understood; for example, it is still challenging to explore the limitation of absorption wavelength of small molecules [Citation16]. Conventionally, photofunctional molecules are often designed using a known chromophore [Citation17], i.e. the part of a molecule responsible for its color. The total number of known organic chromophores is unclear, but a reasonable bottom line would be obtained from the chromophore database by Joung et al. [Citation17] that contains 7,016 chromophores.

Our method was applied to more than 40,000 molecules generated by ChemTS [Citation8], a DNMG that combines reinforcement learning with deep learning, in the task of elongating the absorption wavelength of light, as evaluated by density functional theory (DFT) calculations. In the early stage of generation, diketone and aniline derivatives were highly enriched, but gradually quinone derivatives became more dominant. Among them, derivatives of 1,4-quinon and 1,2-quinon were outstanding. While the former is one of the most well-known chromophores [Citation18], the latter has not been recognized as a chromophore. In fact, the chromophore database [Citation17] contains the former, but not the latter. We validated 1,2-quinon’s function as a potential chromophore by synthesizing a molecule including it and measuring its absorption spectrum. Though more validation experiments would be necessary to establish 1,2-quinon as a new chromophore, our result suggests that CFGM is useful in understanding a DNMG result and can guide chemists to a new pathway of research.

2. Methods

2.1. Characteristic functional group monitoring

A DNMG uses a reward function of molecules that typically depends on quantum chemical calculation. The generated molecules are gradually optimized towards higher reward. CFGM aims to track a set of functional groups that plays main roles in reward optimization. CFGM is applicable to DNMGs that have an unlabelled training set, i.e. a set of molecules without their rewards. Such methods include variational-autoencoder-based molecule generators [Citation6], ReLeaSE [Citation4] and ChemTS [Citation2]. Genetic algorithm-based generators [Citation19] are not covered, because they do not need a training set. Most generators using generative adversarial networks (GANs) [Citation5] require a labelled training set, hence CFGM cannot be applied.

In the time series of generated molecules, we consider a sliding window of a predetermined length (for example, 100 molecules). Given a functional group f, the fraction of in-window molecules including it is described as Pf. In addition, let Ptf denote the fraction in the training set. The odds ratio is described as

PEf=Pf/Ptf.

A high value of PE indicates that the DNMG has learned to use the functional group to raise the reward. CFGM computes the odds ratios for a set of functional groups to identify which ones are enriched at the current time point. By observing the odds ratios across the entire series of generated molecules, the strategic evolution of a DNMG can be investigated.

2.2. ChemTS settings

To elongate the absorption wavelength of pure organic molecules consisting of C, N, O, and H atoms, we used ChemTS, which employs the MCTS [Citation20] algorithm and the recurrent neural network (RNN) [Citation21]. For each generated molecule, its Cartesian coordinates were obtained using the RDKit package [Citation22]. The absorption wavelength was computed using TD-DFT at the B3LYP/3-21 G* level, implemented in the Gaussian 16 package [Citation23]. The lowest 10 states for each molecule were calculated after geometry optimisation. We used the following reward function, r(I), of a generated molecule, I, in the MCTS-based search process:

rI=FIGI,
FI=tanh0.003λIθ2,
GI=tanhSAI4+12.

The reward function consists of the product of a term relating to the absorption wavelength, FI, and a term relating to synthesizability, GI. FItakes values between 0 and 1; the longer the computational wavelength, λI, of molecule I, the larger the value. θ is the comparative criterion, which is set to 400 nm in this study. For a wavelength of 400 nm, the value of FIis 0.5. GI also takes values between 0 and 1 and is calculated from the synthetic accessibility score, SAI, of molecule I, which predicts the difficulty of the synthesis.

To accelerate the MCTS search, we adopted the virtual loss strategy [Citation20] to parallelise the computation. We used the following score (ui) for each child node i in the selection step:

ui=tRivi+vli+CPivp+vlpvi+vli+1.

Here, tRi is the total reward of node i, vi is the total number of visits to node i, i is the total number of virtual visits of i (virtual loss), vli is the RNN probability of node i, and vp and vlp are the total number of visits and the total number of virtual visits of the parent node, respectively. The parameter C controls the exploration–exploitation trade-off, which is set to 2 in this study. The results for C = 1 and C = 4 showed similar tendencies in CFGM. See Section 2 of Supplementary Information for details.

3. Results and discussions

3.1. Molecule generation experiment

We demonstrate how CFGM works with ChemTS in a task of maximising the absorption wavelength. ChemTS was trained with a set of 153,253 SMILES [Citation24] strings that only consisted of H, O, N, and C elements obtained from the ZINC database [Citation25]. It generated 45,321 organic molecules, using 2048 cores over 120 h. See the method section for the reward function and other settings of ChemTS. Before focusing on functional groups, let us investigate the evolution of basic molecular properties. shows the evolution of the absorption wavelength. After generating 10,000 molecules, the average value of the absorption wavelength began to elongate and reached more than 600 nm. The maximum absorption wavelength was over 1,200 nm after generating 40,000 molecules. With an increase in the wavelength, the average HOMO/LUMO gap of the generated molecules monotonically decreases, as shown in . It is known that there is an inversely proportional correlation between the absorption wavelength and the HOMO/LUMO gap. ChemTS also clearly uses this correlation and design molecules whose HOMO/LUMO gap is greater than 1.0 eV. However, the oscillator strength (OS) of the absorption does not grow and saturates around 0.05, which is not different from the average OS of the molecules included in the training data (). After 40,000 molecules are generated, the average value of OS is degraded, in contrast to the elongation of the absorption wavelength. This trend agrees with the intuition and statistical results that the organic molecules that absorb the long-wavelength light with high intensity are rare [Citation26]. Hence, the relationship among electronic structures is reasonable. However, the relationship between the electronic structure and the molecular structure is not.

Figure 1. Evolution of molecular properties in the series of generated molecules. (a) absorption wavelength (nm) to S1 excited state, (b) HOMO/LUMO gap (eV) (c) absorption intensity (oscillator strength; OS), (d) molecular weight (g mol −1), (e) conjugate length, (f) number of aromatic rings. Average values of training and generated molecules at each step are depicted by green broken line and blue solid line, respectively. The shaded area depicts the distribution profiles of generated molecules for each property. A thin shade area represents 5%–95% of the total distribution, while a dense shade area represents 15%–75% of the total distribution in each number of generated molecules.

Figure 1. Evolution of molecular properties in the series of generated molecules. (a) absorption wavelength (nm) to S1 excited state, (b) HOMO/LUMO gap (eV) (c) absorption intensity (oscillator strength; OS), (d) molecular weight (g mol −1), (e) conjugate length, (f) number of aromatic rings. Average values of training and generated molecules at each step are depicted by green broken line and blue solid line, respectively. The shaded area depicts the distribution profiles of generated molecules for each property. A thin shade area represents 5%–95% of the total distribution, while a dense shade area represents 15%–75% of the total distribution in each number of generated molecules.

The design principle of ChemTS for molecules that absorb long-wavelength light is not dependent on the expanding molecular size. As shown in , the average molecular weight gradually increased until 40,000 molecules were generated. However, from the generation of 40,000 molecules, the average molecular weight decreased. This tendency can be observed in the average conjugate length and the number of aromatic rings, as shown in , respectively. Up to the generation of 40,000 molecules, the absorption wavelength increases with increasing conjugation length and the number of aromatic rings (). This means that the traditional strategy of longer conjugation lengths leading to longer absorption wavelengths is working well [Citation27], and ChemTS supports the same strategy. However, a slight deviation has occurred since the generation of 40,000 molecules. This means that ChemTS takes another strategy to elongate the absorption wavelength.

3.2. Strategic evolution

shows the odds ratios (PE) of several functional groups with respect to the entire series of generated molecules. Although ketone derivatives are predominantly generated (~50%), their PE value is not high (0.649). Similarly, the PE of traditional chromophore derivatives (azo, aniline) is not high. Hence, ChemTS does not regard the azo and aniline derivatives as suitable molecules for absorbing long-wavelength light. In contrast, 1,4-quinone shows a high PE of >30, despite the low percentage of generated molecules. Among the generated molecules, 1,2-quinone shows a relatively high odds ratio. This result indicates that ChemTS predicted that 1,2-quinone is an important functional group for long-wavelength absorption. Dyes with anthraquinone and 1,4-quinone structures are well known [Citation18]. However, although 1,2-quinone is considered a cofactor [Citation28] and a building block in heterocyclic synthesis [Citation29], it has received little attention as a chromophore. Among the 1,2-quinone derivatives, the PE of 1,2-naphthoquinone is the highest in , in contrast to 1,4-naphthoquinone, an isomeric derivative of 1,2-naphthoquinone.

Table 1. Functional group enrichment analysis for various functional groups and their percentage of generated molecules and training data. Odds ratio is given as PE.

The evolution of PE for the functional groups shown in indicates that ChemTS changed the strategy from ketones, diketones, and aniline to azo and quinone derivatives. As shown in , ChemTS insisted on ketone, diketone, and aniline derivatives from the initial stage to 20,000 molecule generation. From 20,000, however, PE gradually decreased. Instead, the PE of azo () and quinone () derivatives gradually increased. In particular, the PE of azo, 1,4-quinone, and 1,2-quinone derivatives suddenly increased after the generation of 40,000 molecules. This behaviour corresponds to the elongation of the absorption wavelength, as shown in . The PE values of 1,4-quinone and 1,2-quinone were considerably higher than that of azo. This indicates that 1,4-quinone is important for designing long-wavelength light absorption chromophores. Additionally, ChemTS predicted the potential of 1,2-quinone derivatives after exploring the possibilities of ketone, diketone, and aniline derivatives.

Figure 2. Odds ratio evolution of several functional groups shown in as a function of the number of generated molecules.

Figure 2. Odds ratio evolution of several functional groups shown in Table 1 as a function of the number of generated molecules.

3.3. Experimental validation

Among the 1,2-quinone derivatives, 1,2-naphthoquinone shows a high odds ratio of 78.5, and its evolution behaviour () is similar to that of 1,2-quinone, as shown in . In total, 22 different 1,2-naphthoquinone structures were generated (see Figure S9 of the ESI). Among them, we focused on 1 in , which showed the longest wavelength absorption of these molecules. Compound 1 has a structure consisting of an enol and a carbonyl skeleton attached to the 1,2-naphthoquinone skeleton; time-dependent DFT (TD-DFT) calculations at the APFD/6-31 G* level (computational level is changed for conformation) predicted the appearance of the absorption at approximately 949 nm (). For the actual synthesis, we simplified 1 by trimming the functional groups that would not be as important for their function as chromophores (). First, 2 was obtained by replacing the moiety highlighted by red in 1 with hydrogen. According to our preliminary computation, TD-DFT calculations estimated that the absorption wavelength of 2 was approximately 820 nm, which is shorter than that of 1 by 130 nm. Second, 3 was obtained by replacing the triazole group in 2 with the phenyl group that also has an π conjugate structure because the triazole group is difficult to introduce. TD-DFT calculations estimated the absorption wavelength of 3 to be 757 nm. Although enol skeletons are thermodynamically unfavoured and isomerise to their keto forms, we tried synthesising the enol but failed to isolate the desired compound. Accordingly, novel 4, in which the enol skeleton of 3 was replaced by a simple olefin, was found as a synthesisable model of the target chromophore (1). The absorption wavelength of 4 to its first excited state owing to HOMO–LUMO single electron transition is estimated to be 575 nm, which undergoes a considerable blue shift from that of 1. The nature of excitation to their first excited state is preserved (see Figure S10 of ESI) during the conversion of 1 to 4.

Figure 3. Target molecule inspired by ChemTS. (a) the generated molecule by ChemTS is 1. 2–4 molecules are synthesis models of 1. The absorption wavelength of each molecule is estimated at the APFD/6-31 G* level. Surfaces of HOMO–LUMO orbitals of 4 are drawn at an isodensity value of 0.02. (b) Retro-synthesis of 4.

Figure 3. Target molecule inspired by ChemTS. (a) the generated molecule by ChemTS is 1. 2–4 molecules are synthesis models of 1. The absorption wavelength of each molecule is estimated at the APFD/6-31 G* level. Surfaces of HOMO–LUMO orbitals of 4 are drawn at an isodensity value of 0.02. (b) Retro-synthesis of 4.

The retrosynthetic analysis of 4 is shown in . Compound 4 is prepared from 2-naphthol 5 by oxidation, which is synthesised from naphthol and an olefin through a cross-coupling reaction. As shown in , using a commercially available compound (6) as the starting material, the Suzuki–Miyaura coupling reaction with vinylboronic acid pinacol ester gave 5, which introduced the olefinic moiety into the naphthol skeleton [Citation30]. Compound 5 was oxidised by 2-iodoxybenzoic acid (IBX) to form the target compound (4) as a dark-purple solid with 95% yield [Citation31]. Compound 4 was found to be stable in air and soluble in many common solvents such as CHCl3, CH2Cl2, tetrahydrofuran, acetonitrile, and acetone. The product was characterised by NMR spectroscopy and mass spectrometry (see Figure S1-S4 of ESI).

Figure 4. Synthetic route and UV-vis spectrum of molecule 4. (a) Synthesis process of 4. (b) the black and blue curves correspond to the UV-vis absorption spectra of the solution of 4 and that of 1,2-naphthoquinone in acetonitrile (1 × 10 −5 mol L −1), respectively. The red curve shows the computational absorption spectrum of 4 obtained by TD-DFT calculation at the APFD/6-31 G* level. Photograph of a solution of 4 in acetonitrile (1 × 10 −4 mol L −1) under ambient light is also shown.

Figure 4. Synthetic route and UV-vis spectrum of molecule 4. (a) Synthesis process of 4. (b) the black and blue curves correspond to the UV-vis absorption spectra of the solution of 4 and that of 1,2-naphthoquinone in acetonitrile (1 × 10 −5 mol L −1), respectively. The red curve shows the computational absorption spectrum of 4 obtained by TD-DFT calculation at the APFD/6-31 G* level. Photograph of a solution of 4 in acetonitrile (1 × 10 −4 mol L −1) under ambient light is also shown.

The solution of 4 in acetonitrile (1 × 10−5 mol L−1) exhibited a red-purple color; its UV-vis absorption spectrum showed two peaks at 478 and 550 nm (). The corresponding two peaks of 1,2-naphthoquinone appear at 341 and 391 nm. This result shows that the substituent to 1,2-naphthoquinone in 4 contributes to red shift. The predicted result at the APFD/6-31 G* level agrees well with the experimental UV-vis absorption spectrum, where the former has a broad unimodal peak at 575 nm and the latter has the first peak at 550 nm. Although the 1,2-quinone derivatives are often unstable to air and light irradiation, we confirmed that the UV-vis spectrum of 4 did not change after its recrystallization as shown in Figure S12, hence 4 seems to be stable enough to apply as a dye.

4. Conclusion

We proposed CFGM in this paper and showed that it can elucidate strategic evolution of a DNMG and suggest an unconventional chromophore. In our experiments, ChemTS found light-absorbing molecules only from empirical observations without any knowledge of quantum chemical foundation. Early-day chemists in the 19th century did not have such knowledge either. It is intriguing that the strategy learned by ChemTS is reminiscent of Armstrong’s quinoid theory [Citation32] established in 1887. It states that only the compounds written in a quinoid form are coloured. Obviously, it is too simplistic and wrong in the eyes of current-day chemists. In the research of artificial intelligence, it is a widely asked question if AI ‘thinks’ like humans or how to make AI behave like humans [Citation33]. The observation that ChemTS behaved like a chemist in the 19th century is an encouraging sign for us. In future work, further integration of AI and quantum chemical knowledge may lead to a better system.

Data availability

Chemical formulae of 1,2-naphthoquinone compounds generated by ChemTS, computational validation of the model molecules at a high level, and details of the chemical synthesis and characterisation of products are available in the Supplementary Information. The generated 45321 molecules are listed in the ALW_ChemTS/generated_mols/result.csv file. The file contains following information: generated molecules (SMILES), calculated absorption wavelengths and their oscillator strengths, and basic information such as molecular weight.

Code availability

Our implementation of parallel version of ChemTS for fluorescent molecules is available at https://github.com/tsudalab/ALW_ChemTS.

Supplemental material

Supplemental Material

Download PDF (3.7 MB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/14686996.2022.2075240

Additional information

Funding

This study was partially supported by a project subsidised by the New Energy and Industrial Technology Development Organization (NEDO) and MEXT as Priority Issue on Post-K Computer“(Building Innovative Drug Discovery Infrastructure through Functional Control of Biomolecular Systems) and Program for Promoting Studies on the Supercomputer Fugaku” (MD-driven Precision Medicine), AMED JP20nk0101111, SIP (Technologies for Smart Bio-industry and Agriculture), JST ERATO JPMJER1903, the Core Research for Evolutional Science and Technology (CREST) program of the Japan Science and Technology Agency (JST), Japan, under Grant JPMJCR19J3. The computations in this study were performed in the supercomputer centres at the NIMS and RAIDEN of AIP (RIKEN).

References

  • Sanchez-Lengeling B, Aspuru-Guzik A. Inverse molecular design using machine learning: generative models for matter engineering. Science. 2018;361(6400):360–365.
  • Yang X, Zhang J, Yoshizoe K, et al. ChemTS: an efficient python library for de novo molecular generation. Sci Technol Adv Mater. 2017;18(1):972–976.
  • Yoshikawa N, Terayama K, Sumita M, et al. Population-Based de novo molecule generation, using grammatical evolution. Chem Lett. 2018;47(11):1431–1434. DOI:10.1246/cl.180665
  • Popova M, Isayev O, Tropsha A. Deep reinforcement learning for de novo drug design. Sci Adv. 2018 Jul;4(7):eaap7885.
  • Maziarka Ł, Pocha A, Kaczmarczyk J, et al. Mol-CycleGAN: a generative model for molecular optimization. J Cheminform. 2020;12(1). DOI:10.1186/s13321-019-0404-1.
  • Gómez-Bombarelli R, Wei JN, Duvenaud D, et al. Automatic Chemical design using a data-driven continuous representation of molecules. ACS Cent Sci. 2018;4(2):268–276. DOI:10.1021/acscentsci.7b00572
  • Zhavoronkov A, Ivanenkov YA, Aliper A, et al. Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nat Biotechnol. 2019;379:1038–1040.
  • Sumita M, Yang X, Ishihara S, et al. Hunting for organic molecules with artificial intelligence: molecules optimized for desired excitation energies. ACS Cent Sci. 2018;4(9):1126–1133. DOI:10.1021/acscentsci.8b00213
  • Shogi Master Yoshiharu Habu Reveals How Deep Learning Is Transforming the Game. 2019 [cited 2022 Feb 11]. Available from: https://www.ricoh.com/about/empowering-digital-workplaces/articles/ricoh_AI_shogi_game_score_capturing_system
  • Shin M, Kim J, Kim M Human learning from artificial intelligence: evidence from human go players’ decisions after alphaGo. In: Proceedings of the Annual Meeting of the Cognitive Science Society; 2021. p. 1795–1801.
  • Silver D, Schrittwieser J, Simonyan K, et al. Mastering the game of go without human knowledge. Nature. 2017 Oct 18; 550(7676):354–359. DOI:10.1038/nature24270
  • Qin Y, Li G, Qi T, et al. Aromatic imide/amide-based organic small-molecule emitters for organic light-emitting diodes. Mater Chem Front. 2020;4(6):1554–1568.
  • Fabian J, Nakazumi H, Matsuoka M. Near-Infrared absorbing dyes. Chem Rev. 1992;92(6):1197–1226.
  • Hagfeldt A, Boschloo G, Sun L, et al. Dye-Sensitized solar cells. Chem Rev. 2010;110(11):6595–6663.
  • Cohen AJ, Mori-Sánchez P, Yang W. Challenges for density functional theory. Chem Rev. 2011;112(1):289–320.
  • Qian G, Wang ZY. Near-Infrared organic compounds and emerging applications. Chem Asian J. 2010;5(5):1006–1029.
  • Joung JF, Han M, Jeong M, et al. Experimental database of optical properties of organic compounds. Sci Data. 2020;7(1):1–6.
  • Dulo B, Phan K, Githaiga J, et al. Natural quinone dyes: a review on structure, extraction techniques, analysis and application potential. Waste Biomass Valorization. 2021;12(12):6339–6374. DOI: 10.1007/s12649-021-01443-9
  • Jensen JH. A graph-based genetic algorithm and generative model/monte Carlo tree search for the exploration of chemical space. Chem Sci. 2019;10(12):3567–3572.
  • Browne CB, Powley E, Whitehouse D, et al. A survey of Monte Carlo tree search methods. IEEE Trans Comput Intell AI Games. 2012;4(1):1–43. DOI:10.1109/TCIAIG.2012.2186810
  • Yu Y, Si X, Hu C, et al. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. 2019;31(7):1235–1270. DOI:10.1162/neco_a_01199
  • Landrum G. Rdkit: Open-source cheminformatics. 2022. Available from: http://www.rdkit.org
  • Frisch MJ, Trucks GW, Schlegel HB, et al. Gaussian 16 Rev. C.01. Wallingford, CT; 2016.
  • Weininger D. SMILES, a chemical language and information system. 1. introduction to methodology and encoding rules. J Chem Inf Model. 1988;28(1):31–36.
  • Irwin JJ, Sterling T, Mysinger MM, et al. ZINC: a free tool to discover Chemistry for Biology. J Chem Inf Model. 2012;52(7):1757–1768. DOI:10.1021/ci3001277
  • Terayama K, Sumita M, Tamura R, et al. Pushing property limits in materials discovery via boundless objective-free exploration. Chem Sci. 2020;11(23):5959–5968.
  • Guay J, Kasai P, Diaz A, et al. Chain-Length dependence of electrochemical and electronic properties of neutral and oxidized soluble.Alpha alpha.-coupled thiophene oligomers. Chem Mater. 2002;4(5):1097–1105.
  • Mishra AK, Mukhopadhyay A, Moorthy JN. One-Pot multistep synthesis of bipolar carbazolo-phenazines: hydrogen bond control of Diels-Alder cycloaddition and application for fluoride sensing. Tetrahedron. 2017;73(16):2210–2216.
  • Wendlandt AE, Stahl SS. Quinone‐catalyzed selective oxidation of organic molecules. Angewandte Chemie. 2015;54(49):14638–14658.
  • Parasram M, Iaroshenko VO, Gevorgyan V. Endo-Selective Pd-catalyzed silyl methyl heck reaction. J Am Chem Soc. 2014;136(52):17926–17929.
  • Magdziak D, Rodriguez AA, Van De Water RW, et al. Regioselective oxidation of phenols to o-Quinones with o-Iodoxybenzoic acid (IBX). Org Lett. 2002;4(2):285–288.
  • Travis AS. Colour by design before 1890. Text Chem Colorist. 1990;22:23–27.
  • Daube C, Xu T, Zhan J, et al. Grounding deep neural network predictions of human categorization behavior in understandable functional features: the case of face identity. Patterns. 2021;2(10):100348.