References
- Andrieu, C., Doucet, A., & Holenstein, R. (2010). Particle Markov chain Monte Carlo methods. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 72, 269–342 (with discussion). https://doi.org/https://doi.org/10.1111/(ISSN)1467-9868
- Andrieu, C., Lee, A., & Vihola, M. (2018). Uniform ergodicity of the iterated conditional SMC and geometric ergodicity of particle Gibbs samplers. Bernoulli, 24, 842–872. https://doi.org/https://doi.org/10.3150/15-BEJ785
- Arnold, L. (1974). Stochastic differential equations: Theory and applications. Wiley.
- Bertoli, F., & Bishop, A. N. (2018a). An error analysis in the limit approximation in path integral control. Preprint.
- Bertoli, F., & Bishop, A. N. (2018b). Nonlinear stochastic receding horizon control: Stability, robustness and Monte Carlo methods for control approximation. International Journal of Control, 91, 2387–2402. https://doi.org/https://doi.org/10.1080/00207179.2017.1349340
- Chebotar, Y., Kalakrishnan, M., Yahya, A., Schaal, S., & Levine, S. (2017). Path integral guided policy search. In Proceedings 2017 IEEE international conference on robotics and automation (ICRA) (pp. 3381–3388).IEEE
- Chehrazi, N., Cipriano, L. E., & Enns, E. A. (2019). Dynamics of drug resistance: Optimal control of an infectious disease. Operations Research, 67, 619–650. https://doi.org/https://doi.org/10.1287/opre.2018.1817
- Fahim, A., Touzi, N., & Warin, X. (2011). A probabilistic numerical method for fully nonlinear parabolic PDEs. The Annals of Applied Probability, 21, 1322–1364. https://doi.org/https://doi.org/10.1214/10-AAP723
- Fleming, W. H., & Mitter, S. K. (1982). Optimal control and nonlinear filtering for nondegenerate diffusion processes. Stochastics, 8, 63–77. https://doi.org/https://doi.org/10.1080/17442508208833228
- Fleming, W. H., & Soner, H. M. (2006). Controlled Markov processes and viscosity solutions. Springer.
- Franks, J., Jasra, A., Law, K. J. H., & Vihola, M. (2018). Unbiased inference for discretely observed hidden Markov model diffusions. Arxiv preprint.
- Giles, M. B. (2008). Multilevel Monte Carlo path simulation. Operations Research, 56, 607–617. https://doi.org/https://doi.org/10.1287/opre.1070.0496
- Giles, M. B. (2015). Multilevel Monte Carlo path methods. Acta Numerica, 24, 259–328. https://doi.org/https://doi.org/10.1017/S096249291500001X
- Heinrich, S. (2001). Multilevel Monte Carlo methods. In S. Margenov, J. Wasniewski & P. Yalamov (Eds.), Large-scale scientific computing (pp. 58–67). Springer.
- Heng, J., Bishop, A. N., Deligiannidis, G., & Doucet, A. (2017). Controlled sequential Monte Carlo. Arxiv preprint.
- Heng, J., Houssineau, J., & Jasra, A. (2020). On unbiased estimation of the score function for a class of partially observed diffusion processes. Work in progress.
- Jasra, A., & Doucet, A. (2009). Sequential Monte Carlo for diffusion processes. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 465, 3709–3727. https://doi.org/https://doi.org/10.1098/rspa.2009.0206
- Jasra, A., Kamatani, K., Law, K. J. H., & Zhou, Y. (2017). Multilevel particle filters. SIAM Journal on Numerical Analysis, 55, 3068–3096. https://doi.org/https://doi.org/10.1137/17M1111553
- Jasra, A., Kamatani, K., Law, K. J. H., & Zhou, Y. (2018). Multilevel particle filters. SIAM Journal on Scientific Computing, 40, A887–A902. https://doi.org/https://doi.org/10.1137/17M1112595
- Kappen, H. J. (2005a). Linear theory for control of nonlinear stochastic systems. Physical Review Letters, 95, Article ID: 200201. https://doi.org/https://doi.org/10.1103/PhysRevLett.95.200201
- Kappen, H. J. (2005b). Path integrals and symmetry breaking for optimal control theory. Journal of Statistical Mechanics, 25(11), Article ID: 11011.https://doi.org/https://doi.org/10.1088/1742-5468/2005/11/P11011
- Kappen, H. J., Gomez, V., & Opper, M. (2012). Optimal control as a graphical model inference problem. Machine Learning, 87, 159–182. https://doi.org/https://doi.org/10.1007/s10994-012-5278-7
- Kappen, H. J., & Ruiz, H. C. (2016). Adaptive importance sampling for control and inference. Journal of Statistical Physics, 162, 1244–1266. https://doi.org/https://doi.org/10.1007/s10955-016-1446-7
- Krylov, N. V. (1972). Control of a solution of a stochastic integral equation. Theory of Probability & Its Applications, 17, 114–130. https://doi.org/https://doi.org/10.1137/1117009
- Krylov, N. V. (2008). Controlled diffusion processes. Springer.
- Kurtz, T., & Protter, P. (1991). Wong–Zakai corrections, random evolutions and simulation schemes for SDEs. In Stochastic analysis (pp. 58–67). Academic Press.
- Menchón, S. A., & Kappen, H. J. (2018). Learning effective state-feedback controllers through efficient multilevel importance samplers. International Journal of Control, 92, 2776–2783. https://doi.org/https://doi.org/10.1080/00207179.2018.1459857
- Neck, R. (1984). Stochastic control theory and operational research. European Journal of Operational Research, 17, 283–301. https://doi.org/https://doi.org/10.1016/0377-2217(84)90123-1
- Pham, H. (2009). Continuous-time stochastic control and optimization with financial applications. Springer.
- Ruiz, H. C., & Kappen, H. J. (2017). Particle smoothing for hidden diffusion processes: Adaptive path integral smoother. IEEE Transactions on Signal Processing, 65, 3191–3203. https://doi.org/https://doi.org/10.1109/TSP.2017.2686340
- Runggaldier, W. J. (1998). Concepts and methods for discrete and continuous time control under uncertainty. Insurance: Mathematics and Economics, 22(1), 25–39. https://doi.org/https://doi.org/10.1016/S0167-6687(98)00006-7
- Sethi, S. P. (1973). Optimal control of the Vidale–Wolfe advertising model. Operations Research, 21, 998–1013. https://doi.org/https://doi.org/10.1287/opre.21.4.998
- Sethi, S. P. (2019). Optimal control theory: Applications to management science and economics. Springer.
- Theodorou, E. A., Buchli, J., & Schaal, S. (2010). A generalized path integral control approach to reinforcement learning. Journal of Machine Learning Research, 11, 3137–3181. https://www.jmlr.org/papers/volume11/theodorou10a/theodorou10a.pdf
- Theodorou, E. A., & Todorov, E. (2012). Relative entropy and free energy dualities: Connections to path integral and KL control. In Proceedings of the 51st IEEE conference on decision and control (pp. 1466–1473). IEEE.
- Thijssen, S., & Kappen, H. J. (2015). Path integral control and state-dependent feedback. Physical Review E, 91, Article ID: 032104. https://doi.org/https://doi.org/10.1103/PhysRevE.91.032104
- Tornatore, E., Vetro, P., & Buccellato, S. M. (2014). SIVR epidemic model with stochastic perturbation. Neural Computing and Applications, 24, 309–315. https://doi.org/https://doi.org/10.1007/s00521-012-1225-6
- Touzi, N. (2012). Optimal stochastic control, stochastic target problems and backward stochastic differential equations. Springer.
- Witbooi, P. J., Muller, G. E., & Schalkwyk, G. J. V. (2015). Vaccination control in a stochastic SVIR epidemic model. Computational and Mathematical Methods in Medicine, 2015, Article ID: 271654. https://doi.org/https://doi.org/10.1155/2015/271654