References
- Bouakiz, M., Kebir, Y. (1995). Target-level criterion in Markov decision processes. J. Optim. Theory Appl. 86(1):1–15. DOI: https://doi.org/10.1007/BF02193458.
- Fan, K. (1953). Minimax theorems. Proc. Natl. Acad. Sci. USA. 39(1):42–47. DOI: https://doi.org/10.1073/pnas.39.1.42.
- Guo, X., Hernández-Lerma, O. (2009). Continuous-Time Markov Decision Processes, Volume 62 of Stochastic Modelling and Applied Probability. Theory and Applications. Berlin: Springer-Verlag.
- Guo, X., Piunovskiy, A. (2011). Discounted continuous-time Markov decision processes with constraints: Unbounded transition and loss rates. Math. OR. 36(1):105–132. DOI: https://doi.org/10.1287/moor.1100.0477.
- Hernández-Lerma, O., Bernard Lasserre, J. (1999). Further Topics on Discrete-Time Markov Control Processes, Volume 42 of Applications of Mathematics (New York). New York: Springer-Verlag.
- Huang, X., Guo, X. (2020). Nonzero-sum stochastic games with probability criteria. Dyn. Games Appl. 10(2):509–527. DOI: https://doi.org/10.1007/s13235-019-00317-z.
- Huang, X., Guo, X., Peng, J. (2017). A probability criterion for zero-sum stochastic games. J. Dyn. Games. 4(4):369–383. DOI: https://doi.org/10.3934/jdg.2017020.
- Huang, Y., Guo, X., Li, Z. (2013). Minimum risk probability for finite horizon semi-Markov decision processes. J. Math. Anal. Appl. 402(1):378–391. DOI: https://doi.org/10.1016/j.jmaa.2013.01.021.
- Huang, Y., Guo, X., Song, X. (2011). Performance analysis for controlled semi-Markov systems with application to maintenance. J. Optim. Theory Appl. 150(2):395–415. DOI: https://doi.org/10.1007/s10957-011-9813-7.
- Huo, H., Guo, X. (2020). Risk probability minimization problems for continuous-time Markov decision processes on finite horizon. IEEE Trans. Automat. Contr. 65(7):3199–3206. DOI: https://doi.org/10.1109/TAC.2019.2947654.
- Huo, H., Zou, X., Guo, X. (2017). The risk probability criterion for discounted continuous-time Markov decision processes. Discrete Event Dyn. Syst. 27(4):675–699. DOI: https://doi.org/10.1007/s10626-017-0257-6.
- Kira, A., Ueno, T., Fujita, T. (2012). Threshold probability of non-terminal type in finite horizon Markov decision processes. J. Math. Anal. Appl. 386(1):461–472. DOI: https://doi.org/10.1016/j.jmaa.2011.08.006.
- Nowak, A. S. (1985). Measurable selection theorems for minimax stochastic optimization problems. SIAM J. Control Optim. 23(3):466–476. DOI: https://doi.org/10.1137/0323030.
- Sakaguchi, M., Ohtsubo, Y. (2010). Optimal threshold probability and expectation in semi-Markov decision processes. Appl. Math. Comput. 216(10):2947–2958. DOI: https://doi.org/10.1016/j.amc.2010.04.007.
- Sakaguchi, M., Ohtsubo, Y. (2013). Markov decision processes associated with two threshold probability criteria. J. Control Theory Appl. 11(4):548–557. DOI: https://doi.org/10.1007/s11768-013-2194-8.
- White, D. J. (1993). Minimizing a threshold probability in discounted markov decision processes. J. Math. Anal. Appl. 173(2):634–646. DOI: https://doi.org/10.1006/jmaa.1993.1093.
- Wu, C., Lin, Y. (1999). Minimizing risk models in Markov decision processes with policies depending on target values. J. Math. Anal. Appl. 231(1):47–67. DOI: https://doi.org/10.1006/jmaa.1998.6203.