References
- Bera, S. and V. K. Shrivastava. 2020. “Analysis of Various Optimizers on Deep Convolutional Neural Network Model in the Application of Hyperspectral Remote Sensing Image Classification.” International Journal of Remote Sensing 41 (7): 2664–2683. doi:https://doi.org/10.1080/01431161.2019.1694725.
- Cui, Y., D. Wu, and J. Huang. 2020. “Optimize TSK Fuzzy Systems for Classification Problems: Mini-Batch Gradient Descent with Uniform Regularization and Batch Normalization.” IEEE Transactions on Fuzzy Systems, 1-1. doi:https://doi.org/10.1109/TFUZZ.2020.2967282
- Duchi, J., E. Hazan, and Y. Singer. 2011. “Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.” Journal of Machine Learning Research 12: 2121–2159.
- Kingma, D. P., and J. Ba. 2015. “Adam: A Method for Stochastic Optimization.” Computer Science abs/1412.6980.
- Koga, Y., H. Miyazaki, and R. Shibasaki. 2018. “A CNN-Based Method of Vehicle Detection from Aerial Images Using Hard Example Mining.” Remote Sensing 10 (1): 124. doi:https://doi.org/10.3390/rs10010124.
- Li, J., X. Li, and L. Zhao. 2017. “Unmixing of Large-Scale Hyperspectral Data Based on Projected Mini-Batch Gradient Descent.” International Journal of Wavelets Multiresolution & Information Processing 15 (06): 1750059. doi:https://doi.org/10.1142/S021969131750059X.
- Li, W. W. 2021. “Marine Target Detection for SAR Image Based on Deep Learning.“ PhD diss., Shandong University of Science and Technology.
- Louppe, G. and P. Geurts. 2010. “A Zealous Parallel Gradient Descent Algorithm.” http://hdl.handle.net/2268/80780.
- Masood, S., M. N. Doja, and P. Chandra. 2015. “Analysis of Weight Initialization Methods for Gradient Descent with Momentum.” 2015 International Conference on Soft Computing Techniques and Implementations (ICSCTI), 131–136. doi:https://doi.org/10.1109/ICSCTI.2015.7489618.
- Nesterov, Y. E. 1983. “A Method for Solving the Convex Programming Problem with Convergence Rate O (1/k^2).” Dokl.Akad.nauk Sssr 1983: 269.
- Pearlmutter, B. A. 1991. “Gradient Descent: Second-Order Momentum and Saturating Error.” Advances in Neural Information Processing Systems 4: 887–894.
- Polyak, B. T. 1964. “Some Methods of Speeding Up the Convergence of Iteration Methods.” USSR Computational Mathematics and Mathematical Physics 4 (5): 1–17. doi:https://doi.org/10.1016/0041-5553(64)90137-5.
- Postalcıoğlu, S. 2020. “Performance Analysis of Different Optimizers for Deep Learning-Based Image Recognition.” International Journal of Pattern Recognition and Artificial Intelligence 34 (02): 12. doi:https://doi.org/10.1142/S0218001420510039.
- Robbins, H. and S. Monro. 1951. “A Stochastic Approximation Method.” The Annals of Mathematical Statistics 22 (3): 400–407. doi:https://doi.org/10.1214/aoms/1177729586.
- Si, Z., S. Wen, and B. Dong. 2019. “NOMA Codebook Optimization by Batch Gradient Descent.” IEEE Access 7: 117274–117281. doi:https://doi.org/10.1109/ACCESS.2019.2936483.
- Su, W., L. Chen, M. Wu, Zhou, M., Liu, Z., and Cao, W. 2017. “Nesterov Accelerated Gradient Descent-Based Convolution Neural Network with Dropout for Facial Expression Recognition.” In 2017 11th Asian Control Conference (ASCC) , 1063–1068. doi:https://doi.org/10.1109/ASCC.2017.8287318.
- Tieleman, T., and G. Hinton. 2012. “Lecture 6.5-Rmsprop: Divide the Gradient by a Running Average of Its Recent Magnitude.” Coursera: Neural Networks for Machine Learning, Coursera Lecture 6e 4(2): 26–31 .
- Tien, B. D., H. Shahabi, E. Omidvar, Shirzadi, A., Geertsema, M., Clague, J., and Khosravi, K., et al. 2019. “Shallow Landslide Prediction Using a Novel Hybrid Functional Machine Learning Algorithm.” Remote Sensing 11 (8): 931. doi:https://doi.org/10.3390/rs11080931.
- Tu, Q., Y. Rong, and J. Chen. 2020. “Parameter Identification of ARX Models Based on Modified Momentum Gradient Descent Algorithm.” Complexity 2020 (3): 1–11. doi:https://doi.org/10.1155/2020/9537075.
- Zeiler, M. D. 2012. “Adadelta: An Adaptive Learning Rate Method.” Computer Science abs/1212.5701.
- Zhu, X., Q. Meng, and L. Gu. 2018. “Real-Time Image Recognition Using Weighted Spatial Pyramid Networks.” Journal of Real-Time Image Processing 15 (3): 617–629. doi:https://doi.org/10.1007/s11554-017-0743-y.
- Zuo, J., J. Xu, Y. Chen, and Wang, C. 2019. “Downscaling Precipitation in the Data-Scarce Inland River Basin of Northwest China Based on Earth System Data Products.” Atmosphere 10 (10): 613. doi:https://doi.org/10.3390/atmos10100613.