1,122
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Monocular vision guided deep reinforcement learning UAV systems with representation learning perception

&
Article: 2183828 | Received 05 Jun 2022, Accepted 17 Feb 2023, Published online: 08 Mar 2023

Figures & data

Table 1. Comparison of network structures used in 3 different continuous actions space DRL methods.

Figure 1. TD3.

Figure 1. TD3.

Figure 2. Single image model.

Figure 2. Single image model.

Figure 3. Convolutional variational autoencoder.

Figure 3. Convolutional variational autoencoder.

Figure 4. Structure of actor net and critic net in the model.

Figure 4. Structure of actor net and critic net in the model.

Figure 5. Continuous images model.

Figure 5. Continuous images model.

Figure 6. Structure of actor net and critic net in the model with LSTM layer.

Figure 6. Structure of actor net and critic net in the model with LSTM layer.

Figure 7. Short-term memory state construction.

Figure 7. Short-term memory state construction.

Figure 8. Short-term memory model.

Figure 8. Short-term memory model.

Table 2. Environment Settings.

Figure 9. Simulation environment.

Figure 9. Simulation environment.

Table 3. Reward function.

Figure 10. Reward comparison of single image model, continuous image model and short-term memory model during training.

Figure 10. Reward comparison of single image model, continuous image model and short-term memory model during training.

Figure 11. Average moving distance comparison of single image model, continuous image model and short-term memory model during training.

Figure 11. Average moving distance comparison of single image model, continuous image model and short-term memory model during training.

Table 4. MobileNet V2 structure.

Figure 12. Reward comparison of single image model and MobileNet V2 + TD3.

Figure 12. Reward comparison of single image model and MobileNet V2 + TD3.

Figure 13. Average moving distance comparison of single image model and MobileNet V2 + TD3.

Figure 13. Average moving distance comparison of single image model and MobileNet V2 + TD3.

Table 5. Test results of short-term memory model.

Table 6. Test results of continuous images model.

Table 7. Test results of single image model.

Table 8. Test results of Mobile Net + TD3 model.

Table 9. Test results in rearranged environment.

Table 10. Side collision count in 100 flights of 3 models.