459
Views
1
CrossRef citations to date
0
Altmetric
Articles

Robust sequential design for piecewise-stationary multi-armed bandit problem in the presence of outliers

ORCID Icon, , & ORCID Icon
Pages 122-133 | Received 03 Oct 2020, Accepted 10 Mar 2021, Published online: 12 Apr 2021

Figures & data

Figure 1. Expected rewards for arms in the simulation experiment 1.

Figure 1. Expected rewards for arms in the simulation experiment 1.

Figure 2. Expected cumulative regrets for the M-UCB and RCD-UCB with different α values in the simulation experiment 1.

Figure 2. Expected cumulative regrets for the M-UCB and RCD-UCB with different α values in the simulation experiment 1.

Figure 3. Expected rewards for arms in simulation experiment 2.

Figure 3. Expected rewards for arms in simulation experiment 2.

Figure 4. Expected cumulative regrets for M-UCB and RCD-UCB with different α values in the simulation experiment 2.

Figure 4. Expected cumulative regrets for M-UCB and RCD-UCB with different α values in the simulation experiment 2.

Table 1. Average number (AVE) and standard deviation (SD) of detected change points by each algorithm in the simulation experiment 1.

Table 2. Average cumulative regrets R(T) (AVE), standard deviations (SD) and computational time in seconds (TIME) for different algorithms in the simulation experiment 1.

Table 3. Average number (AVE) and standard deviation (SD) of detected change points by each algorithm in the simulation experiment 2.

Table 4. Average cumulative regrets R(T) (AVE), standard deviations (SD) and average computational time in seconds (TIME) for different algorithms in the simulation experiment 2.

Table 5. Average number (AVE) and standard deviation (SD) of detected change points by each algorithm in the simulation experiment 3.

Table 6. Average cumulative regrets R(T) (AVE) and standard deviations (SD) by each algorithm in the simulation experiment 3.

Figure 5. Average rewards for arms in the elongation data analysis.

Figure 5. Average rewards for arms in the elongation data analysis.

Figure 6. Cumulative regrets for different algorithms in the real data analysis.

Figure 6. Cumulative regrets for different algorithms in the real data analysis.

Table 7. Cumulative regrets R(T) for different algorithms in the elongation data analysis.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.