Skip to main content
Figure 14 | EPJ Quantum Technology

Figure 14

From: Robustness of quantum reinforcement learning under hardware errors

Figure 14

Q-learning agents trained in the TSP environment with one layer of the circuit depicted in Fig. 2 c) and custom noise model, using 1000 Monte Carlo trajectories. The labels indicate the custom noise configurations defined in Table 1, results are averaged over five agents in each curve, except for the exact curve which is averaged over ten agents as done in previous figures

Back to article page