Optimal quantum state tomography with noisy gates

Quantum state tomography (QST) represents an essential tool for the characterization, verification, and validation (QCVV) of quantum processors. Only for a few idealized scenarios, there are analytic results for the optimal measurement set for QST. E.g., in a setting of non-degenerate measurements, an optimal minimal set of measurement operators for QST has eigenbases which are mutually unbiased. However, in other set-ups, dependent on the rank of the projection operators and the size of the quantum system, the optimal choice of measurements for efficient QST needs to be numerically approximated. We have generalized this problem by introducing the framework of customized efficient QST. Here we extend customized QST and look for the optimal measurement set for QST in the case where some of the quantum gates applied in the measurement process are noisy. To achieve this, we use two distinct noise models: first, the depolarizing channel, and second, over- and under-rotation in single-qubit and to two-qubit gates (for further information, please see Methods). We demonstrate the benefit of using entangling gates for the efficient QST measurement schemes for two qubits at realistic noise levels, by comparing the fidelity of reconstruction of our optimized QST measurement set to the state-of-the-art scheme using only product bases.


Introduction 1.Background
In the past decades, the mounting evidence that quantum algorithms can solve specific tasks with efficiency beyond the capability of a state-of-theart classical computer has led to considerable interest in the field of quantum computing.A major turning point was Shor's algorithm for prime factorization [1].In addition, hard optimization problems are expected to be efficiently solved on a quantum device with potentially enormous consequences for multiple fields.Feynman's proposal to use quantum computers for the efficient simulation of quantum systems for which classical simulation is hard [2]represents another highimpact application.Various physical hardware platforms are being developed for quantum computation [3,4,5,6,7,8].
The increasing size and complexity of quantum devices call for more sophisticated techniques for calibration, certification, and evaluation of their performance.The field of quantum characterization, verification, and validation (QCVV) offers various state-of-the-art protocols and techniques to evaluate the performance of a quantum system.Quantum state tomography (QST) [9], a prominent QCVV technique, allows for the reconstruction of a given quantum state from measurement data.Others include quantum process tomography, randomized benchmarking (RB) [10,11], and gate set tomography [12,13,11,14].
While QST is known as the "gold standard" for the verification of a quantum device [15], as it provides comprehensive information for a given quantum state, its computational costs make it infeasible for a system larger than few qubits.Moreover, full QST can be time-consuming even if performed on small systems, say building blocks of a quantum computer of only one or two qubits.Therefore, the search for efficient measurement schemes for QST is of high practical importance.
Optimal QST measurement schemes are known for specific ideal and noise-free scenarios.For a d-dimensional Hilbert space, the ideal choice is a set of d + 1 measurement operators whose eigenbases are mutually unbiased bases (MUBs) [16].For generalized measurements, using ancillary systems, symmetric, informationally complete positive operator-valued measures (SIC-POVMs) are optimal [17,18].For a situation where one out of N qubits is measured, an optimal quorum [19] consists of projectors on socalled mutually unbiased subspaces.Numerically optimized QST measurement sets consisting of independent rank-1 projection operators [20] and projectors on half-dimensional subspaces in dimension six [21] have been obtained.In the first case, the numerical solution outperforms a set that constitutes projectors from a set of MUBs and in the latter case, the solution approximates mutually unbiased subspaces.
When implementing QST on real systems, one is inevitably confronted with the presence of noise and decoherence during every quantum operation.Despite the importance of QST as an established tool for determining the state of a quantum system, rigorous and systematic research on optimizing QST with noisy gates is lacking.Limited research into optimal measurement schemes for QST in the presence of noise for single-and twoqubit systems exist [22,23,24], however, in these works fine-tuning of noisy entangling gates are not considered.In [24], the measurement set of a photonic 2-qubit quantum system is optimized to be maximally robust to a general measurement error.In [23], the general question of QST under measurement constraints is investigated, with an implementation on a single (photonic) qubit.In [22] a QST on a single qubit is performed with a set of generalized (possibly overcomplete) measurements.

A framework for optimal quantum state tomography in noisy systems
Here, we extend the framework by looking for optimal QST schemes in noisy systems, and modify the QST quality measure defined by Wootters and Fields [16], see Sec. 2.1.They expressed the information about a quantum state obtained by performing measurements on this state as where V 0 is the volume of all possible quantum states.The confidence volume V is defined as the volume of the rectangular parallelepiped which includes the part of the distribution, assumed to be Gaussian, with probability density larger than 1/e times its maximum.For dimension d, V can be expressed as Here, V j is the confidence volume of the (d−1)-dimensional subspace spanned by the projectors P jk on eigenstates k = 1, . . ., d of the measurement j and Q is the geometric quality measure given by the volume of the rectangular parallelepiped spanned by {P jk |j=1, . . ., d+1; k=1, . . ., d−1}.The number of repetitions N j and the probabilities of the measurement outcomes, p jk = Tr(P jk ρ) are related to The averaged information gain reads I = dµ d (ρ)I(ρ), where µ d (ρ) is the Haar measure of the density matrices for a d-dimensional Hilbert space.Importantly, I depends on the choice of the QST measurement set only via Q because ln(Tr[P ρ]) is the same constant for any projector P of the same rank.Thus Q can be used as a quality measure for a QST measurement set.
We consider measurements that are realized by first applying a sequence of quantum gates followed by a measurement in a standard basis, see Fig 1 (a).The quality of the selected QST measurement scheme then depends on the choice of gates and how much the chosen gates are affected by noise.There are multiple noise models for quantum gates, and, for simplicity, we focus on two of those models.Note, however, that our approach is not limited to any noise model.First, we consider a Hamiltonian of the form H = λP λ where we -for now -consider P λ = |λ λ| to be projector of rank 1. Switching this Hamiltonian on for a time t yields the unitary where we have defined φ := λt.To model noisy gate operations, we assume that φ cannot be controlled precisely by the applied pulses but instead follows a Gaussian distribution p N (φ) with φ = φ 0 and a standard deviation of σ = (φ − φ 0 ) 2 , see Fig. 1 (c).We assume that variance scales linearly with φ 0 , σ 2 = 2rφ 0 .Representing a density matrix ρ in the eigenbasis of U (φ) and averaging over the Gaussian, reveals that the off-diagonal elements of dφ p N (φ)U † ρU which involve |λ or λ| are modified by e −rφ 0 e −iφ 0 , i.e. additionally to the desired phase factor, they decay exponentially.Although we consider this model for twoqubit gates, later on, we use the term over-and under-rotation [25,26] for it.Second, we consider the depolarizing channel [27], see Fig. 1 (b).If noise affects any quantum state in the same way, based on the waiting time, independent of whether a quantum gate is applied during this time or not, all the projectors which describe one measurement are modified in the same way.However, the waiting time depends on the overall time of the quantum gate sequence needed for the measurement.
The structure and contents of this article are as follows.We consider a single-qubit system, see Sec. 2.2, assuming that measurements along the z axis of the Bloch sphere and rotations around the z axis are error-free while rotations around an axis in the xy plane are noisy.As in [28], the above two noise models coincide for the single qubit.Then, we turn to a two-qubit system, see Sec. 2.3, with error-free single-qubit gates and entangling two-qubit gates affected by noise.While in general, any non-trivial interaction can lead to an entangling gate, we explicitly consider the Heisenberg interaction, see Sec. 2.3.2, and the Ising interaction, see Sec. 2.3.3.These interactions or interactions which yield equivalent gates are relevant for certain quantum computing platforms, namely, the Heisenberg interaction is present for spin qubits in semiconductor quantum dots [6] and an Ising-equivalent interaction for resonant gates [30,29].Our methods for solving the resulting numerical optimization problem are explained in Sec.3.2 and our results are presented in Secs.2.4 and 2.5.There, we investigate the change in the quality of the optimal measurement schemes for different noise levels as well as the corresponding normalized times for the entangling gates.We perform numerical simulations to demonstrate the benefit of using entangling gates for efficient QST by comparing the fidelity of reconstruction obtained by our measurement schemes to the fidelity of reconstruction using nine product bases and show superiority of using entangling gates for noise level corresponding to average gate fidelities between values around 0.8 and 0.9 depending on the noise model, see Sec. 2.6.At the same time, we quantify the advantage of using a QST measurement set optimized for the specific noise level over the use of the QST scheme optimal for the noiseless case.Importantly, to overcome the limitations imposed by the considered noise models, we validate the advantages of using entangling gates for QST on a real quantum device, running QST on both, an actual quantum device and a simulator emulating it.The results of the experiments on the device confirm that in certain scenarios, a better fidelity of the QST procedure is obtained using entangling gates in comparison to the stateof-the-art product bases.We conclude in Sec. 4.

Summary of contributions
Here, we summarize the significant contributions and assumptions of our paper.Most importantly, we introduce a formal evaluation scheme for the efficiency of QST quorums in noisy settings.Note that a such tool has not been available before, although any experimentally implemented QST deals with noise.
Secondly, we apply our scheme to multiple relevant scenarios, including a single-qubit and a two-qubit system with noisy entangling gates, where two important noise models (depolarizing channel and over-and under-rotation) were implemented.We consider two-qubit gates based on Heisenberg and Ising interaction.To make our investigations relevant for practitioners, we identify realistic noise levels by comparison to recent literature.
Furthermore, we formulate the goal of finding the optimal QST quorum under noise as an optimization problem and derive a mathematically meaningful quality measure for each respective interaction and noise model and perform an extensive exploratory analysis using global optimization approaches for exploration combined with local approaches for exploitation.Based on this analysis, it was that the best solutions have the specific structure of three measurement bases of product states and two bases with highly entangled gates related to the standard set of MUBs (where the entangled states are maximally entangled); we derive an analytical expression for the optimized solutions for the depolarizing channel.
Moreover, using simulations, we compare the fidelity of reconstruction for QST with MUBs and our numerically optimized QST to QST with nine separable bases which represents the current state-of-the-art approach for QST under noise.As an important result, we find a significant advantage in using entangling gates compared to the nine separable bases for noise levels already achieved in experiments.Finally, for validation, we compare the performance of QST under noise using entangling gates and product bases on a real quantum computer.The results confirm that for certain realistic scenarios it is more advantageous to use entangling gates for QST.Additionally, we find a crossover behavior where QST using nine product bases as measurement bases becomes the more advantageous approach with an increasing number of experimental runs of each measurement.

Quantum state tomography with noise
We now investigate the performance of quantum state tomography quorums under the influence of noise.Noise which is independent of the choice of the measurement will only lead to a constant change in I .This can be compensated by an increased number of experimental runs, but the optimal QST measurement set remains the same.We consider the more interesting situation where the noise depends on the choice of measurements.We will analyze how the noise affects the averaged information gain I and incorporate this dependence by modifying the quality measure We consider the case where the measurement j is described by a POVM, {F jk } with k F jk = 1.
In the noise-free case, F jk = P jk where P jk are projection operators of rank l jk with P jk P ji = δ ik P jk and k l jk = d.We look at operators being affected by noise such that Here, the traceless part of the operator is rescaled with a factor of q j ≤ 1.Then the outcomes of the noise-affected measurement j follow a multinomial distribution with probabilities p jk = Tr(F jk ρ) = q j p jk +l jk (1−q j )/d.From p jk we can calculate the probabilities in the noiseless case As in [16], we assume that the multinomial distribution is well approximated by a Gaussian.We then use (3) with rescaled probabilities p jk /q j and the relation (4) to calculate the confidence intervals as Rescaling is necessary as the noise-affected measurement outcome is related to the outcome of a corresponding noise-free measurement not only by a shift in probability but also by a factor of q j In order to include the effect of the noise, described here by the value of q j , we need to compute ln (p jk + l jk (1 − q j )/(q j d)) using the Haar measure µ d (ρ).We will calculate this expression for the cases d = 2 and d = 4, as they are investigated in detail in this paper.For d = 2, we can calculate this expression analytically by integrating over the Bloch sphere in cylindrical coordinates for which we can expand for c ∝ 1 − q j 1 up to linear order in (1 − q j ) yielding where l jk = 1 for d = 2.For d = 4, we compute the average numerically.We obtain random density matrices using the representation ρ = U DU † , see [31], where D is a diagonal matrix with the eigenvalues of ρ on the diagonal obtained as the differences between the elements of {0, r 1 , r 2 , r 3 , 1} where {r 1 , r 2 , r 3 } is an ordered set of three values taken from a uniform distribution over the interval [0, 1].We generate random unitary matrices U using the approach described in [32].We use linear regression to fit the data to get an estimate of the coefficient.We use 10 7 matrices, projectors on the four basis vectors, and 40 data points on the interval q j ∈ [0.9, 1].We obtain for l jk = 1, ln p jk + 1 − q j 4q j ≈ const.+ 1.195(1 − q j ).
(9) We disregard terms of higher than linear order in 1 − q j and set N j = N , then we obtain (10) where s depends on the rank of the projectors involved in the measurements, for d = 2, we have s = 3/2, for non-degenerate measurements in 4D, we have s = 2.39.The q j -dependent part of the averaged information gain I will now be included in a modified quality measure We will use Q N in order to find optimal QST measurement set under noisy conditions.

Single qubit
We consider a qubit and assume that while the standard measurement in the z basis is free of errors, rotations about any axis in the xy plane are error-prone.We assume that the angle of such rotations follows the Gaussian distribution described above, So, if we want to project onto the state cos(θ 0 /2)|0 + sin(θ 0 /2)e iφ |1 , we actually project onto the mixed state We see that the traceless part of the density matrix decays exponentially with increasing 0 < θ 0 < π, i.e. the imperfection of the quantum gate yields a depolarization channel for the qubit.The traceless parts of the density matrix can be written as a three-component real vector, the Bloch vector n, such that ρ = 1/2+n•σ/2 where σ is a vector of Pauli matrices.In the specific situation considered here the length of n depends on the angle θ, In other words, the value of q j defined above is given by exp(−rθ j ) as it is for the depolarizing channel.Numerically one finds that the maximal Q N is three Bloch vectors all with the same θ and the phases can be chosen to be φ 1 = 0, φ 2 = 2π/3, and φ 3 = 4π/3.Then, the quality measure with the exponent s = 3/2, see Sec. 2.1, reads and the optimal angle is θ = arctan( 81r 2 /16 + 2 − 9r/4).

Two qubits with noisy entangling gates
We formalize the question "When does it make sense to include entangling gates in a QST measurement scheme?" for non-degenerate measurements.
Each of the measurements included in a QST quorum is assumed to be carried out by first applying a unitary operation U j where j = 1, . . ., 5 for non-degenerate measurements to the unknown quantum state and then performing a measurement in the standard basis {|00 , |01 , |10 , |11 }.The task is now to find U 1 , . . ., U 5 which yield the highest Q N .

Universal quantum gates with noisy entangler
General unitary operators acting on two qubits can be represented by 5) is a local one-qubit gate applied to qubit k defined by the three real parameters Φ = (φ, ψ, χ), U qk (Φ) = cos(φ)e iψ sin(φ)e iχ − sin(φ)e −iχ cos(φ)e −iψ .(13) The gate U tq (β j ) is a universal two-qubit gate, i.e., together with the local gates any desired U j ∈ SU (4) can be realized.As a parametrization of U tq (β j ), the Hamiltonian where σ α are the Pauli matrices for the qubit k, can be used [33,34,35], The operators H p and U tq are diagonal in the Bell basis for H p and e −iη j ik for U tq .While, any interaction which yields a universal two-qubit gate can be applied to reproduce a desired U tq , the representation in Eq. ( 15) provides a very convenient way to do this for any Hamiltonian which is of the form H p with any fixed values of β j including the Heisenberg (β jx = β jy = β jz ) and the Ising interactions (β jx = β jy = 0), Any U tq can be generated by applying any of these gates three times with appropriate single-qubit gates in between.

Heisenberg interaction
Now, we demonstrate how to compose the universal two-qubit gate for the situation where the entangling gate originates from the Heisenberg exchange interaction (β jx = β jy = β jz ), where λ H is the interaction strength.This situation can be realized by spin qubits in semiconductor quantum dots [6].Following the work by Fan et al. [36], U tq is then defined by three parameters α j = (α j,1 , α j,2 , α j,3 ), which are directly related to the amount of time the Heisenberg interaction is switched on, yielding SWAP α jk gates.We assume that the interaction is switched on and off instantaneously.Thus the α jk can be considered as normalized entangling times with the unit π/λ H , |α j | 1 = k α jk is then the normalized entangling time for the measurement j and additionally, we define α := j |α j | 1 as the normalized overall entangling time applied within the gate sequences for a QST measurement set.The gate U tq is composed as [36] U tq (α j ) =σ (1)  z σ (2)  x SWAP α j,1 σ (1)  z SWAP α j,2 × σ (2)  x SWAP α j,3 . ( In the -now unconventionally sorted -Bell basis 1 π , e iα j,2 π , e iα j,3 π ).( 18) Switching on the Heisenberg interaction creates an entangling gate.In our model, the normalized time of the Heisenberg interaction between two spin qubits determines how much the noise affects the quantum system.Since, aside from the QST optimal measurement scheme, we are interested in the role the entangling gate plays in the optimal solution, we evaluate the normalized times α jk during which the Heisenberg interaction is switched on.

Ising interaction
We construct the universal two-qubit gate using the Ising interaction We note that we can reproduce the representation in Eq. ( 15) by the sequence As for the Heisenberg interaction, we define normalized entangling times for each switching on of the interaction for the gate sequence used for realizing measurement number j, τ I j = k τ I jk , and for all sequences within one QST measurement set, τ I = j τ I j .The unit of these times is again π/λ I , i.e., π divided by interaction strength.Note that, also interactions of the form H ∼ (a x σ (1) x + a y σ yield a gate locally equivalent to the gate provided by the Ising interactions.Those interactions are effectively present when resonant gates are applied [29].
2.4 Results from the exploratory analysis: geometric measure and normalized entangling time

Optimization via multiple runs of local search
One approach to look for an optimal solution is to use parallel searches with many well chosen starting points in order to explore well the space of potential optimal solutions.This approach has been used successfully for similar problems in [20,21].We use Powell's method as a local search with a set of 500 diverse starting points, each at least 0.01 distance from each other using a Jaccard-based distance measure.For more details, see section 3.2.In Fig. 2 we present the quality of a selection of the quorums discovered by the above approach.

Global Optimization via Simulated Annealing
In Fig. 3, results of the global optimization via Simulated Annealing are presented.Ten runs are performed for each noise level.The resulting solutions are contrasted to the optimized solution discovered by Powell's method using the set of MUBs, see Sec. 3.2.5, as starting points for the local search.In principle, there are infinitely many possible sets of MUBs that can be used.For our comparison, we use a set of MUBs that is known to have a shorter entangling times τ I = 1/2 and α = 2 than other known choices.In the following, we refer to this set as the standard set of MUBs.
The results of the exploratory analysis demonstrate that high-quality solutions are discovered with entanglement close to or below that of the standard set of MUBs.Here ζ is the noise parameter for the depolarizing channel with Ising and Heisenberg interactions and r is the noise parameter for the over-and-underrotation channel with the two types of interactions (see Method section for the precise definition).For higher levels of noise, e.g.ζ and r take values around 3%, the solutions discovered by the global optimization approach are close to the ones found by using a local search with the standard MUBs as starting points of the search.This, as well as the consideration that the a full set of MUBs represents an ideal quorum of projectors in the absence of noise, motivated us to use the aforementioned set of MUBs as a starting point of a local search for lower levels of noise and compare the results to the results from the global optimization.The relation that a shorter entangling time corresponds to a higher quality measure for optimized quorums does not hold for over-and under-rotation with Ising interaction due to special invariants, discussed below.

Invariance of the Q N for specific applications of the Ising interaction with over-and underrotation
The results of the exploratory analysis for the Ising interaction with over-and under-rotation errors reveal that there are quorums with the same Q N but different τ I , see Fig. 3 (d).This is a consequence of the Ising interaction leaving specific product bases unchanged.For example, the standard basis {|00 , |01 , |10 , |11 } is not af- The solutions are colored differently, based on the noise level used for each noise model and interaction, where ζ and r are the noise parameters for the depolarizing channel and over-and-underrotation respectively.The points with the red circles represent the results from using Powell's method with the standard set of as a starting points being as good as or better than the best results from the exploratory analysis.Their colors also represent the noise level of the corresponding noise parameter.

Results from using the standard set of MUBs as a starting point of a local search
In Fig.
Figure 4: Results for both noise channels, the over-and under-rotation (a-d) and the depolarizing channel (e-h) and for both interactions, the Heisenberg interaction (blue, red) and the Ising interaction (green, orange): We present the quality measure Q MUB for the quorum formed by a complete set of MUBs with lowest possible entangling time (a,e); the ratio of the quality measures for the numerically optimized quorum, Q opt and Q MUB (b,f); as well as the normalized entangling times α (for Heisenberg interaction) (c,d) and τ I for the Ising interaction (g,h).All these quantities are given as a function of the noise parameters r (a-d) and ζ (e-h).Note that the solid and dashed lines connecting the data points are guidance for the eyes while the dashed lines in (c,d,g,h) represent the (smallest possible) normalized entangling time for the MUBs.The increase in τ I from r = 0.28 to r = 0.3 might be due to the fact that Q N can be invariant under switching on the Ising-interaction for certain states, see main text that the MUBs are ideal at zero noise.In parallel, the normalized entangling times of the numerically optimized quorums decrease.

Fidelity of reconstruction
We simulate QST by randomly generating density matrices (in the same way as described in Sec.2.1) and "measurement outcomes" for a number N rep = 5 × 9 × 512 of runs of measurements.We use the maximum-likelihood method for reconstruction for 10 5 random density matrices and present the averaged results in Fig. 5.
The noise levels considered are up to ζ = 0.25 and r = 0.25.While improvement by the optimization procedure compared to the MUBs increases with increasing ζ and r, the optimization might become imprecise for larger values of ζ and r as the linearization in Eqs. ( 9) are no longer a good approximation at those values.However, we are mainly interested in systems with high-purity gates, and, therefore, do not consider higher-order quality measures.We find that for the depolarizing channel, with Heisenberg interaction up to ζ = 0.08, there is a benefit of using entangled-state bases compared to product-state bases.While for the Ising interaction values of ζ where QST with nine separable bases performs better than QST with MUBs or numerically optimized quorums, have not been considered, we predict the limiting value to be four times as high as for the Heisenberg interaction, because the results for Ising and Heisenberg interaction coincide if ζ for the Ising interaction is four times as high as for the Heisenberg interaction.
For the over-and under-rotation noise model, the fidelity of reconstruction using entangling gates is better than using nine product-state bases for a level of noise around r = 0.2 for both, Ising and Heisenberg interactions.In order to evaluate the benefits of using the entangling gates for performing efficient QST, we need to evaluate how the noise levels in the two models compare to the noise in real state-of-the-art devices and whether for these real-life noise values using entangling gates leads to better performance in comparison to using nine productstate bases.Therefore, we calculate the relationship between average gate fidelity and the noise The QST measurement sets were 9 separable bases (red) which don't depend on the noise as only entngling gates are effected, MUBs (blue), and numerically optimized quorums (green).The error bars indicate the standard deviation of the mean after averaging over 10 5 random density matrices.The total number of measurements runs is parameters of the two models.Using the relationship described in Sec.3.1, the average gate fidelity of the CNOT gate which corresponds to the thresholds for the depolarized channel is 0.83.For CNOT gates with this fidelity or higher, we find that the use of entangling gates for measurement is beneficial.The gate fidelity which corresponds to the threshold r = 0.2 for the overand under-rotation channel is 0.85 or 0.89 for the Heisenberg and for the Ising interaction, respectively.For experimentally achievable noise levels [30], there is therefore already a benefit of using entangling gates when determining optimal QST measurement schemes.

Analytical expression for Heisenberg interaction, depolarizing channel
A set of MUBs is obtained by using the unitaries Entangling gates are used only for U 4 and U 5 with the parameters α 41 = α 43 = α 51 = α 53 = 1/2 and α 42 = α 52 = 0. From our numerical results, we observe that for the optimized quorum only the values of α 41 , α 43 , α 51 , and α 53 change in dependence of ζ.Therefore, we try to reproduce the numerical result analytically by fixing all the parameters of the single-qubit gates included in a quorum according to Eq. ( 21) and α 42 = α 52 = 0 but treat α 41 , α 43 , α 51 , and α 53 as independent parameters.Then the quality measure reads Although the expression is not symmetric under the exchange of the values of α 4k and α 5k (k = 1, 3), the maximum is reached at a point where all parameters coincide, With this expression, we reproduce the numerical results.

Analytical expression for Ising interaction, depolarizing channel
The same set of MUBs obtained above for the Heisenberg interaction can be realized using the Ising interaction, by representing the two-qubit gates within the sequence for U 4 and U 5 in Eq. ( 21) with the parameters β 4y = β 5y = π/4 and β 4x = β 4z = β 5x = β 5z = 0. Again, we observe within the numerics that with increasing values of ζ all parameters but β 4y and β 5y remain the same.Thus, we consider the quorum as described above treating β 4y and β 5y as free parameters.We obtain for the quality measure and find the maximum at which indeed yields the same Q N as the numerically optimized solution for each ζ.

Performance of QST with Noisy Gates on Real Quantum Device
Given that the noise models considered above require stringent limiting assumptions on the behavior of the noise rather than using noise characteristics derived from a real quantum computer, we decided to examine our approach using an actual noisy quantum device.However, running our optimized measurement bases on the publicly available quantum devices is challenging, due to the fact that a SWAP α gate with α being a parameter is not available as a native gate, CNOT beign the only native two-qubit gate.Reproducing SWAP α would require a gate sequence with a fixed number of CNOTs, i.e., α would not determine the time of the two-qubit interaction being switched on.Therefore, for the two-qubit noisy system, which was the focus of investigation in this paper, we ran comparative QST using -as measurement bases -only a full set of MUBs and Pauli product bases.Using Pauli bases is currently the state of the art.In order to evaluate how the two approaches using different measurement bases are affected specifically by the noisy gates, we corrected the readout error in both cases before the QST was performed.We ran QST using both measurement bases on a classical simulation of the IBM Manila device, as well as on the actual IBM Manila quantum device.It is important to note that the simulation, which uses averaged data, may be more representative than the results run on the actual device, which may differ from calibration to calibration.We ran both QST approaches for different total numbers of allowed repetitions, N tot , i.e. the total number of shots using the qiskit terminology, and evaluated the results.The results, averaged over 660 randomly selected pure initial states, are shown in Fig. 6 (a) for the simulated Manila quantum device, and in Fig. 6 (b) for the actual quantum device.For the simulated data, for a fidelity of reconstruction of 97%, the MUBs outperform Pauli bases as measurement bases and require a lower number of repetitions N tot .The crossover happens at a total number of shots of around N tot = 2 × 10 3 .It is important to note that for further increasing N tot the fidelity saturates and the gain from performing extra shots is very small, even if the Pauli bases are more advantageous.In the case of the quantum processor, the MUBs outperform Pauli bases up to a fidelity of reconstruction close to 92%, achieved for around N tot = 500; for larger N tot the Pauli bases are the more advantageous approach to use.Note that the averaged infidelity of reconstruction saturates for large numbers of measurement repetitions due to systematic errors.Systematic errors in the CNOT gates are most likely to yield the QST with MUBs having a higher saturation value for the infidelity.This explains the crossover behavior: for lower values of N tot where the infidelity is not dominated by systematic errors, our analysis that MUBs outperform QST with Pauli bases is confirmed; when the systematic errors dominates, Pauli bases are better as they do not suffer from imperfect CNOT gates included in measurement scheme.It should be mentioned that these are results achieved using one calibration of the quantum device, and not an averaged performance over several calibrations, and thus we consider the results of the simulator as more representative.
Clearly, for scenarios where the total number of shots is limited, such as when one needs to test initialization of a quantum device and thus a large number of random initial density matrices, the QST with MUBs is the more beneficial approach.An additional advantage of the QST with MUBs as measurement bases comes from the fact that fewer different circuits are involved, and currently the loading of new circuits is one of the main bottlenecks of using the IBM quantum devices.The advantage of having a smaller number of different measurements (different circuits) is increasing for increasing number of qubits, scaling as a ratio of (2/3) n where n is the number of qubits.Therefore, there is an exponential advantage of MUBs over Pauli product bases regarding We observe that the MUBs outperform the Pauli bases for a low number of shots, while the Pauli bases perform better for a higher number of shots possibly due to not including any two-qubit gate within the measurement scheme.Insets: Log-log plots of the same averaged infidelities.
the number of different measurements.

Discussion of Results
We performed an extensive analysis of the search space of quorums for QST, using global and local parallel explorations.However, the best solutions we were able to discover are the ones that are obtained by a local search method using a full set of MUBs as a starting point.Additionally, these optimized solutions differ from the standard set of MUBs only by different entangling times.Using the derived quality measures, for realistic noise levels, we find that the optimized solutions are better than the MUBs.For realistic depolarizing and over-and under-rotation noise models using the Heisenberg interaction, there is a small improvement in the fidelity of the reconstruction results for the optimized quorum compared to the standard set of MUBs.For the Ising interaction (depolarized and over-and under-rotation), there is no improvement over using the MUBs.The state-of-the-art approach for efficient QST under noise is to use nine separable bases.Our results demonstrate that this is unnecessary.Namely, the standard set of MUBs performs significantly better in state-of-the-art existing systems.

Parametrization of non-degenerate measurements
A quantum gate according to Eq. ( 12) is given by 15 real parameters.For a QST measurement set we need five different quantum gates.Therefore, we have overall 75 parameters.We consider noise caused by the entangling gates while we assume that the single-qubit gates are error-free and that they can be performed instantly.

QST for two qubits with noisy entangling gates, over-and under-rotation
For the noise of the entangling gate, we include over-and under-rotation where the parameters α j,m follow a Gaussian distribution while the entangling gate is switched on, focusing for now on the Heisenberg interaction.The noise-affected operation can be denoted as the desired unitary operation followed by the linear positive map in the eigenbasis of (26) with γ j = e −rα j π .We can express this map by eight Kraus operators for m, k, l ∈ {0, 1}.From the Kraus operators, the average gate fidelity can be directly computed [37] F Heisenberg ou = 4+γ 1 +γ 2 +γ 3 +γ 1 γ 2 +γ 1 γ 3 +γ 2 γ 3 10 .
(28) For a CNOT gate realized by α 1 = α 3 = 1/2 and α 2 = 0, we obtain (29) This allows us to compare the noise parameter r from our model to average gate fidelities of existing implementations of qubit systems.
For the Ising interaction, we obtain, in the Bell basis, a modified map for the effect of the noise ) with γ j = e −2r|β j | .This map is represented by the four Kraus operators for k, l ∈ {0, 1}.The averaged gate fidelity is given by For the CNOT gate where β z = π/4, we obtain Note that the noise does not affect all states of a measurement basis in the same way as the gate does not entangle each product input state.This means our considerations from Sec. 2.1 need to be adjusted.While a measurement j in the presence of noise is still described by a POVM, {F j1 , F j2 , F j3 , F j4 }, Eq. ( 4) does not hold anymore.However, we can find projectors P jk with k = 1, 2, 3, 4 such that F jk = q jk (P jk − l jk 1/d) + l jk 1/d, where q jk now explicitly depends on k, and can be extracted from the F jk by Note that the rank-1 projectors P jk do not necessarily form an orthogonal basis.However, when we select three from each of the five measurements, the volume Q spanned by those 15 projectors does not depend on the selection of the three out of four basis states.The noise-affected quality measure is then given by obtaining the exponent from Eq. (9).

QST for two qubits with noisy entangling gates -depolarizing channel
Depolarization leads to the exponential decay of all of the components of the resulting density matrices which are not proportional to the identity matrix, This map is expressed by the Kraus operators for k, l ∈ {0, x, y, z} but (k, l) = (0, 0) and 1. ( Using again the formalism from [37] this yields an average gate fidelity of The probability to leave the state unchanged by the depolarization is given by ) for the Heisenberg and the Ising interaction respectively, ζ is a measure for how strongly the noise affects the quantum system.For the CNOT gate we then obtain the average gate fidelities and In order to obtain a rough idea of what range the noise level might be in a realistic scenario, we make the strong assumption that the depolarizing channel would describe the noise in a system correctly.Then, we could extract ζ from experimental estimations of the average gate fidelity, e.g. the value of F Huang = 0.98 as found for a CNOT-equivalent gate in [30], ζ Huang = 0.034.Note that the over-and-under-rotation picture as we consider it here for the Ising interaction cannot directly be related to the results in [30] as the two-qubit interaction there comes along with a single-qubit rotations.
The considerations from Sec. 2.1 can be applied in a straightforward manner for the depolarizing channel.

Comparison to entanglement-free QST
We compare a QST quorum including entangling gates to QST without entanglement.However, there is no informationally complete set of five measurement operators whose eigenbases include only product states.A standard procedure with separable basis states only is a set of nine measurements given by all combinations of measuring in the Pauli x, y, and z bases for the first and the second qubit [38,39,40].We simulate quantum measurements and compute the fidelity of reconstruction for the nine measurements without entanglement and the entanglement-including quorums.

Exploratory Analysis
The problem of finding the optimal quorum of projection operators under noise is a non-convex continuous optimization problem with the derivatives of the function not easily obtained, and with multiple local maxima.From our previous work [20,21], we know that using a local optimizer (Powell's method) with well-chosen starting points performs very well.Here, we use the Powell's derivative free method started in parallel with multiple sufficiently diverse starting points in order to improve the exploration of the search space.In addition, we used a global optimization approach.Based on the results of the exploratory analysis and the theoretical considerations in 3.2.5, to discover the optimized quorums for each noise level, we used a local search approach, with starting point the standard set of MUBs.

Local search: Powell's method
Initially, Powell's method for local search with 500 well chosen starting points was used.The points were chosen at random, but with the requirement to meet a diversity threshold, where the diversity was evaluated using the the angles formed by the traceless parts of the projection operators.For detail see below.The diversity measure was based on the Jaccard distance.The distance between two quorums was considered to be the normalized minimal Jaccard distance that each projector from a quorum Q 1 forms with the projectors from the quorum Q 2 based on the angles formed by the traceless parts of the projection operators.The chosen diversity measure threshold used for each of the noise models and interactions is different, based on the distribution of distances between two randomly chosen quorums (the threshold was chosen to be the meanone standard deviation).

Diversity measure for the local search
In the absence of noise, the quality of a quorum is uniquely determined by the pairwise dot products of the traceless part Q jk = P jk − 1/4 of the projection operator P jk projecting on the eigenstate k of the measurement operator j, treating the Q jk as vectors in a vector space with the dot product Tr(Q jk Q j k ).Thus, a diversity measure based on these dot products of two quorums is meaningful.Here, we are interested in small levels of noise of up to ζ = 0.03 and r = 0.03, for which we know that the effect of the noise levels on the quality measure Q N is small, and distance measure based only on dot products of the Q jk is still valid.
Then, each quorum is a set of sets, which are the dot products that each Q jk forms with the other Q j k in the quorum.As a distance measure between two quorums we use the Jaccard distance [41].

Global Optimization: Simulated Annealing
As a part of the exploratory analysis, we performed global optimization using Simulated Annealing with appropriately selected parameters, in combination with a local search approach (Powell's method).This combination is commonly used with success when solving an optimization problem with a complicated landscape: the global optimizer is used for exploration of the search space and locating promising areas, while the local optimizer is then used for exploitation, i.e. refinement and reaching the closest locally optimal point.

Using the standard set of MUBs as a starting point of the local search
For zero noise, MUBs are known to be the ideal choice for an QST quorum and the noise penalizes switching on entangling gates.Thus one can expect for small noise the optimal solutions to be close to the set of MUBs with minimal entangling times.Indeed, many of the best solutions obtained during the exploratory analysis have entanglement, which corresponds to two bases with entangled states.There is a known set of MUBs constructed via the approach presented in [42], known as the standard set of MUBs, where two bases are with maximally entangled states.Its parametrization using the Heisenberg interactions is given in Eqs.(21).
Considering the above, we use the standard set of MUBs as starting points for Powell's method to find the best optimized solutions for low noise level.

Running QST on IBM Manila
We used the interface provided by qiskit to execute QST on the IBM Manila quantum processor as well as the corresponding simulator.Using Pauli bases is the default and readily implemented for the state tomography function in qiskit.We extended the libraries of qiskit to also perform QST based on MUBs in order to compare the two approaches.Using the class "Com-pleteMeasFitter" [43], we corrected the readout error in both cases prior to reconstruction of the QST being performed.660 initial states were randomly selected as pure two-qubit states by first normalizing random Gaussian states in R 4 and then adding three relative phases randomly chosen from a uniform distribution on the interval [0, 2π).

Conclusions and Overview
To summarize, we investigated the optimal QST measurement schemes under the influence of noise.We extended Wootters and Fields' [16] quality measure for a QST measurement quorum to the case of noise-affected measurements and optimized QST measurement sets for a singleand two-qubit system under noise.For a single qubit, we considered noise which increases with the polar angle of the Bloch sphere and perfect azimuthal rotations.For two qubits, we limited the discussion to perfect single-qubit gates and noisy two-qubit gates, generated either by Heisenberg interaction or by Ising interaction.We solved the problem of finding an optimal quorum for quantum state tomography under these noise models by using an extensive number of well-suited numerical techniques.
For two qubits, the results depend on the interaction and on the noise model.For practically relevant noise levels, a minor improvement over using MUBs is present for the Heisenberg overand under-rotation and depolarized noise models.Apart from this, the set of MUBs performs sufficiently well as an quorum for QST for realistic noise levels.
In some cases, we extracted analytical expressions for the optimized quorum from the numerical results, namely for the single-qubit case and for two qubits with a depolarizing channel.In the two-qubit case, only the entangling gate times change as the noise level is varied.
Importantly, for simulated QST based on the noise models described above we find an improvement of QST with MUBs and numerically optimized QST measurement sets compared to QST with separable bases.While we did not include state preparation and measurement (SPAM) errors, their influence can be mitigated [44].
To confirm our findings and alleviate potential limitations of our noise models, we compared the performance of QST using entangling gates with QST using nine separable bases on a real quantum device.We investigated for which scenarios the use of entangled gates is advantageous in comparison with the use of the nine product bases as measurement bases.
Naturally, future research would consider models with noisy two-qubit and single-qubit gates, using system-specific parameters.

Figure 1 :
Figure 1: (a): Noisy measurement M realized by a noisy unitary operation Ũ followed by a noiseless measurement in the standard basis.(b,c): Noise models illustrated here for a single qubit on the Bloch sphere.(b) The depolarizing channel shrinks the Bloch vector.(c) Overand under-rotation describes errors during a quantum gate where the rotation angle fluctuates according to a Gaussian distribution.The example shows an intended π/2-rotation about an axis in the xy-plane.

Figure 2 :
Figure2:The quality and entanglement time of selected solutions (sets of quorums) discovered by using Powell's method with 500 random diverse starting points, for the Heisenberg (a,b) and the Ising interaction (c,d) with noise described by the depolarizing channel (a,c) or overand under-rotation (b,d).Only the 10 best solutions are visualized for each noise level.The points with the red circles represent the optimized quorums resulting from using Powell's method with standard MUBs as starting points being as good as or better than the best results from the exploratory analysis.Here r and ζ are the noise parameters for the over-and-under rotation noise model and for the depolarizing channel, respectively.

Figure 3 :
Figure 3: Selected high-performing quorums of projection operators, discovered by optimizing via simulated annealing for exploration and Powell's local search method for exploitation for the Heisenberg (a,b) and the Ising interaction (c,d) with noise described by the depolarizing channel (a,c) or over-and under-rotation (b,d).The solutions are colored differently, based on the noise level used for each noise model and interaction, where ζ and r are the noise parameters for the depolarizing channel and over-and-underrotation respectively.The points with the red circles represent the results from using Powell's method with the standard set of as a starting points being as good as or better than the best results from the exploratory analysis.Their colors also represent the noise level of the corresponding noise parameter.

3 Figure 5 :
Figure 5: Infidelity of reconstruction in dependence of the noise parameters ζ and r for the Heisenberg interaction (a,b) and the Ising interaction (c,d) under depolarizing noise (a,c) and over-and under-rotation (b,d).The QST measurement sets were 9 separable bases (red) which don't depend on the noise as only entngling gates are effected, MUBs (blue), and numerically optimized quorums (green).The error bars indicate the standard deviation of the mean after averaging over 10 5 random density matrices.The total number of measurements runs is N tot = 5 × 9 × 512.

Figure 6 :
Figure6: Averaged infidelity of reconstruction for 660 pure initial states for QST with nine Pauli product bases (blue) and five MUBs (green) for the IBM Manila simulator (a) and the IBM Manila quantum processor (b).We observe that the MUBs outperform the Pauli bases for a low number of shots, while the Pauli bases perform better for a higher number of shots possibly due to not including any two-qubit gate within the measurement scheme.Insets: Log-log plots of the same averaged infidelities.
4, we present Q N for the MUB quorums in dependence of the noise parameters r and ζ, N monotonously decrease with increasing r and ζ.The improvement by the numerical optimization compared to the MUBs increases with an increasing noise level which is due to the fact