Doğrusal denklem sistemleri için kuantum algoritması (HHL09): Adım 2 - Nedir

^{Bu, doğrusal denklem sistemleri (HHL09) için Kuantum algoritmasının devamıdır : Adım 1 - Faz tahmin algoritmasının kullanımına ilişkin karışıklık ve doğrusal denklem sistemleri için Kuantum algoritması (HHL09): Adım 1 - Gerekli kubit sayısı} .

Makalede: Doğrusal denklem sistemleri için kuantum algoritması (Harrow, Hassidim ve Lloyd, 2009) , bölüme yazılanlar

Bir sonraki adım ayrıştırmak $|b\rangle$ özvektör bazında, faz kestirimi kullanılarak [5-7]. Gösteren: $|u_j\rangle$ özvektörleri $A$ (veya eşdeğer olarak, / $e^{iAt}$ ) ve tarafından $\lambda_j$ karşılık gelen özdeğerler.

sayfada $2$ yapar bazı bana mantıklı (yukarıda bağlantısı önceki mesajların ele olmuştur ana kadar karışıklık). Ancak, sonraki kısım yani $R(\lambda^{-1})$ rotasyon biraz şifreli görünüyor.

İzin Vermek
$| Ψ_{0} ⟩ := \sqrt{\frac{2}{T}} \sum_{τ = 0}^{T - 1} \sin \frac{π (τ + \frac{1}{2})}{T} | τ ⟩$ $|\Psi_0\rangle := \sqrt{\frac{2}{T}}\sum_{\tau =0}^{T-1} \sin \frac{\pi(\tau+\frac{1}{2})}{T}|\tau\rangle$
bazıları için $T$ . Katsayıları $|\Psi_0\rangle$ hata analizimizde görünen belirli bir kuadratik kayıp fonksiyonunu en aza indirmek için seçilir ([5-7] 'yi takip eder) (ayrıntılar için [13]' e bakın).

Ardından, koşullu Hamiltonyen evrimini uyguluyoruz $\sum_{\tau = 0}^{T-1}|\tau\rangle \langle \tau|^{C}\otimes e^{iA\tau t_0/T}$ üzerinde $|\Psi_0\rangle^{C}\otimes |b\rangle$ , nerede $t_0 = \mathcal{O}(\kappa/\epsilon)$ .

Sorular:

1. Tam olarak nedir $|\Psi_0\rangle$ ? Ne yapar $T$ ve $\tau$ anlamına gelir? Bu devasa ifadenin nerede olduğu hakkında hiçbir fikrim yok

\sqrt{\frac{2}{T}} \sum_{τ = 0}^{T - 1} \sin \frac{π (τ + \frac{1}{2})}{T} | τ ⟩

$\sqrt{\frac{2}{T}}\sum_{\tau =0}^{T-1} \sin \frac{\pi(\tau+\frac{1}{2})}{T}|\tau\rangle$ aniden gelir ve kullanımı nedir.

2. Faz tahmin adımından sonra, sistemimizin durumu görünüşe göre :

(\sum_{j = 1}^{j = N} β_{j} | u_{j} ⟩ \otimes | {\tilde{λ}}_{j} ⟩) \otimes | 0 ⟩_{ancilla}

$\left(\sum_{j=1}^{j=N}\beta_j|u_j\rangle\otimes |\tilde\lambda_j\rangle\right)\otimes |0\rangle_{\text{ancilla}}$

Bu kesinlikle şu şekilde yazılamaz

(\sum_{j = 1}^{j = N} β_{j} | u_{j} ⟩) \otimes (\sum_{j = 1}^{j = N} | {\tilde{λ}}_{j} ⟩) \otimes | 0 ⟩_{ancilla}

$\left(\sum_{j=1}^{j=N}\beta_j|u_j\rangle\right)\otimes \left(\sum_{j=1}^{j=N}|\tilde\lambda_j\rangle\right)\otimes |0\rangle_{\text{ancilla}}$ yani

| b ⟩ \otimes (\sum_{j = 1}^{j = N} | {\tilde{λ}}_{j} ⟩) \otimes | 0 ⟩_{ancilla}

$|b\rangle\otimes \left(\sum_{j=1}^{j=N}|\tilde\lambda_j\rangle\right)\otimes |0\rangle_{\text{ancilla}}$

Yani, belli ki $|b\rangle$ ikinci kayıtta ayrı olarak mevcut değildir . Yani nasıl bir devlet hazırladıklarını bilmiyorum $|\Psi_0\rangle^{C}\otimes |b\rangle$ ilk başta! Ayrıca, bu ne yapar $C$ altyazısında $|\Psi_0\rangle^{C}$ göstermek?

3. Bu ifade nerede $\sum_{\tau = 0}^{T-1}|\tau\rangle \langle \tau|^{C}\otimes e^{iA\tau t_0/T}$ aniden ortaya çıkıyor? Simüle etmenin ne faydası var? Ve ne $\kappa$ içinde $\mathcal{O}(\kappa/\epsilon)$ ?

algorithm hhl-algorithm hamiltonian-simulation

— Sanchayan Dutta
kaynak

Yanıtlar:

1. Tanımlar

Bu cevapta kullanılan isimler ve semboller, Kuantum doğrusal sistem algoritmalarında tanımlananları takip eder : bir astar (Dervovic, Herbster, Mountney, Severini, Usher ve Wossnig, 2018) . Aşağıda bir geri çağırma yapılır.

1.1 Kayıt adları

Kayıt adları Quantum doğrusal sistem algoritmalarının Şekil 5'te tanımlanmıştır : bir astar (Dervovic, Herbster, Mountney, Severini, Usher ve Wossnig, 2018) (aşağıda çoğaltılmıştır):

$S$ (1 qubit) çıktının geçerli olup olmadığını kontrol etmek için kullanılan ancilla kayıttır.
$C$ ( $n$ qubits) saat kaydı, yani hamiltonyanın özdeğerlerini kuantum faz tahmini (QPE) ile tahmin etmek için kullanılan kayıttır.
$I$ ( $m$ qubits) denklemin sağ tarafını saklayan kayıttır $Ax = b$ . Depolar $x$ , denklemin sonucu, ne zaman $S$ olarak ölçülür $\left|1\right>$ algoritmanın sonunda.

2. Hakkında $\left|\Psi_0\right>$ :

Tam olarak nedir $\left|\Psi_0\right>$ ?

$\left|\Psi_0\right>$ saat kaydının olası başlangıç durumlarından biridir $C$ .
Ne yapar $T$ ve $\tau$ anlamına gelir?

$T$ büyük bir pozitif tamsayı anlamına gelir. Bu $T$ olabildiğince büyük olmalıdır çünkü $\left|\Psi_0\right>$ için verilen bir hatayı asimptotik olarak en aza indirin $T$ sonsuza kadar büyüyor. İfadesinde $\left|\Psi_0\right>$ , $T$ olacak $2^n$ , kuantum saati için olası durumların sayısı $C$ .

$\tau$ sadece toplama endeksidir
Neden böyle devasa bir ifade $\left|\Psi_0\right>$ ?

Ayrıntılı bir açıklama için DaftWullie'nin gönderisine bakın .

Kuantum algoritmasındaki lineer denklem sistemleri (Harrow, Hassidim ve Lloyd, 2009 v3) alıntılarını takiben :
1. Lineer denklem sistemleri için aynı makalenin Kuantum algoritmasının önceki versiyonu (Harrow, Hassidim ve Lloyd, 2009 v2) . Yazarlar makaleyi 2 kez revize etti (orijinal HHL kağıdının 3 versiyonu var) ve n ° 3 versiyonu önceki versiyonlarda verilen tüm bilgileri içermiyor. V2'de (bölüm A.3. Sayfa 17'den başlayarak), yazarlar hatanın bu özel başlangıç durumuyla ilgili ayrıntılı bir analizini sağlar.
2. Optimum Kuantum Saat (1998 Buzek Derka, Massar) arasında burada ekspresyon $\left|\Psi_0\right>$ olarak verilir $\left|\Psi_{opt}\right>$ Bu bölümü tam olarak anlama bilgim yok, ama bu ifade bir anlamda "optimal" gibi görünüyor.

3. hazırlanması $\left|\Psi_0\right>$ :

Önceki bölümde belirtildiği gibi, $\left|\Psi_0\right>$ bir başlangıç durumudur. Hazırlamıyorlar $\left|\Psi_0\right>$ faz tahmin prosedüründen sonra. Cümle sıralaması makalede gerçekten uygun değil. Kağıtta kullandıkları faz kestirim prosedürü, bölüm 1'de bağlanan kuantum devresinde temsil edilen "klasik" faz kestirim algoritmasından biraz farklıdır ve bu yüzden ayrıntılı olarak açıklamaktadırlar.

Faz tahmin algoritmaları:

Hazırlamak $\left|\Psi_0\right>$ kayıttaki durum $C$ .
Koşullu Hamiltonyen evrimini kayıtlara uygulama $C$ ve $I$ (devlette olan $\left|\Psi_0\right>\otimes \left|b\right>$ ).
Elde edilen duruma kuantum Fourier dönüşümünü uygulayın.

Sonunda, $C$ içinde $\left| \Psi_0 \right>^C$ devletin $\left| \Psi_0 \right>$ kayıt defterinde saklanır $C$ . Bu, kullanılan kayıtların kaydını tutmak için kısa ve kullanışlı bir gösterimdir.

4. Hamilton simülasyonu:

Her şeyden önce, $\kappa$ matrisin koşul numarasıdır ( "koşul numarası" üzerindeki Wikipedia sayfası ) $A$ .

$\sum_{\tau = 0}^{T-1}|\tau\rangle \langle \tau|^{C}\otimes e^{iA\tau t_0/T}$ is the mathematical representation of a quantum gate.

The first part in the sum $|\tau\rangle \langle \tau|^{C}$ is a control part. It means that the operation will be controlled by the state of the first quantum register (the register $C$ as the exponent tells us).

The second part is the "Hamiltonian simulation" gate, i.e. a quantum gate that will apply the unitary matrix given by $e^{iA\tau t_0/T}$ to the second register (the register $I$ that is in the initial state $\left|b\right>$ ).

The whole sum is the mathematical representation of the controlled-U operation in the quantum circuit of "1. Definitions", with $U = e^{iA\tau t_0/T}$ .

— Nelimee
kaynak

$\newcommand{\bra}[1]{\left\langle#1\right|}\newcommand{\ket}[1]{\left|#1\right\rangle}\newcommand{\proj}[1]{|#1\rangle\langle#1|}\newcommand{\half}{\frac12}$ In answer to your first question, I wrote myself some notes some time ago about my understanding of how it worked. The notation is probably a bit different (I've tried to bring it more into line, but it's easy to miss bits), but attempts to explain that choice of the state $|\Psi_0\rangle$ . There also seem to be some factors of $\frac12$ floating around in places.

When we first study phase estimation, we're usually thinking about it in respect to use in some particular algorithm, such as Shor's algorithm. This has a specific goal: getting the best $t$ -bit approximation to the eigenvalue. You either do, or you don't, and the description of phase estimation is specifically tuned to give as high a success probability as possible.

In HHL, we are trying to produce some state

| ϕ ⟩ = \sum_{j} \frac{β_{j}}{λ_{j}} | λ_{j} ⟩,

$\ket{\phi}=\sum_j\frac{\beta_j}{\lambda_j}\ket{\lambda_j},$ where

| b ⟩ = \sum_{j} β_{j} | λ_{j} ⟩

$\ket{b}=\sum_j\beta_j\ket{\lambda_j}$ , making use of phase estimation. The accuracy of the approximation of this will depend far more critically on an accurate estimation of the eigenvalues that are close to 0 rather than those that are far from 0. An obvious step therefore, is to attempt to modify the phase estimation protocol so that rather than using `bins' of fixed width

2 π / T

$2\pi/T$ for approximating the phases of

e^{- i A t}

$e^{-iAt}$ (

T = 2^{t}

$T=2^t$ and

t

$t$ is number of qubits in phase estimation register), we might rather specify a set of

ϕ_{y}

$\phi_y$ for

y \in {0, 1}^{t}

$y\in\{0,1\}^t$ to act as the centre of each bin so that we can have vastly increased accuracy close to 0 phase. More generally, you might specify a trade-off function for how tolerant you might be of errors as a function of the phase

ϕ

$\phi$ . The precise nature of this function can then be tuned to a given application, and the particular figure of merit which you will use to determine success. In the case of Shor's algorithm, our figure of merit was simply this binning protocol -- we were successful if the answer was in the correct bin, and unsuccessful outside it. This is not going to be the case in HHL, whose success is more reasonably captured by a continuous measure such as the fidelity. So, for the general case, we shall designate a cost function

C (ϕ, ϕ^{'})

$C(\phi,\phi')$ which specifies a penalty for answers

ϕ^{'}

$\phi'$ if the true phase is

ϕ

$\phi$ .

Recall that the standard phase estimation protocol worked by producing an input state that was the uniform superposition of all basis states $\ket{x}$ for $x\in\{0,1\}^t$ . This state was used to control the sequential application of multiple controlled- $U$ gates, which are followed up by an inverse Fourier transform. Imagine we could replace the input state with some other state

| Ψ_{0} ⟩ = \sum_{x \in {0, 1}^{t}} α_{x} | x ⟩,

$\ket{\Psi_0}=\sum_{x\in\{0,1\}^t}\alpha_x\ket{x},$ and then the rest of the protocol could work as before. For now, we will ignore the question of how hard it is to produce the new state

| Ψ_{0} ⟩

$\ket{\Psi_0}$ , as we are just trying to convey the basic concept. Starting from this state, the use of the controlled-

U

$U$ gates (targeting an eigenvector of

U

$U$ of eigenvalue

ϕ

$\phi$ ), produces the state

\sum_{x \in {0, 1}^{t}} α_{x} e^{i ϕ x} | x ⟩ .

$\sum_{x\in\{0,1\}^t}\alpha_xe^{i\phi x}\ket{x}.$ Applying the inverse Fourier transform yields

\frac{1}{\sqrt{T}} \sum_{x, y \in {0, 1}^{t}} e^{i x (ϕ - \frac{2 π y}{M})} α_{x} | y ⟩ .

$\frac{1}{\sqrt{T}}\sum_{x,y\in\{0,1\}^t}e^{ix\left(\phi-\frac{2\pi y}{M}\right)}\alpha_x\ket{y}.$ The probability of getting an answer

y

$y$ (i.e.

ϕ^{'} = 2 π y / T

$\phi'=2\pi y/T$ ) is

\frac{1}{T} {| \sum_{x \in {0, 1}^{t}} e^{i x (ϕ - \frac{2 π y}{T})} α_{x} |}^{2}

$\frac{1}{T}\left|\sum_{x\in\{0,1\}^t}e^{ix\left(\phi-\frac{2\pi y}{T}\right)}\alpha_x\right|^2$ so the expected value of the cost function, assuming a random distribution of the

ϕ

$\phi$ , is

\bar{C} = \frac{1}{2 π T} \int_{0}^{2 π} d ϕ \sum_{y \in {0, 1}^{t}} {| \sum_{x \in {0, 1}^{t}} e^{i x (ϕ - \frac{2 π y}{T})} α_{x} |}^{2} C (ϕ, 2 π y / T),

$\bar C=\frac{1}{2\pi T}\int_0^{2\pi}d\phi\sum_{y\in\{0,1\}^t}\left|\sum_{x\in\{0,1\}^t}e^{ix\left(\phi-\frac{2\pi y}{T}\right)}\alpha_x\right|^2C(\phi,2\pi y/T),$ and our task is to select the amplitudes

α_{x}

$\alpha_x$ that minimise this for any specific realisation of

C (ϕ, ϕ^{'})

$C(\phi,\phi')$ . If we make the simplifying assumption that

C (ϕ, ϕ^{'})

$C(\phi,\phi')$ is only a function of

ϕ - ϕ^{'}

$\phi-\phi'$ , then we can make a change of variable in the integration to give

\bar{C} = \frac{1}{2 π} \int_{0}^{2 π} d ϕ {| \sum_{x \in {0, 1}^{t}} e^{i x ϕ} α_{x} |}^{2} C (ϕ),

$\bar C=\frac{1}{2\pi}\int_0^{2\pi}d\phi\left|\sum_{x\in\{0,1\}^t}e^{ix\phi}\alpha_x\right|^2C(\phi),$ As we noted, the most useful measure is likely to be a fidelity measure. Consider we have a state

| + ⟩

$\ket{+}$ and we wish to implement the unitary

U_{ϕ} = | 0 ⟩ ⟨ 0 | + e^{i ϕ} | 1 ⟩ ⟨ 1 |

$U_\phi=\proj{0}+e^{i\phi}\proj{1}$ , but instead we implement

U_{ϕ^{'}} = | 0 ⟩ ⟨ 0 | + e^{i ϕ^{'}} | 1 ⟩ ⟨ 1 |

$U_{\phi'}=\proj{0}+e^{i\phi'}\proj{1}$ . The fidelity measures how well this achieves the desired task,

F = {| ⟨ + | U_{ϕ^{'}}^{†} U | + ⟩ |}^{2} = \cos^{2} (\frac{ϕ - ϕ^{'}}{2}),

$F=\left|\bra{+}U_{\phi'}^\dagger U\ket{+}\right|^2=\cos^2\left(\frac{\phi-\phi'}{2}\right),$ so we take

C (ϕ - ϕ^{'}) = \sin^{2} (\frac{ϕ - ϕ^{'}}{2}),

$C(\phi-\phi')=\sin^2\left(\frac{\phi-\phi'}{2}\right),$ since in the ideal case

F = 1

$F=1$ , so the error, which is what we want to minimise, can be taken as

1 - F

$1-F$ . This will certainly be the correct function for evaluating any

U^{t}

$U^t$ , but for the more general task of modifying the amplitudes, not just the phases, the effects of inaccuracies propagate through the protocol in a less trivial manner, so it is difficult to prove optimality, although the function

C (ϕ - ϕ^{'})

$C(\phi-\phi')$ will already provide some improvement over the uniform superposition of states. Proceeding with this form, we have

\bar{C} = \frac{1}{2 π} \int_{0}^{2 π} d ϕ {| \sum_{x \in {0, 1}^{t}} e^{i x ϕ} α_{x} |}^{2} \sin^{2} (\frac{1}{2} ϕ),

$\bar C=\frac{1}{2\pi}\int_0^{2\pi}d\phi\left|\sum_{x\in\{0,1\}^t}e^{ix\phi}\alpha_x\right|^2\sin^2\left(\half\phi\right),$ The integral over

ϕ

$\phi$ can now be performed, so we want to minimise the function

\frac{1}{2} \sum_{x, y = 0}^{T - 1} α_{x} α_{y}^{⋆} (δ_{x, y} - \frac{1}{2} δ_{x, y - 1} - \frac{1}{2} δ_{x, y + 1}) .

$\half\sum_{x,y=0}^{T-1}\alpha_x\alpha_y^\star(\delta_{x,y}-\half\delta_{x,y-1}-\half\delta_{x,y+1}).$ This can be succinctly expressed as

min ⟨ Ψ_{0} | H | Ψ_{0} ⟩

$\min\bra{\Psi_0}H\ket{\Psi_0}$ where

H = \frac{1}{2} \sum_{x, y = 0}^{T - 1} (δ_{x, y} - \frac{1}{2} δ_{x, y - 1} - \frac{1}{2} δ_{x, y + 1}) | x ⟩ ⟨ y | .

$H=\half\sum_{x,y=0}^{T-1}(\delta_{x,y}-\half\delta_{x,y-1}-\half\delta_{x,y+1})\ket{x}\bra{y}.$ The optimal choice of

| Ψ_{0} ⟩

$\ket{\Psi_0}$ is the minimum eigenvector of the matrix

H

$H$ ,

α_{x} = \sqrt{\frac{2}{T + 1}} \sin (\frac{(x + 1) π}{T + 1}),

$\alpha_x=\sqrt{\frac{2}{T+1}}\sin\left(\frac{(x+1)\pi}{T+1}\right),$ and

\bar{C}

$\bar C$ is the minimum eigenvalue

\bar{C} = \frac{1}{2} - \frac{1}{2} \cos (\frac{π}{T + 1}) .

$\bar C=\half-\half\cos\left(\frac{\pi}{T+1}\right).$ Crucially, for large

T

$T$ ,

\bar{C}

$\bar C$ scales as

1 / T^{2}

$1/T^2$ rather than the

1 / T

$1/T$ that we would have got from the uniform coupling choice

α_{x} = 1 / \sqrt{T}

$\alpha_x=1/\sqrt{T}$ . This yields a significant benefit for the error analysis.

If you want to get the same $|\Psi_0\rangle$ as reported in the HHL paper, I believe you have to add the terms $-\frac14\left(\ket{0}\bra{T-1}+\ket{T-1}\bra{0}\right)$ to the Hamiltonian. I have no justification for doing so, however, but this is probably my failing.

— DaftWullie
kaynak