829
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Solving multiple windowed STFT phase retrieval problems in phase and amplitude respectively

&
Pages 688-707 | Received 20 Apr 2022, Accepted 15 Aug 2022, Published online: 26 Sep 2022

ABSTRACT

We study the Phase Retrieval (PR) problem under the phaseless short-time Fourier transform (STFT) measurements. This paper proposes a novel algorithm named PAR to solve the STFT PR problem in phase and amplitude respectively with a milder retrieval condition compared with the original methods. First, a symmetric undirected graph of signals is proposed for the computation of the relative phase. Then the retrieval conditions of STFT PR problem are discussed for a single window case and some weaker retrieval conditions are proposed compared with the LS method. We also discuss STFT PR problem in multiple windows and establish retrieval theorems without restrictions of sliding step-size L. We give some numerical results of the PAR algorithm.

MSC:

1. Introduction and problem equation

Phase Retrieval(PR) is a classical problem which considers recovering a signal from the magnitude of the measurements. It can be applied in X-ray crystallography [Citation1,Citation2], diffraction imaging system [Citation3], phase measurement in astronomy [Citation4,Citation5] and optics [Citation6,Citation7]. When the phase retrieval measuring vectors are chosen as Short Time Fourier Transform(STFT) measurements [Citation6], it is an STFT PR problem. This problem serves as the model for ptychography, in which one or multiple moving probes are used to sense multiple diffraction measurements [Citation8,Citation9] and ultra-short laser pulse measurement techniques [Citation10,Citation11]. Different number of probes and laser pulses corresponds single or multiple STFT PR problem [Citation12,Citation13].

This paper considers the STFT phase retrieval, which recovering a signal from its STFT magnitude. Set x=(x(0),x(1),,x(N1))TCN. The STFT can be interpreted as the Fourier transform of an N-dimensional signal xCN multiplied by series of sliding windows grRN with N-periodic extension: (1) Xr(m,k)=n=0N1x(n)gr(Lmn)ei2πkn/N,(1) with 0mN/L1, 0kN1 and 1rR. m represents the sliding order of the window gr and L represents the sliding step-size. STFT PR considers the problem of recovering x from |Xr(m,k)|. When R = 1, the problem is a single STFT phase retrieval problem, while R>1 is a multiple STFT phase retrieval problem [Citation14,Citation15].

To solve STFT PR, not only the algorithm but also retrieval conditions need to be considered. It means the conditions under which the STFT magnitude uniquely identifies signals (up to a global phase) [Citation16] under the algorithm. Researchers find that a suitable restriction of measurements including windows' number, type, length and step size enforce retrieval uniqueness.

For single STFT PR problem, different algorithms have various restrictions of windows. Denote the length nonzero window g by W. When the measurement is with adjacent windows (L = 1), SDP approach [Citation17] has been extended to STFT PR [Citation18] and has uniqueness guarantees with computational complexity O(N3). When the sliding step-size L = 1 and WN+12, a Least Squares (LS) approach [Citation19] obtains the stable solution of STFT PR. In addition, a robust method proposed in [Citation20] uses phase polarization to solve STFT PR under a sliding window with length W = N, which is too restrictive in many applications.

As for multiple PR,  [Citation14] discusses some sufficient and necessary conditions of multi-windows STFT PR and an algorithm based on the graph was proposed to recover the signal. But this retrieval condition is quite special (WN2) and can only be satisfied in strict conditions. Greedy Angular Synchronization [Citation21–23] is proposed to compute relative phase by autocorrelation. While it assumes that all the autocorrelation. x(n1)x(n2)¯ can be obtained for all |n1n2|<δ where δ is a given length, it may not easy to satisfy this condition in STFT PR problem.

No matter for single or multiple windows STFT PR, those algorithms have special retrieval restrictions of measurements, which restrict the applications. A better algorithm should consider the retrieval conditions and be suitable for broader applications. In this paper, an algorithm named PAR, which compute phase and amplitude respectively (PAR), is proposed with a milder retrieval restrictions of measurements for both single and multiple STFT PR. First, a graph is established based on STFT measurements and the retrieval conditions of PAR are transformed to the graph connectivity. Based on the analysis of the graph connectivity, some milder retrieval conditions are proposed for different types of windows, which has a broader field of application. Error estimation and experiments are also given to verify the effectiveness of the PAR algorithm.

Throughout the paper, we use the following notation. Bold-face small and capital letters denote vectors and matrices respectively. gcd(a,b,c) denotes the greatest common factor between a, b and c. F denotes 1D discrete Fourier transform (DFT) and F1 denotes 1D discrete inverse Fourier transform (IDFT). FxCN is defined as (2) Fx(k)=n=0N1x(n)ei2πkn/N,0kN1,(2) and F1xCN is defined as (3) F1x(n)=k=0N11Nx(k)ei2πkn/N,0nN1.(3) The paper is organized as follows. In Section 1, we introduce the STFT PR problem and related works. Section 2 discusses the PR problem in the single-window and multi-window respectively. Section 3 gives the reconstruction algorithm and error estimation. Section 4 compares the numerical results of PAR and LS method. Section 5 is the summary of this paper.

2. Analysis of phase retrieval conditions

Define |Xr(m)|2=(|Xr(m,0)|2,,|Xr(m,N1)|2). Instead of recovering x from |Xr(m)|2, this paper considers the acquired data Yr by taking IDFT of |Xr(m)|2 for 0mN/L1, which is defined as (4) Yr(m,l)=1Nk=0N1|Xr(m,k)|2ei2πkl/N=n=0N1x(n)x(n+l)¯gr(mLn)gr(mLnl)¯,(4) with l=0,1,,N1, 0mN/L1 [Citation19]. Therefore, recovering x from Xr is equivalent to recover it from Yr. To recover the signal x uniquely, the injectivity property of xY should be guaranteed. Actually, there are classes of signals that have the same measurements after injection. For arbitrary θC, x and xeiθ produce the same intensity measurements. Thus we consider x and xeiθ as in the same class [x]CM/, where ∼ is the equivalence relation of being identical up to a global phase factor. Then the distance between two vectors in CN is defined as (5) d(z,x)=d([z],[x])=minϕ[0,2π)zxeiϕ2.(5) When d(z,x)=0, x and z are equivalent up to a global phase shift.

2.1. Single adjacent window case (R = 1, L = 1)

To find the relation between success of recovery and windows' parameters, we first discuss the STFT PR problem with a single window g. Denote x~(n,l)=x(nmodN)x((n+l)modN)¯ and g~(n,l)=g(nmodN)g((nl)modN)¯. Then Equation (Equation4) can be represented by (6) Y(m,l)=n=0N1x~(n,l)g~(mLn,l).(6) Define y(l)={Y(m,l)}m=0N/L1, x(l)={x~(n,l)}n=0N1, and matrix G(l)RN/L×N, where the (m,n)th entry of G(l) is equal to g~(mLn,l). Then Equation (Equation6) can be written as (7) y(l)=G(l)x(l).(7) Due to L = 1, G(l) is a circulant square matrix which can be diagonalized by G(l)=FΣ(l)F, where Σ(l) is a diagonal matrix whose entries are given by the DFT of the first column of G(l). F is the DFT matrix and F represents the conjugate transpose matrix of F [Citation24]. If the matrices G(l) are invertible for several 0lN1, then we can compute x(l) by (8) x(l)=(G(l))1y(l).(8) To analyse the invertibility of G(l), the definition of ‘non-vanishing’ is given as follows:

Definition 2.1

xCN is non-vanishing means x(k)0 for all k=0,1,,N1.

Denote g~(l)=(g~(0,l),g~(1,l),,g~(N1,l)). G(l) is invertible if and only if Fg~(l) is non-vanishing since G(l)=FΣlF. Based on this, LS algorithm proposed in [Citation19] can be used to solve the PR problem when Fg~(l) is non-vanishing for l=0,1,,N1. Nevertheless, Theorem 3.3 in [Citation19] shows that if Fg~(l) is non-vanishing only for l = 0, 1, signal can still be recovered. It inspires us to consider recovering the signal under milder and wider conditions.

Suppose signal x is non-vanishing. In fact, for any n1 and n2, since x and xeiθ are in the same class of [x]CM/, we do not need to compute all the phase, but the relative phase [Citation6]: (9) ρn1n2=(x(n1)/|x(n1)|)(x(n2)/|x(n2)|)=x(n1)x(n2)¯|x(n1)x(n2)¯|.(9) To compute the signal phase, we define an undirected graph for STFT phase retrieval: (10) G(x,g,L):=(V(x),E(L)),(10) with a set of nodes: V(x):={0nN1|x(n)0} and a set of edges: E(L):={(n,n)V(x)×V(x)|ρnn can be computed directly by (8) and (9)}. Relative phase information can be transferred through the relationship between the phase of n1,n2,n3: (11) ρn1n3=ρn1n2ρn2n3.(11) If we assign an arbitrary nonzero vertex x(n0) to have nonzero phase ρ0=x(n0)/|x(n0)|, then all the phase can be computed due to phase propagation equation (Equation11) if graph G is connected. In this paper, we can use the same strategy as [Citation23] to choose the initial phase ρ0. Therefore, the graph connectivity decides the retrieval uniqueness of phase.

Lemma 2.1

If there exists non-vanishing Fg~(l0), then ρn,n+l0 can be computed by the phase of x(l) for any nV(x).

Proof.

According to Equations (Equation8) and (Equation9), ρn,n+l0 can be computed when G(l0) is invertible, which is equivalent to Fg~(l0) is non-vanishing.

Since the STFT of signal is periodic, the connectivity of graph G is related to l and N. We have the following results:

Lemma 2.2

Given a graph G with N vertices, suppose there exists k such that any two vertices n1 and n2 are connected if and only if (n1n2)modNk, then graph G is connected if and only if gcd(k,N)=1.

Proof.

Graph connectivity can be judged by the adjacency matrix A{0,1}N×N. If all the elements in S=I+A+A2++AN are non-zero, graph is connected. The adjacency matrix of G is in this form (12) A=12k+1N00 10000 01000110 001 000 100,(12) (13) Al+1=12k+1+lgcd(k,N)N00 10000 01000110 001 000 100.(13) It shows that diag(A,k)=1RN and diag(Al,k+lgcd(k,N))=1RN for 0lN. Since S=I+A+A2++AN, then all elements in S are non-zero is equivalent to that diag(Al,k+lgcd(k,N)) for 0lN covers all the position in matrix A, which means that gcd(k,N)=1.

A generalized result of Lemma 2.2 is given by Lemma 2.3.

Lemma 2.3

Given a graph G with N vertices, suppose any two vertices n1 and n2 are connected if (n1n2)modN=ki for i=1,2,,m. G is connected if and only if gcd(k1,k2,,km,N)=1.

Proof.

The lemma can be inferred by S defined in Lemma 2.2. Denote the adjacency matrix is A, then diag(A,ki)=1RN and diag(Al,ki+lgcd(ki,N))=1RN for 0lN and 1im. Only when gcd(k1,k2,,km,N)=1, all elements in S are non-zero.

Lemmas 2.2 and 2.3 give the relation between k and N. Combining with Lemma 2.1, we get two theorems as follows:

Theorem 2.4

Assume there exists l0, such that gcd(l0,N)=1 and Fg~(l0) is non-vanishing, then all phases can be recovered.

Theorem 2.5

Assume there exists l1,l2,,lm such that gcd(l1,l2,,lm,N)=1, and Fg~(li) is non-vanishing for i=1,2,,m, then the phase can be recovered.

Proof.

Since Fg~(l0) is non-vanishing, x(l0)=(G(l0))1y(l0) can be computed, which means the relative phase between x(n) and x((nl0)modN) can be obtained. According to Lemma 2.2 and 2.3, all phases can be computed.

Theorem 2.4 and 2.5 give two phase recovery conditions. Then we turn to consider how to compute the amplitude. As is shown in Theorem 3.1 in [Citation19] and Algorithm 1 in [Citation23], the amplitude can be computed when Fg~(0) is non-vanishing. However, if Fg~(0) is vanishing, could signal x still be recovered? Here we make an extension of proposition III.4 in [Citation19].

Theorem 2.6

If there exists l0,l1 such that l1l0=M where M=gcd(l1,l0), and Fg~(l0), Fg~(l1) are both non-vanishing, then the amplitude of the signal x can be recovered by (14) |x(l0)|2=k=0l0/M|x(Mk)x(Mk+l0)¯|k=0l0/M1|x(Mk)x(Mk+l1)¯|.(14)

Proof.

Without loss of generality, since l1=l0+M, then (15) k=0l0/M|x(Mk)x(Mk+l0)¯|k=0l0/M1|x(Mk)x(Mk+l1)¯|=|x(0)x(l0)¯x(M)x(M+l0)¯x(l0)x(2l0)¯|x(0)x(l1)¯x(M)x(M+l1)¯x(l0M)x(l0M+l1)¯|=|x(0)x(l0)¯x(M)x(M+l0)¯x(l0)x(2l0)¯||x(0)x(M+l0)¯x(M)x(M+l1)¯x(l0M)x(2l0)¯|=|x(l0)x(l0)¯|=|x(l0)|2.(15) We could compute the amplitude of x(l0) and other vertices amplitude can be computed by the same equation.

Remark 2.1

Since Equation (2.6) involves division, the signal should be non-vanishing and not too small. Here is an example. If Fg~(1) and Fg~(0) are both non-vanishing, then (16) |x(1)|2=|x(0)x(1)¯||x(1)x(2)¯||x(0)x(2)¯|.(16) Other vertices can also be computed by this equation.

Corollary 2.7

If there exists l0 such that Fg~(l0) and Fg~(l0+1) are non-vanishing, then the signal can be recovered.

Proof.

Since gcd(l0,l0+1)=1 for any integer l0, then all phases can be computed according to Theorem 2.4. And it also ensures the amplitude can be computed by Theorem 2.6. Therefore, the signal can be recovered.

Remark 2.2

Corollary 2.7 gives a sufficient condition for the recovery of x. Theorem III.4 proposed in  [Citation19] is a special case of Corollary 2.7.

Now we can prove that when W<N/2, our method can also recover the phase, which expands the scope of W compared with LS model.

Corollary 2.8

If gcd(W,N)=1 and W<N/2, then the signal phase can be recovered.

Proof.

Consider the matrix GWRN×N where the (m,n)th entry of GW is given by g~(mn,W). Since W<N/2 and gcd(N,W)=1, there must exist unique n0 such that g~(n0,W)0. Therefore, we have (17) Y(m,W)=n=0N1x~(n,W)g~(mn,W)=x~(mn0,W)g~(n0,W),m=0,1,,N1.(17) Since g~(n0,W)0, (18) x~(mn0,W)={g~(n0,W)}1Y(m,W).(18) Then ρmn0,mn0+W can be computed as ρmn0,mn0+W=x~(mn0,W)|x~(mn0,W)|={g~(n0,W)|g~(n0,W)|}1×Y(m,W)|Y(m,W)|,m=0,1,,N1. Since m is arbitrary, for any n1 and n2, ρn1n2 can be computed if n1n2=WmodN. Therefore, there exists k = W so that ρn,n+k can be computed for 0nN1. Together with gcd(W,N)=1, the signal phase can be recovered according to Lemma 2.2 where k = W.

By now, the discussion is based on L = 1. We now extend the results to L>1. In fact, since G(l) defined in (Equation7) is not invertible, ρn,n+l cannot be computed by (Equation8) for 0nN1. The problem is in that condition the graph G defined in (Equation10) cannot be connected if there is only one l satisfy conditions of Theorem 2.4 when L>1. Therefore, we discuss multiple windows STFT to solve the conditions that L1.

2.2. Multiple discrete windows case (R>1,L1)

Multiple windows STFT PR provides more measurements compared with single window STFT PR. Therefore, the graph connectivity of G may be easier to satisfy.

Now we start to figure out which condition should be satisfied to ensure graph G is connected in STFT PR problem. Choose a window gr such that Wr<N/2. When l=Wr, there exists unique nr such that g~r(nr,Wr)0, then Yr(m,Wr)=n=0N1x~(n,Wr)g~r(mLn,Wr)=x~(mLn0,W)g~r(nr,Wr). Therefore, (19) ρmLnr,mLnr+Wr=x~(mLnr,Wr)|x~(mLnr,Wr)|={g~r(nr,Wr)|g~r(nr,Wr)|}1×Yr(m,Wr)|Yr(m,Wr)|,m=0,1,2,,N/L1.(19) Note that only when n=mLnr with 0mN/L1, ρn,n+Wr can be computed by Equation (Equation19). Therefore, single window STFT measurements cannot satisfy the connection of graph G(V(x),E(L)) when L>1. If we want to compute all the relative phase in the V(x), multiple windows {gr}r=1R are needed to compute the relative phase. As an instance shown in Figure , when N = 6, L = 2, W1=1, W2=2, the graph G is connected.

Figure 1. An example of connected graph G(V(x),E(L)) with two sliding windows: W1=1, W2=2.

Figure 1. An example of connected graph G(V(x),E(L)) with two sliding windows: W1=1, W2=2.

In this example, all the relative phase can be computed through (Equation19). It shows that when L>1, the relative phase still can be obtained. Given two vertices n1 and n2, we want to figure out in which condition ρn1,n2 can be computed. Based on Theorem 2.4, we get some conclusions in multiple windows. Based on Equation (Equation7) and the solution identification theorem in linear equations, we get

Lemma 2.9

Denote the G(l) of gr in Equation (Equation7) as Gr(l), r=1,2,R. Let G(l)=[G1(l)TG2(l)TGR(l)T]T. Then x(l0) can be computed by y(l0)=Gl0x(l0) if rank(G(l0))=N.

Based on Lemma 2.9, Theorem 2.10 gives an extension of Theorem 3.1 in [Citation19] and Lemma 2.1.

Theorem 2.10

Let {gr}r=1R be a family of sliding windows with the same sliding step-size L, where L be a separation parameter with N/LZ. Define Bm(l)RR×L, where Bm(l)(r,j)=Fg~r(l)(m+jN/L). ρn,n+l can be computed if Bm(l) has rank L for all 0mN/L1.

Proof.

Since Yr(m,l)=1Nk=0N1|Xr(m,k)|2ei2πkl/N with 0mN/L1 and 1rR. Denote x~(l)=(x~(0,l),x~(1,l),,x~(N1,l)), according to Fourier convolution property, we have Yr(m,l)=n=0N1x~(n,l)g~r(mLn,l)=F1{Fx~(l)Fg~r(l)}(mL)=1Nk=0N1Fx~(l)(k)Fg~r(l)(k)e2πimkL/N. Denote Yr(l)=(Yr(0,l),Yr(1,l),,Yr(N/L1,l)). Since Bm(l)=(Fg~r(l)(m+jN/L))1rR,0jL1 and Fg~r(l)(k)=1Nk=0N1g~r(l)(k)ei2πkl/N. Then according to Fourier transform property, for 0mN/L1 and 1rR, (20) FYr(l)(m)=m=0N/L1Yr(m,l)e2πimmL/N=m=0N/L1(1Nk=0N1Fx~(l)(k)Fg~r(l)(k)e2πimkL/N)e2πimmL/N=1Lk=0N1m=0N/L1LNFx~(l)(k)Fg~r(l)(k)e2πim(km)L/N=1Lj=0L1Fx~(l)(m+jN/L)Fg~r(l)(m+jN/L).(20) Based on the equation of Lemma III.17 in [Citation14], we get Fx~(l)(m+jN/L)=Lj=0L1m=0N/L1bm(j,j)e2πimmL/N(r=1RFg~r(l)(m+jN/L)¯Yr(m,l)) where (bm(j,j))0j,jL1=(Bm(l)HBm(l))1 for all 0 mN/L1 and 0jL1.

Thus together with x=F1Fx, (21) x~(n,l)=F1{Fx~(l)}(n)=LNj,j=0L1m,m=0N/L1e2πi(m(mLn)/Njn/L)bm(j,j)(r=1RFg~r(l)(m+jN/L)¯Yr(m,l))(21) Then ρn,n+l can be computed by the phase of x~(n,l). Since (Bm(l)HBm(l)) is invertible for all m, l, which means rank(Bm(l))=L, then we finished the proof.

Remark 2.3

Now consider some special cases of the condition given by Theorem  2.10.

  1. When L = 1, rank(Bm(l))=L is equivalent to rank(r=1R|Fg~r(m,l)|2)=1 for all 0mN1, which means there exists r0, so that Fg~r0(n,l)0, which is consistent with Theorem 3.1 in [Citation19]. If R = 1, which means in single window case, Fg~(n,l) is non-vanishing.

  2. When L = N, rank(Bm(l))=L is equivalent to rank({g~r(n,l)}1rR,0nN1)=N, which is consistent with Corollary 2.9.

Based on Theorem 2.10, we can get theorems as follows:

Theorem 2.11

Suppose there exists l1,l2,,lt so that (l1,l2,,lt,N)=1, and Bm(li) has rank L for i=1,2,,t, then the signal phase can be recovered.

Theorem 2.12

If there exists l0,l1 such that l1l0=(l1,l0), Bm(l0) and Bm(l1) have rank L, then the amplitude of the signal can be recovered.

Proof.

According to Theorem 2.10, if Bm(l) has rank L, ρn,n+l can be computed by Equation (Equation21). Thus Theorem 2.11 and 2.12 can be inferred based on Theorems 2.5 and 2.6.

3. Reconstruction and error estimation

Theorems 2.11 and 2.12 extend retrieval conditions from a single window g to multiple windows {gr}r=1R, and the sliding size L = 1 to L>1. Based on these analyses, we propose the following reconstruction algorithm (PAR).

Here we discuss the computation complexity of PAR algorithm. To finish calculating Yr(n,l) and F{g~r(n,l)}, we can use fast Fourier transforms with O(NlogN). Computing the rank of Bm(li) needs O(LRN) operations, while computing the highest common factor of no more than N number needs O(NlogN) by division algorithm in Theorem 2.11. Assume the number of l meeting conditions in Theorem 2.11 is k, then judging the condition of Theorem 2.12 needs at most O(kN). According to Equations (Equation21) and (Equation14), computing relative phase and amplitude needs O(N) respectively. Thus the total runtime complexity of PAR algorithm is O(NlogN) in general.

Based on our phase retrieval procedure, we now present the following guarantee of stable performance. Suppose the measurements are corrupted by random noise ϵ(m,k) with level |ϵ| according to Equation (Equation4), then the problem become recovering the signal x from |Xr(m,k)|2+ϵr(m,k). For a window family {gr}r=1R with period N extension, we define g=[g1,g2,,gR], B(l)=[B0(l),B1(l),,BN/L1(l)], g2=(Σr=1RΣn=0N1|gr(n)|2)1/2,B(l)1=Σm=0N/L1Σj,j=0L1|bm,l(j,j)|.

Theorem 3.1

Let C=minnV(x)|x(n)|2, A=maxnV(x)|x(n)|2. Denote B=maxlLB(l)1, T=(2l0M+1){AC}(l0M+1), where L, l0 and M are defined in Equation (Equation14) and Algorithm 1. Let xϵ=(xϵ(0),,xϵ(N1))T be the approximation obtained by Algorithm 1 from noisy measurements {|Xr|2+ϵr}r=1R, |ϵ|=maxr,m,k|ϵr(m,k)|, then there exists βR such that for all n=0,1,,N1, (22) |xϵ(n)|xϵ(n)|e2πiβx(n)|x(n)||2NLBg22|ϵ|C,(22) and (23) ||xϵ(n)|2|x(n)|2|TLBg22|ϵ|.(23)

Proof.

For any l0,1,,N1, r=1R|Fg~r(l)|r=1Rk=0N1|g~r(k,l)|=r=1Rk=0N1|gr(k)(gr(kl))¯|g22. The last inequality comes from Cauchy–Schwarz inequality. Denote Yr,ϵ(m,l) is the IDFT of {|Xr|2+ϵr}r=1R in (Equation4). Combine with Equation (Equation21), then we get (24) |xϵ(n)xϵ(n+l)¯x~(n,l)|=LNj,j=0L1m,m=0N/L1e2πi(m(mLn)/Njn/L)bm(j,j)(r=1RFg~r(l)(m+jN/L)¯(Yr,ϵ(m,l)Yr(m,l)))LNL2{NL}2B(l)1g221N|ϵ|=LB(l)1g22|ϵ|LBg22|ϵ|.(24) According to the error transfer formula: If N¯=f(x1¯,x2¯,x3¯), then error estimation (25) ΔN=|fx1|Δx1+|fx2|Δx2+|fx3|Δx3,(25) where ΔN,Δx1,Δx2,Δx3 represent the error bound of each variable. (26) ||xϵ(n)|2|x(n)|2||k=0l0/M|xϵ(Mk)xϵ(Mk+l0)¯|k=0l0/M1|xϵ(Mk)xϵ(Mk+l1)¯|k=0l0/M|x(Mk)x(Mk+l0)¯|k=0l0/M1|x(Mk)x(Mk+l1)¯||(2l0M+1){AC}(l0M+1)LBg22|ϵ|=TLBg22|ϵ|.(26) Then according the triangle inequality, we get that (27) |xϵ(n)xϵ(n+l)¯|xϵ(n)xϵ(n+l)¯|x~(n,l)|x~(n,l)|||xϵ(n)xϵ(n+l)¯x~(n,l)|+||xϵ(n)xϵ(n+l)¯||x~(n,l)|||x~(n,l)|2LBg22|ϵ|C.(27) Since the relative phase propagation in graph G, we have the estimation equation (Equation22).

Let p,pϵCN be the vectors of phases of the entries of xϵ and x. ° represents Hadamard product. The reconstruction error (28) d(x,xϵ)=minϕeiϕxxϵ2x|p|x|pϵ2+|x|pϵ|xϵ|pϵ2=n=1N|x(n)|2|xϵ(n)|xϵ(n)|x(n)|x(n)||2+n=1Nx(n)||xϵ(n)2x22NLBg22|ϵ|C+NTLBg22|ϵ|.(28) Theorem 3.1 ensures that when error ε is in low level and the minimal of x(n) is not small, the error between retrieval phase and exact phase can be controlled. In addition, the norm of g and B influences the upper bound of error. Therefore, the number of windows does not need to be too large but should meet the retrieval conditions. It means that if we choose some suitable windows, the signal can be recovered well by PAR.

4. Numerical results

We now show the results of PAR algorithm in different types of signals and sliding windows. Here we choose three different signals, Gaussian complex signal, chirp signal and real temperature signal. A Gauss complex signal xCN with N = 101, which is defined as (29) x(n)N(μ,1)+iN(0,1),n=0,,N1.(29) Chirp signal is a typical non-stationary signal in communication, sonar, radar. Figure  shows a chirp signal with length N = 101 and average value is 0. Earth abnormal temperature signal is a real signal describing the temperature variation in Figure .

Figure 2. Chirp signal.

Figure 2. Chirp signal.

Figure 3. Earth abnormal temperature.

Figure 3. Earth abnormal temperature.

In addition, we choose some different typical sliding windows in STFT including rectangle windows, triangle windows and hamming windows. These windows have different main lobes and side lobes, which are suitable for different situations. All the sliding windows satisfy the retrieval condition in PAR method. These simulations display the relation between relative error and sliding window as well as signal-noise ratio (SNR), where the relative error between estimation xϵ and real signal x is defined as Re(xϵ,x)=d(xϵ,x)x2. We reproduce our procedure in https://github.com/zhouxianchen/PAR-algorithm with python.

4.1. Relative error and length of window

Since LS method in [Citation19] requires WN+12, we first discuss the relationship between relative error and sliding window with two signals. Here we choose parameter μ=10 and SNR=60 in (Equation29). Figure  and Figure  show PAR method has less than 0.025 relative error without considering the length of window W in three types of windows. And two figures show that PAR method has no restriction with length of window and can be used in wider range compared with the LS method. Relative error in chirp signal and temperature signal is relatively smaller than that in Gauss complex signal, which might be because that it is no relative phase error in the reconstruction procedure with real signal (Figure ).

Figure 4. Relative error in recovering Gaussian complex signal by different windows using PAR.

Figure 4. Relative error in recovering Gaussian complex signal by different windows using PAR.

Figure 5. Relative error in recovering chirp signal by different windows using PAR.

Figure 5. Relative error in recovering chirp signal by different windows using PAR.

Figure 6. Relative error in recovering temperature signal by different windows using PAR.

Figure 6. Relative error in recovering temperature signal by different windows using PAR.

4.2. Relative error and SNR

Here we discuss PAR performance in different SNR. We choose the length of window W = 7 and μ=10 in (Equation29). Figure  shows that PAR can retrieve signal when SNR40. When SNR40, different windows have different performance in the same level SNR. The triangle window has better performance compared with rectangle and hamming windows. We infer it is due to that B and g2 in (Equation22) are influenced by the type of windows, which lead to different error estimation (Figures and ).

Figure 7. Relative error in recovering noisy Gaussian signal using PAR (W = 7).

Figure 7. Relative error in recovering noisy Gaussian signal using PAR (W = 7).

Figure 8. Relative error in recovering noisy chirp signal using PAR (W = 7).

Figure 8. Relative error in recovering noisy chirp signal using PAR (W = 7).

Figure 9. Relative error in recovering temperature signal using PAR (W = 7).

Figure 9. Relative error in recovering temperature signal using PAR (W = 7).

4.3. Relative error and μ

To verify our error estimation, we discuss the relationship between relative error and μ in (Equation29) using a Gauss complex signal. Here we choose a Hamming window STFT. Figure  shows that when SNR40, μ20 can retrieve signal correctly. And when minnV(x)|x(n)|2 is larger, PAR method has robuster retrieval performance. It verifies our error estimation formula (Equation22) and inspires us when we deal with some data in reality, we can use a signal translation to enlarge that minnV(x)|x(n)|2 (Figure ).

Figure 10. Relative error in recovering noisy Gaussian signal using PAR in different μ.

Figure 10. Relative error in recovering noisy Gaussian signal using PAR in different μ.

Figure 11. Relative error in recovering Gauss complex signal with Hamming window using PAR and LS method (SNR=100).

Figure 11. Relative error in recovering Gauss complex signal with Hamming window using PAR and LS method (SNR=100).

4.4. Relative error between LS and PAR algorithm

In this section, we display the performance of PAR algorithm and the classical LS [Citation19] algorithm under different length of window. PAR method has good performance when the length of W is small compared with the LS method. Therefore, PAR method has less restriction of the window length than the LS method.

As is shown in the experiments, no matter for Gaussian complex, chirp or real temperature signal, our method obtains better performance than the LS method. However, note that the retrieval conditions of PAR based on the assumption that the signal is non-vanishing and graph connectivity based on windows. Some signals and windows do not satisfy these assumptions, and our method is not suitable for those situations.

5. Conclusion and further work

Previous research proposed STFT PR methods in relatively strict retrieval conditions. This paper proposes a PAR reconstruction algorithm with a milder retrieval conditions, which solved the PR by computing phase and amplitude respectively. Compared with the previous method, PAR reconstruction Algorithm 1 has a milder constraint of windows and can be used to solve the STFT phase retrieval problems both in single window and multiple windows cases, when prior knowledge is known including that the noise is low and the minimal magnitude of nonzero components of original signal is enough large. Numerical results show that when noise is in low level, PAR has better performance and a translation shift of initial signal can lead to a robuster result. In addition, since the computation cost is O(NlogN) when l is selected in Algorithm 1, it can also be used in initialization of other non-convex method.

Disclosure statement

There are no relevant financial or non-financial competing interests to report. And no potential competing interest was reported by the authors.

Additional information

Funding

This work was supported in part by The National Key Basic Research Program (Grant No. 2020YFA0713504) and National Natural Science foundation (China): 61977065,11971489.

References

  • Harrison RW. Phase problem in crystallography. J Opt Soc Amer A. 1993;10(5):1046–1055.
  • Miao J, Ishikawa T, Shen Q, et al. Extending x-ray crystallography to allow the imaging of noncrystalline materials, cells, and single protein complexes. Annu Rev Phys Chem. 2008;59(1):387–410.
  • Bunk O, Diaz A, Pfeiffer F, et al. Diffractive imaging for periodic samples: retrieving one-dimensional concentration profiles across microfluidic channels. Acta Crystallogr. 2007;63(4):306–314.
  • Fienup C, Dainty J. Phase retrieval and image reconstruction for astronomy. Image Recovery: Theor Appl. 1987;231:275.
  • Stepanova IE, Gudkova TV, Salnikov AM, et al. A new approach to analytical modeling of Mars's magnetic field. Appl Math Sci Eng. 2022;30(1):41–60.
  • Walther A. The question of phase retrieval in optics. Optica Acta Int J Opt. 1963;10(1):41–49.
  • Geng Y, Wen X, Tan J, et al. Noise-robust phase retrieval by optics path modulation with adaptive feedback. Opt Commun. 2022;515:128199.
  • Fienup JR, Guizarsicairos M. Phase retrieval with transverse translation diversity: a nonlinear optimization approach. Opt Express. 2008;16(10):7264–7278.
  • Maiden AM, Humphry MJ, Zhang F, et al. Superresolution imaging via ptychography. J Opt Soc Am A Opt Image Sci Vis. 2011;28(4):604–612.
  • Trebino R, Guang Z, Zhu P, et al. The measurement of ultrashort laser pulses. 2018 2nd URSI Atlantic Radio Science Meeting (AT-RASC); IEEE; 2018. p. 1–3.
  • Kane DJ. Principal components generalized projections: a review [invited]. J Opt Soc Am B. 2008;25(6):A120–A132.
  • Lin W, Zhang R, Xu X, et al. Phaseless signal recovery from triple-window short-time Fourier measurements. Int J Phys Conf Ser. 2020;1617:012081. IOP Publishing.
  • Grohs P, Liehr L, Rathmair M. Multi-window STFT phase retrieval: lattice uniqueness. Preprint 2022. Available from: arXiv:2207.10620.
  • Li L, Cheng C, Han D, et al. Phase retrieval from multiple-window short-time Fourier measurements. IEEE Signal Process Lett. 2017;24(4):372–376.
  • Guo Y, Wang A, Wang W. Multi-source phase retrieval from multi-channel phaseless STFT measurements. Signal Process. 2018;144:36–40.
  • Grohs P, Koppensteiner S, Rathmair M. Phase retrieval: uniqueness and stability. SIAM Rev. 2020;62(2):301–350.
  • Jaganathan K, Oymak S, Hassibi B. Sparse phase retrieval: uniqueness guarantees and recovery algorithms. IEEE J Sel Top Signal Process. 2017;10(4):770–781.
  • Sun DL, Smith Iii JO. Estimating a signal from a magnitude spectrogram via convex optimization. arXiv. 2012;1209.2076. DOI:10.48550/arXiv.1209.2076
  • Bendory T, Eldar YC. Non-convex phase retrieval from STFT measurements. IEEE Trans Inform Theor. 2018;64(99):1–1.
  • Pfander GE, Salanevich P. Robust phase retrieval algorithm for time-frequency structured measurements. SIAM J Imaging Sci. 2019;12(2):736–761.
  • Alexeev B, Bandeira AS, Fickus M, et al. Phase retrieval with polarization. SIAM J Imaging Sci. 2014;7(1):35–66.
  • Iwen MA, Viswanathan A, Wang Y. Fast local correlation measurements. SIAM J Imaging Sci. 2016;9(4):1655–1688.
  • Iwen MA, Preskitt B, Saab R, et al. Phase retrieval from local measurements: improved robustness via eigenvector-based angular synchronization. Appl Comput Harmon Anal. 2020;48:415–444
  • Bendory T, Eldar YC. Phase retrieval from STFT measurements via non-convex optimization. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2017. p. 4770–4774.