Full article: Solving multiple windowed STFT phase retrieval problems in phase and amplitude respectively

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

We study the Phase Retrieval (PR) problem under the phaseless short-time Fourier transform (STFT) measurements. This paper proposes a novel algorithm named PAR to solve the STFT PR problem in phase and amplitude respectively with a milder retrieval condition compared with the original methods. First, a symmetric undirected graph of signals is proposed for the computation of the relative phase. Then the retrieval conditions of STFT PR problem are discussed for a single window case and some weaker retrieval conditions are proposed compared with the LS method. We also discuss STFT PR problem in multiple windows and establish retrieval theorems without restrictions of sliding step-size L. We give some numerical results of the PAR algorithm.

KEYWORDS:

MSC:

94A12

1. Introduction and problem equation

Phase Retrieval(PR) is a classical problem which considers recovering a signal from the magnitude of the measurements. It can be applied in X-ray crystallography [Citation1,Citation2], diffraction imaging system [Citation3], phase measurement in astronomy [Citation4,Citation5] and optics [Citation6,Citation7]. When the phase retrieval measuring vectors are chosen as Short Time Fourier Transform(STFT) measurements [Citation6], it is an STFT PR problem. This problem serves as the model for ptychography, in which one or multiple moving probes are used to sense multiple diffraction measurements [Citation8,Citation9] and ultra-short laser pulse measurement techniques [Citation10,Citation11]. Different number of probes and laser pulses corresponds single or multiple STFT PR problem [Citation12,Citation13].

This paper considers the STFT phase retrieval, which recovering a signal from its STFT magnitude. Set $x = (x (0), x (1), \dots, x (N - 1))^{T} \in C^{N} .$ The STFT can be interpreted as the Fourier transform of an N-dimensional signal $x \in C^{N}$ multiplied by series of sliding windows $g_{r} \in R^{N}$ with N-periodic extension: (1) $X_{r} (m, k) = \sum_{n = 0}^{N - 1} x (n) g_{r} (L m - n) e^{- i 2 π k n / N},$ (1) with $0 \leq m \leq ⌈ N / L ⌉ - 1$ , $0 \leq k \leq N - 1$ and $1 \leq r \leq R$ . m represents the sliding order of the window $g_{r}$ and L represents the sliding step-size. STFT PR considers the problem of recovering $x$ from $| X_{r} (m, k) |$ . When R = 1, the problem is a single STFT phase retrieval problem, while R>1 is a multiple STFT phase retrieval problem [Citation14,Citation15].

To solve STFT PR, not only the algorithm but also retrieval conditions need to be considered. It means the conditions under which the STFT magnitude uniquely identifies signals (up to a global phase) [Citation16] under the algorithm. Researchers find that a suitable restriction of measurements including windows' number, type, length and step size enforce retrieval uniqueness.

For single STFT PR problem, different algorithms have various restrictions of windows. Denote the length nonzero window $g$ by W. When the measurement is with adjacent windows (L = 1), SDP approach [Citation17] has been extended to STFT PR [Citation18] and has uniqueness guarantees with computational complexity $O (N^{3})$ . When the sliding step-size L = 1 and $W \geq ⌈ \frac{N + 1}{2} ⌉$ , a Least Squares (LS) approach [Citation19] obtains the stable solution of STFT PR. In addition, a robust method proposed in [Citation20] uses phase polarization to solve STFT PR under a sliding window with length W = N, which is too restrictive in many applications.

As for multiple PR, [Citation14] discusses some sufficient and necessary conditions of multi-windows STFT PR and an algorithm based on the graph was proposed to recover the signal. But this retrieval condition is quite special ( $W \leq ⌈ \frac{N}{2} ⌉$ ) and can only be satisfied in strict conditions. Greedy Angular Synchronization [Citation21–23] is proposed to compute relative phase by autocorrelation. While it assumes that all the autocorrelation. $x (n_{1}) \bar{x (n_{2})}$ can be obtained for all $| n_{1} - n_{2} | < δ$ where δ is a given length, it may not easy to satisfy this condition in STFT PR problem.

No matter for single or multiple windows STFT PR, those algorithms have special retrieval restrictions of measurements, which restrict the applications. A better algorithm should consider the retrieval conditions and be suitable for broader applications. In this paper, an algorithm named PAR, which compute phase and amplitude respectively (PAR), is proposed with a milder retrieval restrictions of measurements for both single and multiple STFT PR. First, a graph is established based on STFT measurements and the retrieval conditions of PAR are transformed to the graph connectivity. Based on the analysis of the graph connectivity, some milder retrieval conditions are proposed for different types of windows, which has a broader field of application. Error estimation and experiments are also given to verify the effectiveness of the PAR algorithm.

Throughout the paper, we use the following notation. Bold-face small and capital letters denote vectors and matrices respectively. $gcd (a, b, c)$ denotes the greatest common factor between a, b and c. $F$ denotes 1D discrete Fourier transform (DFT) and $F^{- 1}$ denotes 1D discrete inverse Fourier transform (IDFT). $F x \in C^{N}$ is defined as (2) $F x (k) = \sum_{n = 0}^{N - 1} x (n) e^{- i 2 π k n / N}, 0 \leq k \leq N - 1,$ (2) and $F^{- 1} x \in C^{N}$ is defined as (3) $F^{- 1} x (n) = \sum_{k = 0}^{N - 1} \frac{1}{N} x (k) e^{i 2 π k n / N}, 0 \leq n \leq N - 1.$ (3) The paper is organized as follows. In Section 1, we introduce the STFT PR problem and related works. Section 2 discusses the PR problem in the single-window and multi-window respectively. Section 3 gives the reconstruction algorithm and error estimation. Section 4 compares the numerical results of PAR and LS method. Section 5 is the summary of this paper.

2. Analysis of phase retrieval conditions

Define $| X_{r}^{(m)} |^{2} = (| X_{r} (m, 0) |^{2}, \dots, | X_{r} (m, N - 1) |^{2})$ . Instead of recovering x from $| X_{r}^{(m)} |^{2}$ , this paper considers the acquired data $Y_{r}$ by taking IDFT of $| X_{r}^{(m)} |^{2}$ for $0 \leq m \leq ⌈ N / L ⌉ - 1$ , which is defined as (4) $Y_{r} (m, l) = \frac{1}{N} \sum_{k = 0}^{N - 1} | X_{r} (m, k) |^{2} e^{i 2 π k l / N} = \sum_{n = 0}^{N - 1} x (n) \bar{x (n + l)} g_{r} (m L - n) \bar{g_{r} (m L - n - l)},$ (4) with $l = 0, 1, \dots, N - 1$ , $0 \leq m \leq ⌈ N / L ⌉ - 1$ [Citation19]. Therefore, recovering $x$ from $X_{r}$ is equivalent to recover it from $Y_{r}$ . To recover the signal $x$ uniquely, the injectivity property of $x ⟶ Y$ should be guaranteed. Actually, there are classes of signals that have the same measurements after injection. For arbitrary $θ \in C$ , $x$ and $x e^{i θ}$ produce the same intensity measurements. Thus we consider $x$ and $x e^{i θ}$ as in the same class $[x] \in C^{M} / \sim$ , where ∼ is the equivalence relation of being identical up to a global phase factor. Then the distance between two vectors in $C^{N}$ is defined as (5) $d (z, x) = d ([z], [x]) = min_{ϕ \in [0, 2 π)} ‖ z - x e^{i ϕ} ‖_{2} .$ (5) When $d (z, x) = 0$ , $x$ and $z$ are equivalent up to a global phase shift.

2.1. Single adjacent window case (R = 1, L = 1)

To find the relation between success of recovery and windows' parameters, we first discuss the STFT PR problem with a single window $g$ . Denote $\tilde{x} (n, l) = x (n \mod N) \bar{x ((n + l) \mod N)}$ and $\tilde{g} (n, l) = g (n \mod N) \bar{g ((n - l) \mod N)}$ . Then Equation (Equation4(4) $Y_{r} (m, l) = \frac{1}{N} \sum_{k = 0}^{N - 1} | X_{r} (m, k) |^{2} e^{i 2 π k l / N} = \sum_{n = 0}^{N - 1} x (n) \bar{x (n + l)} g_{r} (m L - n) \bar{g_{r} (m L - n - l)},$ (4) ) can be represented by (6) $Y (m, l) = \sum_{n = 0}^{N - 1} \tilde{x} (n, l) \tilde{g} (m L - n, l) .$ (6) Define $y^{(l)} = {Y (m, l)}_{m = 0}^{⌈ N / L ⌉ - 1}$ , $x^{(l)} = {\tilde{x} (n, l)}_{n = 0}^{N - 1}$ , and matrix $G^{(l)} \in R^{⌈ N / L ⌉ \times N}$ , where the $(m, n) t h$ entry of $G^{(l)}$ is equal to $\tilde{g} (m L - n, l)$ . Then Equation (Equation6(6) $Y (m, l) = \sum_{n = 0}^{N - 1} \tilde{x} (n, l) \tilde{g} (m L - n, l) .$ (6) ) can be written as (7) $y^{(l)} = G^{(l)} x^{(l)} .$ (7) Due to L = 1, $G^{(l)}$ is a circulant square matrix which can be diagonalized by $G^{(l)} = F^{†} Σ^{(l)} F$ , where $Σ^{(l)}$ is a diagonal matrix whose entries are given by the DFT of the first column of $G^{(l)}$ . $F$ is the DFT matrix and $F^{†}$ represents the conjugate transpose matrix of $F$ [Citation24]. If the matrices $G^{(l)}$ are invertible for several $0 \leq l \leq N - 1$ , then we can compute $x^{(l)}$ by (8) $x^{(l)} = (G^{(l)})^{- 1} y^{(l)} .$ (8) To analyse the invertibility of $G^{(l)}$ , the definition of ‘non-vanishing’ is given as follows:

Definition 2.1

$x \in C^{N}$ is non-vanishing means $x (k) \neq 0$ for all $k = 0, 1, \dots, N - 1$ .

Denote ${\tilde{g}}^{(l)} = (\tilde{g} (0, l), \tilde{g} (1, l), \dots, \tilde{g} (N - 1, l))$ . $G^{(l)}$ is invertible if and only if $F {\tilde{g}}^{(l)}$ is non-vanishing since $G^{(l)} = F^{†} Σ_{l} F$ . Based on this, LS algorithm proposed in [Citation19] can be used to solve the PR problem when $F {\tilde{g}}^{(l)}$ is non-vanishing for $l = 0, 1, \dots, N - 1$ . Nevertheless, Theorem 3.3 in [Citation19] shows that if $F {\tilde{g}}^{(l)}$ is non-vanishing only for l = 0, 1, signal can still be recovered. It inspires us to consider recovering the signal under milder and wider conditions.

Suppose signal $x$ is non-vanishing. In fact, for any $n_{1}$ and $n_{2}$ , since $x$ and $x e^{i θ}$ are in the same class of $[x] \in C^{M} / \sim$ , we do not need to compute all the phase, but the relative phase [Citation6]: (9) $ρ_{n_{1} n_{2}} = \frac{(x (n_{1}) / | x (n_{1}) |)}{(x (n_{2}) / | x (n_{2}) |)} = \frac{x (n_{1}) \bar{x (n_{2})}}{| x (n_{1}) \bar{x (n_{2})} |} .$ (9) To compute the signal phase, we define an undirected graph for STFT phase retrieval: (10) $G (x, g, L) := (V (x), E (L)),$ (10) with a set of nodes: $V (x) := {0 \leq n \leq N - 1 | x (n) \neq 0}$ and a set of edges: $E (L) := {(n, n^{'}) \in V (x) \times V (x) | ρ_{n n^{'}} can be computed directly by (8) and (9)} .$ Relative phase information can be transferred through the relationship between the phase of $n_{1}, n_{2}, n_{3}$ : (11) $ρ_{n_{1} n_{3}} = ρ_{n_{1} n_{2}} ρ_{n_{2} n_{3}} .$ (11) If we assign an arbitrary nonzero vertex $x (n_{0})$ to have nonzero phase $ρ_{0} = x (n_{0}) / | x (n_{0}) |$ , then all the phase can be computed due to phase propagation equation (Equation11(11) $ρ_{n_{1} n_{3}} = ρ_{n_{1} n_{2}} ρ_{n_{2} n_{3}} .$ (11) ) if graph G is connected. In this paper, we can use the same strategy as [Citation23] to choose the initial phase $ρ_{0}$ . Therefore, the graph connectivity decides the retrieval uniqueness of phase.

Lemma 2.1

If there exists non-vanishing $F {\tilde{g}}^{(l_{0})}$ , then $ρ_{n, n + l_{0}}$ can be computed by the phase of $x^{(l)}$ for any $n \in V (x)$ .

Proof.

According to Equations (Equation8(8) $x^{(l)} = (G^{(l)})^{- 1} y^{(l)} .$ (8) ) and (Equation9(9) $ρ_{n_{1} n_{2}} = \frac{(x (n_{1}) / | x (n_{1}) |)}{(x (n_{2}) / | x (n_{2}) |)} = \frac{x (n_{1}) \bar{x (n_{2})}}{| x (n_{1}) \bar{x (n_{2})} |} .$ (9) ), $ρ_{n, n + l_{0}}$ can be computed when $G^{(l_{0})}$ is invertible, which is equivalent to $F {\tilde{g}}^{(l_{0})}$ is non-vanishing.

Since the STFT of signal is periodic, the connectivity of graph G is related to l and N. We have the following results:

Lemma 2.2

Given a graph G with N vertices, suppose there exists k such that any two vertices $n_{1}$ and $n_{2}$ are connected if and only if $(n_{1} - n_{2}) \mod N \equiv k$ , then graph G is connected if and only if $gcd (k, N) = 1$ .

Proof.

Graph connectivity can be judged by the adjacency matrix $A \in {0, 1}^{N \times N}$ . If all the elements in $S = I + A + A^{2} + \dots + A^{N}$ are non-zero, graph is connected. The adjacency matrix of G is in this form (12) $\begin{aligned} A = 12 \dots \dots k + 1 \dots N 00 \dots \dots 10 \dots 000 ⋱ \dots 01 ⋱ 0 ⋮ ⋱ \dots ⋱ ⋱ ⋱ ⋱ ⋮ 00 ⋱ \dots ⋱ ⋱ ⋱ 110 ⋱ \dots ⋱ ⋱ ⋱ 001 ⋱ \dots ⋱ ⋱ ⋱ 0 ⋮ ⋱ \dots ⋱ ⋱ ⋱ ⋱ 00 \dots \dots 10 \dots \dots 0, \end{aligned}$ (12) (13) $\begin{aligned} A^{l + 1} = 12 \dots \dots k + 1 + l \cdot gcd (k, N) \dots N 00 \dots \dots 10 \dots 000 ⋱ \dots 01 ⋱ 0 ⋮ ⋱ \dots ⋱ ⋱ ⋱ ⋱ ⋮ 00 ⋱ \dots ⋱ ⋱ ⋱ 110 ⋱ \dots ⋱ ⋱ ⋱ 001 ⋱ \dots ⋱ ⋱ ⋱ 0 ⋮ ⋱ \dots ⋱ ⋱ ⋱ ⋱ 00 \dots \dots 10 \dots \dots 0 . \end{aligned}$ (13) It shows that $d i a g (A, k) = 1 \in R^{N}$ and $d i a g (A^{l}, k + l \cdot gcd (k, N)) = 1 \in R^{N}$ for $0 \leq l \leq N$ . Since $S = I + A + A^{2} + \dots + A^{N}$ , then all elements in S are non-zero is equivalent to that $d i a g (A^{l}, k + l \cdot gcd (k, N))$ for $0 \leq l \leq N$ covers all the position in matrix A, which means that $gcd (k, N) = 1.$

A generalized result of Lemma 2.2 is given by Lemma 2.3.

Lemma 2.3

Given a graph G with N vertices, suppose any two vertices $n_{1}$ and $n_{2}$ are connected if $(n_{1} - n_{2}) \mod N = k_{i}$ for $i = 1, 2, \dots, m$ . G is connected if and only if $gcd (k_{1}, k_{2}, \dots, k_{m}, N) = 1$ .

Proof.

The lemma can be inferred by S defined in Lemma 2.2. Denote the adjacency matrix is A, then $d i a g (A, k_{i}) = 1 \in R^{N}$ and $d i a g (A^{l}, k_{i} + l \cdot gcd (k_{i}, N)) = 1 \in R^{N}$ for $0 \leq l \leq N$ and $1 \leq i \leq m$ . Only when $gcd (k_{1}, k_{2}, \dots, k_{m}, N) = 1$ , all elements in S are non-zero.

Lemmas 2.2 and 2.3 give the relation between k and N. Combining with Lemma 2.1, we get two theorems as follows:

Theorem 2.4

Assume there exists $l_{0}$ , such that $gcd (l_{0}, N) = 1$ and $F {\tilde{g}}^{(l_{0})}$ is non-vanishing, then all phases can be recovered.

Theorem 2.5

Assume there exists $l_{1}, l_{2}, \dots, l_{m}$ such that $gcd (l_{1}, l_{2}, \dots, l_{m}, N) = 1$ , and $F {\tilde{g}}^{(l_{i})}$ is non-vanishing for $i = 1, 2, \dots, m$ , then the phase can be recovered.

Proof.

Since $F {\tilde{g}}^{(l_{0})}$ is non-vanishing, $x^{(l_{0})} = (G^{(l_{0})})^{- 1} y^{(l_{0})}$ can be computed, which means the relative phase between $x (n)$ and $x ((n - l_{0}) \mod N)$ can be obtained. According to Lemma 2.2 and 2.3, all phases can be computed.

Theorem 2.4 and 2.5 give two phase recovery conditions. Then we turn to consider how to compute the amplitude. As is shown in Theorem 3.1 in [Citation19] and Algorithm 1 in [Citation23], the amplitude can be computed when $F {\tilde{g}}^{(0)}$ is non-vanishing. However, if $F {\tilde{g}}^{(0)}$ is vanishing, could signal $x$ still be recovered? Here we make an extension of proposition III.4 in [Citation19].

Theorem 2.6

If there exists $l_{0}, l_{1}$ such that $l_{1} - l_{0} = M$ where $M = gcd (l_{1}, l_{0})$ , and $F {\tilde{g}}^{(l_{0})}$ , $F {\tilde{g}}^{(l_{1})}$ are both non-vanishing, then the amplitude of the signal $x$ can be recovered by (14) $| x (l_{0}) |^{2} = \frac{\prod_{k = 0}^{l_{0} / M} | x (M k) \bar{x (M k + l_{0})} |}{\prod_{k = 0}^{l_{0} / M - 1} | x (M k) \bar{x (M k + l_{1})} |} .$ (14)

Proof.

Without loss of generality, since $l_{1} = l_{0} + M$ , then (15) $\begin{aligned} \frac{\prod_{k = 0}^{l_{0} / M} | x (M k) \bar{x (M k + l_{0})} |}{\prod_{k = 0}^{l_{0} / M - 1} | x (M k) \bar{x (M k + l_{1})} |} & = \frac{| x (0) \bar{x (l_{0})} x (M) \bar{x (M + l_{0})} \dots x (l_{0}) \bar{x (2 l_{0})} |}{x (0) \bar{x (l_{1})} x (M) \bar{x (M + l_{1})} \dots x (l_{0} - M) \bar{x (l_{0} - M + l_{1})} |} \\ = \frac{| x (0) \bar{x (l_{0})} x (M) \bar{x (M + l_{0})} \dots x (l_{0}) \bar{x (2 l_{0})} |}{| x (0) \bar{x (M + l_{0})} x (M) \bar{x (M + l_{1})} \dots x (l_{0} - M) \bar{x (2 l_{0})} |} \\ = | x (l_{0}) \bar{x (l_{0})} | = | x (l_{0}) |^{2} . \end{aligned}$ (15) We could compute the amplitude of $x (l_{0})$ and other vertices amplitude can be computed by the same equation.

Remark 2.1

Since Equation (2.6) involves division, the signal should be non-vanishing and not too small. Here is an example. If $F {\tilde{g}}^{(1)}$ and $F {\tilde{g}}^{(0)}$ are both non-vanishing, then (16) $| x (1) |^{2} = \frac{| x (0) \bar{x (1)} | | x (1) \bar{x (2)} |}{| x (0) \bar{x (2)} |} .$ (16) Other vertices can also be computed by this equation.

Corollary 2.7

If there exists $l_{0}$ such that $F {\tilde{g}}^{(l_{0})}$ and $F {\tilde{g}}^{(l_{0} + 1)}$ are non-vanishing, then the signal can be recovered.

Proof.

Since $gcd (l_{0}, l_{0} + 1) = 1$ for any integer $l_{0}$ , then all phases can be computed according to Theorem 2.4. And it also ensures the amplitude can be computed by Theorem 2.6. Therefore, the signal can be recovered.

Remark 2.2

Corollary 2.7 gives a sufficient condition for the recovery of $x$ . Theorem III.4 proposed in [Citation19] is a special case of Corollary 2.7.

Now we can prove that when $W < N / 2$ , our method can also recover the phase, which expands the scope of W compared with LS model.

Corollary 2.8

If $gcd (W, N) = 1$ and $W < N / 2$ , then the signal phase can be recovered.

Proof.

Consider the matrix $G^{W} \in R^{N} \times N$ where the $(m, n)$ th entry of $G^{W}$ is given by $\tilde{g} (m - n, W)$ . Since $W < N / 2$ and $gcd (N, W) = 1$ , there must exist unique $n_{0}$ such that $\tilde{g} (n_{0}, W) \neq 0$ . Therefore, we have (17) $Y (m, W) = \sum_{n = 0}^{N - 1} \tilde{x} (n, W) \tilde{g} (m - n, W) = \tilde{x} (m - n_{0}, W) \tilde{g} (n_{0}, W), m = 0, 1, \dots, N - 1.$ (17) Since $\tilde{g} (n_{0}, W) \neq 0$ , (18) $\tilde{x} (m - n_{0}, W) = {\tilde{g} (n_{0}, W)}^{- 1} Y (m, W) .$ (18) Then $ρ_{m - n_{0}, m - n_{0} + W}$ can be computed as $\begin{aligned} ρ_{m - n_{0}, m - n_{0} + W} & = \frac{\tilde{x} (m - n_{0}, W)}{| \tilde{x} (m - n_{0}, W) |} = {\frac{\tilde{g} (n_{0}, W)}{| \tilde{g} (n_{0}, W) |}}^{- 1} \times \frac{Y (m, W)}{| Y (m, W) |}, \\ m & = 0, 1, \dots, N - 1. \end{aligned}$ Since m is arbitrary, for any $n_{1}$ and $n_{2}$ , $ρ_{n_{1} n_{2}}$ can be computed if $n 1 - n 2 = W \mod N$ . Therefore, there exists k = W so that $ρ_{n, n + k}$ can be computed for $0 \leq n \leq N - 1$ . Together with $gcd (W, N) = 1$ , the signal phase can be recovered according to Lemma 2.2 where k = W.

By now, the discussion is based on L = 1. We now extend the results to L>1. In fact, since $G^{(l)}$ defined in (Equation7(7) $y^{(l)} = G^{(l)} x^{(l)} .$ (7) ) is not invertible, $ρ_{n, n + l}$ cannot be computed by (Equation8(8) $x^{(l)} = (G^{(l)})^{- 1} y^{(l)} .$ (8) ) for $0 \leq n \leq N - 1$ . The problem is in that condition the graph G defined in (Equation10(10) $G (x, g, L) := (V (x), E (L)),$ (10) ) cannot be connected if there is only one l satisfy conditions of Theorem 2.4 when L>1. Therefore, we discuss multiple windows STFT to solve the conditions that $L \geq 1$ .

2.2. Multiple discrete windows case ( $R > 1, L \geq 1$ )

Multiple windows STFT PR provides more measurements compared with single window STFT PR. Therefore, the graph connectivity of G may be easier to satisfy.

Now we start to figure out which condition should be satisfied to ensure graph G is connected in STFT PR problem. Choose a window $g_{r}$ such that $W_{r} < N / 2$ . When $l = W_{r}$ , there exists unique $n_{r}$ such that ${\tilde{g}}_{r} (n_{r}, W_{r}) \neq 0$ , then $Y_{r} (m, W_{r}) = \sum_{n = 0}^{N - 1} \tilde{x} (n, W_{r}) {\tilde{g}}_{r} (m L - n, W_{r}) = \tilde{x} (m L - n_{0}, W) {\tilde{g}}_{r} (n_{r}, W_{r}) .$ Therefore, (19) $\begin{aligned} ρ_{m L - n_{r}, m L - n_{r} + W_{r}} & = \frac{\tilde{x} (m L - n_{r}, W_{r})}{| \tilde{x} (m L - n_{r}, W_{r}) |} = {\frac{{\tilde{g}}_{r} (n_{r}, W_{r})}{| {\tilde{g}}_{r} (n_{r}, W_{r}) |}}^{- 1} \times \frac{Y_{r} (m, W_{r})}{| Y_{r} (m, W_{r}) |}, \\ m & = 0, 1, 2, \dots, N / L - 1. \end{aligned}$ (19) Note that only when $n = m L - n_{r}$ with $0 \leq m \leq ⌈ N / L ⌉ - 1$ , $ρ_{n, n + W_{r}}$ can be computed by Equation (Equation19(19) $\begin{aligned} ρ_{m L - n_{r}, m L - n_{r} + W_{r}} & = \frac{\tilde{x} (m L - n_{r}, W_{r})}{| \tilde{x} (m L - n_{r}, W_{r}) |} = {\frac{{\tilde{g}}_{r} (n_{r}, W_{r})}{| {\tilde{g}}_{r} (n_{r}, W_{r}) |}}^{- 1} \times \frac{Y_{r} (m, W_{r})}{| Y_{r} (m, W_{r}) |}, \\ m & = 0, 1, 2, \dots, N / L - 1. \end{aligned}$ (19) ). Therefore, single window STFT measurements cannot satisfy the connection of graph $G (V (x), E (L))$ when L>1. If we want to compute all the relative phase in the $V (x)$ , multiple windows ${g_{r}}_{r = 1}^{R}$ are needed to compute the relative phase. As an instance shown in Figure , when N = 6, L = 2, $W_{1} = 1$ , $W_{2} = 2$ , the graph G is connected.

Figure 1. An example of connected graph $G (V (x), E (L))$ with two sliding windows: $W_{1} = 1$ , $W_{2} = 2$ .

In this example, all the relative phase can be computed through (Equation19(19) $\begin{aligned} ρ_{m L - n_{r}, m L - n_{r} + W_{r}} & = \frac{\tilde{x} (m L - n_{r}, W_{r})}{| \tilde{x} (m L - n_{r}, W_{r}) |} = {\frac{{\tilde{g}}_{r} (n_{r}, W_{r})}{| {\tilde{g}}_{r} (n_{r}, W_{r}) |}}^{- 1} \times \frac{Y_{r} (m, W_{r})}{| Y_{r} (m, W_{r}) |}, \\ m & = 0, 1, 2, \dots, N / L - 1. \end{aligned}$ (19) ). It shows that when L>1, the relative phase still can be obtained. Given two vertices $n_{1}$ and $n_{2}$ , we want to figure out in which condition $ρ_{n_{1}, n_{2}}$ can be computed. Based on Theorem 2.4, we get some conclusions in multiple windows. Based on Equation (Equation7(7) $y^{(l)} = G^{(l)} x^{(l)} .$ (7) ) and the solution identification theorem in linear equations, we get

Lemma 2.9

Denote the $G^{(l)}$ of $g_{r}$ in Equation (Equation7(7) $y^{(l)} = G^{(l)} x^{(l)} .$ (7) ) as $G_{r}^{(l)}$ , $r = 1, 2, \dots R$ . Let $G^{(l)} = [{G_{1}^{(l)}}^{T} {G_{2}^{(l)}}^{T} \dots {G_{R}^{(l)}}^{T}]^{T}$ . Then $x^{(l_{0})}$ can be computed by $y^{(l_{0})} = G_{l_{0}} x^{(l_{0})}$ if $r a n k (G^{(l_{0})}) = N$ .

Based on Lemma 2.9, Theorem 2.10 gives an extension of Theorem 3.1 in [Citation19] and Lemma 2.1.

Theorem 2.10

Let ${g_{r}}_{r = 1}^{R}$ be a family of sliding windows with the same sliding step-size L, where L be a separation parameter with $N / L \in Z$ . Define $B_{m}^{(l)} \in R^{R \times L}$ , where $B_{m}^{(l)} (r, j) = F {\tilde{g}}_{r}^{(l)} (m + j N / L)$ . $ρ_{n, n + l}$ can be computed if $B_{m}^{(l)}$ has rank L for all $0 \leq m \leq N / L - 1$ .

Proof.

Since $Y_{r} (m, l) = \frac{1}{N} \sum_{k = 0}^{N - 1} | X_{r} (m, k) |^{2} e^{i 2 π k l / N}$ with $0 \leq m \leq N / L - 1$ and $1 \leq r \leq R$ . Denote ${\tilde{x}}^{(l)} = (\tilde{x} (0, l), \tilde{x} (1, l), \dots, \tilde{x} (N - 1, l))$ , according to Fourier convolution property, we have $\begin{aligned} Y_{r} (m, l) & = \sum_{n = 0}^{N - 1} \tilde{x} (n, l) {\tilde{g}}_{r} (m L - n, l) = F^{- 1} {F {\tilde{x}}^{(l)} \cdot F {\tilde{g}}_{r}^{(l)}} (m L) \\ = \frac{1}{N} \sum_{k = 0}^{N - 1} F {\tilde{x}}^{(l)} (k) F {\tilde{g}}_{r}^{(l)} (k) e^{2 π i m k L / N} . \end{aligned}$ Denote $Y_{r} (l) = (Y_{r} (0, l), Y_{r} (1, l), \dots, Y_{r} (N / L - 1, l))$ . Since $B_{m}^{(l)} = (F {\tilde{g}}_{r}^{(l)} (m + j N / L))_{1 \leq r \leq R, 0 \leq j \leq L - 1}$ and $F {\tilde{g}}_{r}^{(l)} (k) = \frac{1}{N} \sum_{k = 0}^{N - 1} {\tilde{g}}_{r}^{(l)} (k) e^{- i 2 π k l / N}$ . Then according to Fourier transform property, for $0 \leq m \leq N / L - 1$ and $1 \leq r \leq R$ , (20) $\begin{aligned} {F Y_{r}}^{(l)} (m) & = \sum_{m^{'} = 0}^{N / L - 1} Y_{r} (m^{'}, l) e^{- 2 π i m m^{'} L / N} \\ = \sum_{m^{'} = 0}^{N / L - 1} (\frac{1}{N} \sum_{k = 0}^{N - 1} F {\tilde{x}}^{(l)} (k) F {\tilde{g}}_{r}^{(l)} (k) e^{2 π i m^{'} k L / N}) e^{- 2 π i m m^{'} L / N} \\ = \frac{1}{L} \sum_{k = 0}^{N - 1} \sum_{m^{'} = 0}^{N / L - 1} \frac{L}{N} F {\tilde{x}}^{(l)} (k) F {\tilde{g}}_{r}^{(l)} (k) e^{2 π i m^{'} (k - m) L / N} \\ = \frac{1}{L} \sum_{j^{'} = 0}^{L - 1} F {\tilde{x}}^{(l)} (m + j^{'} N / L) F {\tilde{g}}_{r}^{(l)} (m + j^{'} N / L) . \end{aligned}$ (20) Based on the equation of Lemma III.17 in [Citation14], we get $F {\tilde{x}}^{(l)} (m + j N / L) = L \sum_{j^{'} = 0}^{L - 1} \sum_{m^{'} = 0}^{N / L - 1} b_{m} (j, j^{'}) e^{- 2 π i m m^{'} L / N} (\sum_{r = 1}^{R} \bar{F {\tilde{g}}_{r}^{(l)} (m + j^{'} N / L)} Y_{r} (m^{'}, l))$ where $(b_{m} (j, j^{'}))_{0 \leq j, j^{'} \leq L - 1} = {(B_{m}^{(l)}}^{H} {B_{m}^{(l)})}^{- 1}$ for all $0 \leq m \leq N / L - 1$ and $0 \leq j \leq L - 1$ .

Thus together with $x = F^{- 1} F x$ , (21) $\begin{aligned} \tilde{x} (n, l) & = F^{- 1} {F {\tilde{x}}^{(l)}} (n) \\ = \frac{L}{N} \sum_{j^{'}, j = 0}^{L - 1} \sum_{m, m^{'} = 0}^{N / L - 1} e^{- 2 π i (m (m^{'} L - n) / N - j n / L)} b_{m} (j, j^{'}) (\sum_{r = 1}^{R} \bar{F {\tilde{g}}_{r}^{(l)} (m + j^{'} N / L)} Y_{r} (m^{'}, l)) \end{aligned}$ (21) Then $ρ_{n, n + l}$ can be computed by the phase of $\tilde{x} (n, l)$ . Since $({B_{m}^{(l)}}^{H} B_{m}^{(l)})$ is invertible for all m, l, which means $r a n k (B_{m}^{(l)}) = L$ , then we finished the proof.

Remark 2.3

Now consider some special cases of the condition given by Theorem 2.10.

When L = 1, $r a n k (B_{m}^{(l)}) = L$ is equivalent to $r a n k (\sum_{r = 1}^{R} | F {\tilde{g}}_{r} (m, l) |^{2}) = 1$ for all $0 \leq m \leq N - 1$ , which means there exists $r_{0}$ , so that $F {\tilde{g}}_{r_{0}} (n, l) \neq 0$ , which is consistent with Theorem 3.1 in [Citation19]. If R = 1, which means in single window case, $F \tilde{g} (n, l)$ is non-vanishing.
When L = N, $r a n k (B_{m}^{(l)}) = L$ is equivalent to $r a n k ({{\tilde{g}}_{r} (n, l)}_{1 \leq r \leq R, 0 \leq n \leq N - 1}) = N$ , which is consistent with Corollary 2.9.

Based on Theorem 2.10, we can get theorems as follows:

Theorem 2.11

Suppose there exists $l_{1}, l_{2}, \dots, l_{t}$ so that $(l_{1}, l_{2}, \dots, l_{t}, N) = 1$ , and $B_{m}^{(l_{i})}$ has rank L for $i = 1, 2, \dots, t$ , then the signal phase can be recovered.

Theorem 2.12

If there exists $l_{0}, l_{1}$ such that $l_{1} - l_{0} = (l_{1}, l_{0})$ , $B_{m}^{(l_{0})}$ and $B_{m}^{(l_{1})}$ have rank L, then the amplitude of the signal can be recovered.

Proof.

According to Theorem 2.10, if $B_{m}^{(l)}$ has rank L, $ρ_{n, n + l}$ can be computed by Equation (Equation21(21) $\begin{aligned} \tilde{x} (n, l) & = F^{- 1} {F {\tilde{x}}^{(l)}} (n) \\ = \frac{L}{N} \sum_{j^{'}, j = 0}^{L - 1} \sum_{m, m^{'} = 0}^{N / L - 1} e^{- 2 π i (m (m^{'} L - n) / N - j n / L)} b_{m} (j, j^{'}) (\sum_{r = 1}^{R} \bar{F {\tilde{g}}_{r}^{(l)} (m + j^{'} N / L)} Y_{r} (m^{'}, l)) \end{aligned}$ (21) ). Thus Theorem 2.11 and 2.12 can be inferred based on Theorems 2.5 and 2.6.

3. Reconstruction and error estimation

Theorems 2.11 and 2.12 extend retrieval conditions from a single window $g$ to multiple windows ${g_{r}}_{r = 1}^{R}$ , and the sliding size L = 1 to L>1. Based on these analyses, we propose the following reconstruction algorithm (PAR).

Here we discuss the computation complexity of PAR algorithm. To finish calculating $Y_{r} (n, l)$ and $F {{\tilde{g}}_{r} (n, l)}$ , we can use fast Fourier transforms with $O (N \log N)$ . Computing the rank of $B_{m}^{(l_{i})}$ needs $O (L R N)$ operations, while computing the highest common factor of no more than N number needs $O (N \log N)$ by division algorithm in Theorem 2.11. Assume the number of l meeting conditions in Theorem 2.11 is k, then judging the condition of Theorem 2.12 needs at most $O (k N)$ . According to Equations (Equation21(21) $\begin{aligned} \tilde{x} (n, l) & = F^{- 1} {F {\tilde{x}}^{(l)}} (n) \\ = \frac{L}{N} \sum_{j^{'}, j = 0}^{L - 1} \sum_{m, m^{'} = 0}^{N / L - 1} e^{- 2 π i (m (m^{'} L - n) / N - j n / L)} b_{m} (j, j^{'}) (\sum_{r = 1}^{R} \bar{F {\tilde{g}}_{r}^{(l)} (m + j^{'} N / L)} Y_{r} (m^{'}, l)) \end{aligned}$ (21) ) and (Equation14(14) $| x (l_{0}) |^{2} = \frac{\prod_{k = 0}^{l_{0} / M} | x (M k) \bar{x (M k + l_{0})} |}{\prod_{k = 0}^{l_{0} / M - 1} | x (M k) \bar{x (M k + l_{1})} |} .$ (14) ), computing relative phase and amplitude needs $O (N)$ respectively. Thus the total runtime complexity of PAR algorithm is $O (N \log N)$ in general.

Based on our phase retrieval procedure, we now present the following guarantee of stable performance. Suppose the measurements are corrupted by random noise $ϵ (m, k)$ with level $| ϵ |$ according to Equation (Equation4(4) $Y_{r} (m, l) = \frac{1}{N} \sum_{k = 0}^{N - 1} | X_{r} (m, k) |^{2} e^{i 2 π k l / N} = \sum_{n = 0}^{N - 1} x (n) \bar{x (n + l)} g_{r} (m L - n) \bar{g_{r} (m L - n - l)},$ (4) ), then the problem become recovering the signal $x$ from $| X_{r} (m, k) |^{2} + ϵ_{r} (m, k)$ . For a window family ${g_{r}}_{r = 1}^{R}$ with period N extension, we define $g = [g_{1}, g_{2}, \dots, g_{R}]$ , $B^{(l)} = [B_{0}^{(l)}, B_{1}^{(l)}, \dots, B_{N / L - 1}^{(l)}]$ , $\begin{aligned} ‖ g ‖_{2} & = {(Σ_{r = 1}^{R} Σ_{n = 0}^{N - 1} | g_{r} (n) |^{2})}^{1 / 2}, \\ ‖ B^{(l)} ‖_{1} & = Σ_{m = 0}^{N / L - 1} Σ_{j, j^{'} = 0}^{L - 1} | b_{m, l} (j, j^{'}) | . \end{aligned}$

Theorem 3.1

Let $C = min_{n \in V (x)} | x (n) |^{2}$ , $A = max_{n \in V (x)} | x (n) |^{2}$ . Denote $‖ B ‖ = max_{l \in L} ‖ B^{(l)} ‖_{1}$ , $T = (2 \frac{l_{0}}{M} + 1) {\frac{A}{C}}^{(\frac{l_{0}}{M} + 1)}$ , where $L$ , $l_{0}$ and M are defined in Equation (Equation14(14) $| x (l_{0}) |^{2} = \frac{\prod_{k = 0}^{l_{0} / M} | x (M k) \bar{x (M k + l_{0})} |}{\prod_{k = 0}^{l_{0} / M - 1} | x (M k) \bar{x (M k + l_{1})} |} .$ (14) ) and Algorithm 1. Let $x_{ϵ} = (x_{ϵ} (0), \dots, x_{ϵ} (N - 1))^{T}$ be the approximation obtained by Algorithm 1 from noisy measurements ${| X_{r} |^{2} + ϵ_{r}}_{r = 1}^{R}$ , $| ϵ | = max_{r, m, k} | ϵ_{r} (m, k) |,$ then there exists $β \in R$ such that for all $n = 0, 1, \dots, N - 1$ , (22) $| \frac{x_{ϵ} (n)}{| x_{ϵ} (n) |} - e^{2 π i β} \frac{x (n)}{| x (n) |} | \leq \frac{2 N L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ |}{C},$ (22) and (23) $| | x_{ϵ} (n) |^{2} - | x (n) |^{2} | \leq T L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ | .$ (23)

Proof.

For any $l \in 0, 1, \dots, N - 1$ , $\sum_{r = 1}^{R} | F {\tilde{g}}_{r}^{(l)} | \leq \sum_{r = 1}^{R} \sum_{k = 0}^{N - 1} | {\tilde{g}}_{r} (k, l) | = \sum_{r = 1}^{R} \sum_{k = 0}^{N - 1} | g_{r} (k) \bar{(g_{r} (k - l))} | \leq ‖ g ‖_{2}^{2} .$ The last inequality comes from Cauchy–Schwarz inequality. Denote $Y_{r, ϵ} (m^{'}, l)$ is the IDFT of ${| X_{r} |^{2} + ϵ_{r}}_{r = 1}^{R}$ in (Equation4(4) $Y_{r} (m, l) = \frac{1}{N} \sum_{k = 0}^{N - 1} | X_{r} (m, k) |^{2} e^{i 2 π k l / N} = \sum_{n = 0}^{N - 1} x (n) \bar{x (n + l)} g_{r} (m L - n) \bar{g_{r} (m L - n - l)},$ (4) ). Combine with Equation (Equation21(21) $\begin{aligned} \tilde{x} (n, l) & = F^{- 1} {F {\tilde{x}}^{(l)}} (n) \\ = \frac{L}{N} \sum_{j^{'}, j = 0}^{L - 1} \sum_{m, m^{'} = 0}^{N / L - 1} e^{- 2 π i (m (m^{'} L - n) / N - j n / L)} b_{m} (j, j^{'}) (\sum_{r = 1}^{R} \bar{F {\tilde{g}}_{r}^{(l)} (m + j^{'} N / L)} Y_{r} (m^{'}, l)) \end{aligned}$ (21) ), then we get (24) $\begin{aligned} | x_{ϵ} (n) \bar{x_{ϵ} (n + l)} - \tilde{x} (n, l) | & = \frac{L}{N} \sum_{j^{'}, j = 0}^{L - 1} \sum_{m, m^{'} = 0}^{N / L - 1} e^{- 2 π i (m (m^{'} L - n) / N - j n / L)} b_{m} (j, j^{'}) \\ * (\sum_{r = 1}^{R} \bar{F {\tilde{g}}_{r}^{(l)} (m + j^{'} N / L)} (Y_{r, ϵ} (m^{'}, l) - Y_{r} (m^{'}, l))) \\ \leq \frac{L}{N} * L^{2} * {\frac{N}{L}}^{2} ‖ B^{(l)} ‖_{1} ‖ g ‖_{2}^{2} \frac{1}{N} | ϵ | = L ‖ B^{(l)} ‖_{1} ‖ g ‖_{2}^{2} | ϵ | \\ \leq L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ | . \end{aligned}$ (24) According to the error transfer formula: If $\bar{N} = f (\bar{x_{1}}, \bar{x_{2}}, \bar{x_{3}})$ , then error estimation (25) $Δ N = | \frac{\partial f}{\partial x_{1}} | Δ x_{1} + | \frac{\partial f}{\partial x_{2}} | Δ x_{2} + | \frac{\partial f}{\partial x_{3}} | Δ x_{3},$ (25) where $Δ N, Δ x_{1}, Δ x_{2}, Δ x_{3}$ represent the error bound of each variable. (26) $\begin{aligned} | | x_{ϵ} (n) |^{2} - | x (n) |^{2} | & \leq | \frac{\prod_{k = 0}^{l_{0} / M} | x_{ϵ} (M k) \bar{x_{ϵ} (M k + l_{0})} |}{\prod_{k = 0}^{l_{0} / M - 1} | x_{ϵ} (M k) \bar{x_{ϵ} (M k + l_{1})} |} - \frac{\prod_{k = 0}^{l_{0} / M} | x (M k) \bar{x (M k + l_{0})} |}{\prod_{k = 0}^{l_{0} / M - 1} | x (M k) \bar{x (M k + l_{1})} |} | \\ \leq (2 \frac{l_{0}}{M} + 1) {\frac{A}{C}}^{(\frac{l_{0}}{M} + 1)} L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ | = T L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ | . \end{aligned}$ (26) Then according the triangle inequality, we get that (27) $\begin{aligned} | \frac{x_{ϵ} (n) \bar{x_{ϵ} (n + l)}}{| x_{ϵ} (n) \bar{x_{ϵ} (n + l)} |} - \frac{\tilde{x} (n, l)}{| \tilde{x} (n, l) |} | & \leq \frac{| x_{ϵ} (n) \bar{x_{ϵ} (n + l)} - \tilde{x} (n, l) | + | | x_{ϵ} (n) \bar{x_{ϵ} (n + l)} | - | \tilde{x} (n, l) | |}{| \tilde{x} (n, l) |} \\ \leq \frac{2 L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ |}{C} . \end{aligned}$ (27) Since the relative phase propagation in graph G, we have the estimation equation (Equation22(22) $| \frac{x_{ϵ} (n)}{| x_{ϵ} (n) |} - e^{2 π i β} \frac{x (n)}{| x (n) |} | \leq \frac{2 N L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ |}{C},$ (22) ).

Let $p, p_{ϵ} \in C^{N}$ be the vectors of phases of the entries of $x_{ϵ}$ and $x$ . ° represents Hadamard product. The reconstruction error (28) $\begin{aligned} d (x, x_{ϵ}) & = min_{ϕ} {‖ e^{i ϕ} x - x_{ϵ} ‖}_{2} \\ \leq {‖ x | \circ p - | x | \circ p_{ϵ} ‖_{2} + ‖ | x | \circ p_{ϵ} - | x_{ϵ} | \circ p_{ϵ} ‖}_{2} \\ = \sqrt{\sum_{n = 1}^{N} {| x (n) |}^{2} {| \frac{x_{ϵ} (n)}{| x_{ϵ} (n) |} - \frac{x (n)}{| x (n) |} |}^{2}} + \sqrt{\sum_{n = 1}^{N} {‖ x (n) | - | x_{ϵ} (n) ‖}^{2}} \\ \leq ‖ x ‖_{2} \frac{2 N L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ |}{C} + \sqrt{N T L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ |} . \end{aligned}$ (28) Theorem 3.1 ensures that when error ε is in low level and the minimal of $x (n)$ is not small, the error between retrieval phase and exact phase can be controlled. In addition, the norm of $g$ and $B$ influences the upper bound of error. Therefore, the number of windows does not need to be too large but should meet the retrieval conditions. It means that if we choose some suitable windows, the signal can be recovered well by PAR.

4. Numerical results

We now show the results of PAR algorithm in different types of signals and sliding windows. Here we choose three different signals, Gaussian complex signal, chirp signal and real temperature signal. A Gauss complex signal $x \in C^{N}$ with N = 101, which is defined as (29) $x (n) \sim N (μ, 1) + i N (0, 1), n = 0, \dots, N - 1.$ (29) Chirp signal is a typical non-stationary signal in communication, sonar, radar. Figure shows a chirp signal with length N = 101 and average value is 0. Earth abnormal temperature signal is a real signal describing the temperature variation in Figure .

Figure 2. Chirp signal.

Figure 3. Earth abnormal temperature.

In addition, we choose some different typical sliding windows in STFT including rectangle windows, triangle windows and hamming windows. These windows have different main lobes and side lobes, which are suitable for different situations. All the sliding windows satisfy the retrieval condition in PAR method. These simulations display the relation between relative error and sliding window as well as signal-noise ratio (SNR), where the relative error between estimation $x_{ϵ}$ and real signal $x$ is defined as $R e (x_{ϵ}, x) = \frac{d (x_{ϵ}, x)}{‖ x ‖_{2}} .$ We reproduce our procedure in https://github.com/zhouxianchen/PAR-algorithm with python.

4.1. Relative error and length of window

Since LS method in [Citation19] requires $W \geq ⌈ \frac{N + 1}{2} ⌉$ , we first discuss the relationship between relative error and sliding window with two signals. Here we choose parameter $μ = 10$ and $S N R = 60$ in (Equation29(29) $x (n) \sim N (μ, 1) + i N (0, 1), n = 0, \dots, N - 1.$ (29) ). Figure and Figure show PAR method has less than 0.025 relative error without considering the length of window W in three types of windows. And two figures show that PAR method has no restriction with length of window and can be used in wider range compared with the LS method. Relative error in chirp signal and temperature signal is relatively smaller than that in Gauss complex signal, which might be because that it is no relative phase error in the reconstruction procedure with real signal (Figure ).

Figure 4. Relative error in recovering Gaussian complex signal by different windows using PAR.

Figure 5. Relative error in recovering chirp signal by different windows using PAR.

Figure 6. Relative error in recovering temperature signal by different windows using PAR.

4.2. Relative error and SNR

Here we discuss PAR performance in different SNR. We choose the length of window W = 7 and $μ = 10$ in (Equation29(29) $x (n) \sim N (μ, 1) + i N (0, 1), n = 0, \dots, N - 1.$ (29) ). Figure shows that PAR can retrieve signal when $S N R \geq 40$ . When $S N R \leq 40$ , different windows have different performance in the same level SNR. The triangle window has better performance compared with rectangle and hamming windows. We infer it is due to that $‖ B ‖$ and $‖ g ‖_{2}$ in (Equation22(22) $| \frac{x_{ϵ} (n)}{| x_{ϵ} (n) |} - e^{2 π i β} \frac{x (n)}{| x (n) |} | \leq \frac{2 N L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ |}{C},$ (22) ) are influenced by the type of windows, which lead to different error estimation (Figures and ).

Figure 7. Relative error in recovering noisy Gaussian signal using PAR (W = 7).

Figure 8. Relative error in recovering noisy chirp signal using PAR (W = 7).

Figure 9. Relative error in recovering temperature signal using PAR (W = 7).

4.3. Relative error and μ

To verify our error estimation, we discuss the relationship between relative error and μ in (Equation29(29) $x (n) \sim N (μ, 1) + i N (0, 1), n = 0, \dots, N - 1.$ (29) ) using a Gauss complex signal. Here we choose a Hamming window STFT. Figure shows that when $S N R \geq 40$ , $μ \geq 20$ can retrieve signal correctly. And when $min_{n \in V (x)} | x (n) |^{2}$ is larger, PAR method has robuster retrieval performance. It verifies our error estimation formula (Equation22(22) $| \frac{x_{ϵ} (n)}{| x_{ϵ} (n) |} - e^{2 π i β} \frac{x (n)}{| x (n) |} | \leq \frac{2 N L ‖ B ‖ ‖ g ‖_{2}^{2} | ϵ |}{C},$ (22) ) and inspires us when we deal with some data in reality, we can use a signal translation to enlarge that $min_{n \in V (x)} | x (n) |^{2}$ (Figure ).

Figure 10. Relative error in recovering noisy Gaussian signal using PAR in different μ.

Figure 11. Relative error in recovering Gauss complex signal with Hamming window using PAR and LS method (SNR=100).

4.4. Relative error between LS and PAR algorithm

In this section, we display the performance of PAR algorithm and the classical LS [Citation19] algorithm under different length of window. PAR method has good performance when the length of W is small compared with the LS method. Therefore, PAR method has less restriction of the window length than the LS method.

As is shown in the experiments, no matter for Gaussian complex, chirp or real temperature signal, our method obtains better performance than the LS method. However, note that the retrieval conditions of PAR based on the assumption that the signal is non-vanishing and graph connectivity based on windows. Some signals and windows do not satisfy these assumptions, and our method is not suitable for those situations.

5. Conclusion and further work

Previous research proposed STFT PR methods in relatively strict retrieval conditions. This paper proposes a PAR reconstruction algorithm with a milder retrieval conditions, which solved the PR by computing phase and amplitude respectively. Compared with the previous method, PAR reconstruction Algorithm 1 has a milder constraint of windows and can be used to solve the STFT phase retrieval problems both in single window and multiple windows cases, when prior knowledge is known including that the noise is low and the minimal magnitude of nonzero components of original signal is enough large. Numerical results show that when noise is in low level, PAR has better performance and a translation shift of initial signal can lead to a robuster result. In addition, since the computation cost is $O (N \log N)$ when l is selected in Algorithm 1, it can also be used in initialization of other non-convex method.

Disclosure statement

There are no relevant financial or non-financial competing interests to report. And no potential competing interest was reported by the authors.

Additional information

Funding

This work was supported in part by The National Key Basic Research Program (Grant No. 2020YFA0713504) and National Natural Science foundation (China): 61977065,11971489.

References

Harrison RW. Phase problem in crystallography. J Opt Soc Amer A. 1993;10(5):1046–1055.
Google Scholar
Miao J, Ishikawa T, Shen Q, et al. Extending x-ray crystallography to allow the imaging of noncrystalline materials, cells, and single protein complexes. Annu Rev Phys Chem. 2008;59(1):387–410.
Google Scholar
Bunk O, Diaz A, Pfeiffer F, et al. Diffractive imaging for periodic samples: retrieving one-dimensional concentration profiles across microfluidic channels. Acta Crystallogr. 2007;63(4):306–314.
Google Scholar
Fienup C, Dainty J. Phase retrieval and image reconstruction for astronomy. Image Recovery: Theor Appl. 1987;231:275.
Google Scholar
Stepanova IE, Gudkova TV, Salnikov AM, et al. A new approach to analytical modeling of Mars's magnetic field. Appl Math Sci Eng. 2022;30(1):41–60.
Web of Science ®Google Scholar
Walther A. The question of phase retrieval in optics. Optica Acta Int J Opt. 1963;10(1):41–49.
Google Scholar
Geng Y, Wen X, Tan J, et al. Noise-robust phase retrieval by optics path modulation with adaptive feedback. Opt Commun. 2022;515:128199.
Google Scholar
Fienup JR, Guizarsicairos M. Phase retrieval with transverse translation diversity: a nonlinear optimization approach. Opt Express. 2008;16(10):7264–7278.
Web of Science ®Google Scholar
Maiden AM, Humphry MJ, Zhang F, et al. Superresolution imaging via ptychography. J Opt Soc Am A Opt Image Sci Vis. 2011;28(4):604–612.
Web of Science ®Google Scholar
Trebino R, Guang Z, Zhu P, et al. The measurement of ultrashort laser pulses. 2018 2nd URSI Atlantic Radio Science Meeting (AT-RASC); IEEE; 2018. p. 1–3.
Google Scholar
Kane DJ. Principal components generalized projections: a review [invited]. J Opt Soc Am B. 2008;25(6):A120–A132.
Web of Science ®Google Scholar
Lin W, Zhang R, Xu X, et al. Phaseless signal recovery from triple-window short-time Fourier measurements. Int J Phys Conf Ser. 2020;1617:012081. IOP Publishing.
Google Scholar
Grohs P, Liehr L, Rathmair M. Multi-window STFT phase retrieval: lattice uniqueness. Preprint 2022. Available from: arXiv:2207.10620.
Google Scholar
Li L, Cheng C, Han D, et al. Phase retrieval from multiple-window short-time Fourier measurements. IEEE Signal Process Lett. 2017;24(4):372–376.
Web of Science ®Google Scholar
Guo Y, Wang A, Wang W. Multi-source phase retrieval from multi-channel phaseless STFT measurements. Signal Process. 2018;144:36–40.
Web of Science ®Google Scholar
Grohs P, Koppensteiner S, Rathmair M. Phase retrieval: uniqueness and stability. SIAM Rev. 2020;62(2):301–350.
Web of Science ®Google Scholar
Jaganathan K, Oymak S, Hassibi B. Sparse phase retrieval: uniqueness guarantees and recovery algorithms. IEEE J Sel Top Signal Process. 2017;10(4):770–781.
Web of Science ®Google Scholar
Sun DL, Smith Iii JO. Estimating a signal from a magnitude spectrogram via convex optimization. arXiv. 2012;1209.2076. DOI:10.48550/arXiv.1209.2076
Google Scholar
Bendory T, Eldar YC. Non-convex phase retrieval from STFT measurements. IEEE Trans Inform Theor. 2018;64(99):1–1.
Google Scholar
Pfander GE, Salanevich P. Robust phase retrieval algorithm for time-frequency structured measurements. SIAM J Imaging Sci. 2019;12(2):736–761.
Web of Science ®Google Scholar
Alexeev B, Bandeira AS, Fickus M, et al. Phase retrieval with polarization. SIAM J Imaging Sci. 2014;7(1):35–66.
Web of Science ®Google Scholar
Iwen MA, Viswanathan A, Wang Y. Fast local correlation measurements. SIAM J Imaging Sci. 2016;9(4):1655–1688.
Web of Science ®Google Scholar
Iwen MA, Preskitt B, Saab R, et al. Phase retrieval from local measurements: improved robustness via eigenvector-based angular synchronization. Appl Comput Harmon Anal. 2020;48:415–444
Web of Science ®Google Scholar
Bendory T, Eldar YC. Phase retrieval from STFT measurements via non-convex optimization. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2017. p. 4770–4774.
Google Scholar

Solving multiple windowed STFT phase retrieval problems in phase and amplitude respectively

ABSTRACT

1. Introduction and problem equation

2. Analysis of phase retrieval conditions

2.1. Single adjacent window case (R = 1, L = 1)

2.2. Multiple discrete windows case ( $R > 1, L \geq 1$ )

3. Reconstruction and error estimation