Full article: Recovery from Power Sums

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We study the problem of recovering a collection of n numbers from the evaluation of m power sums. This yields a system of polynomial equations, which can be underconstrained (m < n), square (m = n), or overconstrained (m > n). Fibers and images of power sum maps are explored in all three regimes, and in settings that range from complex and projective to real and positive. This involves surprising deviations from the Bézout bound, and the recovery of vectors from length measurements by p-norms.

Keywords:

1 Introduction

This article offers a case study in solving systems of polynomial equations. Our model setting reflects applications of nonlinear algebra in engineering, notably in signal processing [Citation15], sparse recovery [Citation7], and low rank recovery [Citation8]. Suppose there is a secret list of complex numbers $z_{1}, z_{2}, \dots, z_{n}$ . Our task is to find them. Measurements are made by evaluating the m power sums $c_{j} = \sum_{i = 1}^{n} z_{i}^{a_{j}}$ , where $A = {a_{1}, a_{2}, \dots, a_{m}}$ is a set of m distinct positive integers. Our aim is to recover the multiset $z = {z_{1}, \dots, z_{n}}$ from the vector $c = (c_{1}, \dots, c_{m})$ .

To model this problem, for any given pair $(n, A)$ , we consider the polynomial map(1) $ϕ_{A, C} : C^{n} \to C^{m}, where ϕ_{j} = x_{1}^{a_{j}} + x_{2}^{a_{j}} + \dots + x_{n}^{a_{j}} for j = 1, 2, \dots, m .$ (1)

We are interested in the image and the fibers of the map $ϕ_{A, C}$ . The study of these complex algebraic varieties addresses the following questions: Is recovery possible? Is recovery unique? This problem is especially interesting when $z_{1}, z_{2}, \dots, z_{n}$ are real, or even positive. Hence, we also study the maps $ϕ_{A, R}$ and $ϕ_{A, \geq 0}$ that are obtained by restricting $ϕ_{A, C}$ to $R^{n}$ and $R_{\geq 0}^{n}$ , respectively. For any of these, we study the following system of m equations in n unknowns:(2) $ϕ_{A, •} (x) = c .$ (2)

There are three different regimes. If m > n then (2) is overconstrained and has no solution, unless $c = ϕ_{A, C} (z)$ for some $z \in C^{n}$ , and we anticipate unique recovery of ${z_{1}, \dots, z_{n}}$ . If m = n then (2) is expected to have finitely many solutions, at most the Bézout number $a_{1} a_{2} \dots a_{n}$ . If m < n then the solutions to (2) form a variety of expected dimension n – m.

Example 1

(n = 3). We illustrate the three regimes. Consider the multiset $z = {6, 8, 13}$ . We first allow m = 4 measurements, with $A = {2, 5, 7, 8}$ . Then the system (2) equals(3) $\begin{matrix} x_{1}^{2} + x_{2}^{2} + x_{3}^{2} & = & 269, & x_{1}^{5} + x_{2}^{5} + x_{3}^{5} & = & 411837, \\ x_{1}^{7} + x_{2}^{7} + x_{3}^{7} & = & 65125605, & x_{1}^{8} + x_{2}^{8} + x_{3}^{8} & = & 834187553. \end{matrix}$ (3)

For the lexicographic term order with $x_{1} > x_{2} > x_{3}$ , we compute the reduced Gröbner basis ${x_{1} + x_{2} + x_{3} - 27, x_{2}^{2} + x_{2} x_{3} + x_{3}^{2} - 27 (x_{2} + x_{3}) + 230, (x_{3} - 6) (x_{3} - 8) (x_{3} - 13)} .$

This is a 0-dimensional radical ideal, having six zeros, so $z = {6, 8, 13}$ is recovered uniquely.

We next take m = 3 with $A = {2, 5, 7}$ . Here, we solve the first three equations in (3). This square system has 66 complex solutions, four less than the Bézout number $70 = 2 \times 5 \times 7$ . Finally, we allow only m = 2 measurements, with $A = {2, 5}$ . The first two equations in (3) define a curve of degree 10 = 2 × 5 in $C^{3}$ . Its closure in $P^{3}$ is a singular curve of genus 14.

Remark 2.

In applications, noise in the data is a concern. This makes our problem interesting even for $A = {1, 2, \dots, m}$ . However, from the perspectives of algebraic geometry and exact computations, the dense case is solved. The power sums reveal the elementary symmetric functions, via Newton’s identities. Our recovery problem amounts to finding the roots of a polynomial of degree n in one variable. For related work see [Citation1]. The recent article [Citation15] studies a more general version of the dense problem: The authors consider the composition γ of the map $ϕ_{A}$ with a linear map $T : C^{m} \to C^{n}$ where $m \leq n$ and $A = {1, \dots, m}$ . Their Theorem 1 states that, if T is generic, then $γ^{- 1} (γ (x))$ is finite for every $x \in C^{m}$ .

In this paper, $A$ is any set of m distinct positive integers. Our presentation is organized as follows. In Section 2 we show that, for $m \leq n$ , the fiber of $ϕ_{A, C}$ above a generic point in $C^{m}$ has the expected dimension n – m. For m > n we expect the recovery of complex multisets from power sums to be unique when $\gcd (a_{1}, \dots, a_{m}) = 1$ . This is stated in Conjecture 6. In Section 3, we study the case m = n. We propose a formula for the number of solutions of (2). This number is generally less than the Bézout number $a_{1} a_{2} \dots a_{n}$ . For instance, in Example 1, the drop is from 70 to 66. We shall explain this. This issue is closely related to the question, put forth in [Citation3], for which sets $A$ the power sums form a regular sequence. We present supporting evidence for the conjectures made in [Citation3] and we offer generalizations.

In Section 4 we turn to the images of the power sum maps $ϕ_{A, C}, ϕ_{A, R}$ , and $ϕ_{A, \geq 0}$ . The image of $ϕ_{A, C}$ is constructible and has the expected dimension $min (m, n)$ , but it is generally not closed in $C^{m}$ . In the overconstrained case (m > n), we study the degree and equations of the closure of the image. For instance, the image of $ϕ_{A, C} : C^{3} \to C^{4}$ in Example 1 is defined by a polynomial of degree 45 with 304 terms. The image of the real map $ϕ_{A, R}$ is semialgebraic in $R^{m}$ . It is closed if some a_i is even. Moreover, the orthant $R_{\geq 0}^{n}$ is mapped to a closed subset of $R_{\geq 0}^{m}$ . It is a challenging task to find a semi-algebraic description of the image. We take first steps by exploring its algebraic boundary. Delineating the real image involves the ramification locus in $C^{n}$ and its image in $C^{m}$ , which is the branch locus of $ϕ_{A, C}$ .

In Section 5, we examine our problem over the positive real numbers. Here, the recovery from power sums is equivalent to recovery from length measurements by various p-norms. This enables a better understanding of the map $ϕ_{A, \geq 0}$ . We prove that recovery is unique in the square case n = m, see Proposition 24. The image of $ϕ_{A, \geq 0}$ is expressed as a compact subset in the probability simplex $Δ_{m - 1}$ . Theorem 27 characterizes the structure of this set.

2 Fibers

Consider the map $ϕ = ϕ_{A, C}$ from $C^{n}$ to $C^{m}$ whose coordinates are $ϕ_{j} = \sum_{i = 1}^{n} x_{i}^{a_{j}}$ . In this section we examine the fibers of $ϕ$ and we show that they have the expected dimension. We conclude with a discussion concerning the uniqueness of recovery in the case $m = n + 1$ .

Given a point $c = (c_{1}, \dots, c_{m})$ in $C^{m}$ , the defining ideal of the fiber $ϕ^{- 1} (c)$ equals $I_{c} = 〈 ϕ_{1} (x) - c_{1}, \dots, ϕ_{m} (x) - c_{m} 〉 \subset C [x_{1}, \dots, x_{n}] .$

Our recovery problem amounts to computing the variety $V (I_{c}) = ϕ^{- 1} (c)$ defined by I_c in $C^{n}$ .

Proposition 3.

Assume $m \leq n$ . Then the following hold:

The map $ϕ$ is dominant, i.e., the image of $ϕ$ is dense in $C^{m}$ .
For generic c, the ideal I_c is radical, and its variety $V (I_{c})$ has dimension n – m.

Proof.

The fiber of $ϕ$ above a point c is the variety $V (I_{c}) \subset C^{n}$ . By [14, Lemma 054Z], the fibers of $ϕ$ are generically reduced. This implies that the ideal I_c is radical for all points c outside a proper closed subset of $C^{m}$ . The Jacobian of the map $ϕ$ is the m × n matrix(4) $I = {(\begin{matrix} \frac{\partial ϕ_{j}}{\partial x_{i}} \end{matrix})}_{\overset{1 \leq i \leq n}{1 \leq j \leq m}} = {(\begin{matrix} a_{j} x_{i}^{a_{j} - 1} \end{matrix})}_{\overset{1 \leq i \leq n}{1 \leq j \leq m}} .$ (4)

Up to multiplication by a positive integer, each m × m minor of this matrix is the product of a Vandermonde determinant and a Schur polynomial; see (8). In particular, none of these minors of $I$ is identically zero. Thus, the Jacobian matrix $I$ has rank m over the field $C (x_{1}, \dots, x_{m})$ . By [9, I.11.4], this implies that the polynomials $ϕ_{1}, \dots, ϕ_{m}$ are algebraically independent over $C$ . From this we conclude that the associated ring homomorphism $ϕ^{*} : C [y_{1}, \dots, y_{m}] \to C [x_{1}, \dots, x_{n}], y_{i} \mapsto ϕ_{i} (x)$ is injective. Hence, our map $ϕ$ is dominant, by [14, Lemma 0CC1]. The statement in (i) that the image is dense refers either to the Zariski topology or to the classical topology. Both have the same closure in this situation, by [10, Corollary 4.20].

It now follows from [11, Theorem 9.9 (b)] that, for all points c outside a proper Zariski closed subset of $C^{m}$ , the fiber $ϕ^{- 1} (c)$ has dimension n – m. This finishes the proof. □

The condition that the point c is generic is crucial in Proposition 3. The following example shows that the fiber dimension can jump up for special points $c \in C^{m}$ .

Example 4

(n = 3). Let m = 3 and $A = {3, 5, 7}$ . The generic fiber of the map $ϕ$ consists of 60 points in $C^{3}$ . Interestingly, that number would increase to 66 if the number 3 in our set $A$ were replaced with the number 2, as we saw in Example 1. Now, consider the fiber over $c = (0, 0, 0)$ . It is defined by the homogeneous ideal $I_{0} = 〈 x_{1}^{a} + x_{2}^{a} + x_{3}^{a} : a \in A 〉$ . This ideal defines three lines of multiplicity three, with an embedded point at the origin. The radical of this ideal equals $〈 x_{1} + x_{2}, x_{3} 〉 \cap 〈 x_{1} + x_{3}, x_{2} 〉 \cap 〈 x_{2} + x_{3}, x_{1} 〉$ .

Let us assume m > n, so we are in the overconstrained case. The following statement is derived from the m = n case in Proposition 3, namely by adding additional constraints:

Corollary 5.

For m > n, the fiber of $ϕ$ above a generic point in $C^{m}$ is empty. The closure of the image of $ϕ$ is an irreducible variety of dimension n in $C^{m}$ . The same holds over $R$ .

Describing the image of $ϕ$ will be our topic in Sections 4 and 5. A generic point c in that image can be created easily, namely by setting $c = ϕ (z)$ where $z = (z_{1}, \dots, z_{n})$ is any generic point in $C^{n}$ . We are interested in the fiber $ϕ^{- 1} (c)$ over such a point c. By construction, that fiber is nonempty: it contains all $n!$ points that are obtained from z by permuting coordinates. For the remainder of this section, assume $\gcd (a_{1}, \dots, a_{m}) = 1$ . Then we conjecture that there are no other points in that fiber. This would mean that the set ${z_{1}, \dots, z_{n}}$ can be recovered uniquely from any m of its power sums, provided $m \geq n + 1$ .

Conjecture 6. The recovery of a set of n complex numbers from n + 1 power sums with coprime powers is unique. To be precise, for $m = n + 1$ , the map $ϕ$ is generically injective. This means that, for generic points $z \in C^{n}$ , the fiber $ϕ^{- 1} (ϕ (z))$ coincides with the set of $n!$ coordinate permutations of z.

We are also interested in the following more general conjecture. Let $τ = (τ_{1}, \dots, τ_{n})$ be in $R_{> 0}^{n}$ and consider the map $ψ : C^{n} \to C^{m}$ , where $ψ_{j} = \sum_{i = 1}^{n} τ_{i} x_{i}^{a_{j}}$ . Let $Stab (τ)$ be the subgroup of the symmetric group S_n consisting of all coordinate permutations that fix τ.

Conjecture 7. For generic points $z \in C^{n}$ , the fiber $ψ^{- 1} (ψ (z))$ is precisely the set of all coordinate permutations of z. The cardinality of this set is equal to $| Stab (τ) |$ .

By computing Gröbner bases, we confirmed Conjectures 6 and 7 for a range of small cases: Conjecture 6 holds for n = 4 and $\sum_{a \in A} a \leq 52$ , and for n = 5 and $\sum_{a \in A} a \leq 49$ . Conjecture 7 holds for $(n, m) = (2, 3)$ and $\sum_{a \in A} a \leq 100$ , for $(n, m) = (3, 4)$ and $\sum_{a \in A} a \leq 69$ , and for $(n, m) = (4, 5)$ and $\sum_{a \in A} a \leq 49$ . In all cases we took random integers $1 \leq τ_{i} \leq 100$ .

3 Square Systems

We here fix n = m, so we study the square case. By Proposition 3, our system (2) has finitely many solutions in $C^{n}$ . Our aim is to find their number. We study this for n = 2 (Proposition 10) and n = 3 (Conjecture 14). This generalizes a conjecture of Conca, Krattenthaler and Watanabe [3, Conjecture 2.10]. We conclude with a discussion of the general case $n \geq 4$ .

Our point of departure is a result which links Proposition 3 with Bézout’s Theorem.

Proposition 8.

For general measurements $c \in C^{n}$ , the square system (2) has finitely many complex solutions $x \in C^{n}$ . The number of these solutions is bounded above by $a_{1} a_{2} \dots a_{n}$ .

We now define the homogenized system (HS) to be the system (2), where c_j is replaced by $c_{j} x_{0}^{a_{j}}$ . Note that (HS) has its solutions in $P^{n}$ . What we are interested in for our recovery problem are the solutions that do not lie in the hyperplane at infinity ${x_{0} = 0}$ . Next, we define the system at infinity (SI) to be (2) with c = 0. The solutions of (SI) are in $P^{n - 1}$ . The cone over that projective scheme is the zero fiber of the map $ϕ$ . We will use the notations (HS) and (SI) both for the systems of equations and the projective schemes defined by them.

Remark 9.

The scheme (HS) is in general not the projective closure of the affine part defined by (2), as it can contain higher-dimensional components. For example, set $n = m = 4$ , and let $A$ consist of four odd coprime integers. The variety in $C^{4}$ defined by the system (2) is zero-dimensional by Lemma 3. However, the scheme (HS) is not zero-dimensional in $P^{4}$ , since it contains the lines defined by $x_{i} = - x_{j}, x_{k} = - x_{l}, x_{0} = 0$ for ${i, j, k, l} = {1, 2, 3, 4}$ .

The solutions of (HS) that lie in the hyperplane ${x_{0} = 0}$ are precisely the solutions to (SI). However, the multiplicities are different. If the variety (SI) in $P^{n - 1}$ is finite, then the number of solutions to (2) in $C^{n}$ equals $a_{1} a_{2} \dots a_{n}$ minus the total length of (HS) along (SI). For n = 2, this observation fully determines the number of solutions to (2) in terms of $A$ .

Proposition 10

(n = 2). Assume $a_{1} < a_{2}$ . For generic $(c_{1}, c_{2}) \in C^{2}$ , the number of common solutions in $C^{2}$ to the equations $x_{1}^{a_{1}} + x_{2}^{a_{1}} = c_{1}$ and $x_{1}^{a_{2}} + x_{2}^{a_{2}} = c_{2}$ equals $a_{1} (a_{2} - gcd (a_{1}, a_{2}))$ if both $a_{1} / gcd (a_{1}, a_{2})$ and $a_{2} / gcd (a_{1}, a_{2})$ are odd. It equals the Bézout number $a_{1} a_{2}$ otherwise.

Proof.

First assume $gcd (a_{1}, a_{2}) = 1$ . The binary forms $x_{1}^{a_{1}} + x_{2}^{a_{1}}$ and $x_{1}^{a_{2}} + x_{2}^{a_{2}}$ are relatively prime, unless both a₁ and a₂ are odd, so $x_{1} + x_{2}$ divides both forms. In the former case, (SI) has no solutions, so the number of solutions to (2) equals the Bézout number $a_{1} a_{2}$ . If a₁ and a₂ are odd, then (SI) $= {x_{1} + x_{2} = 0}$ defines the point $(1 : - 1)$ on the line $P^{1}$ , corresponding to the point $P = (1 : - 1 : 0)$ of the scheme (HS). The multiplicity of (HS) at P can be computed locally in the chart ${x_{2} \neq 0}$ by setting $x_{2} = - 1$ . It is the multiplicity at the point (0, 1) of the affine scheme in $C^{2}$ defined by the ideal $I = 〈 x_{1}^{a_{1}} - x_{0}^{a_{1}} - 1, x_{1}^{a_{2}} - x_{0}^{a_{2}} - 1 〉$ .

Write $m_{P'} = 〈 x_{0}, x_{1} - 1 〉$ for the maximal ideal of $P' = (0, 1)$ in the local ring $O_{P'}$ of the curve $V (x_{1}^{a_{1}} - x_{0}^{a_{1}} - 1) \subset C^{2}$ . In $O_{P'}$ we have $x_{1} - 1 = \frac{x_{1}^{a_{1}} - 1}{u} = \frac{x_{0}^{a_{1}}}{u}$ , where u is a unit. In fact, u is a certain product of cyclotomic polynomials in x₁. Therefore, x₀ is a uniformizer, i.e., $m_{P'} = 〈 x_{0} 〉$ , and $x_{1} - 1$ is contained in $m_{P'}^{a_{1}} ∖ m_{P'}^{a_{1} + 1}$ . From this we conclude $x_{1}^{a_{2}} - x_{0}^{a_{2}} - 1 = ({((x_{1} - 1) + 1)}^{a_{2}} - x_{0}^{a_{2}} - 1) = \sum_{i = 1}^{a_{2}} (\begin{matrix} a_{2} \\ i \end{matrix}) {(x_{1} - 1)}^{i} - x_{0}^{a_{2}} \in m_{P'}^{a_{1}} ∖ m_{P'}^{a_{1} + 1} .$

Hence, $x_{1}^{a_{2}} - x_{0}^{a_{2}} - 1$ vanishes to order a₁ at $P'$ . We conclude that the multiplicity of (HS) in P is a₁. Therefore, the system (2) has $a_{1} (a_{2} - 1) = a_{1} (a_{2} - \gcd (a_{1}, a_{2}))$ solutions in $C^{2}$ .

Finally, suppose that a₁ and a₂ are not relatively prime, and set $g = gcd (a_{1}, a_{2})$ . We replace x₁, x₂ by $x_{1}^{g}, x_{2}^{g}$ , and we apply our previous analysis to the two equations(5) ${(x_{1}^{g})}^{a_{1} / g} + {(x_{2}^{g})}^{a_{1} / g} = c_{1} and {(x_{1}^{g})}^{a_{2} / g} + {(x_{2}^{g})}^{a_{2} / g} = c_{2} .$ (5)

The system (5) has solutions at infinity if and only if $a_{1} / g$ and $a_{2} / g$ are both odd. In that case, we have (SI) $= {x_{1}^{g} + x_{2}^{g} = 0}$ , which defines the g points ${(ζ^{i} : 1)}_{i = 1, \dots, g}$ in $P^{1}$ , where ζ is a primitive gth root of –1. Each of the corresponding points in (HS) has multiplicity a₁. This can be computed analogously to what we did for P in the argument above. □

We turn to n = 3, and we assume $\gcd (a_{1}, a_{2}, a_{3}) = 1$ . Our problem is now much harder. It is unknown when (SI) has any solutions in $P^{2}$ . No solutions means that the power sums $ϕ_{1}, ϕ_{2}, ϕ_{3}$ form a regular sequence. Conca, Krattenthaler and Watanabe [3, Conjecture 2.10] suggest that this holds if and only if $a_{1} a_{2} a_{3} \equiv 0 mod 6$ ; we call this the CKW conjecture. They prove the “only if” part in [3, Lemma 2.8]. Another proof for this part is given by the next lemma. Set $A_{p} = {a_{1} mod p, a_{2} mod p, a_{3} mod p}$ for p = 2, 3. Thus, $A_{2} \subseteq {0, 1}$ and $A_{3} \subseteq {0, 1, 2}$ . We assumed $A_{p} = {0}$ for p = 2, 3. Let ζ be a primitive cube root of unity.

Lemma 11.

The points $(1 : - 1 : 0), (1 : 0 : - 1),$ and $(0 : 1 : - 1)$ are in (SI) if and only if $0 \notin A_{2}$ , and the points $(1 : ζ : ζ^{2})$ and $(1 : ζ^{2} : ζ)$ are in (SI) if and only if $0 \notin A_{3}$ .

Proof.

If n is a prime, ξ is a primitive nth root of unity, and a is a multiple of n, then the power sum $x_{1}^{a} + x_{2}^{a} + \dots + x_{n}^{a}$ does not vanish at $(1, ξ, \dots, ξ^{n - 1})$ , but rather it evaluates to n. We obtain the assertion by specializing to n = 2 and n = 3. □

The CKW conjecture states that (SI) has no solutions when $0 \in A_{2} \cap A_{3}$ . It is shown in [Citation3, Theorem 2.11] that this holds if ${1, n} \subset A$ with $2 \leq n \leq 7$ , or if ${2, 3} \subset A$ . The proof rests on the expression of power sums in terms of elementary symmetric polynomials.

In what follows we present conjectures that imply the CKW conjecture. We begin with a converse to Lemma 11. Theorems 13 and 15 verify all conjectures for some new cases.

Conjecture 12. We have $(SI) \subseteq {(1 : - 1 : 0), (1 : 0 : - 1), (0 : 1 : - 1), (1 : ζ : ζ^{2}), (1 : ζ^{2} : ζ)}$ .

This generalizes [3, Conjecture 2.10] since the five possibilities for points on (SI) do not occur if $0 \in A_{2} \cap A_{3}$ . We show some new cases of the conjecture using computational tools.

Theorem 13.

Conjecture 12 holds for all $a_{1} < a_{2} < a_{3}$ with $a_{1} + a_{2} + a_{3} \leq 300$ .

Proof.

Let $P' = (α : β : γ)$ be a point on (SI), corresponding to $P = (α : β : γ : 0)$ on (HS). After permuting coordinates, we may assume $α \neq 0$ . Then $P'$ is in the affine chart $C^{2}$ of $P^{2}$ given by $x = x_{2} / x_{1}$ and $y = x_{3} / x_{1}$ . The restriction of (SI) to that plane $C^{2}$ is defined by(6) $x^{a_{1}} + y^{a_{1}} + 1 = x^{a_{2}} + y^{a_{2}} + 1 = x^{a_{3}} + y^{a_{3}} + 1 = 0.$ (6)

Conjecture 12 states that the number of solutions to the system (6) is 0, 2 or 4, as follows:

Table

Display Table

We verified the counts in the second column for all $a_{1} < a_{2} < a_{3}$ with $a_{1} + a_{2} + a_{3} \leq 300$ . We did this using the Gröbner basis implementation in the computer algebra system magma. The same would be doable with other tools for bivariate equations. □

Conjecture 14. For n = 3 and $gcd (a_{1}, a_{2}, a_{3}) = 1$ , the following holds for the system (2):

If $0 \in A_{3}$ , then we have $# Solutions = {\begin{matrix} a_{1} a_{2} a_{3} & if A_{2} = {1, 0}; \\ a_{1} a_{2} a_{3} - 3 a_{1} a_{2} & if A_{2} = {1} . \end{matrix}$

If $A_{3} = {1}$ or {2}, then we have $# Solutions = {\begin{matrix} a_{1} a_{2} a_{3} - 4 a_{1} & if A_{2} = {1, 0}; \\ a_{1} a_{2} a_{3} - 4 a_{1} - 3 a_{1} a_{2} & if A_{2} = {1} . \end{matrix}$

If $A_{3} = {1, 2}$ , then we have $# Solutions = {\begin{matrix} a_{1} a_{2} a_{3} - 2 i_{A} & if A_{2} = {1, 0}; \\ a_{1} a_{2} a_{3} - 2 i_{A} - 3 a_{1} a_{2} & if A_{2} = {1} . \end{matrix}$

Here $i_{A}$ is the index of nilpotency of the zero-divisor x₀ in the homogeneous system (HS).

At present we do not have a simple formula for the number $i_{A}$ in all cases. Computationally, it can be found from the homogeneous ideal $I = 〈 f_{1}, f_{2}, f_{3} 〉$ that is generated by $f_{j} = x_{1}^{a_{j}} + x_{2}^{a_{j}} + x_{3}^{a_{j}} - c_{j} x_{0}^{a_{j}}$ for j = 1, 2, 3. Using ideal quotients, the definition is as follows: $(I : x_{0}^{i_{A} - 1}) ⊊ (I : x_{0}^{i_{A}}) = (I : x_{0}^{i_{A} + 1}) .$

From our computations it seems that $i_{A}$ is always either a₁ or a₂ or $2 a_{1}$ .

Theorem 15.

Conjecture 14 holds for all $a_{1} < a_{2} < a_{3}$ with $a_{3} \leq 20$ or $a_{1} + a_{2} + a_{3} \leq 40$ .

Proof.

Our approach is to compute the multiplicity in (HS) for each point that is known (by Theorem 13) to lie in (SI). We conjecture that these multiplicities are as follows:

If $A_{2} = {1}$ , then the point $(1 : - 1 : 0 : 0)$ has multiplicity $a_{1} a_{2}$ in (HS);
if $A_{3} = {1, 2}$ , then the point $(1 : ζ : ζ^{2} : 0)$ has multiplicity $i_{A}$ in (HS);
if $| A_{3} | = 1$ , then $2 i_{A} = 2 a_{1}$ and this is the multiplicity of $(1 : ζ : ζ^{2} : 0)$ in (HS).

These claims imply Conjecture 14, by our previous analysis. Indeed, if $0 \in A_{2} \cap A_{3}$ , then (SI) is empty and the number of solutions to (2) is the Bézout number $a_{1} a_{2} a_{3}$ . Otherwise, we need to subtract the multiplicities above, according to the various cases. Here the number in (i) is multiplied by 3 since the S₃-orbit of $(1 : - 1 : 0 : 0)$ has three points, and the numbers in (ii) and (iii) are multiplied by 2 since the S₃-orbit of $(1 : ζ : ζ^{2} : 0)$ has two points.

For our computations, we fix $P \in {(1 : - 1 : 0 : 0), (1 : ζ : ζ^{2} : 0)}$ , we focus on the affine chart $C^{3} = {x_{1} = 1}$ , and we consider the ideal $I = 〈 f_{1}, f_{2}, f_{3} 〉$ in the local ring $O_{P, C^{3}}$ . The quotient $V = O_{P, A^{3}} / I$ is a vector space over $C$ , and its dimension is the multiplicity of (HS) at P. We computed this dimension for all values of $a_{1}, a_{2}, a_{3}$ in the stated range, and we verified that (i), (ii), and (iii) are satisfied. This was done using Gröbner bases in magma. □

Extending Conjecture 14 to $n \geq 4$ seems out of reach at the moment, for two reasons. First of all, the conditions on $A$ for (SI) to have no solutions are less simple. For n = 4 with $gcd (a_{1}, a_{2}, a_{3}, a_{4}) = 1$ , Conca, Krattenthaler and Watanabe [3, Conjecture 2.15] state three conditions on $A$ under which (SI) has no solutions. They show that all three conditions are necessary. We verified their conjecture using Gröbner bases in magma for $a_{1} + a_{2} + a_{3} + a_{4} \leq 100$ . Secondly, in the event that (SI) does have solutions, it is not at all obvious what these should be. In general, they are not given only by points whose coordinates are roots of unity, as was the case for n = 3. This happens already for n = 4 as the following example shows:

Example 16.

Set n = 4 and $A = {2, 4, 9, 10} .$ The system (2) has 576 solutions which is 144 less than the Bézout number 720. The scheme (SI) in $P^{3}$ which is defined by the ideal $〈 x_{1}^{2} + x_{2}^{2} + x_{3}^{2} + x_{4}^{2}, x_{1}^{4} + x_{2}^{4} + x_{3}^{4} + x_{4}^{4}, x_{1}^{9} + x_{2}^{9} + x_{3}^{9} + x_{4}^{9}, x_{1}^{10} + x_{2}^{10} + x_{3}^{10} + x_{4}^{10} 〉$ contains 72 distinct points. The minimal polynomial of each of the coordinates of the points in (SI) has degree 36. Every root of this polynomial occurs in each coordinate in exactly two points.

4 Images

We now study the images of the power sum maps $ϕ_{A, C}$ , $ϕ_{A, R}$ , and $ϕ_{A, \geq 0}$ . The recovery problem (2) has a solution if and only if the measurement vector c lies in that image. We know from Chevalley’s Theorem [10, Theorem 4.19] that $im (ϕ_{A, C})$ is a constructible subset of $C^{m}$ . Over the real numbers, the Tarski-Seidenberg Theorem [10, Theorem 4.17] tells us that $im (ϕ_{A, R})$ is a semialgebraic subset of $R^{m}$ and $im (ϕ_{A, \geq 0})$ is a semialgebraic subset of $R_{\geq 0}^{m}$ . It follows from Proposition 3 that, for each of these images, the dimension equals $min (n, m)$ .

We first examine whether the images are closed. We use the classical topology on $R^{m}$ or $C^{m}$ . This makes sense not just over $R$ , but also over $C$ , since the Zariski closure of the image of any complex polynomial map coincides with its classical closure [10, Corollary 4.20].

Proposition 17.

The constructible set $im (ϕ_{A, C})$ is generally not closed in $C^{m}$ . The semialgebraic set $im (ϕ_{A, R})$ is closed in $R^{m}$ when $0 \in A_{2}$ , but it is generally not closed otherwise. Finally, the semialgebraic set $im (ϕ_{A, \geq 0})$ is always closed in $R_{\geq 0}^{m}$ .

Proof.

Let $m = n = 2, a_{1} = 1$ and $a_{2} \geq 3$ odd. Fix any $c = (c_{1}, c_{2}) \in C^{2}$ , with $c_{1} = 0$ and $c_{2} = 0$ . Then (2) has no complex solution because $ϕ_{1} (x)$ divides $ϕ_{2} (x)$ . Thus, c does not belong to the image of $ϕ_{A, C}$ or $ϕ_{A, R}$ . However, c is in the closure of $im (ϕ_{A, C})$ because that closure is $C^{2}$ by Proposition 3 (i). The same counterexample works over the real numbers. Let us now set $c_{1} = ϵ$ for $ϵ > 0$ very small and solve $ϕ_{1} (x) = ϵ$ by setting $x_{2} = ϵ - x_{1}$ . This substitution in $ϕ_{2} (x) = c_{2}$ gives a polynomial equation in one variable x₁ of odd degree a₂. Such an equation always has a real solution $x_{1} (ϵ)$ . The image of the point $(x_{1} (ϵ), ϵ - x_{1} (ϵ))$ converges to c in $R^{2}$ as $ϵ \to 0$ from which we conclude that c lies in the closure of $im (ϕ_{A, R})$ .

We are left with the cases where the image is closed. First suppose $0 \in A_{2}$ . Then there is an even element in $A$ , say a_i. Let $c \in R^{m}$ be in the closure of $im (ϕ_{A, R})$ . There exists a sequence ${x^{(l)}}_{l \geq 0}$ of points in $R^{n}$ such that $ϕ_{A, R} (x^{(l)})$ converges to c as $l \to \infty$ . Since a_i is even, we have $ϕ_{i} (x^{(l)}) = | | x^{(l)} | |_{a_{i}}^{a_{i}}$ , which converges to $c_{i} \geq 0$ as $l \to \infty$ . Hence, the sequence ${x^{(l)}}_{l \geq 0}$ is bounded in the norm $| | \cdot | |_{a_{i}}$ . There is a subsequence that converges to some point $x^{(\infty)}$ in $R^{n}$ . Since the power sum map $ϕ_{A, R}$ is continuous, the image of $x^{(\infty)}$ is equal to c. Therefore, $c \in im (ϕ_{A, R})$ , and we conclude that the real image is closed. Finally, take $A$ arbitrary and consider the nonnegative power map $ϕ_{A, \geq 0}$ . Let $c \in R_{\geq 0}^{m}$ be in the closure of $im (ϕ_{A, \geq 0})$ and let ${x^{(l)}}_{l \geq 0}$ be a sequence of nonnegative points whose images $ϕ_{A, \geq 0} (x^{(l)})$ converge to c as $l \to \infty$ . Now, for any index i, the norm $| | x^{(l)} | |_{a_{i}} = ϕ_{i} {(x^{(l)})}^{1 / a_{i}}$ is bounded, so there exists a convergent subsequence of ${x^{(l)}}_{l \geq 0}$ . Let $x^{(\infty)} \in R_{\geq 0}^{n}$ be its limit. Again, by the continuity of the power map, we have $ϕ_{A, \geq 0} (x^{(\infty)}) = c$ . From this we conclude that $im (ϕ_{A, \geq 0})$ is closed. □

Example 18.

Set $n = m = 2$ and $A = {1, 3}$ . The image of $ϕ_{A, R}$ is the nonclosed set ${(0, 0)} \cup {c \in R^{2} : (c_{1} < 0 and c_{1}^{3} \geq 4 c_{2}) or (c_{1} > 0 and c_{1}^{3} \leq 4 c_{2})} .$

On the other hand, the image of the map restricted to the nonnegative orthant is closed:(7) $im (ϕ_{A, \geq 0}) = {c \in R_{\geq 0}^{2} : c_{2} \leq c_{1}^{3} \leq 4 c_{2}} .$ (7)

In Section 5 we generalize this description of the image of $ϕ_{A, \geq 0}$ to other power sum maps.

We next examine our images through the lens of algebraic geometry. Let $c_{1}, \dots, c_{m}$ be variables with $degree (c_{i}) = a_{i}$ . These are coordinates on the weighted projective space $W P^{m - 1}$ with weights given by $A$ . We regard $ϕ = ϕ_{A, C}$ as a rational map from $P^{n - 1}$ to $W P^{m - 1}$ . The following features of the image will be characterized in Theorem 21: (i) For $m = n + 1$ , the closure of the image $im (ϕ)$ is an irreducible hypersurface in $W P^{m - 1}$ . We give a formula for its degree, which is the weighted degree of its defining polynomial in the unknowns $c_{1}, \dots, c_{m}$ . (ii) For $m \leq n$ , we describe the positive branch locus of the map $ϕ$ . This is a hypersurface in $C^{m}$ . By reasoning as in the proof of [8, Theorem 3.13], this hypersurface represents the algebraic boundary of the image of $ϕ_{A, \geq 0}$ .

To study the branch locus of $ϕ$ , we start with the ramification locus $R$ . This consists of points in $C^{n}$ where $ϕ$ is not smooth [2, Section 2.2, Proposition 8]. Set $ϕ_{j} = \sum_{i = 1}^{n} x_{i}^{a_{j}}$ and $μ = \min {n, m}$ . Let $I \subset C [x_{1}, \dots, x_{n}]$ be the ideal generated by the $μ \times μ$ minors of the Jacobian matrix $I$ as in (4); this is an ideal of height $\leq | m - n | + 1$ . Its variety $R = V (I)$ is the set of points where $I$ has rank less than μ. Each maximal minor of $I$ , up to multiplication with a positive integer, has the form(8) ${(x_{i_{1}} x_{i_{2}} \dots x_{i_{μ}})}^{a_{i_{1}} - 1} \cdot \prod_{1 \leq j < k \leq μ} (x_{i_{j}} - x_{i_{k}}) \cdot S (x_{i_{1}}, x_{i_{2}}, \dots, x_{i_{μ}}),$ (8) for some $1 \leq i_{1} < \dots < i_{μ} \leq n$ . The last factor is a Schur polynomial.

In the square case m = n, the variety $R$ is a reducible hypersurface in $C^{n}$ , given by the vanishing of one polynomial (8). Write $g = gcd (a_{1} - 1, \dots, a_{m} - 1)$ . By [4, Theorem 3.1], the Schur polynomial S is either constant, which happens when $(a_{i} - 1) / g = i - 1$ for $1 \leq i \leq n$ , or it is irreducible. Let $R'$ be the closure in $C^{n}$ of $R ∖ (\cup_{i \neq j} V (x_{i} - x_{j}) \cup V (x_{k}))$ . Thus, $R'$ is the nontrivial component in the ramification locus. Our discussion implies the following:

Proposition 19.

Assume m = n. The ramification variety $R'$ is either empty, in which case we have $(a_{i} - 1) / g = i - 1$ for $1 \leq i \leq n$ , or it is an irreducible hypersurface of degree $\sum_{i = 1}^{n} (a_{i} - 1) - (\begin{matrix} n \\ 2 \end{matrix}) - n (a_{1} - 1) .$

Example 20.

For $A = {3, 6, 7}$ and n = 3, the ideal I is principal. Its generator factors as $x_{1}^{2} x_{2}^{2} x_{3}^{2} (x_{1} - x_{2}) (x_{1} - x_{3}) (x_{2} - x_{3}) (x_{1}^{2} x_{2}^{2} + x_{1}^{2} x_{2} x_{3} + x_{1} x_{2}^{2} x_{3} + x_{1}^{2} x_{3}^{2} + x_{1} x_{2} x_{3}^{2} + x_{2}^{2} x_{3}^{2}) .$

The variety $R'$ is the quartic surface in $C^{3}$ defined by the Schur polynomial in the last factor.

In the overdetermined regime (m > n), there are partial results by Fröberg and Shapiro [Citation5], who study the closure of $R ∖ (\cup_{i \neq j} V (x_{i} - x_{j}))$ in $C^{n}$ which we denote by $R ″$ . Notably, it remains an open problem to find the dimension of $R ″$ . Assuming $a_{1} = g = 1$ , the first interesting case is n = 3, m = 5. Proving the dimension of $R ″$ to be the expected one is equivalent to showing that three complete homogeneous polynomials form a regular sequence. This brings us back to Conca, Krattenthaler and Watanabe [3, Conjecture 2.17].

Now assume $m \leq n$ . Then $R$ contains all linear spaces defined by $n - m + 1$ independent equations of the form x_i = x_j or x_k = 0. We call these positive ramification components. This name is justified as follows. The Schur polynomial S in (8) has positive coefficients, and therefore cannot vanish at nonzero points in the nonnegative orthant $R_{\geq 0}^{n}$ . Hence, only these components contribute to the ramification locus of the positive map $ϕ_{A, \geq 0}$ . A positive branch hypersurface is any irreducible hypersurface in weighted projective space in $W P^{m - 1}$ that is the closure of the image of a positive ramification component under the power sum map $ϕ$ .

Theorem 21.

The following hypersurfaces in $W P^{m - 1}$ are relevant for the image of our map.

If $m = n + 1$ , then the image of $ϕ$ is an irreducible hypersurface in $W P^{m - 1}$ whose weighted degree is at most $(a_{1} a_{2} \dots a_{m}) / (m - 1)!$ . If this ratio is an integer, then this bound can be attained. Specifically, if m = 3 and $a_{1} a_{2} a_{3}$ is even, then it can be attained.
If $m \leq n$ , then the weighted degree of any positive branch hypersurface of $ϕ$ is at most the Bézout number $a_{1} a_{2} \dots a_{m}$ .

Proof.

Let $m \leq n$ . The restriction of $ϕ$ to any positive ramification component is a rational map from $P^{m - 2}$ to $W P^{m - 1}$ . After renaming the x_i if needed, we can write its coordinates as(9) $\sum_{i = 1}^{m - 1} τ_{i} x_{i}^{a_{j}} for j = 1, \dots, m,$ (9) where $τ_{1}, \dots, τ_{m - 1}$ are positive integers. Let $H_{τ}$ denote the image of this map in $W P^{m - 1}$ . This also covers the case $m = n + 1$ in (i) since $ϕ$ has coordinates as in (9) with $τ_{1} = \dots = τ_{m - 1} = 1$ . Hence, all hypersurfaces in (i) and (ii) have the form $H_{τ}$ . Our aim is to compute their degrees.

Fix positive integers $τ_{1}, \dots, τ_{m - 1}$ and set $z_{j} = c_{j}^{1 / a_{j}}$ . Consider the projective space $P^{2 m - 2}$ with coordinates $x_{1}, \dots, x_{m - 1}, z_{1}, \dots, z_{m}$ . Let Z denote the variety in $P^{2 m - 2}$ defined by the homogeneous polynomials $\sum_{i = 1}^{m - 1} τ_{i} x_{i}^{a_{j}} - z_{j}^{a_{j}}$ , for $j = 1, \dots, m$ . By the same reasoning as in Proposition 3, this variety is irreducible and it is a complete intersection of degree $a_{1} a_{2} \dots a_{m}$ .

We consider the image of Z under the coordinate projection(10) $π : P^{2 m - 2} - - \to P^{m - 1}, (x_{1} : \dots : x_{m - 1} : z_{1} : \dots : z_{m}) \mapsto (z_{1} : \dots : z_{m}) .$ (10)

The closure of $π (Z)$ is essentially the hypersurface $H_{τ}$ we care about, but it lives in $P^{m - 1}$ . Its degree in $P^{m - 1}$ with coordinates $(z_{1} : \dots : z_{m})$ coincides with the degree of $H_{τ}$ in $W P^{m - 1}$ with coordinates $(c_{1} : \dots : c_{m})$ . Indeed, these two hypersurfaces have the same defining polynomial, up to the substitution $c_{j} = z_{j}^{a_{j}}$ . The Refined Bézout Theorem [6, 12.3] implies(11) $\deg (π (Z)) \leq \frac{\deg (Z)}{\deg (π |_{Z})} = \frac{a_{1} a_{2} \dots a_{m}}{\deg (π |_{Z})},$ (11) where equality holds if π has no base locus. This immediately proves (ii).

We proceed with proving (i). The degree of $π |_{Z}$ is the size of its generic fiber. This equals the size of the generic fiber of the map given by (10). Conjecture 7 states that the size of the generic fiber of π is the size of the stabilizer of $τ = (τ_{1}, \dots, τ_{m - 1})$ in the symmetric group $S_{m - 1}$ . In particular, it would follow that the generic fiber is a single point if and only if the τ_i are all distinct, and it consists of $(m - 1)!$ points if and only if the τ_i are identical. We do not know yet whether this conjecture holds. But, in any case, the number $| Stab (τ) |$ furnishes a lower bound for the size of a generic fiber and thus for $\deg (π)$ .

Since the hypersurface $im (ϕ)$ in (i) equals $H_{τ}$ for $τ_{1} = \dots = τ_{m - 1} = 1$ Z $τ_{1} = \dots = τ_{m - 1} = 1$ , we conclude from (11) that its weighted degree is at most $(a_{1} a_{2} \dots a_{m}) / (m - 1)!$ . Equality can only hold when the base locus of π on the variety $V (ϕ_{j} (x) - z_{j}^{a_{j}} : j = 1, \dots, m)$ is empty. This happens precisely when the system at infinity (SI) is the empty set. A necessary condition for this to happen is that $(m - 1)!$ divides the Bézout number $a_{1} a_{2} \dots a_{m}$ . One checks that this is also sufficient when m = 3: we saw in Proposition 10, that (SI) is empty when $a_{1} a_{2}$ is even. □

Example 22

(m = 3). Suppose $gcd (a_{1}, a_{2}, a_{3}) = 1$ and $B = a_{1} a_{2} a_{3}$ is even. If n = 2, then the image of $ϕ$ is a curve of expected degree $B / 2$ in $W P^{2}$ . If $n \geq 3$ , then every positive branch curve has expected degree $B / 2$ or B. For instance, if n = 4, then the ramification component ${x_{1} = 0, x_{3} = x_{4}}$ should give a branch curve of degree B, while ${x_{1} = x_{2}, x_{3} = x_{4}}$ should give a branch curve of degree $B / 2$ . We shall see pictures of such curves in the next section.

Example 23

(m = 4). If n = 3 and 6 divides $B = a_{1} a_{2} a_{3} a_{4}$ , then we expect the image of $ϕ$ to have weighted degree $B / 6$ . This would follow from the conjectures in Sections 2 and 3. The positive branch surfaces for $n \geq 4$ should have degrees $B / 6, B / 2$ or B. If 6 does not divide B, then the weighted degrees of the image and branch surfaces in $W P^{3}$ are determined by the base loci. This takes us back to Conjecture 14. To be very explicit, let $A = {2, 5, 7, 8}$ as in Example 1. Here $B / 6 = 560 / 6 = 93.333 \dots$ . The image of our map $ϕ : P^{2} - - \to W P^{3}$ is defined by a homogeneous polynomial of weighted degree 90 with 304 terms, namely $9 c_{1}^{45} - 1050 c_{1}^{41} c_{4} - 3724 c_{1}^{40} c_{2}^{2} + 22400 c_{1}^{39} c_{2} c_{3} - 31000 c_{1}^{38} c_{3}^{2} + \dots - 1966899200 c_{1} c_{4}^{11} + 1258815488 c_{2}^{2} c_{4}^{10} .$

By contrast, consider $A = {2, 5, 7, 9}$ . Now, $B / 6 = 105$ is an integer, and this equals the weighted degree of the image surface. Its defining polynomial has 388 terms, and it looks like $59049 c_{1}^{35} c_{2}^{7} - 459270 c_{1}^{34} c_{2}^{6} c_{3} - 59049 c_{1}^{35} c_{3}^{5} + 255150 c_{1}^{33} c_{2}^{6} c_{4} + \dots + 6350400 c_{2} c_{3}^{4} c_{4}^{8} - 324000 c_{3}^{6} c_{4}^{7} .$

5 Recovery from p-norms

Focusing on the positive region, we now investigate the properties of the map $ϕ_{A, \geq 0}$ . The key fact to be used throughout is that the power sum of degree p represents the p-norm:(12) $| | x | |_{p} = {(\sum_{i = 1}^{n} x_{i}^{p})}^{ 1 / p} for all x = (x_{1}, \dots, x_{n}) \in R_{\geq 0}^{n} .$ (12)

Hence, our recovery problem for nonnegative vectors $x \in R_{\geq 0}^{n}$ is equivalent to recovery of x from values of the p-norms $| | \cdot | |_{p}$ , where p runs over a prespecified set $A$ of positive integers. We are interested in existence and uniqueness of vectors with given p-norms for $p \in A$ .

Let us begin with the basic identifiability question: How many different p-norms are needed to reconstruct a vector in $R_{\geq 0}^{n}$ from their values up to permuting the n coordinates? Conjecture 6 together with (12) would imply that n + 1 different norms suffice. On the other hand, it follows from Proposition 3 that at least n different p-norms are necessary. But are these n measurements already sufficient? We start by showing that this is indeed the case.

Proposition 24.

For m = n, recovery from p-norms is always unique. Given any set $A$ of n positive integers, the map $ϕ_{A, \geq 0} : R_{\geq 0}^{n} \to R_{\geq 0}^{n}$ is injective up to permuting coordinates.

Proof.

Write $ϕ = ϕ_{A, \geq 0}$ . We proceed by induction on n. For n = 1, the map $ϕ$ is obviously injective as it is strictly increasing. Thus, we have unique recovery for n = 1. Let us now prove the statement for arbitrary n. Our argument is based on the calculus fact that a differentiable function from a real interval to $R$ is injective if its derivative has constant sign.

Consider the cone of decreasing vectors, $C = {x \in R^{n} : x_{1} > x_{2} > \dots > x_{n} \geq 0}$ . Let X₁, X₂ be two arbitrary distinct points from this cone. Our claim states that they map to two different points under the map $ϕ$ , i.e., that $ϕ$ is injective on C. Let L be the line segment from X₁ to X₂ and consider the restriction $ϕ |_{L} : R \to R^{n}$ of $ϕ$ to L that is now a function in one variable. Its derivative is given by the product of the Jacobian matrix of $ϕ$ , which we denote by $I_{ϕ}$ , evaluated at L, and the vector $(X_{2} - X_{1})$ . First notice that if $X_{1, n} = X_{2, n} = 0$ , then we are in the case where the induction hypothesis applies. Let us now w.l.o.g. assume $X_{2, n} > 0$ . Then $I_{ϕ}$ is an n × n matrix whose determinant is of the form (8).

The Schur polynomial S does not vanish on $L∖ {X_{1}}$ , and neither do the linear factors. Hence, the coordinates of the vector $I_{ϕ} \cdot (X_{2} - X_{1})$ do not vanish at any point on $L∖ {X_{1}}$ . Each coordinate is a function of constant sign on the whole segment L. This shows that $ϕ$ is injective on the line L. As X₁ and X₂ were chosen to be arbitrary points, we conclude that $ϕ$ is injective on the whole cone C. For a much more general version of this argument, we refer to the equivalence of conditions (inj) and (jac) in [12, Theorem 1.4]. □

The proof above is not algorithmic. It does not tell us how to invert $ϕ$ . Our current method of choice for recovery is solving the equations using numerical algebraic geometry.

The next goal is to characterize the semialgebraic set $im (ϕ_{A, \geq 0})$ inside the nonnegative orthant $R_{\geq 0}^{m}$ . Starting with m = 2, we first present a generalization of the formula (7).

Proposition 25.

Set $m = 2 \leq n$ and $A = {a_{1} < a_{2}}$ . Then the nonnegative image equals (13) $im (ϕ_{A, \geq 0}) = {c \in R_{\geq 0}^{2} : c_{2}^{a_{1}} \leq c_{1}^{a_{2}} \leq n^{a_{2} - a_{1}} c_{2}^{a_{1}}} .$ (13)

Proof.

At any point $x \in R_{\geq 0}^{n}$ , our map evaluates the norms $| | x | |_{a_{j}} = ϕ_{j} {(x)}^{1 / a_{j}}$ for i = 1, 2. The first norm is larger than or equal to the second one: $| | x | |_{a_{1}} \geq | | x | |_{a_{2}}$ . They agree at coordinate points. Their ratio is maximal at $e = (1, 1, \dots, 1)$ . This gives the inequalities $1 \leq \frac{| | x | |_{a_{1}}}{| | x | |_{a_{2}}} \leq \frac{| | e | |_{a_{1}}}{| | e | |_{a_{2}}} = \frac{n^{1 / a_{1}}}{n^{1 / a_{2}}} .$

All values in this range are obtained by some point $x \in R_{\geq 0}^{n}$ . We now raise both sides to the power $a_{1} a_{2}$ and thereafter we clear denominators. This gives the inequalities in (13). □

The proof of Proposition 25 suggests that the study of the nonnegative image $im (ϕ_{A, \geq 0})$ can be simplified by replacing the power sum map by the normalized map into the simplex(14) $ψ_{A} : R_{\geq 0}^{n} - - \to Δ_{m - 1} : x \mapsto \frac{1}{\sum_{j = 1}^{m} | | x | |_{a_{j}}} \cdot (| | x | |_{a_{1}}, | | x | |_{a_{2}}, \dots, | | x | |_{a_{m}}) .$ (14)

Here, $Δ_{m - 1} = {u \in R_{\geq 0}^{m} : u_{1} + u_{2} + \dots + u_{m} = 1}$ is the standard probability simplex. If we know the image of this map then that of the power sum map can be recovered as follows:(15) $im (ϕ_{A, \geq 0}) = {c \in R_{\geq 0}^{m} : \frac{1}{\sum_{j = 1}^{m} c_{j}^{1 / a_{j}}} (c_{1}^{1 / a_{1}}, c_{2}^{1 / a_{2}}, \dots, c_{m}^{1 / a_{m}}) \in im (ψ_{A})} .$ (15)

We next consider the case m = 3. For every $n \geq 3$ , the image is a nonconvex region in the triangle $Δ_{2}$ . These regions get larger as n increases. We illustrate this for an example.

Example 26.

Set $A = {2, 3, 4}$ . For $n \geq 3$ , the image of the norm map $ψ_{A}$ into the triangle $Δ_{2}$ is an n-gon with curvy boundary edges that lies inside the subtriangle ${c_{1} > c_{2} > c_{3}}$ . The edges and diagonals of this n-gon are the following $(\begin{matrix} n \\ 2 \end{matrix})$ curvy segments for $1 \leq i < j \leq n$ : $B_{i j} = ψ_{A} ({x \in R_{\geq 0}^{n} : x_{1} = \dots = x_{i} \geq x_{i + 1} = \dots = x_{j}, and x_{k} = 0 for k > j}) .$

The Zariski closure of B_ij is an irreducible curve. There are $⌊ n / 2 ⌋$ distinct sets B_ij with i = j. For $i \neq j$ , the Zariski closures of B_ij and $B_{j - i, j}$ are the same. Hence, we obtain $\frac{(\begin{matrix} n \\ 2 \end{matrix}) - ⌊ \frac{n}{2} ⌋}{2} + ⌊ \frac{n}{2} ⌋ = ⌊ \frac{n^{2}}{4} ⌋$ distinct complex algebraic curves as Zariski closures of these curvy segments.

For n = 3, there are two distinct branch curves: one curve of degree 12, given by the segment B₁₂, and one of degree 24, given by the two segments B₁₃ and B₂₃. For n = 6, shows the curvy hexagon $im (ψ_{A})$ . Its 15 curvy segments form nine distinct branch curves, six of degree 24 and three of degree 12. The latter are given by $B_{12}, B_{24}, B_{36}$ . The curvy segment B₁₂ is red in both pictures. For n = 2, we have $B_{12} = im (ψ_{A})$ . For $n \geq 3$ , the curvy segment B₁₂ is one of the n boundary edges of $im (ψ_{A})$ . The Zariski closure of the curvy segment B₁₂ is the branch curve ${c_{1}^{12} - 4 c_{1}^{6} c_{2}^{6} - 4 c_{2}^{12} + 12 c_{1}^{2} c_{2}^{6} c_{3}^{4} - 3 c_{1}^{4} c_{2}^{8} - 2 c_{3}^{12} = 0}$ .

Fig. 1 The image of the norm map $ψ_{A}$ for $n = 6, m = 3$ is a curvy hexagon in a triangle. The color coding on the left shows the progression of images for $n = 3, 4, 5, 6$ . The color coding on the right shows the algebraic degrees 12 (red) and 24 (blue) of the curvy segments.

We now state a theorem which generalizes our observations in Example 26 to $m \geq 4$ . We fix $n \geq m$ and $A = {a_{1} < \dots < a_{m}}$ as before. For $1 \leq l \leq m$ and any ordered set $ν = (ν_{1}, \dots, ν_{l}) \in (\begin{matrix} [n] \\ l \end{matrix})$ , let $R_{ν}$ denote the set of vectors $x \in R_{\geq 0}^{n}$ that satisfy $x_{i} = x_{i + 1}$ if $i < ν_{1}$ or $ν_{r} \leq i < ν_{r + 1}$ for some r, and x_i = 0 for all $i > ν_{l}$ , and $x_{i} \geq x_{i + 1}$ otherwise. Its image $B_{ν} = ψ_{A} (R_{ν})$ is a semialgebraic subset of dimension $l - 1$ in $Δ_{m - 1}$ . Proposition 25 tells us that $B_{ν}$ is a curvy simplex with vertices $B_{ν_{1}}, \dots, B_{ν_{l}}$ . We define the type of ν to be the multiset ${ν_{1}, ν_{2} - ν_{1}, ν_{3} - ν_{2}, \dots, ν_{l} - ν_{l - 1}}$ . We can view $τ = type (ν)$ as a partition with precisely $l$ parts of an integer between $l$ and n. Let $T_{n, l}$ denote the set of such partitions τ. We use the notation $Stab (τ)$ from Conjecture 7. In analogy to the proof of Theorem 21, we denote by $H_{τ}$ the image in the simplex $Δ_{m - 1}$ of a positive ramification component of type τ.

Theorem 27.

Assume $m \leq n$ . The norm map $ψ_{A}$ in (14) has the following properties:

The image of $ψ_{A}$ in $Δ_{m - 1}$ is the union of the curvy $(m - 1)$ -simplices $B_{ν}$ where $ν \in (\begin{matrix} [n] \\ m \end{matrix})$ . The curvy facets of these curvy simplices are $B_{μ}$ where $μ \in (\begin{matrix} [n] \\ m - 1 \end{matrix})$ . Some of these curvy $(m - 2)$ -simplices form the boundary of the semialgebraic set $im (ψ_{A})$ .
Two curvy $(m - 2)$ -simplices $B_{μ}$ and $B_{μ'}$ have the same Zariski closure if $type (μ) = type (μ')$ . Thus, the irreducible branch hypersurfaces $H_{τ}$ are indexed by $τ \in T_{n, m - 1}$ .

Proof.

For $l \in {1, \dots, m}$ and $ν \in (\begin{matrix} [n] \\ l \end{matrix})$ , the set $R_{ν}$ is a convex polyhedral cone, spanned by linearly independent vectors in a linear subspace of dimension $l \leq m$ in $R^{n}$ . By Proposition 24, the map $ϕ_{A, \geq 0}$ is injective on $R_{ν}$ . Therefore, by the transformation in (15), the map $ψ_{A}$ is injective on $R_{ν}$ up to scaling. This means that the image $B_{ν} = ψ_{A} (R_{ν})$ is a curvy simplex of dimension $l - 1$ inside the probability simplex $Δ_{m - 1}$ . We also conclude that the boundary of $im (ψ_{A})$ equals the union of the images $B_{ν} = ψ_{A} (R_{ν})$ , where ν runs over a certain subset of $(\begin{matrix} [n] \\ m - 1 \end{matrix})$ . These specify the algebraic boundary of $im (ϕ_{A, \geq 0})$ . This proves (i).

To see that part (ii) holds, we write the restriction of $ϕ_{A, \geq 0}$ to the cone $R_{μ}$ as a polynomial function in only $l$ distinct variables x_i. The jth coordinate of that restriction has the form $\sum_{i = 1}^{l} τ_{i} x_{i}^{a_{j}}$ , where $τ = type (μ)$ . Different cones $R_{μ}$ of the same type τ are distinguished only by the orderings of the parameters $x_{1}, x_{2}, \dots, x_{l}$ . However, they have the same linear span in $R^{n}$ . Hence, after we drop the distinguishing inequalities x_i > x_j, the maps are the same. In particular, their images $B_{μ}$ have the same Zariski closures $H_{τ}$ in the simplex $Δ_{m - 1}$ . □

Example 26 illustrates Theorem 27 for $m = 3$ , where $| T_{n, 2} | = ⌊ n^{2} / 4 ⌋$ and $| Stab (τ) | \in {1, 2}$ . We found it more challenging to understand the geometry of our image in higher dimensions.

Example 28

( $n = 8, m = 4$ ). The image of $ψ_{A}$ in the tetrahedron is a curvy 3-polytope. It is partitioned by $56 = (\begin{matrix} 8 \\ 3 \end{matrix})$ curvy triangles $B_{ν}$ . Their types τ identify 16 clusters: two singletons, ten triples, and four of size six. These determine $16 = | T_{8, 3} |$ branch surfaces $H_{τ}$ .

Based on computational experiments, we believe that, for all pairs $m \leq n$ and all exponents $A$ , the image of $ψ_{A}$ has the combinatorial structure of the cyclic polytope of dimension m – 1 with n vertices. In particular, the boundary is formed by the curvy $(m - 2)$ -simplices $B_{μ}$ where μ runs over all sequences that satisfy Gale’s Evenness Condition [16, Theorem 0.7]. This predicts that the boundary in Example 28 is subdivided into 12 curvy triangles $B_{μ}$ , namely those indexed by $μ \in {123, 128, 134, 145, 156, 167, 178, 238, 348, 458, 568, 678}$ . Our belief is supported by related results for the moment curve, where $A = {1, 2, \dots, m}$ , due to Bik, Czapliński and Wageringel [Citation1]. Their figures show curvy cyclic polytopes in dimension 3.

The theory of triangulations of cyclic polytopes [Citation13] now suggests an approach to unique recovery even when m < n. Each triangulation consists of a certain subset of $(\begin{matrix} [n] \\ m \end{matrix})$ . If our belief is correct, then this should induce a curvy triangulation of $im (ψ_{A})$ . A general point c in the image is contained in a unique simplex $B_{ν}$ of the triangulation. There is a unique z in the locus $R_{ν}$ with $ψ_{A} (z) = c$ . The assignment $c \mapsto z$ serves as a method for unique recovery.

We conclude with a natural generalization of the problem discussed in this section. Let $K = {K_{1}, \dots, K_{m}}$ be a set of centrally symmetric convex bodies in $R^{n}$ . Each of these defines a norm $| | \cdot | |_{K_{i}}$ on $R^{n}$ . The unit ball for that norm is the convex body K_i. Consider the map (16) $ψ_{K} : R_{\geq 0}^{n} - - \to Δ_{m - 1} : x \mapsto \frac{1}{\sum_{j = 1}^{m} | | x | |_{K_{j}}} \cdot (| | x | |_{K_{1}}, | | x | |_{K_{2}}, \dots, | | x | |_{K_{m}}) .$ (16)

Problem 29. Study the image and the fibers of the map $ψ_{K}$ . Identify the branch loci of $ψ_{K}$ .

Supplemental material

uexm_a_2061650_sm0384.zip

Download Zip (61.8 KB)

Additional information

Funding

Hana Melánová was supported by Supported by START-Prize Y-966 of the Austrian Science Fund.

References

Bik, A., Czapliński, A., Wageringel, M. (2021). Semi-algebraic properties of Minkowski sums of a twisted cubic segment. Collect. Math. 72: 87–107. doi:10.1007/s13348-020-00281-7
Web of Science ®Google Scholar
Bosch, S., Lütkebohmert, W., Raynaud, M. (1990). Néron Models. Ergebnisse der Mathematik und ihrer Grenzgebiete, Vol. 21. Berlin: Springer-Verlag.
Google Scholar
Conca, A., Krattenthaler, C., Watanabe, J. (2009). Regular sequences of symmetric polynomials. Rend. Semin. Mat. Univ. Padova 121: 179–199. doi:10.4171/RSMUP/121-11
Web of Science ®Google Scholar
Dvornicich, R., Zannier, U. (2009). Newton functions generating symmetric fields and irreducibility of Schur polynomials. Adv. Math. 222: 1982–2003.
Web of Science ®Google Scholar
Fröberg, R., Shapiro, B. (2016). On Vandermonde varieties. Math. Scand. 199: 73–91. doi:10.7146/math.scand.a-24185
Google Scholar
Fulton, W. (1984). Intersection Theory. Ergebnisse der Mathematik und ihrer Grenzgebiete (3), Vol. 2. Berlin: Springer-Verlag.
Google Scholar
Josz, C., Lasserre, J. B., Mourrain, B. (2019). Sparse polynomial interpolation: sparse recovery, super-resolution, or Prony? Adv. Comput. Math. 45: 1401–1437. doi:10.1007/s10444-019-09672-2
Web of Science ®Google Scholar
Kahle, T., Kubjas, K., Kummer, M. (2017). The geometry of rank-one tensor completion. SIAM J. Appl. Algebra Geom. 1: 200–221. doi:10.1137/16M1074102
Web of Science ®Google Scholar
Lefschetz, S. (1953). Algebraic Geometry. Princeton: Princeton University Press.
Google Scholar
Michałek, M., Sturmfels, B. (2021). Invitation to Nonlinear Algebra. Graduate Studies in Mathematics, Vol. 211. Providence, RI: American Mathematical Society.
Google Scholar
Milne, J. S. (2017). Algebraic Geometry (v6.02). Available at www.jmilne.org/math/.
Google Scholar
Müller, S., Feliu, E., Regensburger, G., Conradi, C., Shiu, A., Dickenstein, A. (2016). Sign conditions for injectivity of generalized polynomial maps with applications to chemical reaction networks and real algebraic geometry. Found. Comput. Math. 16: 69–97. doi:10.1007/s10208-014-9239-3
Web of Science ®Google Scholar
Rambau, J. (1997). Triangulations of cyclic polytopes and higher Bruhat orders. Mathematika 44: 162–194. doi:10.1112/S0025579300012055
Web of Science ®Google Scholar
The Stacks project authors: The Stacks project. (2021). https://stacks.math.columbia.edu.
Google Scholar
Tsakiris, M., Peng, L., Conca, A., Kneip, L., Shi, Y., Choi, H. (2020). An algebraic-geometric approach for linear regression without correspondences. IEEE Trans. Inform. Theory 66: 5130–5144. doi:10.1109/TIT.2020.2977166
Web of Science ®Google Scholar
Ziegler, G. (1995). Lectures on Polytopes. Graduate Texts in Mathematics, Vol. 152. New York: Springer-Verlag.
Google Scholar

Recovery from Power Sums

Abstract

1 Introduction

2 Fibers

3 Square Systems

4 Images

5 Recovery from p-norms

uexm_a_2061650_sm0384.zip

References

Information for

Open access

Opportunities

Help and information

Recovery from Power Sums

Abstract

1 Introduction

2 Fibers

3 Square Systems

4 Images

5 Recovery from p-norms

uexm_a_2061650_sm0384.zip

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date