Supernullers

---- UNDER CONSTRUCTION ----

Summary

Nulling interferometers rely on destructive interferences between subapertures beams to achieve high level of starlight suppression, often quantified by the null depth order. A previous study showed that, with an optimal beam nuller design, there is a well defined relationship between the number of subapertures N and a minumum null order γ achievable, with γ=2 for N=2,3 and γ=4 for N=4,5,6. While in randomly chosen 2-D interferometer geometry, this relationship is a fundamental limit on the maximum null depth achievable, I show that for some specific geometries, this limit can be exceeded, leading to interferometer designs able to reach very deep nulls with a small number of apertures. The subaperture positions in such interferometers, which are refered to as supernullers, obey simple geometrical rules, which are both necessary and sufficient conditions to exceeding the null depth limit to which non-supernullers are constrained.

1. Introduction

Direct imaging of exoplanets is scientifically extremely valuable, as physical separation of planet light from the much brighter central starlight is required to acquired high quality spectra of the planet. The combination of high star-to-planet contrast and the small angular separation between the two objects makes this challenging and requires unconventional optical imaging systems, such as coronagraphs or nulling interferometers.

Interferometers are an attractive approach to this challenge, as they are well-suited for high angular resolution, and their beam combiner can be designed to efficiently cancel the central starlight. The non-zero stellar angular diameter must be taken into account for the design of a nulling interferometer, and requires one or more outputs of the interferometer to have maintain a small transmission over the size of the stellar disk. The interferometer's ability to cancel starlight is best quantified by the null depth order γ which describes how transmission evolves as a function of angular distance θ to the optical axis for small angles by approximating it a θ^γ. In most cases, a null order γ=2 is insufficient. For example, in order to keep the starlight level below the planet light level for the observation of a Sun-Earth system analog in the visible (contrast = 1e10, stellar diameter = 1% of planet-to-star separation) requires γ=6 if the Earth is placed at 1 λ/B, and γ=8 if the Earth is placed at 8 λ/B. Deep nulls also offer resilience against pointing errors and reduce exozodiacal light contribution, as the inner part of the exozodiacal cloud is likely to be the brightest.

While the nulling interferometer concept was first proposed with two telescopes and a θ² null, more powerful designs have since been proposed, offering deeper nulls but requiring more subapertures [refs]. Rouan [ref] proposed a technique to achieve arbitrararly deep nulls, provided that a sufficiently large number of telescopes is available. Despite the large number of nulling interferometer designs previously suggested, no clear understanding of how to optimally design nulling interferometers has emerged. A previous study [ref] in fact shows that there exists a minimum null depth order which is only a function of the number of aperture, which we denote γ_min(N), and that for any 2-D interferometer geometry with N apertures, a beam combiner can be designed to produce at least one output with a θ^γ_min(N) transmission near the optical axis.

2. Impact of interferometer geometry on achievable null depth

2.1. Minimum achievable null depth vs. Maximum achievable null depth

In a previous study , the optimal design of a nulling beam combiner was discussed. A key finding was the existence of a minimum achievable null depth γ_min(N), which is only a function of the number N of apertures in the interferometer: any 2-D inteferometer with N apertures can produce at least one interferometer output with a θ^γ_min(N) transmission near the optical axis.

As described in detail in the optimal beam combiner study, the optimal null depth in a nulling interferometer can be derived from the Taylor expansion of the complex amplitude at the entrance of the interferometer around the optical axis. For an aperture of the interferometer at position (x,y), the complex amplitude response to a point source at angular position (α,β) is:
e^i(αx+βy) = 1 + iαx + iβy - (1/2)(α²x² + 2αβxy + β²y²) - (i/6)(α³x³ - 3α²βx²y - 3αβ²xy² - β³y³) + .... (2.1)

e^i(αx+βy) = 1 + iαx + iβy - (1/2)(α²x² + 2αβxy + β²y²) - (i/6)(α³x³ - 3α²βx²y - 3αβ²xy² - β³y³) + ....	(2.1)

To each point on the sky corresponds a N-coefficient vector V(α,β) of complex amplitudes representing the input of the interferometer across the N apertures. The coefficients of this vector are:
V__k(α,β) = r_k (1 + x_k (i α) + y_k (i β) - x_k² (α²/2) + x_ky_k (αβ) + y_k² (β²/2) - (i/6)(α³x_k³ - 3α²βx_k²y - 3αβ²x_ky_k² - β³y_k³) + ... ) (2.2)
Where r_k is the radius of aperture k (or the square root of its collecting area, if not circular). A stellar disk of finite diameter is approximated as a series of NBpt mutually incoherent point sources covering the stellar disk, and the corresponding interferometer input is then represented by NBpt vectors. The optimal nulling beam combiner design is obtained by matching the terms of this expansion with the interferometer outputs. Thus, in the general 2-D case, the brightest output of the nulling interferometer should coherently mix inputs with coefficients r_k, concentrating most of the starlight. The next two outputs will remove from the expansion the first order terms in x and y, thus producing second order nulls in intensity (square of first order nulls in amplitude). Provided that the number of apertures is sufficiently large to access these modes, the next 3 modes will remove the terms in x², y² and xy, and will therefore be fourth-order nulls in intensity. This process is continued until the number of modes reaches the number of apertures in the array, and therefore produces a well-defined relationship between the number of apertures N and the achiveable null depth γ_min(N).

V__k(α,β) = r_k (1 + x_k (i α) + y_k (i β) - x_k² (α²/2) + x_ky_k (αβ) + y_k² (β²/2) - (i/6)(α³x_k³ - 3α²βx_k²y - 3αβ²x_ky_k² - β³y_k³) + ... )	(2.2)

2.2. Maximum null depth, and condition for achieving a supernull

A matrix notation of the N-aperture interferometer geometry is defined in this section, with

             | 1 x0 y0 x0^2 x0x0 y0^2 ...|
             | 1 x1 y1 x1^2 x1x1 y1^2 ...|
Mgeom(N,L) = | ..........................|
             | 1 xk yk xk^2 xkxk yk^2 ...|
             | ..........................|

with k=0...N-1, and with L equal to the number of column of the matrix. This matrix contains only the positions of the subapertures, not their sizes. With randomly chosen aperture positions, the rank of this matrix for L=N (square matrix) is expected to be equal to N: no linear dependency exists between its rows.

To achieve a supernull, the interferometer geometry must be such that terms in the Tailor expansion given in equation (2.2) are linearly dependent. Mathematically, this is equivalent to the rank of the matrix M_geom(N,N) being strictly less than N. This allows a mode of the interferometer beam combiner to simultaneously suppress more than a single term of the Taylor expansion, therefore potentially reaching deeper null depth than possible if no linear combination exists between the columns of M_geom(N,N). Additional orders of the Taylor expansion can then be reached, as can be represented by M_geom(N,L) with L>N.

The maximum null depth of the interferometer is:
γ_max(N) = γ_min(L) (2.3)
with L chosen such that the rank of M_geom(N,L) is equal to N:
rk(Mgeom(N,L)) = N (2.4)

γ_max(N) = γ_min(L)	(2.3)

rk(Mgeom(N,L)) = N	(2.4)

Equation (2.3) and (2.4) show that the maximum achievable null depth for a given interferometer geometry is determined by counting the number of terms in the Taylor expansion given equation (2.2) that can be cancelled by the first N-1 outputs of the interferometer, and measuring the order of the remaining Nth term. If linear dependencies exist in the first columns of matrix M_geom(N,N), then additional column(s) can be added to the matrix until the rank of M_geom(N,L) is equal to N, corresponding to a beam combiner with N outputs able to match the first L terms of the Taylor expansion. The maximum null depth is then given by the order of the last column of M_geom(N,L).

3. Types of supernuller geometries

3.1. Linear arrays

The simplest supernuller geometry is a linear array, for which the (x,y) coordinates of the subapertures follow the relationship ax+by=c, or, equivalently y=0 with the proper coordinate choice. In this case, the only columns of M_geom which are linearly independent are powers of the x coordinate. Enforcing rk(M_geom(N,L)) = N leads to the highest order vector of M_geom being x^N-1. The achieved null is therefore in θ^2(L-1), which is considerably deeper than can be achieved in the general 2-D geometry case.

3.2. Circular arrays

In a circular array, the apertures coordinate (x,y) follow a relationship of the type x²+y²=r². This relationship introduces linear dependencies in the columns of matrix M_geom through the following relationships:
y² = x² - r
x y² = x³ - rx
y³ = x²y - ry
x² y² = x⁴ - rx²
xy³ = x³y - rx
...
(3.1)

y² = x² - r x y² = x³ - rx y³ = x²y - ry x² y² = x⁴ - rx² xy³ = x³y - rx ...	(3.1)

As can be seen from these relationships, any x^k1y^k2 with k2>1 can be written as a linear combination of x^k1'y^k2' terms with k2'=0 or 1. For each fixed value of (k1+k2), there is therefore only two linearly independent columns in the matrix. To obtain rk(M_geom(N,L))=N, the highest order terms in the matrix must be k1+k2=floor(N/2). This analysis therefore predicts that the achievable null order for any circular array is:
γ_circ(N) = 2 floor(N/2) (3.2)
For example, any circular array with N=8 should offer a θ⁸ null, while a random 2-D geometry with the same number of apertures is limited to a θ⁶ null.

γ_circ(N) = 2 floor(N/2)	(3.2)

Fig 3.1: Optimal beam combiner solution for a 8-aperture regular octogon array geometry, computed here for a 0.001 λ/B stellar radius.
[eps]

3.3. Other geometries

Other relationships between the x and y coordinates of the subapertures can produce linear dependencies between the columns of M_geom. Several of them are listed in table 1, and the analysis presented in sections 3.1 and 3.2 can be generalized to these other geometries. Figure 3.2 shows, for the geometries lists in table 1, how null depth order improves as the number of apertures N increases. Linear arrays offer the best null depth for a fixed number of aperture. Cicular geometries (circle, ellipse, arc) and hyperbola geometry offer better null depth than random 2-D geometries for N=6 and N>7. Any point on this figure above the thick curve indicating the general 2-D case is by definition a supernuller.

TABLE 1: Examples of supernuller geometries
Geometry	(x,y) relationship	Subapertures
line	ax+by=c	N>2
circle	x²+y²=r²	N=6, N>7
ellipse	x²+ay²=r²	N=6, N>7
arc	(x-a)²+(y-b)²=r²	N=6, N>7
parabola	x = ay²	N=6, N>7
hyperbola	xy = a	N=6, N>7
cubic	x = a y³	N=10, N>12
cubic	xy²=a	N=10, N>12

Fig 3.2: Null depth order as a function of number of subapertures for different array geometries.
[eps]

For example, if apertures are on an hyperbola following the relationship xy=a, then any x^k1y^k2 with k1>1 and k2>1 can be written as a linear combination of terms in the form x^k1'y^k2' with k1'=0 or k2'=0. Just as for the circle array geometry, for each fixed value of (k1+k2), there is therefore only two linearly independent columns in the matrix. The null depth achievable with any hyperbola array geometry is therefore:
γ_hyperb(N) = 2 floor(N/2) (3.3)
Figure 3.3. illustrates this finding by showing a 6-aperture hyperbola array geometry achieving a θ⁸ null. This null depth is not achievable in the general 2-D case, so the interferometer qualifies as a supernuller.

γ_hyperb(N) = 2 floor(N/2)	(3.3)

Fig 3.3: Optimal beam combiner solution for a 6-aperture hyperbola array geometry, computed here for a 0.001 λ/B stellar radius.
[eps]

Conclusions

A previous study showed the existence of a minimum achievable null depth, and showed that linear arrays arrays can achieve considerably deeper nulls than non-linear arrays with the same number of apertures. The study presented in this paper completes this work by quantifying the maximum null depth achievable as a function of array geometry. A key finding is that several simple 2-D geometries (circle, parabola, hyperbola) offer deeper nulls than possible with a general 2-D geometry. These supernullers can offer particularly effective solutions for high contrast imaging of faint exoplanets around partially resovled bright stars. Incidentally, the circle (or arc) geometry, in addition to being a family of supernullers, allows easy beam combination without requiring large delay lines if light is mixed in a central hub.

The design of a nulling interferometer must take into account many parameters in addition to null depth, such as sensitivity to background light, (u,v) plane coverage, tolerance to noise sources, and array efficiency. The work presented in this study is therefore not intended to directly lead to sensible nulling interferometer designs, but it is expected that nulling interferometer geometries should tend to be supernullers if null depth is a performance driver, and if no other strong constraint forces the subapertures positions.