Li Ge, "Constructing the scattering matrix for optical microcavities as a nonlocal boundary value problem," Photonics Res. 5, B20 (2017)
Copy Citation Text
We develop a numerical scheme to construct the scattering (S) matrix for optical microcavities, including the special cases with parity-time and other non-Hermitian symmetries. This scheme incorporates the explicit form of a nonlocal boundary condition, with the incident light represented by an inhomogeneous term. This approach resolves the artifact of a discontinuous normal derivative typically found in the R-matrix method. In addition, we show that, by excluding the aforementioned inhomogeneous term, the non-Hermitian Hamiltonian in our approach also determines the Periels–Kapur states, and it constitutes an alternative approach to derive the standard R-matrix result in this basis. Therefore, our scheme provides a convenient framework to explore the benefits of both approaches. We illustrate this boundary value problem using 1D and 2D scalar Helmholtz equations. The eigenvalues and poles of the S matrix calculated using our approach show good agreement with results obtained by other means.
Driven by advances in nanofabrication capabilities and their applications to integrated optics, understanding resonances and wave transport in optical microcavities [1,2] has been one of the most energized subjects in modern optics. These compact optical structures also offer a unique opportunity to study non-Hermitian phenomena [3] and wave chaos [4,5] in a well-controlled manner. To probe these properties of optical microcavities, one approach resorts to the scattering (S) matrix formalism [6], which was an essential tool in the understanding of resonances in nuclear physics [7,8], particle physics [9], and quantum field theory [10], which also played a crucial role in the study of wave transport in various fields, including condensed matter systems [11], optics [12], and microwave networks [13].
In essence, the S matrix, denoted by an energy- or frequency-dependent , connects a set of incoming channels to their corresponding outgoing channels , both defined outside the scattering potential. Therefore, it takes the openness of the system into account, and the conservation of optical flux in the absence of gain and loss is manifested by the unitarity of [i.e., ]. When is analytically continued into the complex- plane, its poles (i.e., where its eigenvalues approach infinity) correspond to the resonances of the system, whose wave functions only connect to the outgoing channels, now also evaluated at complex frequencies [14].
As already pointed out by Wigner and Eisenbud’s early work in nuclear physics [8], the calculation of the matrix can be understood as a nonlocal boundary value problem (BVP), which was derived using an orthogonal basis of the system and explicitly contains the real-valued frequencies of this basis. More specifically, this orthogonal basis was defined with vanishing normal derivatives at the boundary of the system, and, as a result, the expansion of an arbitrary state using a finite number of these basis functions has a discontinuous normal derivative in general as an artifact [8,15]. Alternatively, the expansion can be carried out using quasinormal modes [16] (i.e., the Gamow states [14]) or the Periels–Kapur states [17,18], both defined with purely outgoing boundary conditions. These approaches, however, do not remove the artifact in the normal derivative, due to the lack of incoming flux that is inherent in the scattering process. We note that in literature the modal expansion approach, regardless of the specific basis, is referred to as the -matrix method [15] in general.
Sign up for Photonics Research TOC. Get the latest issue of Photonics Research delivered right to you!Sign up now
Although the consequence of the aforementioned artifact may be insignificant with the introduction of the Bloch operator [19] and a large number of basis functions [20], having an approach at one’s disposal that addresses this conceptual problem may prove valuable in some cases. In addition, it is often favorable in optical systems to adopt a self-contained approach, i.e., without requiring a priori knowledge or assigning phenomenological parameters to a large set of basis functions. These two goals can be met in principle by using either time-dependent numerical simulations of the Maxwell’s equations [21,22] or boundary integral equations [23–25]. However, the former do not offer much physical insight on the properties of the matrix, and the latter require that the Green’s function is known inside the optical microcavity and hence do not apply, for example, when the refractive index varies smoothly due to a localized thermal source [26] or a modulated optical gain and loss profile [27]. We also note that, while boundary integral equations can be regarded as a nonlocal boundary condition (and related to the Ewald–Oseen extinction theorem [23]), they typically involve solving for the wave function and its normal derivative simultaneously at the boundary [24].
To overcome these limitations, we propose a finite-difference approach in this paper that solves for the steady state of the Maxwell’s equations inside a last scattering surface (LSS), outside of which the scattered flux does not reenter other parts of the system. As we will show in Section 1, accomplishing the two aforementioned goals in 1D is effortless because the boundary conditions are local and involve only the wave function and its spatial derivative at either end of the system. In higher dimensions, however, a local boundary condition in the form that holds for all ’s on the LSS does not exist in general. Instead, we use a nonlocal boundary condition in its finite difference form, resulting in (1) a self-contained scheme to construct the matrix without using a modal expansion that (2) resolves the artifact in the normal derivative at the LSS. In addition, this approach via a nonlocal BVP produces the same non-Hermitian Hamiltonian that determines the Periels–Kapur states [17] [or constant-flux (CF) states [18] for the Maxwell’s equations specifically], which constitutes an alternative approach to derive the standard -matrix result in this basis. Hence our scheme provides a convenient framework to explore the benefits of both approaches when constructing and comprehending the matrix.
This paper is organized as follows: In Section 2 we discuss the BVP and -matrix approaches in 1D using the scalar Helmholtz equation. Despite the simplicity of the boundary conditions, the discussion already reveals the fundamental connection between the matrix and the CF states in our approach. We also explicitly show that the (normal) derivative of an arbitrary scattering state cannot be accurately captured by the -matrix approach with a finite number of basis functions. In Section 3 we exemplify the nonlocal BVP approach for the scattering of TM waves in 2D, and the treatment of TE waves is similar. We then apply this scheme to parity-time () [27–47] and rotation-time () symmetric [40,48] optical microcavities, focusing on the spontaneous symmetry breaking of the -matrix eigenvalues. Finally, we give concluding remarks in Section 4.
2. BVP IN 1D
We begin by considering the 1D Helmholtz equation where is the electric field, is the dielectric constant for an optical microcavity placed between and , and is the free-space wave vector. For an incoming wave from the left {denoted by }, we write the formal solution of as where are the outgoing wave functions on the left and right of the system and , are the reflection and transmission coefficients. By requiring that both and are continuous at (denoted by ), the boundary conditions are then simply by eliminating and , which are local without involving both or in a single expression.
Before we embark on our quest to higher dimensions, we note an important feature of Eq. (3): without the constant term , the boundary conditions become the same as those imposed by the CF states [18], i.e., with purely outgoing waves. Similarly, an incoming wave from the right simply adds an additional constant term in Eq. (4). Therefore, starting from the non-Hermitian Hamiltonian that determines the (outgoing) CF states in the interior of the system, one can obtain the wave function in the scattering problem by turning an eigenvalue problem to an inhomogeneous equation. Below we give the specific forms of this non-Hermitian Hamiltonian and the inhomogeneous term using the finite-difference method.
To start, we discretize the 1D space into equally spaced points, with the left (right) boundary of the optical microcavity placed at the middle of the 0th and 1st [th and th] points. The separation of two neighboring grid points is then given by , and the Helmholtz equation takes the following form: where are the values of the wave function and the dielectric constant at these points. The boundary conditions in Eqs. (3) and (4) can be rewritten as where is dimensionless. The constant term in Eq. (6) is due to the incoming wave , as mentioned previously, and by dropping it we recover the generalized eigenvalue problem that determines the CF states with purely outgoing boundary condition [49]: is a column vector containing the values in the th CF state, and is a diagonal matrix with elements . The CF frequencies (similar to the resonances or the poles of the matrix) is given by . Note that the real-valued free-space wave vector , instead of the complex-valued CF frequencies ’s, appears in the non-Hermitian Hamiltonian : which is tri-diagonal. We note that it is not possible to write the corresponding equation for resonances as such a generalized eigenvalue problem because would contain the complex-valued resonance frequencies yet to be determined.
Now with the constant term in the boundary condition in Eq. (6), the wave function in the scattering problem is given by and the inhomogeneous term is a column vector with a single non-zero element, i.e., when light is incident from the left. Similarly, for the scattering of a right-incident wave, the only non-zero element of is . The equation above can be put into a more explicit form to obtain :
The transmission and reflection coefficients can then be calculated using for left incidence (and similarly for right incidence), and the matrix is given by with when the system has Lorentz reciprocity [50–52].
For comparison, here we also discuss the modal expansion approach to construct . By inserting to Eq. (10) and utilizing Eq. (8), we find where we have used the “self-orthogonality” of the CF states [53] in the following form:
For left incidence and taking , we find
This expression is identical to that used in the standard derivation of the -matrix method in the CF basis, which we outline below.
As mentioned in the introduction, the expansion of in a finite number of basis functions introduces an artifact to the normal derivative at the LSS. Therefore, the standard derivation of the -matrix method resorts to the Green’s theorem instead to take into consideration the boundary condition of , resulting in
We note that the scattered waves in , as well as the CF states, produce () at () after taking the derivative, and the corresponding boundary terms above are all canceled. Hence, we find , with which we immediately recover Eq. (16). Once ’s and ’s are known, the matrix can be constructed again by applying Eq. (12) and the corresponding expressions for , .
In Fig. 1(a) we show the total wave function inside a half-gain-half-loss optical microcavity with symmetry [28–30] and a left incident wave, whose refractive index satisfies [27,31–48]. Good agreement between ’s given by Eqs. (11) and (14) are obtained using 50 CF states. Nevertheless, the artifact of at the boundary of the microcavity in the -matrix approach can be readily seen in Fig. 1(b), where we plot the optical flux given by (up to a prefactor). The BVP approach, on the other hand, gives a good agreement with the analytical result [54] where , , , and , , , , , .
Figure 1.(a) Total wave function and (b) its flux depicted by black thick lines for a half-gain-half-loss microcavity with light incident from the left. The wave vector and refractive indices are used. The expansion in Eq. (14) with 50 CF states is plotted by the red thin lines as a comparison, which can barely be distinguished from the black line in (a) but shows a significant deviation near the left boundary in (b). The black dot in (b) shows the analytical result at given by Eq. (18).
In this section, we elucidate how the matrix is constructed in our scheme as a nonlocal BVP in 2D. Similar to the 1D case discussed in the previous section, we show that the non-Hermitian Hamiltonian in our approach is also the one that determines the CF states with a purely outgoing boundary condition.
A. Construction of the S Matrix
Before we proceed, we note that, in the calculation of a CF state or a resonance, one basically assumes a homogenous source residing inside an optical microcavity, reflected by the imaginary part of its complex frequency. In a scattering problem, the source is an external one instead, and one needs to find a way to distinguish in the boundary condition the known incident wave and the unknown scattered waves. In the 1D case shown in the previous section, the incident wave adds an inhomogeneous term (i.e., ) to the non-Hermitian eigenvalue problem that determines the outgoing CF states [c.f. Eqs. (8) and (10)]. This separation of incident and outgoing waves holds even when the boundary condition becomes nonlocal in 2D, as we show below.
To illustrate this property, we again consider the scalar Helmholtz equation and use the polar coordinates. A circular LSS of radius encloses the optical microcavity with dielectric constant (see Fig. 2), outside of which we assume and adopt as our incoming and outgoing channels. Here are the second and first Hankel functions and is the angular momentum number, which also serves as the channel index. Note that we do not restrict the free-space wave vector to be real, which enables the calculation of the resonances as the complex-valued poles of the matrix.
Figure 2.Schematic of an optical microcavity (shaded area) and the circular LSS (solid line) in 2D. The finite-difference grid is indicated by the dots and dashed lines.
Suppose the incident wave impinges on the LSS in the th channel . The total field and its radial derivative outside the LSS can then be written as where . Next we discretize the 2D space on a polar grid [55], with the circular LSS placed at the middle of the th and th rings (see Fig. 2). On each ring, there are equally spaced grid points with spacing . We then write the total field on the th and th rings as where is the index for the grid points in the azimuthal direction, is the corresponding azimuthal angle, and is the uniform spacing between two consecutive grid points in the radial direction.
To derive the nonlocal boundary condition and the matrix, we eliminate the two coefficients , in the example of TM polarization (with the electric field perpendicular to the 2D cavity plane), and the case of TE polarization can be treated in a similar fashion. Using the continuity of both and its radial derivative, the left-hand sides of Eqs. (20) and (21) on the LSS can be approximated by which lead to and we have dropped the argument in . The product can be eliminated to derive a more concise form of the matrix:
This is the expression we will use in our numerical examples, which requires obtaining using the Fourier transform of ’s:
To find , we eliminate in Eqs. (26) and (27) and derive an expression for , which when substituted into Eq. (23) gives our nonlocal scattering boundary condition: where is the radius of the th ring. Note that the term in Eq. (30) is the manifestation of the incoming wave, and by dropping it we again recover the (nonlocal) outgoing boundary condition that defines the CF states [56], similar to Eq. (6) in 1D.
Equation (30) can then be inserted into the discretized Helmholtz equation on the th ring to eliminate [49], which gives rise to the following matrix equation:
The column vector represents the total wave function, i.e., and is a column vector of zeros except for the last elements, which are given by . has the same form as its 1D counterpart, i.e., a diagonal matrix with the values of the dielectric constant on the discretized grid. has rows and columns; it is the same non-Hermitian Hamiltonian that determines the CF states [49,56]: and it consists a banded matrix and a block in the lower right corner. is symmetric with nonzero elements on the 0, , and diagonals and comes from the nonlocal boundary condition in Eq. (30): can be checked to be also symmetric using and the definition of in Eq. (30). Once is obtained for each incoming channel by solving Eq. (33), i.e., as in the 1D case, we immediately know the Fourier coefficients from Eq. (29), with which the construction of the matrix is completed using Eq. (28).
In the limit of a fine grid (), Eq. (28) becomes We note that the second term in this expression does not depend on the dielectric constant inside the LSS; hence it can be regarded as the result of a “direct scattering” process [6]. The first term then corresponds to the “resonance-assisted” scattering process, and, to understand its determining factors, we resort to the modal expansion of in the CF basis, which takes the following form in 2D: Note that we have used the following normalization of the CF basis such that is dimensionless, and its value does not scale with the discretization, i.e., the expression above becomes in the continuous limit.
given by Eq. (40) has the typical resonant denominator with ’s being the CF frequencies. If we project each CF state at the LSS onto the outgoing channel function (which is equivalent to a Fourier transform), i.e., the inner product singles out the th Fourier coefficient : and, consequently, where is the matrix [15]. This expression clearly indicates that the contribution of a particular CF state to the scattering process is not only dependent on the resonant denominator; it also depends on the spatial overlaps between this CF state and the incoming and outgoing channels at the LSS, represented by and , respectively. In the simplest example where the system is isotropic and the angular momentum is a good quantum number (e.g., a circular microdisk cavity), only the CF states with the same angular momentum as the incoming (and outgoing) channel contribute to the scattering process.
To show that given by Eq. (45) is consistent with the standard -matrix result in the CF basis, we apply the Green’s theorem to the interior of the LSS, which gives us
Similar to the 1D case, the outgoing channels are cancelled in the boundary integral, which can be proved rigorously by applying the Green’s theorem again to the exterior of the system, where both and satisfy
Note that the same wave vector for the total field and the CF states is crucial to remove the outgoing waves from the boundary integral in Eq. (47), and we end up with for the incoming wave in the th channel. Unlike the matrix in other bases [15], here the matrix does not appear in the expansion coefficient . By realizing that the expansion in Eq. (43) holds not just on the LSS but also in the exterior region, we find and the total wave function in the interior of the system is given by
Once we substitute the on the LSS by Eq. (20) and project both sides of the equation above onto the outgoing channels, we immediately recover the matrix given by Eq. (45).
B. Results
To test our approach based on a nonlocal BVP in 2D, we first calculate the poles of the matrix for a circular microdisk cavity with a uniform index. As mentioned in the introduction, the poles of the matrix correspond to complex-valued resonances of the optical microcavity. This connection is due to the diverging eigenvalues of the matrix at its poles, meaning that an infinitesimal incoming amplitude leads to finite outgoing waves. For a circular microdisk cavity with a uniform index, the angular momentum number is a conserved quantity, and the LSS is chosen as the disk boundary. The TM resonances can be found by solving the following analytical expression:
In Fig. 3 we compare the poles of the matrix calculated by the BVP approach and this analytical expression, and good agreement is found for different poles with .
Figure 3.Resonances of a microdisk cavity with a uniform index . The crosses are the analytical results given by Eq. (52), and the circles are the poles of the matrix constructed using Eq. (28).
Next, we inspect the matrix constructed by the BVP approach from a different perspective, i.e., the symmetry property of its eigenvalues in the presence of and symmetries. It was found that, in a -symmetric system, ’s undergo spontaneous symmetry breaking as a function of frequency or system size [37]: symmetry warrants . When in this expression are the same, the corresponding eigenstate of the matrix is in the -symmetric phase with , i.e., the optical flux in the corresponding scattering eigenstate is conserved, even though the system is non-Hermitian in the presence of gain and loss. In the -broken phase , and we find instead (or equivalently, ), which represent a pair of amplified and attenuated scattering eigenstates.
In Fig. 4 we show the eigenvalues of a microdisk cavity with refractive index , which satisfies not only symmetry (here changes to ) but also symmetry [40,48], i.e., . The eigenvalues of the matrix for an -symmetric structure display similar spontaneous breaking as those in their -symmetric counterparts, and these properties are nicely manifested by the matrix given by Eq. (28) using the BVP approach [Fig. 4(a)].
Figure 4.(a) Spontaneous symmetry breaking of -matrix eigenvalues in a microdisk cavity with and symmetries. Its refractive index is given by , the imaginary part of which is shown schematically by the inset in (b). (b) A real-valued eigenvector of at in the - and -symmetric phase. The blue (pink) bars show symmetric and antisymmetric components with opposite ’s. The corresponding wave function is shown in (d), where the cavity boundary is marked by the white circle. The wave function of a scattering eigenstate in the broken-symmetry phase at is shown in (c) as a comparison.
It can be easily seen that, when a system has both and symmetries, their symmetric (broken) phases for the matrix coincide due to the unimodular property of . Therefore, the scattering eigenstates in the symmetric phase should also simultaneously possess the properties of both and symmetries. To understand and differentiate these properties, we turn to the eigenvectors of the matrix, which are the projection coefficients of the scattering eigenstates onto the incoming and outgoing channels. The cylindrical channels specified by Eq. (19) are -symmetric, i.e., , where the minus sign introduced by the parity operator (again ) in the exponent is canceled by performing a time reversal (i.e., becomes in the exponent; it also changes to ). Now if is the incident wave in an eigenstate of with eigenvalue , then by performing a combined operation on its scattered wave [i.e., ] the new incoming state should also be an eigenstate of the matrix due to symmetry. It then follows that should hold for all ’s in the -symmetric phase, and the proportional constant can be set as 1 by choosing a proper global phase of . In other words, all ’s can be made real in the -symmetric phase. If we apply a similar analysis of symmetry, we find that the channel functions transform according to , which leads to . Therefore, we find as a result of these two symmetries, which is nicely captured by the result of the BVP approach [Fig. 4(b)].
We also note that a 2D structure with both and symmetries also has mirror symmetry about the axis (i.e., the axis perpendicular to the parity axis in symmetry) [40], which imposes the following property:
The overall sign corresponds to scattering eigenstates that are even and odd functions about the axis, respectively. Again, the mirror symmetry about the axis is nicely observed in the BVP approach, both in the symmetry-broken phase [Fig. 4(c)] and symmetric phase [Fig. 4(d)]. In the former , and hence ; in the latter, we find is symmetric about both the horizontal and vertical axes instead.
The corresponding resonant modes with resonant frequencies are shown in Fig. 5, which are calculated as the poles of the matrix constructed using the BVP method. These poles differ by less than 0.01% from the results of an iterative method we outline below, in both their real and imaginary parts. As briefly mentioned in Section 2, the difference between a CF state and a resonance lies in the wave vector in the exterior of an optical microcavity: A CF state features a real-valued , while a resonance has the same complex-valued resonant frequency as in the interior of the microcavity. Therefore, if we replace by a CF frequency in the eigenvalue problem in Eq. (35), which determines the CF states, and repeat this procedure until converges, we end up with the same complex-valued frequency in both the interior and exterior of the microcavity, which is a resonance.
Figure 5.Resonant modes corresponding to the scattering eigenstates in Figs. 4(c) and 4(d).
It is important to note that, while the resonant modes of the system have mirror symmetry about the axis, here, they do not possess a - and -symmetric phase: performing a combined or operation does not leave a resonant mode unchanged. Its outgoing waves outside the microcavity are now turned into incoming waves, which, by definition, give a zero of the matrix, whose complex-valued frequency is the complex conjugate of the original resonance [37]. Nevertheless, because the resonant modes are the scattering eigenstates at the poles of the matrix, they bear resemblance to the latter when is evaluated at a real-valued frequency, as can be seen by comparing Figs. 4(c) and 4(d) and Figs. 5(a) and 5(b).
4. CONCLUSION AND DISCUSSION
In summary, we have presented a finite-difference scheme to construct the matrix for optical microcavities as a BVP. The boundary condition for the total field is simple in 1D but becomes nonlocal in 2D, which appears as an inhomogeneous term and also in the non-Hermitian Hamiltonian that determines the CF states. Although the uniqueness and existence of the solutions for a nonlocal BVP are not guaranteed in general, here we can rest assured as the underlying physical process (i.e., scattering) is deterministic. We have verified that our approach is consistent with the -matrix method in the basis of CF states, and it addresses the artifact in the normal derivative of the total field typically found in the -matrix approach.
For applications such as enhancing light–matter interactions and sensing, often it requires accurate knowledge of wave function inside and on the boundary of optical microcavities. In such cases, the BVP method proposed here provides an economic alternative to the modal expansion approach, as the latter requires a large number of basis functions to provide the same level of accuracy. For example, at least 500 basis functions and five times more computational time are needed to capture the symmetry properties of the scattering eigenvalues shown in Fig. 4(a), whether CF states or the orthogonal states with a vanishing radial derivative at the LSS are used [37,57].
We also note that there are several other efficient numerical approaches to construct the matrix, such as finite-different-time-domain methods [21,22] already mentioned in the introduction and the method of auxiliary sources [58,59]. The advantages of our approach are that it provides a conceptually clear construction and a numerically straightforward implementation, and it can be applied to 3D structures using techniques similar to those developed for binary gratings [60]. Our approach can also be applied to a network of optical microcavities [61], and it can treat continuous variations of the refractive index both inside and between these cavities, all enclosed by the LSS.
Finally, we note that, while the poles of the matrix are independent of the choice of the incoming and outgoing channels, the eigenvalues of do depend on such choices in general. Only when the incoming and outgoing channels are transformed in the same way do the eigenvalues of stay unchanged because then merely experiences a similar transformation. In our discussion of 2D TM waves, the specific forms of the channels have been chosen to simplify the notations in the derivation of the nonlocal boundary condition and the matrix. When different channels are used, for example, by changing the angular dependence of to [37], we effectively perform a permutation on the incoming channels, which is not a similar transformation with unchanged outgoing channels. Therefore, the -matrix eigenvalues and their symmetric (symmetry-broken) phases change as a result in general. Exploring this freedom of choosing the channel functions may lead to a close resemblance between the spontaneous symmetry breaking of the matrix and the corresponding close-cavity modes in - and -symmetric systems, similar to the finding in 1D heterostructures [38].
References
[1] R. K. Chang, A. J. Campillo. Optical Processes in Microcavities(1996).