Menu contact home

Chapter 4
Nonlinear propagation in coupled waveguide arrays

If two waveguides are placed in close proximity, light will be transferred between them, due to the evanescent field surrounding one overlapping with the core of the other. Such a device is known as a directional coupler, and has been realised in both fibres [176] and in SOI [177]. This can be generalised to an array of waveguides, which again has been realised in both fibres [178] and SOI.

The use of silicon adds a novel feature to such devices, arising from the strong dependence of the coupling on frequency. The frequency dependence of couplers is well known [179], but in silicon it is so strong that the second derivative of the coupling with respect to frequency becomes substantial, giving it a group velocity dispersion (GVD).

The GVD of the coupling is almost always normal, but by choosing the relative phases within the wires, its effective sign can be changed. By exciting a coupler mode where the two wires are half a cycle out of phase, the effective coupling GVD can be made anomalous. This can be used to realise both solitons (section 4.3) and modulational instability (section 4.5) in coupled systems where the individual waveguides are normally dispersive.

In waveguide arrays, this effect generalises into an interplay between group velocity dispersion and diffraction. Anomalous GVD can be achieved by using anomalous diffraction, in which neighboring waveguides are out of phase. Again, this can be used to realise multi-wire solitons, as is shown in section 4.4.2.

Anomalous diffraction has been a widely studied topic in recent years [180]. A substantial fraction of this interest is due to its close association with negative index refraction [181182]. (Materials with negative refractive index have been hypothesised since the 1960s [183], but have only recently been realised [184], and even then in suboptimal highly-lossy form. They are of prime importance to modern optics, as they promise exotic applications including "superlenses" which can resolve images at less than the wavelength of light [185], and even invisibility devices [186].) The similarity between refraction and dispersion results from the fact that both relate the wavenumber of propagation to another wavenumber. In the case of refraction, this wavenumber is in time (i.e. the frequency), whilst in diffraction it is the transverse spatial wavenumber. Therefore, reversing the sign of diffraction is analogous to reversing the sign of the refractive index. This chapter generalises that result, by showing that reversing the sign of diffraction is also akin to reversing the sign of dispersion.

The key results of this chapter have been published in Optics Express [187].

4.1 Modelling coupled SOI waveguides

The light guided by a silicon nanowire isn’t totally confined to the silicon itself, and will in part travel in the surrounding medium. This can be seen in figure 4.1, which shows the waveguide modes to have evanescent tails which extend into the surrounding air and silica. If two waveguides are placed in close proximity, then the evanescent field of one will significantly overlap with the core of the other, allowing light to be transferred between them.

A coupled waveguide system can be thought of as being one big waveguide having two principal modes, known as supermodes. In the case of identical waveguides (which is the only case considered in this report), one of these modes will be perfectly symmetric (in that the field profiles in the two wires will be mirror images of one another), whilst the other will be perfectly antisymmetric (in that the mirror-imaging will be accompanied by a reversal of field direction). An example of such a mode pairing is given in figure 4.1. If a pulse of light is fired into just one wire, a superposition of the two supermodes will be excited. These will (in general) have different propagation constants and so beating between the two modes will occur. Each wire will alternately see constructive and destructive interference, and so the pulse will move back and forth between wires [179].



Figure 4.1: Symmetric (left) and antisymmetric (right) mode profiles, displayed over a 2μm × 1μm cross section for 220nm × 330nm waveguides placed 330nm apart. Silica is beneath the horizontal line, with the rectangles denoting the silicon. The electric field vector is split into Cartesian components, with transverse components parallel to the silica-air interface shown top, transverse perpendicular components shown middle, and longitudinal components shown bottom. Colour saturation gives absolute value. The + and signs denote relative phase.

To extend the model derived in chapter 2 to a coupled-wire system, we start by making what is known as the tight-binding approximation [188]. This assumes that the supermodes are linear superpositions of the single-wire modes. (It is named after a similar approximation in solid state physics, in which the wavefunction of a crystal is assumed to be a linear superposition of the wavefunctions of the individual sites [189190].) This approach suffers from two main limitations: Firstly, it ignores the existence of higher order photonic bands in which the peaks of intensity no longer correspond to the waveguide cores [191]. Secondly, there is an upper limit to the coupling strength that can be considered, as when waveguides are in very close proximity, slot-guiding modes will form, in which light is trapped between the waveguides by total internal reflection [192].

To procede, we write the envelopes of the symmetric and antisymmetric supermodes (given by As and Aa respectively) as

        1
As  =   2 (A1 + A2)                         (4.1)
        1
Aa  =   2 (A1 – A2)                         (4.2)
where A1 and A2 are the slowly varying envelopes for the two wires. Subject to the tight-binding approximation, the propagation coefficients can be written as
 βs(ω ) =   β(ω)+ δ (ω )                        (4.3)
βa (ω ) =   β(ω)– δ (ω )                        (4.4)
where δ is a parameter that corresponds to the strength of the coupling. It is almost always positive, meaning that the symmetric supermode has a higher effective index than the antisymmetric supermode. The sign can however be changed with certain exotic waveguide configurations. An example is of AlGaAs waveguides surrounded by a metamaterial consisting of AlGaAs rods in a sapphire matrix [193181].

For the study of solitons, the coupling coefficient is usually assumed to be a constant [179]. For silicon wires, however, it is necessary to consider the full dependence on frequency. This is represented using the Taylor expansion δ(ω ) = m(δ /m!)
  m(ω – ω )
     0m. Rewriting equations 4.3 and 4.4 using these coefficients gives

βsm  =   βm + δm                            (4.5)

βam  =   βm – δm                            (4.6)
The δm coupling coefficients were obtained by calculating βsm and βam directly from Maxwell’s equations, using the methods described in section 2.1.2. By altering the dimensions of the waveguides, it was possible (to a certain extent) to alter the magnitudes of the coupling coefficients. However, despite considering a large range of waveguides, the signs could not be changed. The reason for this can be seen by noting that the evanescent fields surrounding the waveguide scale roughly with the wavelength. If we then assume that the coupling into a wire is proportional to its overlap with the exponentially decaying tail of another wire, then the frequency dependence of coupling can be approximated as
      – x(ω–ω0)
δ ≈ δ0e
(4.7)

where x is a positive constant. Taylor expanding this in ω gives

                        (    )
δ ≈ δ0 + (– δ0x)(ω – ω0)+ 1 δ0x2 (ω – ω0 )2 +𝒪 (ω – ω0)3
                       2
(4.8)

where we can identify the linear term δ0x with δ1 and the quadratic term δ0x2 with δ2. It therefore follows (assuming that we are not using an exotic waveguide configuration with negative c0) that δ1 is negative and δ2 is positive.

In the absence of all damping and nonlinearity, the two supermodes will propagate independently, with Schrödinger-like equations given by

∂As   ∑M  m –1         ∂mAs
-∂z-+     i   (βm + δm )-∂tm-  =  0                  (4.9)
      m=0
∂Aa   ∑M  m– 1         ∂mAa
-∂z-+    i    (βm – δm )-∂tm  =  0                 (4.10)
      m=0

where (unlike with a single wire) the zeroth and first order terms are still present, as they differ between the two modes and thus can’t be scaled away. (This is highly significant, as it is these differences that cause the coupling.) If we substitute equations 4.1 and 4.2 into equations 4.9 and 4.10, we obtain

  (          )     M   m– 1         (  m      m   )
1  ∂A1-+  ∂A2- + 1 ∑  i----(βm + δm)  ∂-A1-+ ∂--A2    =  0     (4.11)
2   ∂z    ∂z     2 m=0 m!             ∂tm     ∂tm
  (          )     M∑   m– 1         (  m      m   )
1  ∂A1-–  ∂A2- + 1    i----(βm – δm)  ∂-A1-– ∂--A2    =  0     (4.12)
2   ∂z    ∂z     2 m=0 m!             ∂tm     ∂tm

Adding or subtracting these gives

∂A1-  M∑  im–-1(   ∂mA1-     ∂mA2-)
∂z  +     m!   βm  ∂tm  +δm  ∂tm    =   0             (4.13)
     m=0      (                  )
∂A2-  M∑  im–-1    ∂mA1-     ∂mA1-
∂z  +     m!   βm  ∂tm  +δm  ∂tm    =   0             (4.14)
     m=0

These are simply the wave equations for an isolated waveguide, with a coupling term of the form

 M∑   m–1δm ∂mA '
    i   m!--∂tm--
m=0
(4.15)

added on, where A' is the envelope for the neighbouring waveguide.

Coupling length

If we consider the simplest possible case of coupled waveguides, with all nonlinearity, frequency dependence and damping removed (i.e. the case of monochromatic light through a lossless waveguide at low power), we obtain

∂A1-  =  iδ0A2                           (4.16)
 ∂z
∂A2-  =  iδ0A1                           (4.17)
 ∂z
where A1 and A2 are the envelopes for the two guides. This has a solution of the form
A1   =  cos(c0z)                           (4.18)

A2   =  isin (c0z)                          (4.19)
where we have imposed the boundary conditions that A1 = 1 and A2 = 0 at z = 0. (In other words, light has been fired into the first wire, but not into the second.) We can see that after a propagation distance of π/2δ0, all the light has been transferred in the second wire, and at π/δ0, all the light has returned to the first wire (with a phase shift). This will continue, with light being transferred back-and-forth between the two wires. (This back-and-forth light transfer is precisely that alluded to above, and is caused by beating between the symmetric and antisymmetric supermodes.) Using this fact, we can define the coupling length as
       π        π
LC ≡  2|δ| ≡ |β--–-β--|
        0     s0   a0
(4.20)

This gives the distance taken for transfer of light from one wire into another. This (in the same manner as the dispersion length) provides a very useful metric with which to gauge the intrinsic length scales of coupling. It is also possible to calculate the coupling length in a more direct manner, by performing numerical integration of the full Maxwell equations in a coupled wire system [194].

Conversion to dimensionless units

When converted into the familiar dimensionless units, the coupling term becomes

 M∑   m–1   ∂mEc
    i   cm ∂τm--
m=0
(4.21)

where the mth coupling coefficient is given by

      LD
cm = m!Tm-δm
        0
(4.22)

It follows from the definition of the δm coefficients (in equations 4.5 and 4.6) that these coefficients are given by

c  = --LD-- (β   – β  )
 m   2m!T0m   sm    am
(4.23)

By analogy with the dispersion operator Dˆ given by equation 2.71, we can define a coupling operator Ĉ as

    ∑M    (  ∂ )m
Cˆ=     cm  i---
    m=0      ∂τ
(4.24)

This gives (for a single wire interacting with a second wire with envelope E')

∂E
--- – iDˆE = i|E |2E + iCˆE '
 ∂ζ
(4.25)

which after reintroducing damping gives

∂E-   ˆ        2    ˆ  '               2          4
∂ζ – iDE = i|E| E + iCE – εlEn – ε2pa|En| E – ε3pa|En |E – εfcEnν
(4.26)

Waveguide arrays

The procedure for calculating waveguide coupling is linear, and so obeys the principle of superposition. Therefore, when multiple waveguides are present, the coupling between each pair of neighbours can be considered independently, and the resulting coupling terms simply added together. This can be extended to an arbitrarily large set of waveguides, so that the undamped equation for the nth waveguide is given by

∂En-– iˆDE  = i|E |2E  + iˆC (E    + E   )
 ∂ζ       n     n   n       n+1    n– 1
(4.27)

where the out-of-bounds values of En (i.e. those beyond the ends of the row of waveguides) are identically zero. Reintroducing damping gives

∂En-– iˆDE   = i|E |2E + iˆC (E   + E    )– εE  – ε  |E |2E – ε  |E |4E – ε E ν
 ∂ζ       n     n   n       n+1    n–1   l n    2pa  n      3pa n      fc n n
(4.28)

When N wires are present, it follows that there must be N separate equations to describe free carrier evolution. The equation for the nth waveguide is given by

dνn      4  νn
dτ = |En| –  τc
(4.29)

where νn is the carrier density in that particular wire.

4.2 Device specifications

The aim of this chapter is to control dispersion with coupling. To demonstrate this point, a normally dispersive waveguide was chosen, which in isolation would not be capable of exhibiting soliton propagation or modulational instability. Therefore, transverse dimensions of 220nm × 330nm were chosen, giving a normal GVD of 1416 psnm1 km1 at a 1.5μm pump wavelength (Figure 4.2). For 100fs pulses, this gives a dispersion length of 1.93mm. Such waveguides were placed 330nm apart, providing very strong coupling (as is shown in figure 4.2), but remaining within the tight-binding approximation (as is shown in figure 4.3). The mode profiles were shown above (in figure 4.1). The leading coupling coefficients are c0 = 132.7, c1 = 24.22 and c2 = 2.254, whilst the dispersion coefficient is p2 = 0.5.



Figure 4.2: The left hand plot shows dispersion length (as a function of wavelength) of a 220nm × 330nm SOI waveguide for 100fs pulses. The solid line gives anomalous GVD, whilst the dashed line gives normal GVD. The vertical bar gives the position of the 1.5μm pump wavelength. The right hand plot shows the coupling length for the same waveguides placed 330nm apart.



Figure 4.3: Demonstration of the validity of the tight binding approximation for 220nm × 330nm waveguides placed 330nm apart. The power transmission as a function of transverse distance is plotted, with the solid line giving the symmetric mode, and the dashed line giving the antisymmetric mode. The dots show the corresponding quantities calculated by linear superposition of the single-wire modes, showing a very good agreement. The grey blocks show the waveguide dimensions

4.3 Temporal solitons in a directional coupler

Directional couplers in silicon can support solitons, even if the constituent waveguides are normally dispersive at the pump frequency. The coupled system is described by the equations

∂E1
-∂ζ-– iˆDE1   =  i|E1 |2E1 + iCˆE2                    (4.30)

∂E2-– iˆDE2   =  i|E2 |2E2 + iCˆE1                    (4.31)
 ∂ζ
where E1 and E2 are the field envelopes of the two wires. (For clarity the damping terms are not shown.) To find supermodal solitons, we switch to a supermodal basis of the form
        1
X   ≡   2 (E1 + E2 )                       (4.32)
        1
 Y  ≡   2 (E1 – E2 )                       (4.33)
                                           (4.34)
where X and Y are the symmetric and antisymmetric modes respectively. Transforming equations 4.30 and 4.31 gives
      (      )
∂X-– i Dˆ+ Cˆ X   =  i|X |2X + 2i|Y|2X + iX ∗Y 2           (4.35)
∂ζ
∂Y- – i( ˆD – ˆC)Y  =  i|Y |2Y + 2i|X |2Y + iY ∗X2            (4.36)
 ∂ζ
We now have equations for the two modes, which are connected only by nonlinear coupling terms. Truncating the operators to second order gives
∂X            ∂2X           ∂X
--- + i(p2 + c2)-2-– ic0X + c1---  =   i|X |2X + 2i|Y |2X + iX ∗Y2
 ∂ζ           ∂τ             ∂τ
                                                                (4.37)
 ∂Y-+ i(p – c )∂2Y--+ ic Y – c ∂Y- =   i|Y |2Y + 2i|X |2Y + iY∗X2
 ∂ ζ     2   2 ∂τ2    0     1∂τ
                                                                (4.38)
Soliton solutions can now be found by setting either X or Y to zero, whilst retaining the other term. Doing this, we obtain one of the following equations
               2
∂X-+ i(p2 + c2)∂-X2 – ic0X + c1∂X  =  i|X|2X             (4.39)
∂ζ            ∂τ            ∂τ
 ∂Y-+ i(p – c )∂2Y-+ icY – c ∂Y-  =  i|Y|2Y              (4.40)
 ∂ζ     2   2 ∂τ2     0    1∂ τ
These equations have the respective solutions
           √--------     ( √----------        )
X   =   eiqζ 2 (q – c0) sech     --q–-c0--(τ – ζc1)           (4.41)
                         ( √-– (p2 +-c2)      )
        iqζ√--------         --q+-c0---
 Y  =   e   2 (q + c0) sech     – (p2 – c2) (τ +ζc1)         (4.42)
These solutions have the same structure as the familiar E = eiqz√ --
  2qsech(√ -- )
   2qτ solitons (and indeed become identical when we set the coupling terms to zero). Therefore, the waveguide pair is simply acting as a single quasi-waveguide, with modified propagation coefficients. The solutions are a generalisation of previously known solutions [179], to which they reduce in the case c1 = c2 = 0.

Three things about these solutions are notable:



Figure 4.4: a) The dispersion parameter for the coupling between 220nm × 330nm wires placed 330nm apart, plotted over wavelength. This shows the coupling operator to be strongly normally dispersive. b) The dispersion parameters for the symmetric and antisymmetric supermodes (dashed lines) together with that of a single wire (solid line). It can be seen that the spectral range of anomalous GVD is restricted for the symmetric mode, and extended for the antisymmetric mode.

4.3.1 Stability analysis of the antisymmetric mode

Just because equations 4.41 and 4.42 are stationary solutions, doesn’t mean that the corresponding solitons are stable. They could be mathematically analogous to a needle which is placed perfectly upright, balanced upon its point, such that the slightest perturbation will cause it to fall. Therefore, it is necessary to see how the solutions react to perturbation. Such analysis is widely established for models with frequency-independent coupling [196197], but here it is extended to systems where the coupling affects both the group velocity and GVD.

We will start by analysing the stability of solitons in the antisymmetric mode. We take a solution of the form X = 0, Y = y(ζ,τ)eiqζ (where y(ζ,τ)eiqζ is the solution given by equation 4.42 written in the form of a real amplitude y multiplied by a complex phase eiqζ). We then add perturbation terms to give

              iqζ
X   =  εy(ζ,τ )e                                 (4.43)
Y   =  y(ζ,τ)eiqζ + ey(ζ,τ)eiqζ                   (4.44)
where εy represents potential inter-modal instability, whilst ey represents instability within the mode. By only considering the initial stages of instability, we can linearise the problem by rejecting the higher powers of εy and ey. Making the substitution into equations 4.37 and 4.38 gives
 ∂εy           ∂2εy                ∂εy        2     2 ∗
 ∂ζ-+ i(p2 +c2)∂-τ2 + i(q– c0)εy + c1∂τ =  2iy εy + iy εy       (4.45)
∂e            ∂2e                 ∂e
--y + i(p2 – c2)-2y+ i(q+ c0)ey – c1-y  =  2iy2ey + iy2e∗y      (4.46)
 ∂ζ           ∂τ                  ∂τ
It is inconvenient to have y a function of both τ and ζ. However, if we make the substitution τ τy ζc1, where τy is relative time in a shifted frame of reference, we obtain the univariate function
                      (√ ---------- )
y (τ ) = √2-(q-+c-) sech  --q-+-c0--τ
   y           0         – (p2 – c2) y
(4.47)

Transforming equations 4.45 and 4.46 likewise gives

∂εy          ∂2εy                 ∂εy        2      2∗
∂ζ + i(p2 + c2) ∂τ2y + i(q – c0)εy + 2c1∂τy =  2iy εy + iy εy      (4.48)
        ∂e           ∂2e
        --y+ i(p2 – c2)--y2 + i(q + c0)ey  =   2iy2ey + iy2e∗y     (4.49)
        ∂ζ            ∂τy
We assume solutions of the form
εy  =  ˜εy(τy)eλyζ                          (4.50)
              lyζ
ey  =  ˜ey(τy)e                            (4.51)
where λy and ly are constants to be determined. This is an eigenvalue problem (as is shown in appendix C.1) with ˜ε y and y as the eigenfunctions and λy and ly as the eigenvalues.

Equations 4.50 and 4.51 exhibit a continuum of eigenvalues along the imaginary axis, which correspond to the set of all delocalised eigenfunctions. (This is derived in appendix C.2.) However when c12 + (p + c )
  2   2 (q– c )
     0 < 0, a bandgap exists in λy over the range

  |             |            |             |
  ||       --c21--||            ||       --c21--||
– |q– c0 + p2 + c2| < ℑ[λy] < |q– c0 + p2 + c2|
(4.52)

and when (q+ c0)/(p2 – c2) < 0, a bandgap in ly exists over the range

– |q+ c0|  < ℑ[ly] <  |q + c0|
(4.53)

These delocalised modes are not particularly important in themselves, as having zero real part they will not contribute to exponential growth and instability. The band-gap is significant however, as within it discrete eigenvalues can exist, which correspond to localised eigenfunctions [23]. These can have non-zero real part, and if positive will lead to exponential growth and thus instability. A search for discrete eigenvalues was made numerically, as is shown in appendix C.3.

It is known that a bifurcation point should occur at q = c0, corresponding to the existence of asymmetric nonlinear supermodes [196197]. (These modes are considered in more detail in section 4.5.2, in the context of modulation instability.) However no discrete eigenvalues corresponding to instabilities into these modes were found, despite scanning across both q and the pump frequency. This is not entirely unexpected, as the exotic solitons that form in these asymmetric modes have a higher energy than those in the ordinary antisymmetric mode [196197], and thus the latter are already in the more stable state.

Intramodal perturbation

For the intramodal perturbation, the discrete eigenvalues (of which there are four) can be found analytically. (The resulting spectrum is shown in figure 4.5.) There is a 2-fold degenerate solution with eigenvalue ly = 0 corresponding to the eigenfunction

          (√ --q-+-c0-- )
ey = isech    –-(p-–-c-)τy
                 2   2
(4.54)

and a second solution, again with 2-fold degeneracy and ly = 0, with eigenfunction

          ( √ --q+-c0--- )     ( √ --q+-c0--- )
ey = 2tanh    –-(p--– c-) τy sech    –-(p--– c-) τy
                 2   2                2   2
(4.55)



Figure 4.5: Spectrum of equation 4.49 shown for q = 130.9 (i.e. a pulse with unit duration) for the system described in section 4.2. A continuum of delocalised eigenvalues lies along the imaginary axis, with a bandgap in the range 1.754 < ey < 1.754. Four discrete eigenvalues exist at the origin, which are plotted as functions of τy.

None of these will contribute to instability, as they have no real part. A numerical search was made for additional discrete eigenvalues (again, using the method described in appendix C.3), but none were found. This suggests that the antisymmetric mode is also stable with respect to intramodal instability.

4.3.2 Bifurcation of the symmetric supermode

We can analyse the stability of the symmetric mode in the same way. (The solutions will not exist at a 1.5μm pump. Therefore a 1.4μm pump was used.) As before, we have a symmetric solution of the form X = x(ζ,τ)eiqζ, Y = 0 (where x(ζ,τ)eiqζ is the solution given by equation 4.41 written in the form of a real amplitude x multiplied by a complex phase eiqζ), which we perturb to give

X  =   x(ζ,τ)eiqζ + ex(ζ,τ)eiqζ                   (4.56)

Y  =   εx(ζ,τ )eiqζ                               (4.57)
where εx represents potential inter-modal instability, whilst ex represents instability within the mode. Substituting this into equations 4.37 and 4.38 and linearising gives
∂ex           ∂2ex                 ∂ex        2      2∗
∂ζ-+ i(p2 + c2)∂τ2-+ i(q– c0)ex + c1∂τ =  2ix εx + ix εx      (4.58)
∂ε            ∂2ε                 ∂ε
--x + i(p2 – c2)-2x+ i(q+ c0)εx – c1-x  =  2ix2ex + ix2e∗x      (4.59)
 ∂ζ           ∂τ                  ∂τ
As before, we shift into a moving frame of reference in order to avoid x being function of both τ and ζ. We make the substitution τ τx + ζc1, giving
       √ --------     (√ ----------  )
x (τx) =  2(q – c0) sech   --q-–-c0--τx
                         – (p2 + c2)
(4.60)

Transforming equations 4.58 and 4.59 likewise gives

        ∂ex          ∂2ex                    2      2 ∗
        ∂ζ-+ i(p2 + c2)-∂τ2 + i(q – c0)ex  =  2ix ex + ix ex     (4.61)
              2         x
∂εx-+ i(p2 – c2)∂-εx2 +i(q+ c0)εx – 2c1∂εx =  2ix2εx + ix2ε∗x      (4.62)
∂ζ            ∂τx                  ∂τx

Again, we assume solutions of the form

εx  =  ˜εx(τx)eλxζ                          (4.63)
              lxζ
ex  =  ˜ex(τx)e                            (4.64)
where λx and lx are constants to be determined. As before, this is an eigenvalue problem (as shown in appendix C.1) with ˜ε x and x as the eigenfunctions and λx and lx as the eigenvalues.

As with the antisymmetric case, 4.63 and 4.64 exhibit a continuum of eigenvalues along the imaginary axis, corresponding to the set of all delocalised eigenfunctions. (This is derived in appendix C.2.) When c12 + (p2 – c2)(q+ c0) < 0 a bandgap exists in λx over the range

  ||          2  ||            ||          2  ||
– ||q+ c0 +--c1--|| <  ℑ[λx] < ||q+ c0 +--c1--||
          p2 – c2                    p2 – c2
(4.65)

and when (q– c0)/(p2 + c2) < 0, a bandgap in lx exists over the range

– |q– c|  < ℑ[l ] < |q – c|
       0       x        0
(4.66)

It is known that in the absence of coupling GVD, a bifurcation point occurs at q = 5
3c0, which (as for the instability mentioned in section 4.3.1) corresponds to the formation of an asymmetric mode [196197]. If we wish to find similar behaviour, we need to search for a discrete eigenvalue which gains a positive real part when q exceeds a critical value. This search was made numerically (using the method described in appendix C.3), with the results shown in figure 4.6. This shows that as q is increased, a pair of discrete eigenvalues in λx (with values summing to zero) enter the band-gap and migrate along the imaginary axis towards the origin. When they reach it, they start heading in opposite directions along the real axis, and thus the soliton will become unstable.



Figure 4.6: Movement of discrete eigenvalues as q is increased. Starting at q = 1.65, the two eigenvalues lie on the imaginary axis. As q is increased towards 1.74 the eigenvalues move towards the origin. Between q = 1.74 and q 1.75, they reach the origin, and start moving along the real axis. At q = 1.85 they have a strong real part.

We therefore see a bifurcation when λx = 0. In the special case where c1 = 0, we can find an analytical solution

          (√ ---------- )
        2    --q-–-c0---
˜εx = sech     – (p2 + c2)τx
(4.67)

subject to the condition

   2-–-p2 –-c2
q = 2 + p2 + c2c0
(4.68)

Without coupling (when c2 = 0, and p2 must be 1
2), this reduces to q = 5
3c0, and thus the result is a generalisation of that seen in references [196] and [197]. We will return to the self-focussing effects that give rise to this instability in section 4.5.2.

Intramodal perturbation

For the intramodal perturbation, the discrete eigenvalues can (just as for the antisymmetric case) be found analytically. There is a 2-fold degenerate solution with eigenvalue ly = 0 corresponding to the eigenfunction

          (√ --q-–-c0-- )
ey = isech   –-(p-+-c-)τx
                 2   2
(4.69)

and a second solution, again with 2-fold degeneracy and ly = 0, with eigenfunction

          ( √---q–-c0--  )     ( √ --q–-c0--- )
ey = 2tanh    –-(p-+-c-) τx sech     –-(p--+c-) τx
                 2   2                2   2
(4.70)

Again, none of these will contribute to instability, as they have no real part. A numerical search was made for additional discrete eigenvalues, but none were found. This suggests that (as for the antisymmetric case) there is no intramodal instability.

4.3.3 Soliton generation

Soliton evolution was modelled numerically. (See appendix A.1.) The supermodes can be generated directly by firing coherent light into the array such that both wires are excited. The antisymmetric mode can be generated by pumping at a non-zero angle of incidence, so that the differing path lengths cause an incremental phase shift between neighboring waveguides [198199]. It is more interesting, however, to fire light into just one wire, thus creating a superposition of the two modes, and then to rely upon the group-velocity difference noted above to split the modes. Pulse propagation for the linear regime is given in figure 4.7, and it can be seen that the pulse splits cleanly in two. Introducing nonlinearity (but not damping) in figure 4.8 shows that the antisymmetric pulse forms a soliton. The introduction of damping in figure 4.9 shows that the soliton survives.



Figure 4.7: Evolution of a pulse with peak power 15P0 fired into the 1st wire of a 2 wire system. (Amplitude derived as √ ∑-------
    n|En|2, which corresponds to incoherent mixing between the outputs of each wire.) Without damping or nonlinearity, the pulse splits into a pair of supermodal pulses due to the effect of coupling on group velocity.



Figure 4.8: As with figure 4.7, but with nonlinearity turned on. The antisymmetric mode is anomalously dispersive, and so in the presence of nonlinearity, the pulse forms a soliton. Conversely, the symmetric supermode is normally dispersive, and so the nonlinearity accelerates pulse broadening.



Figure 4.9: As with figure 4.8, but with realistic damping (ε2pa = 0.1, εl = 0.05) turned on. Despite this, substantial nonlinear suppression of dispersive broadening of the antisymmetric pulse remains.

The soliton like pulse shown in figure 4.9 can be investigated further, by using the soliton area parameter defined in section 3.3.3. As before, we define a parameter S = TFWHM√P----
   max. For the ideal soliton solution (equation 4.42) the peak power (summed over both of the channels) is given (in terms of the intrinsic unit P0) by

Pmax = 4 (q + c0)P0
(4.71)

Similarly, the FWHM duration is given (in terms of the intrinsic unit T0) by

            (   √ -)√ -------
TFWHM  = 2ln  1+   2   c2 –-p2T0
                       q+ c0
(4.72)

This gives an ideal soliton area parameter of

              (      )
S  = 4T √P--ln 1+ √2- √c--–-p-
 0     0   0             2   2
(4.73)

Figure 4.10 shows the soliton area parameter plotted as a function of distance. It remains roughly constant, having a value close to S0, thus showing that the pulse is a soliton.



Figure 4.10: Soliton area parameter for the soliton in figure 4.9. The solid line gives the value of S along the distance of propagation. This remains roughly constant and close to the ideal value S0 (denoted by the dashed grey line). The dotted line gives the parameter with nonlinearity turned off, showing a stark contrast with the nonlinear regime. (The quantities are scaled such that the power unit P0 = 1W.)

4.4 Temporal solitons in multiwire waveguide arrays

4.4.1 Interplay between diffraction and dispersion

The above can be generalised to a system with an arbitrary number of waveguides, N. We start by rewriting the undamped equations of motion (equations 4.27) in a vector form, where ⃗E contains the values of En. This gives

 ⃗           (  )
∂E-– iDˆ⃗E = if E⃗ + iXˆˆC⃗E
∂ζ
(4.74)

where the function f(⃗x) ≡|⃗x|2⃗x acts independently on each element of the vector, and the matrix operator ˆX is defined as

ˆX     ≡ δ      + δ
 (μ)(ν)   (μ)(ν– 1)   (μ)(ν+1)
(4.75)

where δ is the Kronecker symbol. (It should be noted that the matrix ˆX will be encountered once again in section 5.3, when the radiation emitted by spatiotemporal solitons is considered.)

In order to proceed, we need to diagonalise this matrix equation. We write the eigenvalue equation for Xˆ as

ˆX⃗xk = χk⃗xk
(4.76)

which yields eigenvalues of the form

         (      )
χk ≡ 2cos  -πk---
           N + 1
(4.77)

with corresponding (normalised) eigenvectors

       √ ------   (     )
(⃗xk) =   --2---sin  -πnk--
    n    N  +1     N + 1
(4.78)

It should be noted that the index k is arbitrary, and thus may be replaced with other indices, as is done for χj and ⃗xj below. The eigenvalue are distinct, as the argument of the cosine function is always within the range 0 < 𝜃 < π, over which the cosine function shows one-to-one correspondence. As Xˆ is both self-adjoint and distinct-eigenvalued, it follows that the normalised eigenvectors must form an orthonormal basis. This can be seen explicitly by defining a scalar product operation

      N∑    ∗
⃗a⋅⃗b ≡    (a)n (b)n
      n=1
(4.79)

Using the trigonometric identity

           (      )    (      )
--2---N∑      -πnj--     -πnk--
N + 1    sin  N + 1  sin  N + 1   = δ(j)(k)                (4.80)
      n=1
                     ∀j,k ∈ {1,2,⋅⋅⋅ ,N}
(where δ is the Kronecker symbol) gives ⃗xk ⃗xj = δ(j)(k), as required. We can therefore diagonalise equation 4.74 by expanding ⃗E in this basis as
    √ ---------
⃗     2(N-+-1) N∑  ˜
E =       3       Ek⃗xk
               k=1
(4.81)

where k is the amplitude of each modal component. (The √2-(N-+-1)/3 prefactor is a matter of convenience, chosen so that in a few steps further on, equation 4.92 will have unit nonlinearity.)

The basis conforms to the condition that E0 = EN+1 = 0, meaning that the (non-existent) wires outside of the waveguide boundaries contain no light. It should also be noted that this basis is a generalisation of that used for the N = 2 case in section 4.3, with 1 = X and 2 = Y .

Substituting this into equation 4.27 and then taking the scalar product with any basis vector ⃗xj (to project out the jth mode) gives

                           (        )
∂ ˜Ej          2              ∑N
-∂ζ-– iˆDjE˜j = 3 (N + 1)⃗xj ⋅f    ˜Ek⃗xk
                             k=1
(4.82)

where the effective dispersion of each mode is given by

ˆDj = ˆD + χj ˆC
(4.83)

We have not fully diagonalised the equations, due to the nonlinear terms. However, there are no longer any linear terms linking the components of equation 4.82. Therefore, in the low power limit the system is diagonal and so each component of equation 4.82 will evolve independently.

Normal and anomalous diffraction

It is apparent from equation 4.83 that the dispersion and coupling are interlinked, with the eigenvalue χj controlling the extent of this interaction. The fact that these eigenvalues are distinct means that each mode has a unique dispersion relation. The values of χj occur in pairs having opposite sign (plus a zero-eigenvalue when N is odd) due to the property that cos(𝜃) ≡–cos(π– 𝜃). This is highly significant, as by choosing the appropriate mode, it is possible to effectively reverse the signs of the coupling coefficients.

It is instructive to rewrite equations 4.77 and 4.78 as

χj  =  2cos(κx)                              (4.84)
       √ ------
⃗xj  =    --2---sin(κxn)                       (4.85)
         N + 1
where the periodicity of the waveguide amplitudes is given by the transverse wavenumber
      πj
κx ≡ ------
     N + 1
(4.86)

In the limit of an infinitely large array, the transverse wavenumber becomes a continuous varying quantity. If we were to excite such an array with a coherent beam of light fired along the direction of the waveguides, then a mode with all the wires in phase (and thus κx = 0) would be excited. If the initial excitation was restricted to a finite region of the waveguide, then a spatial wavepacket centred about κx = 0 would be excited. Alternatively, by tilting the beam with respect to the direction of the waveguides, a phase shift would be introduced between each wire (due to the varying path length) thus exciting a mode with non-zero κx [199]. In general, such a beam will spread into the surrounding waveguides, in what is essentially a discrete version of diffraction. It can be shown, for instance, that the width of a Gaussian beam focused onto the array boundary will vary with propagation as [199]

       √ -------------
            ( 2Cχ-''j )2
W  = W0  1+    W 20 ζ
(4.87)

where W0 is the initial width. The diffraction coefficient χj''2χj/∂κx2 = χj specifies the magnitude of the diffraction. It is the spatial equivalent of dispersion, and is almost always negative [198].

At κx = 0, the diffraction coefficient possesses its minimum value of χj'' = 2. As κx is increased from 0, the diffraction coefficient becomes less negative, until at κx = π/2 the diffraction vanishes, and the beam propagates with constant width. Increasing κx beyond π/2 causes the diffraction coefficient to become positive, in what is known as anomalous diffraction [198]. At κx = π the diffraction coefficient reaches its maximum value of χj'' = 2. Notably, equation 4.87 is not dependent on the sign of χj'', and so this state broadens in exactly the same manner as the κx = 0 state. More generally, the states with 0 ≤|κx| < π/2 give normal diffraction, whilst the states with π/2 < |κx|≤ π give anomalous diffraction. It can also be seen that normal diffraction corresponds to positive values of χj, whilst anomalous diffraction corresponds to negative values.

The mode profiles for the κx = 0, π/2 and π cases are plotted in figure 4.11. For κx = 0, the wires are all in phase, whilst for κx = π every mode is maximally out of phase with its neighbours. The diffractionless mode at κx = π/2 corresponds to a mode in which every other wire has zero amplitude, as is shown in figure 4.11.



Figure 4.11: Examples of modes of an infinite waveguide array (shown over a 20-waveguide section). At a transverse wavenumber of κx = 0 (bottom), all the waveguides are in phase, giving normal diffraction. At a transverse wavenumber of κx = π/2 (middle) every other waveguide has zero amplitude (denoted with crosses), giving zero effective coupling and thus zero diffraction. At κx = π every waveguide is out of phase with its neighbours, giving anomalous diffraction.

In the more general case, the relationship between the normally diffractive and anomalously diffractive modes can be seen by defining a matrix Ŝ as

 ˆ            μ
S(μ)(ν) ≡ – (– 1) δ(μ)(ν)
(4.88)

where δ is the Kronecker symbol. This matrix anticommutes with Xˆ and thus multiplication of an eigenvector by Ŝ provides the eigenvector with oppositely signed eigenvalue. Therefore, multiplication by Ŝ (which is self-inverse) switches back-and-forth between states with normal and anomalous diffraction. The operation has the effect of shifting the phase of every other wire by half a cycle, which in turn has the effect of reversing the inter-wire phase relationship between each neighboring pair. Therefore, each normally diffractive mode is paired with an anomalously diffractive mode having the same amount of light in each waveguide, and the same beam widening properties, but opposing inter-wire phase relationships.

It should be noted that the scheme described above is not the only means of achieving anomalous diffraction. The most direct means is to invert the sign of coupling (as mentioned in section 4.1) by using systems consisting of (for example) AlGaAs waveguides separated by a metamaterial consisting of AlGaAs rods in a sapphire matrix [193181]. It can also be achieved by using higher order photonic bands [191200].

When a finite array of wires of wires is used, κx becomes quantised to the values given by equation 4.86. Whilst the extremal κx = 0 and κx = π modes can only exist in an infinite array, the κx = π/2 diffractionless modes can be seen, and exist for all arrays with odd N.

Controlling dispersion with diffraction

From equation 4.83 it is apparent that for normally diffractive modes (with positive χj), the coupling dispersion operator Ĉ will be added to the dispersion operator ˆD, whilst for anomalously diffractive modes (with negative χj), Ĉ will be subtracted from  ˆ
D. In otherwords, normal diffraction will cause the coupling dispersion to reinforce the waveguide dispersion, whilst anomalous diffraction will cause the coupling dispersion to counteract the waveguide dispersion.

As was described in section 4.1, the GVD of coupling is usually normal. However, by using anomalous diffraction, we can use it to create anomalous GVD. Figure 4.12 shows the parameter space (in wavelength and transverse wavenumber) giving anomalous GVD for a set of 220nm × 330nm waveguides placed 330nm apart. Examples of actual dispersion curves that can be created like this, are shown in figure 4.13.



Figure 4.12: The relation between dispersion and diffraction in an array of 220nm × 330nm waveguides placed 330nm apart. The shaded region gives the parameter space (in wavelength and transverse wavenumber) giving anomalous supermode GVD. The transverse wavenumbers corresponding to 2 and 6 wire systems are highlighted with horizontal lines. The mode profiles for the 6 wire system are plotted, with the index j corresponding to that in equation 4.83.



Figure 4.13: Supermodal dispersion plots in an array of 220nm × 330nm waveguides placed 330nm apart. The solid line gives the GVD of an isolated wire. The dashed lines gives the GVDs of the two-wire supermodes. The dotted lines give the GVDs of the six-wire supermodes, with the modes plotted to the right, and the index j corresponding to that in equation 4.83. The shaded area gives the maximum possible extent of dispersion modification, with the boundaries corresponding to the κx = 0 and κx = π cases of an infinite system.

As mentioned previously, there is a correspondence between anomalous diffraction and negative index refraction [181182]. Switching back into the unscaled units (defined in section 4.1), the supermodal wavenumber is given by βmode = β + χjδ, and so the supermodal effective index is given by

nmode ≡ λ-βmode =-λ-(β + χjδ)
        2π       2π
(4.89)

Whilst χj is not capable of changing the sign of the refractive index, it is capable of reversing the sign of one of the contributing terms.

4.4.2 Supermodal soliton solutions

When only one mode j is present, equation 4.82 reduces to

 ˜                     ∑N    (      )
∂Ej-– iDˆj ˜Ej = i---8---    sin4  -kπj-- |˜Ej|2E˜j
 ∂ζ           3 (N  + 1)k=1     N + 1
(4.90)

This can be simplified using the identity

                             (
         N∑     (      )      |{  1  ; j ⁄= N+21-
----8---    sin4  -kπj--   =                              (4.91)
3 (N  + 1) k=1     N + 1       |(  4        N+1-
                                3  ; j =  2
        ∀j ∈ {1,2,⋅⋅⋅ ,N }
giving an uncoupled NLS equation of the form
∂E˜j-– iˆD ˜E  = iΓ   |E˜ |2 ˜E
 ∂ζ     j j     Nj  j  j
(4.92)

where the coefficient ΓNj is given by

      (
      |{  1 ;  j ⁄= N+21-
Γ Nj ≡
      |(  4        N+1-
         3 ;  j =  2
(4.93)

The exceptional cases at j = (N + 1)/2 correspond to the κx = π/2 non-diffractive modes, in which the wires are effectively uncoupled. (They also correspond to the trivial N = 1 case, where only a single wire is present.) Given the lack of any interaction between dispersion and diffraction, these modes are not particularly interesting, and will not be considered further.

Taking this single-mode equation, and truncating the operators to second order gives

∂E˜j-+ i(p  + χ c)∂2E˜j-– iχ cE˜ + χ c ∂ ˜Ej-= iΓ  |E˜ |2 ˜E
 ∂ζ     2    j2  ∂τ2     j0  j   j 1∂τ      Nj  j  j
(4.94)

This has solutions of the form

        √ -----------    ( √ ------------         )
      iqζ  2 (q – χjc0)          q– χjc0
E˜j = e    ---Γ-Nj--- sech     –-(p2-+χjc2) (τ – ζχjc1)
(4.95)

From equations 4.78 and 4.81, it follows that the transformation out of the modal basis is made as

     √ -- N    (      )
En =   4 ∑  sin  -πnk-- E˜k
       3 k=1    N  +1
(4.96)

Doing so, gives multi-wire soliton solutions of the form

        √ --------------   (      )     (√ ------------         )
      iqζ  --8--             -πnj--         --q-–-χjc0---
En = e    3Γ Nj (q– χjc0) sin N +1   sech    – (p2 + χjc2) (τ – ζχjc1)
(4.97)

It can be seen that in the single wire N = 1 case (where p2 = 1
2, ΓNj = 4
3 and χj = 0) this reduces to the conventional solution of E = √ --
  2qsech(√ -- )
   2qτeiqζ . The solutions are a further generalisation of previously known solutions [179], to which they reduce in the case N = 2 and c1 = c2 = 0.

4.4.3 Soliton formation

Soliton evolution was modelled numerically. As with the two wire case, the anomalously diffractive supermodes can be excited directly by pumping at a non-zero angle of incidence [198]. However, as before, the method of pumping a single wire and then letting the resulting superposition of modes separate was used. Pulse propagation for the linear regime is given in figure 4.14, and it can be seen that the input pulse splits into a train of six pulses. Introducing nonlinearity (but not damping) in figure 4.15 shows that the final pulse in the train forms a soliton. The introduction of damping in figure 4.16 shows that the soliton survives.



Figure 4.14: Evolution of a pulse with peak power 150P0 fired into the 3rd wire of a 6 wire system. (Amplitude derived as √ ∑-------
    n|En|2, which corresponds to incoherent mixing between the outputs of each wire.) Without damping or nonlinearity, the pulse splits into a train of 6 supermodal pulses (with numbering corresponding to j in equation 4.83).



Figure 4.15: As with figure 4.14, but with nonlinearity turned on. The 6th pulse forms a soliton.



Figure 4.16: As with figure 4.15, but with realistic damping (ε2pa = 0.1, εl = 0.05) turned on. Despite this, substantial nonlinear supression of pulse broadening of the 6th pulse remains.

The soliton-like j = 6 pulse shown in figure 4.16 can be investigated (as in section 4.3.3) by using the soliton area parameter. For the ideal soliton solution (equation 4.97) the peak power summed over all of the channels is given by

       4(q– χjc0)(N + 1)
Pmax = ------3Γ---------P0
               Nj
(4.98)

and the FWHM duration is given by

            (     -)√ ------------
TFWHM  = 2ln 1 +√ 2    – (p2 +-χjc2)T0
                         q– χjc0
(4.99)

This gives an ideal soliton area parameter of

                        √ -------------------
     4   √ ---  (   √ )   – (p2 + χjc2)(N + 1)
S0 = √3T0  P0 ln  1+   2   -------Γ-Nj-------
(4.100)

Figure 4.17 shows the soliton area parameter plotted as a function of distance. It remains roughly constant, having a value close to S0, thus showing that the pulse is a soliton.



Figure 4.17: Soliton area parameter for the j = 6 soliton shown in figure 4.16. The solid line gives the value of S along the distance of propagation. This remains roughly constant and close to the ideal value S0 (denoted by the dashed grey line). The dotted line gives the parameter with nonlinearity turned off, showing a stark contrast with the nonlinear regime. (The quantities are scaled such that the power unit P0 = 1W.)

4.5 Modulation instability in couplers

Modulation instability (which was considered for a single wire in section 3.4) can also happen in coupled waveguide systems [201202]. As for soliton propagation, the conventional models assume a frequency independent coupling term. Likewise, when coupling GVD is introduced, it can be seen that the dynamics are greatly changed, and that modulation instability can occur in normally dispersive waveguides.

Equations 4.35 and 4.36 possess continuous wave solutions of the form

    √ --i(P+c )ζ
X =   Pe    0
(4.101)

for the symmetric mode, and

    √--
Y =  P ei(P–c0)ζ
(4.102)

for the antisymmetric mode. As in section 3.4, P is related to the optical power. It follows from the transformation in equations 4.32 and 4.33 that P is the power per wire, and so the total power in the system in given by 2P.

As with the single wire case, perturbations from these solutions can be reinforced by the nonlinearity, creating spectral sidebands. However, two very different phenomena can result, depending on whether these sidebands are emitted into the same mode as the pump signal, or into the other mode. Both cases are considered below.

4.5.1 Intra-modal modulation instability

Taking the antisymmetric solution, we can perturb it as

    (  --  )
Y =  √ P + ε ei(P–c0)ζ
(4.103)

where ε(ζ,τ) is a small perturbation into the same mode. Substituting this into equation 4.36 gives

    (          )
∂ε-–  ˆD – ˆC + c0 ε = iP (ε+ ε∗)
∂ζ
(4.104)

where terms in ε2 and ε3 have been removed, as we are only considering the first stage of perturbation. This is the same as equation 3.14, except we have replaced the dispersion operator ˆD with a modified operator ˆD Ĉ + c0. We can therefore use the same procedure to derive an expression for the modulation instability side-lobes. The gain spectrum is given by

           √(----------------)------(----------------)
g (ωm ) = 2ℑ   ˆDeven – ˆCeven + c0 2 + 2P Dˆeven – Cˆeven + c0
(4.105)

where (as before) the even operators are defined as

              1
Deven(ω)  ≡   -[D (ω )+ D (– ω)]                  (4.106)
              21
 Ceven(ω)  ≡   -[C(ω) +C (– ω)]                  (4.107)
              2
so that they only hold components with even powers of ω. Notably, equation 4.105 does not depend on c0, as the explicit c0 term cancels with the implicit term in the Ĉ operator. Therefore, the modulation instability depends only on the GVD of the coupling, and not its overall value. The gain maxima occur when
Dˆ   (ω  )– ˆC   (ω  )+ c + P = 0
  even  m     even  m     0
(4.108)

where as described in section 3.4, this is a necessary but insufficient condition, requiring the explicit removal of pathological solutions. The predicted modulation instability peak positions are given in figure 4.18, whilst the numerical spectra are given in figure 4.19. As with a single wire case, four modulation instability peaks can be seen. However, these become obscured in the presence of free carrier interactions. (As in section 3.2.1, a physically reasonable estimate for the carrier cross section of εfc 103(1+ 7.5i) was used.) It should be noted, however, that techniques exist to sweep away free charge carriers [59], and so a more favourable experimental result may be possible.



Figure 4.18: Predicted modulation instability sideband wavelengths for a 220nm × 330nm SOI waveguide. For an isolated waveguide (denoted by the dashed lines) the modulation instability only occurs for pump wavelengths where the waveguide is anomalously dispersive (between the dashed vertical lines). When coupling GVD is present, by using the antisymmetric supermode of two such waveguides placed 330nm apart (denoted by the solid line) the modulation instability is extended into the anomalous regime.



Figure 4.19: Spectral output of a pair of 220nm × 330nm SOI wires placed 330nm apart pumped with a 10ps rectangular pulse with power 10P0 in the antisymmetric mode after 3.9mm. The top plot includes 2PA and linear damping (ε2pa = 0.1, εl = 0.05) but not free carrier effects, whilst the bottom plot does include them. The dashed lines give the predicted gain spectra. The powers used to calculate these have been reduced to 5P0 and 3.5P0 respectively, to allow for the effect of damping.

4.5.2 Cross-modal modulation instability

In the symmetric supermode, the effect of coupling GVD is to restrict the wavelength range in which conventional modulational instability is possible. A different picture emerges, however, when the effect of cross-modal instability is considered. This can be seen by taking the symmetric supermodal solution, and modifying it as

       √ --i(P +c0)ζ
X   =    Pe                               (4.109)
 Y  =  εei(P+c0)ζ                          (4.110)
where ε is the cross-modal perturbation. Strictly speaking, the equation for X should also be perturbed, to account for the energy loss into the other mode. When analysing ε, however, this is a second-order effect, and can be neglected for the first stages of propagation. Following the same procedure as above gives the gain spectrum as
           √------------------------------------------
g (ω  ) = 2ℑ ( ˆD    – ˆC   – c )2 + 2P (Dˆ  – Cˆ   – c )
    m          even    even   0          even   even   0
(4.111)

Unlike equation 4.105, this does depend on the value of c0, as the explicit and implicit values reinforce one another. This is a result of the wavenumber mismatch between the mode of the signal and the mode of the emitted radiation. The gain maxima occur at the solutions to

Dˆ   (ω  )– ˆC   (ω  )– c + P = 0
  even  m     even  m     0
(4.112)

This gain spectrum is notably different from the inter-modal case, in that we can have non-zero gain at the pump frequency. We can see this by setting ωm = 0, which reduces equation 4.111 to

         √ --------
g(0) = 4ℑ  c20 – Pc0
(4.113)

We therefore have a critical power at Pc = c0, above which the pump itself is unstable.

Modal instability and asymmetric nonlinear supermodes

This phenomenon of the pump itself becoming unstable is well documented, and results from the fact that the modes themselves are altered by the nonlinearity [179]. These modes can be investigated by rewriting equations 4.30 and 4.31 in a different basis as [203]

dS1
-dζ- =   2c0S3                             (4.114)
dS
---2 =   – S1S3                            (4.115)
 dζ
dS3- =   S1S2 – 2c0S1                      (4.116)
 dζ
where the parameters S1, S2 and S3 are defined has
S   =  |E |2 – |E |2                        (4.117)
 1       1      2
S2  =  2ℜ [E1E∗2]                          (4.118)
S   =  2ℑ [E E∗]                          (4.119)
 3          1 2
(We have assumed evolution in the continuous wave regime, and so the time dependent terms are zero.) Conservation of energy subjects these variables to the condition
|S |2 + |S |2 + |S |2 = (2P)2
  1     2     3
(4.120)

where (in keeping with the above notation) P = 12(          )
|E1|2 + |E2 |2 gives the mean power per wire. If S1, S2 and S3, are regarded as the Cartesian coordinates of a three-dimensional vector S⃗, then equation 4.120 defines a sphere of radius 2P, on which ⃗
S must lie. (This is extremely similar to the Stokes formation of optical polarisation, with ⃗S being a modified Stokes vector [204], which is not surprising, as instead of having light split between two principal polarisation states, we have light split between two wires.)

We can proceed by noting that

Γ ≡ S2 + S21/ (4c0)
(4.121)

is a conserved quantity. This allows us to combine equations 4.114, 4.115 and 4.116 into a single second order equation of the form [203]

       (               )
d2S1 +  4c2– 2c Γ + 1 S2 S = 0
 dζ2      0    0   2  1   1
(4.122)

This is an anharmonic oscillator (which oscillates in ζ). We can analyse its dynamics by defining a Lagrangian

  (   dS1  )   1 (dS1 )2  [(  2     ) 2   1 4]
L  S1,-dζ ,ζ = 2  -dζ-  –   2c0 – c0Γ S1 + 8S1
(4.123)

which upon substitution into the Euler Lagrange equation

  (        )
d-( --∂(-L-)) – -∂L-= 0
dζ  ∂  dS1     ∂S1
        dζ
(4.124)

gives equation 4.122 as required. We can separate the Lagrangian, into a quasi-kinetic energy (dS1/dζ dependent) term T and a quasi potential energy (S1 dependent) term V, using the standard energy construction of the Lagrangian (L = T –V). This gives

       1( dS1)2
T  =   2  dζ                                 (4.125)
      (        )     1
V  =   2c20 – c0Γ S21 +-S41                    (4.126)
                     8
The stability is dependent on the shape of the quasi potential energy function V. (See figure 4.20.) When 2c02 > c0Γ it is monostable, with a single potential well centred about S1 = 0. If the two waveguides have equal amplitude (such that S1 is also zero) this mode will sit at the bottom of the potential well, making it stable. However, when c0Γ > 2c02, the potential becomes bistable, with the mode sitting at the critical point between two potential wells (with minima at S1 = ±2√ -------2
  c0Γ – 2c0), making it unstable. It follows that Γ = 2c0 is the seperatrix between the regimes of stability and instability.


Figure 4.20: Quasi potential energy function V (given by equation 4.126), versus Stokes parameter S1. Plotted for c0 = 1 and for varying values of Γ. When Γ = 1 (such that c0Γ < 2c02), the potential is monostable. When Γ = 2 (such that c0Γ = 2c02), the potential is on the seperatrix. When Γ = 4 (such that c0Γ > 2c02), the potential becomes bistable. It follows that the symmetric mode solution (denoted by the dot) is stable for the first two cases, but unstable for the third case.

It should be noted that S1 measures the asymmetry between the intensities in each wire, such that a value different from S1 = 0 corresponds to a mode in which one wire contains more light than the other. Therefore, the potential wells that form when c0Γ > 2c02 correspond to the influence of asymmetric modes. These asymmetric modes are the result of self focussing, whereby the intensity dependence of the refractive index causes a waveguide with more light to become more confining, thus reinforcing any asymmetry. (Self focussing in coupled systems is a well known phenomenon for both continuous wave [205] and for solitons [206], which will be considered in more detail in chapter 5.) This self focussing can only happen when there is an existing asymmetry, and so (when c0Γ > 2c02) the symmetric supermode is metastable.

This behaviour can be understood by returning to the analogy of a needle placed perfectly upright, balanced upon its point (as mentioned in section 4.3.1). If the needle is completely isolated from all external forces, then it will remain in position. However, any perturbation will be reinforced by gravity, and the needle will topple over. Notably, the initial state has rotational symmetry (about the axis defined by the shaft of the needle), as do the equations of motion describing the system. When the needle topples (and the system falls into a "vacuum state"), spontaneous symmetry breaking occurs, in which a new parameter (namely the azimuthal angle of the needle’s position) emerges. In the coupler we also have a symmetry, namely an invariance under swapping the wires. This is present both in the initial conditions, and in the equations of motion (which can be seen by swapping E1 and E2 in equations 4.30 and 4.31. Under perturbation, a new parameter (namely the sign of S1) emerges, and thus we have symmetry breaking into one of two states.

It is also possible to "seed" the instability by introducing a small asymmetric signal, which causes the system to collapse into a particular state. This effect has been suggested as the basis of optical switches, whereby a weak signal can control the direction of a much stronger signal [207].

The symmetric supermodal solution has E1 = E2 = √P--ei(P +c0), giving a value for Γ (from equation 4.121) of Γ = 2P. This sets the seperatrix between the stable and unstable regimes at P = c0, which agrees with the result derived from equation 4.113. We therefore see that the cross-modal modulation instability is a frequency-dependent generalisation of ordinary coupler instability. Notably, this means that self focussing can occur below the critical power, but that it happens indirectly, via the generation of frequencies away from the pump.

Numerical modelling

For a system with 220nm × 330nm wires placed 330nm apart pumped at 1.5μm, equation 4.112 can be solved to give two gain peaks at 1.33μm and 1.74μm. This system was modelled numerically, as is shown in figure 4.21. This shows that sub-critical-power self-focussing does occur via the generation of spectral sidebands. The effect seems to vanish in the presence of free charge carriers. However, as noted previously, it is possible to sweep away free charge carriers [59], and so it may be possible to observe the phenomenon in a real device.



Figure 4.21: Spectral output of a pair of 220nm × 330nm SOI wires placed 330nm apart pumped with a 10ps rectangular pulse with power 10P0 in the symmetric mode after 3.9mm. The top plot includes 2PA and linear damping (ε2pa = 0.1, εl = 0.05) but not free carrier effects, whilst the bottom plot does include them. Spectral lines are generated at 1.33μm and 1.74μm as predicted, but these are obscured when free charge carriers are present.