![]() |
|||


Table of Contents:
3.2 Polarization and Phase relations
3.4 Optical Kerr Effect: Theory
3.5 Cross Phase Modulation (XPM)
4. Soliton Propagation in Optical Fibers
4.2 Optical fiber and its non-linearity.
4.4 Comparison between actual transmission system and
soliton transmission
4.5 Applications: Light Guiding Light
Appendix 1:
Solitons - Historical Background
Some important advances in nonlinear optics:
Ø Townes (1960), invention of the laser.
Ø Franken (1961), First observation ever of nonlinear optical effects, second harmonic generation (SHG).
Ø Terhune (1962), First observation of third harmonic generation (THG).
Ø E. J. Woodbury and W. K. Ng (1962), first demonstration of stimulated Raman scattering.
Ø Armstrong (1962), formulation of general permutation symmetry relations in nonlinear optics.
Ø A. Hasegawa and F. Tappert (1973), first theoretical prediction of soliton generation in optical fibers.
Ø H. M. Gibbs (1976), first demonstration and explanation of optical bistability.
Ø L. F. Mollenauer (1980), first confirmation of soliton generation in optical fibers.
Recently, many advances in nonlinear optics has been made, with a lot of efforts with fields of, for example, Bose-Einstein condensation and laser cooling; these fields are, however, a bit out of focus from the subjects of this course, which can be said to be an introduction to the 1960s and 1970s advances in nonlinear optics. It should also be emphasized that many of the effects observed in nonlinear optics, such as the Raman scattering, were observed much earlier in the microwave range.
Some important applications in nonlinear optics:
Ø Optical parametric amplification (OPA) and oscillation (OPO), hwp=>hws+hwi.
Ø Second harmonic generation (SHG), hw+ hw => h(2w).
Ø Third harmonic generation (THG), hw+ hw+ hw => h(3w).
Ø Pockels effect, or the linear electro-optical effect (applications for optical switching).
Ø Optical bistability (optical logics).
Ø
Optical solitons (ultra long-haul communication).
In dielectric materials, when applying an electric field, we induce a small (compared to atomic dimensions) displacement of positive and negative charges. Every molecule of the material acquire an induced dipole moment and the material as a whole is said polarized. All these dipoles produce their own field, which adds to the external one. If the external field oscillates, with a frequency far from some resonant frequency of the material (i.e. there is no absorption) the polarization, defined as the sum of all the dipole moments, follows the external field. Thus, polarization can be related to the applied electrical field E through the so-called electrical susceptibility tensor X:
where we have separated the three spatial components of the polarization and highlighted the frequency dependence of the various terms. if the tensor X is not diagonal, the polarization, normally, is not parallel to the external field. If the electrical field become strong enough, the linear response regime, described by equation 2.1 doesn't hold anymore and we must consider non-linear terms of the polarization:
where with X(1) and X(2) we indicate the linear and the
second-order susceptibility; from now on we'll drop the tensor formalism. Notice
that if we are in presence of two fields with different frequencies, the
non-linear polarization allows the exchange of energy between a number of
electromagnetic fields of different energies. Second harmonic generation (SHG)
is one application of this phenomenon. In SHG, part of the energy of an optical
wave of frequency ω
propagating through a material is converted to that of a wave of frequency 2ω. Thus assume that electronic
response to a driving electric field can be simulated by that of an electron in
an anharmonic potential well. The equation of motion
for the electron is then
where X is the deviation from the potential minimum, mDX2 is the anharmonic restoring force (corresponding to the (m/3)DX3 in the potential), the driving electric field is Eocos(wt) and γ the dumping term. Notice that in centrosymmetric crystals must be D=0, so that SHG can have place only in non-centrosymetric materials.
|
Figure 2.1: An applied sinusoidal electric field and the resulting polarization; (a) in a linear crystal and (b) in a crystal lacking inversion symmetry. |
This process is illustrated in figures (2.1) and (2.2). In figure (2.1) is shown the difference between the linear response regime and the non-linear one: in the last, the polarization still have the same periodicity of the driving field, but it is not anymore a sinusoidal function. We can, thus, decompose the polarization function in its Fourier components, shown in figure (2.2).
|
Figure 2.2: Analysis of the nonlinear polarization wave: (a) polarization from figure (2.1), (b) first harmonic the P Fourier factorization, (c) second harmonic (2w) and (d) DC component.) |
If the material is not invariant under inversion symmetry, will have a non-zero second-harmonic components and, thus, an oscillation at a frequency w2=2w. This means that an electromagnetic field with frequency 2w will be generated in our system. The amplitude of the second-harmonic field is always several orders of magnitude lower than the incoming (and transmitted) one. Nevertheless, it is possible greatly increase the conversion efficiency, fulfilling the so-called phase-matching condition. It can be demonstrated that for the conversion efficiency holds
where kw
is the wavevector of the incoming wave, k2w
is the wavevector secon-harmonic
generated one and
is
the length of our dielectric material. Thus, to maximize the efficiency
conversion, we must work in an experimental condition for which
,
or k2w =kw. If Δk≠0, the
second-harmonic generated wave at some plane (e.g. X1),
having propagated to some other plane (X2), is not in phase
with the second-harmonic genearated wave at X2.
This results in the interference described by the factor (sin2(x)/x2)in
equation (2.5). The common technique that is used to satisfy the phase-matching
condition take advantage of the natural birefringence of anisotropic crystals,
but it remains quite a stringent condition. A second technique for achieving
phase matching in nonlinear optics is to modulate the nonlinearity periodically
in one dimension.The resulting grating can be used to
compensate for the wave vector mismatch in a way that is called quasi-phase
matching(QPM). In 1998 Berger proposed extending the idea of QPM to
multiple spatial dimensions in much the same way as conventional linear
gratings have been extended to photonic crystals. The result is that the
phase-matching condition is greatly relaxed. This happens because, if we
consider a peridiocally varying nonlinear
susceptibility X(3)=X(2)(r), we can write it as a
Fourier expansion :
where the sum is extended over the whole 2D reciprocal lattice. Inserting the
expansion in the calculations that brought to the phase-matching condition, the
B vectors enter in the sinus argument leading to the quasi-phase
matching condition
where the reciprocal lattice vector B has been labelled with the two integer coordinates (n,m), given in the (b1,b2) basis of the recirocal lattice. For each vector Bn,m and a prescribed kw there is a unique angle of propagation for the second harmonic. The coupling strength of a phase matching process using Bn,m, results proportional to the corresponding Fourier coefficient of X(2). If a particular Fourier coefficient is zero, then no SHG will be observed in that direction. The first experimental observation of SHG by a photonic crystal were made by Broderick in year 2000 and confirmed the Berger theory.
The figure below shows the setup for optical second harmonic generation in LiNbO3

Figure 2.3: The setup for optical second harmonic generation in LiNbO3.
One example on using SHG as shown in figure 2.4:

Figure 2.4: example: generation of fifth harmonic (5w)
Kerr effect was, first discovered by John Kerr in 1875. Very
few materials are known to behave as doubly refracting at normal
condition and certain material become doubly refracting when placed
in strong electric field. This effect is generally termed as Kerr electro-
optic effect. Since it is observed at high electric field, it is not due to
only first order electric field. This nonlinear effect is considered due
to reorientation and rotation of molecular lattice of the medium,
called as Kerr medium.
P = eceE (For linear medium)
Where e is permitivity of medium and ce is electric susceptibility
P = e(c1 E + c2 E 2 + c3 E 3 +….) (For nonlinear medium)
Where c1 is linear susceptibility tensor, c2 is quadratic tensor
(3 x 3 x 3 matrix) and c3 is cubic tensor (cubic matrix with 3 x 3
matrix at each lattice point).
Since non-linear response time is of the order of 2- 4 femto seconds, by exploiting the Kerr effect, one can
construct Kerr Shutter for many practical purposes. Electrostatic Kerr Shutter
has been used for high-speed photography. All Optical Kerr Shutter (everything
controlled by light) can be used in optical communication where the rate of
pulse (data) transfer is of great importance.
High intensity pulsed laser beam can be used
as pump together with low intensity probe beam in a fiber optic cable. Pump
generates required electric field to change the polarization of low intensity
probe beam. With Kerr Fiber, that preserves polarization for pump, and a
polarizer, femto second Optical Kerr Shutter can be
constructed.
If the Kerr medium is aligned
such that the optical axis is parallel to the applied electric field, then the
birefringence can be explained in terms of Dn,
the difference in the refractive indices of extraordinary (E) and Ordinary (O)
rays by empirical formula.
Dn = K l E2 Where E is electric field. In terms of the phase change,
f = 2LK pV2/d2
Where K is Kerr coefficient, L is the length of electrode in “Kerr Cell”, V is potential difference applied across electrodes, d is interaction length of light in Kerr Cell.
By using a Polariscope one can measure the phase change (f) due to electric field and can measure Kerr Coefficient “K”.
![]() |
Fig 3.1. Without applying voltage

Fig 3.2. With voltage
R
is the plane polarized light vector incident on the field perpendicular to the xy plane and at an angle “i” with
the field. The light is traveling in z direction.
Let, R = a sin wt
Then x = a cos i sin wt
y = a sin i
sin wt
With the electric
field applied,
x = a cos i sin(wt + f1)
y = a sin i sin(wt + f2)
Fig 3.3. Diagrammatic section of the Kerr cell
system showing angular positions along optic axis
Let, Df = f1 – f2
ð
(x2/cos2i)-(2xy cos Df)/(cos i sin i)
+(y2/sin2i) = a2 sin2 Df
Expressing in polar
coordinates by letting r as any
component of the polarized vector R, emergent from the field at an angle q in the xy plane.
x = r cos(q +i)
y =
r sin(q +i)
r2 = (a2 sin2 Df/2) / (1-(1- tan2Df/2)
cos2q) : (equation of ellipse)
With the angle i = 450, the semi-axes of the ellipse are therefore,
A = a cosDf/2 for q = 0 B = a sinDf/2 for q = p/2
At Df = p/2, r2 = a2/2, regardless of q and circular polarization exists.
For p/2 < Df < p, the polarization becomes elliptic again with major and minor axes interchanged, B > A.
For Df = p the polarization becomes plane once more and r = B = a.
If an analyzer is placed at – 45o to the electric field (i.e. q = p/2)
Then transmitivity (T) of linearly polarized beam with its electric field vector at an angle
i = 45o ,
T = sin2 Df/2
This is the condition we are going to use for optical Kerr Shutter.
A high intensity pulsed laser beam (pump) can be sent through
a Kerr medium (optical fiber) along with a probe beam so that
the electric field generated by pump beam changes the
optical properties of the medium. If optical fiber is polarization
preserving for pump beam then with proper adjustment of
polarizer, one can make ultra fast optical shutter. When there
is pump beam, the probe beam passes through fiber, otherwise not.

Fig 3.4. Probe polarization
is perpendicular to the direction of polarizer. Without pump beam,
no light passes through polarizer.

Fig
3.5. Probe polarization is changed by pump beam. With proper choice of pump
intensity the polarization of probe beam can be change by 90o making
shutter open for the duration of pump pulse.
Starting from Maxwell’s equation
![]()
………….
(3.1)
Where, Jf and rf are free current density and charge density. P is electric polarization and M is magnetic polarization. In case of optical fiber Jf = 0 and rf = 0 and M = 0
In general to describe the relation between E and P, quantum mechanical approach is required, but phenomenological approach given by following equation can be used in the medium far from resonance.
P = c(1).E + c(2):EE + c(3) : E.E.E +….. ………… (3.2)
For linear case, P can be expressed as,
…………
(3.3)
Where c(1) is linear susceptibility. If E is a monochromatic plane wave with
…………
(3.4)
Then Fourier transformation P(r, t) becomes
…………
(3.5)
Then linear dielectric constant is given by,
e(k , w) = 1 + 4p c(1) (k , w) ………… (3.6)
In electric dipole approximation, c(1) (r, t) is independent of r. Hence c(1) (k, w) and e(k,
w) are independent of k.
For nonlinear case, P can be expressed in power series of E.
…(3.7)
Within electric dipole approximation c(n) (r,t) is independent of r and hence c(n) (k, w) is independent of k. P can also be written as,
P(w) = P(1)(w) + P(2)(w)+ P(3)(w)
Where superscripts denotes the order of nonlinearity in the polarization vector. It is easier to separate the linear part with nonlinear as,
P(r, t) = PL(r, t) + PNL(r, t).
If there is no free charge,
…………
(3.8)
For the fiber cable like fused silica, molecules are symmetric hence c(2) has no contribution. Hence PNL (r, t) comes from c(3) alone.
Approximations:
So that only linear contribution gives
8.
dielectric constant

With nonlinear contribution
,
e’(w) = 1 + c(1)(w) + eNL(w) , ………… (3.9)
In slowly varying envelop approximation: The rapidly varying part of electric filed as well as polarization vectors can be separated
…………
(3.10)
…………
(3.11)
…………
(3.12)
Where c.c. is complex conjugate. E’(r ,t), PL’(r ,t), PNL’(r ,t) are slowly varying parts, x is polarization unit vector. Putting equation (3.11) into (3.5)
…………
(3.13)
Where, E’ (r, w) is Fourier transform of E’(r, t).
Similarly putting equation (3.12) into (3.7) and considering
nonlinear response instantaneous and time dependence of c(3) as three delta function of the form d(t – t1)
PNL(r, t) = eo c(3) :E(r, t) E(r, t)E(r, t) ………… (3.14)
Dispersion of c(3) can be neglected by assuming instantaneous non-linear response. Then from equation (3.10), (3.12) and (3.14)
P’NL(r, t) = eo eNLE’(r,
t) …………
(3.15)
eNL = ¾ c(3)xxxx | E’ (r,t)| 2 ………… (3.16)
eNL non-linear contribution to dielectric constant
Now Fourier transform of E’(r,t) is given by,
…………
(3.17)
When equation (3.17) is plugged in equation (3.8), we get the equation of the form,
…………
(3.18)
Where,
ko = w/c and e’(w) = 1 + cxx(1)(w) + eNL ………… (3.9)
Defining,
n’(w) = n(w)
+ n2|E’|2 …………
(3.19)
Also using e= (n’ + ia/2ko)2 and from equations (3.15), (3.9) and (3.19), and neglecting (n2)2 terms we get,
n2= (3/8n) c(3)xxxx ………… (3.20)
Considering two waves having frequencies w1 and w2 coupled together, the x component of resulting electric field is given by,
…………
(3.21)
Where E1 and E2 are the x
components of the coupling waves. Substituting equation (3.21) in (3.14) and
neglecting the dispersion of cxxxx,
.… (3.22)
Where,
PNL(w1) = ceff (|E1|2
+ 2 |E2|2) E1
PNL (w2)
= ceff
(|E2|2+ 2 |E1|2) E2
PNL (2 w1-
w2) = ceff E12
E2* …………
(3.23)
PNL (2 w2- w1) = ceff E22 E1*
ceff = ¾ eo c(3)xxxx
Phase matching condition has to be satisfied to build up (2w1- w2) and (2w2- w1) frequency components significantly as in for four waves mixing. In general special arrangement is required to satisfy this condition. Hence we neglect those terms containing (2w1- w2) and (2w2- w1).
PNL(wj) = eoejNLEj for (j = 1,2.)
P (wj)
= eoejEj …………
(3.24)
Where,
![]()
………… (3.25)
Here, n = n1 = n2 has been assumed. The phase difference is
given by,
………… (3.26)
Here, first term is Self Phase Modulation (SPM) and second term is Cross Phase Modulation (XPM).
………… (3.27)
Where, Ex and Ey are complex amplitudes.
In isotropic medium
………… (3.28)
With Px and Py given by,
……… (3.29)
Where, i, j = x or y, and
c(3)xxxx = c(3)xxyy + c(3)xyxy + c(3)xyyx …………
(3.30)
Here, c(3)xxyy , c(3)xyxy
and c(3)xyyx are three independent and non zero
components of c(3), which depend on physical mechanism that contribute to
c(3). In silica fiber,
electronic contribution is dominant and all of these components are nearly
equal in magnitude.
…………
(3.31)
Again the last terms of equation (3.31) is analogous to the four
waves mixing and contributes negligibly when the length of the fiber L >>
LB , Where,
is called beat length of the fiber.
Then,
………… (3.32)
The phase difference between x and y component of probe at the output of the fiber of length L is given by
………… (3.33)
Considering the case of pump linearly polarized along x-axis and probe field 45o to pump field and also neglecting the probe contribution. From equation (3.25),
Dnx
= 2n2|Ep|2 , Where, |Ep|2 is pump intensity.

Recalling equation (3.29)
……...(3.29)
There is no pump field component in y-direction. Only first term in the equation (3.29), hence the first term of equation (3.31), contribute to Py and again neglecting self-contribution,
Dny = 2n2
b |Ep|2
Where, b = c(3)xxyy / c(3)xxxx and b= 1/3 for silica fiber
From equation (33),
………… (3.34)
n2B is the Kerr Coefficient of the fiber.
The probe transmitivity Tp is now given by.
…………
(3.34)
Thus, by changing the pump intensity (Ep), fiber length (L) or wavelength of probe (l), transmission of the probe signal through fiber can be controlled.
Kerr Shutter can be used to measure the speed of light. A laser beam is chopped by a fast shutter (the Kerr cell, with the pulser). The oscilloscope traces a rising signal, starting at some definite time after the oscilloscope trace is triggered by the pulser. The delay in receiving the signal can be observed which is partly in the electronics, but partly also in the travel time necessary for the light pulse to reach the photodiode detector. By changing the light path the delay changes by an amount proportional to the change in distance. The constant of proportionality is just the speed of light.

Figure 3.6: The delay is large for longer path length and small for shorter path length
In 1958 Froome got the value of c = 299,792.5 km/s using a microwave interferometer and a Kerr cell shutter. Now adopted value of c = 299,792.458 km/s (1983)
Kerr Shutter can be used as fluorescence rejection system in spectroscopic measurement. If the spectroscopic signals are collected for longer time, fluorescence will pervade the real signal. If the spectroscopic excitation pulse and the Kerr Gating pulse (Pump) are synchronized, pure signal can be collected.

Raman
spectra of acetonitrile and the laser dye DCM
measured fig. (a) with gate and fig. (b) without gate for 1 mM DCM concentration. The probe wavelength was 588 nm
(1 ps, 4.6 mJ) and the
accumulation time 400 s.
The term "soliton" was
introduced in the 1960's, but the scientific research of solitons had started
in the 19th century[1] when
John Scott-Russell observed a large solitary wave in a canal near
Solitons are very stable solitary waves in a solution of those equations. As the term "soliton" suggests, these solitary waves behave like "particles". When they are located mutually far apart, each of them is approximately a traveling wave with constant shape and velocity. As two such solitary waves get closer, they gradually deform and finally merge into a single wave packet; this wave packet, however, soon splits into two solitary waves with the same shape and velocity before "collision".

Figure 4.1: Solitons
The stability of solitons stems from the delicate balance of "nonlinearity" and "dispersion" in the model equations. Nonlinearity drives a solitary wave to concentrate further; dispersion is the effect to spread such a localized wave. If one of these two competing effects is lost, solitons become unstable and, eventually, cease to exist. In this respect, solitons are completely different from "linear waves" like sinusoidal waves. In fact, sinusoidal waves are rather unstable in some model equations of soliton phenomena.
So when both the dispersive and the nonlinear term are present in the equation the two effects can neutralize each other. If the water wave has a special shape the effects are exactly counterbalanced and the wave rolls along undistorted. The soliton shape can be expressed by equation (4.1) as follows:
u(x,t) = a sech2[b(x-vt)], (4.1)
with b=(a/12)1/2 and v=3a. The constant a is the only free parameter in the solution. It defines the amplitude and the width in such a way that a large (tall) soliton will be narrow, while a low soliton will be broad. The constant v defines the velocity of the soliton. Since v=3a a tall soliton will move faster than a low one.

Figure 4.2: Sine wave
Solitons can be classified as follows:
Pulses of light with a certain shape and energy that can propagate unchanged over large distances.
Pulses of "darkness" within a continuous wave, where the pulses are of a certain shape, and possess propagation properties similar to the bright solitons.
Continuous wave beams or pulses, with a transverse extent of the beam that via the refractive index change due to optical Kerr-effect can compensate for the diffraction of the beam. The optically induced change of refractive index works as an effective wave guide for the light.
Also Solitons can be classified into:
Which represents the family of pulses that doesn’t change its shape as it propagates.
Represents those pulses that undergo periodic changes in its shape as it propagates.
It is important to understand that an optical soliton cannot be propagated anywhere except in mater like optical fibers which possess two main properties such as:
Ø The Group Velocity Dispersion (GVD)
Ø Non-linearity term (the Kerr effect)
The
group velocity dispersion
is related to
the introduced dispersion parameter ![]()
as:
(4.2)
and hence the sign of the group velocity dispersion is the opposite of the sign of the dispersion parameter β. In order to get a qualitative picture of the effect of linear dispersion, let us consider the effect of the sign of β:
β> 0:
For this case, the group velocity dispersion is negative, since
(4.3)
This implies that the group velocity decreases with an increasing angular frequency. In other words, the “blue” frequency components of the pulse travel slower than the “red” components. Considering the effects on the pulse as it propagates, the leading edge of the pulse will after some distance contain a higher concentration of low (“red”) frequencies, while the trailing edge rather will contain a higher concentration of high (“blue”) frequencies. This effect is illustrated in Fig. 4.3.

Figure 4.3. Pulse propagation in a linearly dispersive medium with β > 0.
Whenever “red” frequency components travel faster than “blue” components, we usually associate this with so-called normal dispersion.
β < 0:
For this case, the group velocity dispersion is instead positive, since now
(4.4)
This implies that the group velocity increases with an increasing angular frequency. In other words, the “blue” frequency components of the pulse now travel faster than the “red”
components. Considering the effects on the pulse as it propagates, the leading edge of the pulse will after some distance hence contain a higher concentration of high (“blue”) frequencies, while the trailing edge rather will contain a higher concentration of low (“red”) frequencies.
This effect, being the inverse of the one described for a negative group velocity dispersion, is illustrated in Fig. 4.4.
Figure 4.4. Pulse propagation in a linearly dispersive medium with β < 0.
Whenever “blue” frequency components travel faster than “red” components, we usually associate this with so-called anomalous dispersion.
Notice that depending on the distribution of the frequency components of the pulse as it enters a dispersive medium, the pulse may for some propagation distance actually undergo pulse compression.
For β > 0, this occurs if the leading edge of the pulse contain a higher concentration of
“blue” frequencies, while for β < 0, this occurs if the leading edge of the pulse instead contain a higher concentration of “red” frequencies.
Where a Gaussian wave propagating in the presence of SPM
only can be shown as in Fig.4.5 (Note that the envelope remains the same, but
there is a chirping effect):

Figure 4.5: Gaussian wave propagating in the presence of SPM only.
These two properties (GVD and Kerr effect) give us the non-linear Schrödinger equation:
(4.5)
Where q is the complex amplitude of he pulse, z represents the distance along the direction of propagation, t the time.
The second term is originated from the GVD and the third is due to the Kerr effect. Hasegawa in 1973 was the first to show that a pulse of light in a fiber suit exactly to the NLS, to solve this equation and to introduced the optical soliton as solution.
The soliton comes from the exact cancellation between the GVD and the non-linear effect. As a result, a soliton can travel without spreading its shape, through a fiber over thousands kilometers.

Figure 4.6: Solitary wave propagating through non-linear, dispersive
medium. The pulse shape does not change with time.
As a light beam with some limited spatial extent in the transverse direction enter an optical Kerr media, the intensity variation across the beam will via the intensity dependent refractive index n = n0 +n2I form a lensing through the medium. Depending on the sign of the coefficient n2 (the nonlinear refractive index"), the beam will either experience a defocusing lensing effect (if n2 < 0) or a focusing lensing effect (if n2 > 0); in the latter case the beam itself will create a self-induced waveguide in the medium (see Fig. 4.7).
Figure 4.7. An illustration of the effect of self-focusing.
As being the most important case for beams with maximum intensity in the middle of the beam (as we usually encounter them in most situations), we will focus on the case n2 > 0. For this case, highly intense beams may cause such a strong focusing that the beam eventually break up again, due to strong diffraction effects for very narrow beams, or even due to material damage in the nonlinear crystal.
For some situations, however, there exist stationary solutions to the spatial light distribution that exactly balance between the self-focusing and the diffraction of the beam. We can picture this as a balance between two lensing effects, with the first one due to self-focusing, with an effective focal length Lfoc (see Fig.4.8), and the second one due to diffraction, with an effective focal length of Ldefoc (see Fig. 4.9).

Figure 4.8. Self-focusing seen as an effective lensing of the optical beam.

Figure 4.9. Diffraction seen as an effective defocusing of the optical beam.
Whenever these effects balance each other, we in this picture have the effective focal length Lfoc + Ldefoc = 0.
In the electromagnetic wave picture, the propagation of an optical continuous wave in optical Kerr-media is governed by the wave equation:
(4.6)
For simplicity we will from now on consider the spatial extent of the beam in only one transverse Cartesian coordinate x.
By introducing the spatial envelope Aw(x; z) according to
(4.7)
and using the slowly varying envelope approximation in the direction of propagation z, the wave equation takes the form
(4.8)
First, whatever is the bits’ format to send, it is important to precise that different wavelengths (colors) travel at the same time in the same fiber (with approximately the same speed), the multiplexing, more or less dense (DWDM), is made to send more data at the same time.
Actually most of transmission use NRZ (Non Return to Zero) and RZ (Return to Zero) method.(see figure 4.10)

Figure 4.10: Schematic illustration of NRZ and RZ format for a bit sequence
Both NRZ and RZ system are limited by the spreading of their shape, which requires a large pulses separation and fiber spans of limited length. This also imply to use electronic components at the end of each span (to combat the deformation of the bits), to receive the signal, to analyze it, then to rebuild it and to send it in the new span.
But nowadays, recent applications use more data such as videoconference, Internet etc…which imply an increasing request in higher rate system and what are the physical solutions?
First of all, a physical approach is to increase the frequency (data rate) in a channel.
For that we send pulses with less separation between them. A problem occurs: the broadening which gets worse as data rate increase (GVD).

Figure 4.11 What’s happens with higher frequency
Secondly, another way should be to increase the multiplexing in one band (add new colors). If we move away from 1550 nm, we are in the area where the losses of the fiber are more important so we have to add more power at the injection, and problems of NLS appear.
Finally, a conceivable solution should be to add another band (L band at longer wavelength) to increase the data rate, but a new problem occurs: the Raman effect which should distort the actual C band transmission (the L band pump the energy of the C band). As a conclusion of actual system, to go from 2.5 Gbit/s to 10 (or even to 40 Gbit/s we need more sensitive receivers and spans less long. But it is too expensive!
The soliton transmission system is a RZ transmission and the time of the pulse is in order to picosecond. This kind of transmission is characterized by two main advantages. First there is a slow rate of change of the pulse, so there is no need of transmitters and receivers but only optical amplifiers. Thus the cost of the installation is less expensive than a classical communication link. Moreover, the fact that there is not use of electronic components eliminates the problem relative to different kind of encoding between countries.
Secondly, we know that soliton travel in non-linear medium, this is important too because it allows higher power to be injected into the fiber. These two advantages induce in 8 dB of extra power margin (5 dB due to better signal over noise and 3 dB from higher power injection). Thus, the span can be longer and we can broaden the width of multiplexing (to move away from 1550 nm, in area where the losses of the fiber are a little bit more important).
With the first experiments, researchers conclude that solitons were suitable at higher bit rate only on submarine links but not for DWDM system (dense wavelength division multiplexing) on terrestrial link because of four-wave mixing (FWM).
The FWM results from interaction between different wavelength (multiplexing). That interaction generates additional optical carriers and is proportional to the spacing of the channel in DWDM system. To combat the FWM, we need a high dispersion when to avoid pulse broadening, a low average dispersion is required.
Technically we will have to use spans with more of 20 ps/nm.km of dispersion but with a total average of a few ps/nm/Km.
Recently, Mollenauer (in May 2000) succeeded to transmit solilton at a rate of 270 Gbit/s along 9000 km staying in the error free domain (less than 1 error for 109 bits).
For that, he has used DSM (Dispersion Shift Management) fibers, filter sliding system, a channel spacing of 75 GHz and 27 channels (range from 1542.5 nm to 1558.1 nm). The amplification was realized with erbium doped fibers and Raman gain control system. Despite the fact that the optical amplification is still a problem, this test is a great success for solitons that have really broken the rules of the actual other techniques’ limits.
Furthermore, Mollenauer expects to send information at 1 THz with the same range but only 0.4 nm spacing, underlying that last experiment was only limited by technical problems.
The present on-going revolution in photonics could see its summit with the ultimate all-optical device. Simply imagine a transparent cube like that of Fig.1 with a myriad of interconnections and components created by light alone. This is our dream. Circuits that lie on top of one another and are reconfigurable. Virtual circuitry. No physical "wires". Either light itself directs and manipulates light without any intervening fabricated components such as optical waveguides, or light is used to write ideal optical components and circuitry in photosensitive materials. Light controls its own destiny.
The story of guiding light with light and of creating virtual circuitry is not just about an emerging technology and the ingenious efforts on the part of many who are presently attempting to make it a reality. Research into this field has also revealed new conceptual and experimental approaches for understanding how curious light beams, known as optical spatial solitons, can be made to remain localized in space while performing some rather amazing acrobatics. Advances of late have been remarkably swift, coming from different groups across the globe: Spatial solitons, once the domain of high power lasers, can now be launched by an incandescent light bulb. Soliton dynamics, once the province of esoteric mathematics, is now accessible with undergraduate physics. Mere theoretical predictions of a few years ago, such as the possibility of one soliton being made to spiral about another, the fusion or the creation of solitons upon collision, and the transportation of a dim beam by a bright beam are now readily observable in the laboratory. Even popular science magazines have questioned whether fabricated optical components would eventually become obsolete for device applications.

Solitons from a Linear Perspective
The building blocks for light guiding light are free-standing beams, known as spatial solitons. Unlike linear waves which diffract, solitons create their own channel as they travel in a uniform nonlinear medium, remaining localized and preserving their shape. Beams in a linear medium do not influence each other. But solitons can attract, repel, spiral around each other and this interaction can even be described by the classical force laws treating the beams as particles with mass Whereas linear waves always pass through one another, solitons can be dramatically altered by collisions. They can annihilate one another, fuse (Fig. 2) or give birth to multiple solitons. These phenomena turn out to be of potential importance to the emerging technology of light guiding light and light written circuitry.

From this elementary perspective, we can appreciate that disparate types of solitons are actually the same animal. In the simplest case, a soliton is one mode of the waveguide it induces. A soliton can be two or more modes of the induced waveguide. This elaboration explains interesting soliton dynamics, incoherent solitons, multi-humped solitons and the coexistence of different classes of solitons.
Now, if a soliton can be composed of a number of
modes, each travelling at a different speed, then it
should be possible to decompose the soliton into its constituent modes in exact
analogy to
The fact that nonlinear propagation has a linear waveguide equivalent provides a powerful conceptual tool, one that guides us in a physical manner to the fundamental equations and to their solutions. It allows us to predict novel phenomena, motivate light written circuitry, and foreshadow the design of lossless waveguide components. Put simply, all soliton dynamics have a linear waveguide analogue, albeit some unusual shaped waveguide system. Vice versa, every linear waveguiding phenomenon has its soliton equivalent in some nonlinear medium. A self-consistency relation unites the linear and nonlinear equivalents.
A Simple Model of Soliton Dynamics
One major challenge is to find a simple analytical description of solitons and their interactions. To achieve this, we need only borrow from the literature of the linear harmonic oscillator.
Because every linear optical waveguide has a soliton equivalent, it is natural to first consider the simplest optical waveguide possible, one whose refractive index falls off parabolically. Light beams obey the linear harmonic oscillator in this medium. This reveals that Gaussian beams remain Gaussian shaped as they propagate. In general the beams undulate periodically, undergoing periodic trajectories.
Now, according to the above linear perspective, Gaussian shaped solitons must also exist in some homogeneous nonlinear medium with the same behaviour as beams in a parabolic index optical waveguide. The particular nonlinear medium is found by using the self-consistency relation. Several candidates exist. But the simplest medium is one whose nonlinear induced refractive index change depends on the beam total power only. This arises, for example, when the medium has a nonlocal response with a correlation length that is much larger than the beam diameter. In such a medium, Gaussian shaped soliton beams remain Gaussian and they are unaltered by colliding with one another.
For a special beam radius and power, a Gaussian beam will propagate without change. Such a beam is called a stationary soliton. It induces a graded index optical fiber which can guide a signal beam. All other beams "breathe" as they propagate with their radius oscillating periodically (Fig. 3).

What happens to two stationary solitons that are initially launched in parallel to each other? In a homogeneous linear medium they would diffract as they travel in a straight trajectory. In this nonlinear medium they can attract and undergo periodic collisions with one another or, if launched skew to each other, spiral about each other as shown in Fig. 4. Finally, a distant "dim" beam can remain localized and be guided and steered by a "bright" soliton beam.

Solitons as Bundles of Classical Particles
Most experiments to date have involved comparatively narrow solitons launched by a coherent source. At the other extreme it is possible to have comparatively large incoherent solitons. And, in a beautiful experiment, Mitchell and Segev have launched them from an ordinary incandescent light bulb!
Such "big" incoherent solitons can be
very neatly viewed as being composed of an enormous number of modes of the multimoded waveguide they induce. But, recall that diffuse
light propagation along multimoded waveguides can be
described by classical geometric optics. So incoherent solitons can also be viewed
as bundles of rays, each ray obeying the paraxial ray equation, or equivalently
as a bundle of classical (non-interacting) point particles, each particle
obeying
This leads to predictions that are unique to incoherent solitons. For example, they can have any shape in their two-dimensional cross-section, even travel in parallel without interacting, unlike coherent beams in the same intensity dependent medium.
Temporal Solitons for Telecommunications
It is insightful to contrast spatial with temporal optical solitons. Temporal solitons are pulses that propagate along optical glass fibersfor long distance telecommunications. Here the material nonlinearity is only weakly perturbed. This is a one-dimensional problem. Whereas, spatial solitons envisaged for device applications in bulk material are typically quasi-monochromatic beams that are localized in two-transverse dimensions and propagate only for millimeters. These beams sufficiently alter the refractive index of the bulk material to actually create their own waveguides. The extra dimension brings additional riches such as the possibility for beams to spiral around each other, but it also demands that the nonlinearity be saturating or nonlocalized if the beams are to be both stable and localized in space.
Device and Logic Applications: Switching Light with Light
The dream of photonics is to have a completely optical technology. Here the traditional carriers of information, electrons, are envisaged to be replaced by photons for devices based on switching and logic. Spatial solitons offer one potential way to achieve this dream. We have described how waveguides are induced by solitons. The challenge is to develop methods for controllable steering of these waveguides by light itself and to produce reconfigurable waveguides.
Because solitons can attract and guide beams, light can be used to switch light for various device and logic applications which are presently being performed by electronics. The solitons can be considered as the information flow itself or as inducing optical waveguides in which the information is carried. This information can take the form of a weak ("dim") probe beam at a different wavelength or different polarisation than the ("bright") soliton beam. In either scheme, intricate virtual circuitry can be written in bulk nonlinear media. Depending on the material, the circuitry has a life span which allows for the possibility of self-reconfigurable circuits. Such plasticity opens the door to adaptive circuits that can be designed to transform themselves to the desired application. We are led to the image of a transparent cube with thousands of dynamically interconnected "wires" all created, maintained and organised by light itself.
The speed required for switching depends on the application. It can be as slow as seconds for circuit reconfiguration in network application or as fast as picoseconds for optical computing. The steering of one soliton by another or of a weak signal beam by a soliton forms the simplest case of spatial switching. Alternatively, the coupling ratio of a soliton induced coupler can be adjusted by changing the pump signal in one arm. Structures can be created to tap a signal, make a copy or reroute it and these processes can be made dynamic. It is also possible for two colliding solitons to fuse or for a soliton to be split into two solitons by a weak probe beam, thus creating additional forms of spatial switching
Designer components and light written circuitry
We have shown how soliton dynamics can be approached from the perspective of linear waveguides. Curiously the reverse is also true. The phenomenon of soliton dynamics provides a method of actually fabricating waveguide components whose design had not even been foreshadowed from our knowledge of linear waveguides. The concept is elementary. Solitons are allowed to interact in the appropriate photosensitive material so that the desired induced waveguide configurations are then permanently written.
For example, consider the optical device known as the X-junction. This is one of many building block components used for processing optical signals. It can be used to mix or split two signals in any desired proportion. Figure 5 shows a signal propagating through a soliton induced X-junction. In this particular case, the junction is designed to be completely transparent.

Light written circuitry offers potential advantages over the more conventional fabrication processes such as ion diffusion, PECVD, and sol-gel. In some materials beams fuse upon colliding. But, unless beams collide at virtual grazing incidence, they always pass through one another without influencing each other. In this way it is possible to compress circuitry into a compact space with many circuits sharing the same physical location. Furthermore, certain photosensitive materials offer the potential for erasing one light written device and replacing it by another. Hence, we have the building blocks for dense reconfigurable virtual circuitry.
In Nonlinear Optics we are concerned about the study of interaction of light in matter, we can see that for a linear optics the material acts in a linear way regarding polarization, because in the linear region the electric field is much lower than Intra-Atomic field, and the refractive index “n” is independent from the light intensity “I”. Whereas in Nonlinear Region the Electric field is approximately equal to Intra-Atomic field resulting with modified electron distribution, so the material reacts nonlinearly with the light, and “n” depends on “I”.
Nonlinear optical phenomena (such as the Kerr effect and second-harmonic generation) could be not bad, if we think about all gains caused by nonlinear devices, so there are many useful applications that benefit from this nonlinear behavior. Applications of this effective nonlinearity is soliton generation (Solitons can exist when dispersion and nonlinearity counteract one another), mode-locking of lasers frequency-shifting, optical switching, and compression of pulses, etc...
Virtual circuitry could be another beautiful application, like a transparent cube with a myriad of interconnections created, maintained, and arranged by light itself. So we can control “n” by the light itself or manipulate one beam with the other, and this leads to a great variety of technical innovations.
v Applied Nonlinear Optics, Frits Zernike and John Midwinter, 1973, John Wiley.
v Optics and Lasers including fibers and integrated optics, M. Young, 2nd revised edition, 1984, Springer-Verlag.
v Laser Age in Optics “translated from Russian”, L.V. Tarasov, 2nd printing 1982, Mir publishers-Moscow.
v Fiber optic Communication, Joseph Palaris, 4th edition 1998, Prentice Hall.
v Optical Fiber Communications, Gerd Keiser, 3rd International Edition 2000,McGraw-Hill.
v
Lecture Notes on Nonlinear Optics,
Fredrik Jonsson, Department of Laser Physics and
Quantum Optics,
v Nonlinear Electro-Optic Effect and Kerr Shutter, Electrodynamics- II
project work, (Advisor: Prof. C.D. Lin)
By Jagat Shakya and Mim Lal Nakarmi,
Department of Physics, Kansas State University, Manhattan, KS 66502, April
2001.
v
Discrete Spatial Optical Solitons in Waveguide Arrays, H. S. Eisenberg and Y. Silberberg R. Morandotti, A. R. Boyd, and J. S. Aitchison,
PHYSICAL REVIEW LETTERS,VOLUME 81, NUMBER 16, 19 OCTOBER 1998
v Optical Solitons, Mr. FOURRIER Jean-christophe, Mr. DUREL Cyrille, Quantum Electronics, Lecturer : Dr. Jean-paul MOSNIER, Applied Physics, Year 4, 2000
v
Light Guiding Light. (Let light be the
master of its own destiny)
by Allan W Snyder and François Ladouceur
(http://people.deas.harvard.edu/~jones/ solitons/papers/lgl/lgl.html)
v Nonlinear optical interactions in optical fibers, R. Boyd and E. Buckland, Journal of nonlinear optical physics and Materials, Vol.7 No.1 1998, pp105-112.
v http://photonics.ece.umd.edu/pubs/misc-presentations/MP-1/solitons.pdf
v
www.sccs.swarthmore.edu/users/02/lisal/
physics/presentations/soliton.pdf
v
www.pma.caltech.edu/Courses/ph136/
yr2002/chap09/0209.1.pdf
v
http://www.elettra.trieste.it
v
http://www.fact-index.com/s/so/soliton.html
v
http://www.ma.hw.ac.uk/solitons/
v
homepages.tversu.ru/~s000154/collision/main.html
v
www.ma.hw.ac.uk/solitons/press.html
v
http://www.lib.kth.se
v http://photonics.ece.umd.edu/pubs/misc-presentations/
MP-1/solitons.pdf
v http://physics.usc.edu/~vongehr/solitons_html/solitons.pdf
Until very recently, physicists were quite content about their understanding of wave motion. They had theories that explained all of their observations. Every type of wave motion possible had been studied, including that resulting from vibrating strings, sound waves, water waves and electromagnetic waves. In fact, most of the information that we receive comes from wave motion.
In the mid sixties at
In 1973 Hasegawa and Tappert proposed that such solitons could be used in optical communications systems. Researchers had already been working on the possibility of using optical fibers as the conduit for long distance communications . Such systems have a greater transmission rate than conventional systems using cables. In the 1970's optical fibers were designed, which had relatively low losses. However, they still had to overcome the fact that wave pulses in optical fibers tended to broaden due to dispersion. This spreading was detrimental, since well separated pulses at the transmitter could begin to overlap by the time they had reached the receiver, causing the information being transmitted to become jumbled. However, solitons had the advantage that the information could be transmitted with very little loss or dispersion.
The Great Wave of Translation
The story begins in 1834 on the bank of the Edinburg-Glasgow canal in
"I believe that I shall best introduce this phenomenon by decribing the circumstances of my own first aquaintance with it. I was observing the motion of a boat which was rapidly drawn along a narrow channel by a pair of horses, when the boat suddenly stopped - not so the mass of water in the channel which it had put into motion; it accumulated round the prow of the vessel in a state of agitation, then suddenly leaving it behind, rolled forward with great velocity, assuming the form of a large solitary elevation, a rounded, smooth and well-defined heap of water, which continued its course along the channel apparently without change of form or diminution of speed. I followed it on horseback, and overtook it still rolling on at a rate of some eight or nine miles an hour, preserving its original feature some thirty feet long and a foot to a foot and a half in height. Its height gradually diminished, and after a chase of one or two miles I lost it in the windings of the channel. Such in the month of August 1834, was my first chance interview with that singular and beautiful phenomenon which I have called the Wave of Translation, a name which it now very generally bears."

Russell was so impressed by the incident, that he went back to his lab and made a model of the canal, with which he could try to reproduce and study water waves of the same type.
In 1955 Fermi, Pasta, and Ulam
were studying the finite heat conductivity of solids. Their model of a solid
consisted of a set of masses connected by springs. In such a model the masses
are the molecules of the solid and the springs represent the forces holding the
molecules together. It was known that if the springs behaved according to Hooke's law, the same law governing the behavior of common
springs, then heat can flow through the solid without there being a temperature
difference between its ends. In this case the thermal conductivity is infinite.
Fermi, Pasta and Ulam undertook a numerical study of
a one-dimensional lattice at
What they had found instead was that the energy
flowed back and forth throughout all the modes and eventually recollected near
the initial state. This surprised everyone, as the system did not behave as
expected. In 1967 Zabusky and Kruskal
at
The dispersive characteristics of a fiber can be
minimized in the operating region of the
system by changing the core radius and the difference in the refractive indices
of the core and the cladding. The main parameter that characterizes dispersion
is the rate of change of the group velocity with respect to the frequency. This
is known as group velocity dispersion, or GVD. The wavlength
at which there is minimum dispersion is the zero-dispersion wavelength. For
wavelengths below this point, the fiber exhibits positive, or normal, GVD;
while, for larger wavelengths, the disperion is known
as negative, or anomalous, GVD.
The relation between the polarization and the applied electric field has been assumed to be linear. However, with the advent of the laser, which produces large electric fields, nonlinear effects should also be included. One type of nonlinear effect, the Kerr effect, occurs when the medium becomes anisotropic due to the polarization of the molecules in the presence of an applied electric field. In particular, the index of refraction is found to depend on the square of the amplitude of the applied electric field. If the optical fibers are designed appropriately, the dispersive and nonlinear effects can be balanced, thereby allowing for the possibility of a localized pulse to propagate through the fiber without distortion due to either effect.
In 1973 Hasegawa and Tappert had proposed that soliton pulses could be used in optical communications through the balance of nonlinearity and dispersion. They showed that these solitons would propagate according to the nonlinear Schrodinger equation (NLS), which had been solved by the inverse scattering method a year earlier by Zakharov and Shabat. At that time there was no capability to produce the fibers with the proper characteristics for doing this and the dispersive properties of optical fibers were not known. Also, the system required a laser which could produce very small wavelengths, which also was unavailable. It wasn't until seven years later, when Mollenauer, Stollen and Gordon at AT&T Bell Laboratories had experimentally demonstrated the propagation of solitons in optical fibers.
The original communications systems employed pulse trains with widths of about one nanosecond. However, there was still some distortion due to fiber loss. This was corrected by placing repeaters every several of tens of kilometers. As the width of the available pulses was decreased, the spacing of the repeaters was increased. In the mid 1980's it was proposed that tby sending in an additional pump wave along the fiber, the dispersion of a soliton could be halted through a process known as Raman scattering. In 1988 Mollenauer and his group had shown that this could be done by propagating a soliton over 6000 km without the need for repeaters.
Research into the use of solitons as information carriers in optical systems is
still being
heavily researched. Soliton pulses are not immune to
fiber losses, leading to pulse broadening. As the pulses broaden, neighboring solitons will overlap and this overlap is not fully understood.
Systems, which allow the propagation of envelope solitons,
operate in the region of negative group velocity dispersion (GVD). However, for
positive GVD it has been recently found that a different type of soliton, the so-called dark soliton,
can propagate through optical fibers. The characteristics of these fibers is
currently being investigated. These are just a few of the directions that
theoretical and experimental investigations have taken over the last decade in
this rapidly growing field. It is clear that solitons
will play a major role in the next generation of optical communication systems.