Gravitational lensing formalism

From testwiki
Jump to navigation Jump to search

Template:Gravitational Lensing Template:Broader

In general relativity, a point mass deflects a light ray with impact parameter b by an angle approximately equal to

α^=4GMc2b

where G is the gravitational constant, M the mass of the deflecting object and c the speed of light. A naive application of Newtonian gravity can yield exactly half this value, where the light ray is assumed as a massed particle and scattered by the gravitational potential well. This approximation is good when 4GM/c2b is small.

In situations where general relativity can be approximated by linearized gravity, the deflection due to a spatially extended mass can be written simply as a vector sum over point masses. In the continuum limit, this becomes an integral over the density ρ, and if the deflection is small we can approximate the gravitational potential along the deflected trajectory by the potential along the undeflected trajectory, as in the Born approximation in quantum mechanics. The deflection is then

α^(ξ)=4Gc2d2ξdzρ(ξ,z)b|b|2,bξξ

where z is the line-of-sight coordinate, and b is the vector impact parameter of the actual ray path from the infinitesimal mass d2ξdzρ(ξ,z) located at the coordinates (ξ,z).[1]

Thin lens approximation

In the limit of a "thin lens", where the distances between the source, lens, and observer are much larger than the size of the lens (this is almost always true for astronomical objects), we can define the projected mass density

Σ(ξ)=ρ(ξ,z)dz

where ξ is a vector in the plane of the sky. The deflection angle is then

α^(ξ)=4Gc2(ξξ)Σ(ξ)|ξξ|2d2ξ
Angles involved in a thin gravitational lens system.

As shown in the diagram on the right, the difference between the unlensed angular position β and the observed position θ is this deflection angle, reduced by a ratio of distances, described as the lens equation

β=θα(θ)=θDdsDsα^(Ddθ)

where Dds is the distance from the lens to the source, Ds is the distance from the observer to the source, and Dd is the distance from the observer to the lens. For extragalactic lenses, these must be angular diameter distances.

In strong gravitational lensing, this equation can have multiple solutions, because a single source at β can be lensed into multiple images.

Convergence and deflection potential

The reduced deflection angle α(θ) can be written as

α(θ)=1πd2θ(θθ)κ(θ)|θθ|2

where we define the convergence

κ(θ)=Σ(θ)Σcr

and the critical surface density (not to be confused with the critical density of the universe)

Σcr=c2Ds4πGDdsDd


We can also define the deflection potential

ψ(θ)=1πd2θκ(θ)ln|θθ|

such that the scaled deflection angle is just the gradient of the potential and the convergence is half the Laplacian of the potential:

θβ=α(θ)=ψ(θ)
κ(θ)=122ψ(θ)

The deflection potential can also be written as a scaled projection of the Newtonian gravitational potential Φ of the lens[2]

ψ(θ)=2DdsDdDsc2Φ(Ddθ,z)dz

Lensing Jacobian

The Jacobian between the unlensed and lensed coordinate systems is

Aij=βiθj=δijαiθj=δij2ψθiθj

where δij is the Kronecker delta. Because the matrix of second derivatives must be symmetric, the Jacobian can be decomposed into a diagonal term involving the convergence and a trace-free term involving the shear γ

A=(1κ)[1001]γ[cos2ϕsin2ϕsin2ϕcos2ϕ]

where ϕ is the angle between α and the x-axis. The term involving the convergence magnifies the image by increasing its size while conserving surface brightness. The term involving the shear stretches the image tangentially around the lens, as discussed in weak lensing observables.

The shear defined here is not equivalent to the shear traditionally defined in mathematics, though both stretch an image non-uniformly.

Effect of the components of convergence and shear on a circular source represented by the solid green circle. The complex shear notation is defined below.

Fermat surface

There is an alternative way of deriving the lens equation, starting from the photon arrival time (Fermat surface)

t=0zsndzccosα(z)

where dz/c is the time to travel an infinitesimal line element along the source-observer straight line in vacuum, which is then corrected by the factor

1/cos(α(z))1+α(z)22

to get the line element along the bended path dl=dzccosα(z) with a varying small pitch angle α(z), and the refraction index Template:Math for the "aether", i.e., the gravitational field. The last can be obtained from the fact that a photon travels on a null geodesic of a weakly perturbed static Minkowski universe

ds2=0=c2dt2(1+2Φc2)(1+2Φc2)1dl2

where the uneven gravitational potential Φc2 drives a changing the speed of light

c=dl/dt=(1+2Φc2)c.

So the refraction index

ncc(12Φc2).

The refraction index greater than unity because of the negative gravitational potential Φ.

Put these together and keep the leading terms we have the time arrival surface

t0zsdzc+0zsdzcα(z)220zsdzc2Φc2.

The first term is the straight path travel time, the second term is the extra geometric path, and the third is the gravitational delay. Make the triangle approximation that α(z)=θβ for the path between the observer and the lens, and α(z)(θβ)DdDds for the path between the lens and the source. The geometric delay term becomes

Ddc(θβ)22+Ddsc[(θβ)DdDds]22=DdDsDdsc(θβ)22.

(How? There is no Ds on the left. Angular diameter distances don't add in a simple way, in general.) So the Fermat surface becomes

t=constant+DdDsDdscτ,τ[(θβ)22ψ]

where τ is so-called dimensionless time delay, and the 2D lensing potential

ψ(θ)=2DdsDdDsc2Φ(Ddθ,z)dz.

The images lie at the extrema of this surface, so the variation of τ with θ is zero,

0=θτ=θβθψ(θ)

which is the lens equation. Take the Poisson's equation for 3D potential

Φ(ξ)=d3ξρ(ξ)|ξξ|

and we find the 2D lensing potential

ψ(θ)=2GDdsDdDsc2dzd3ξρ(ξ)|ξξ|=i2GMiDisDsDic2[sinh1|zDi|Di|θθi|]|DiDs+|Di0.

Here we assumed the lens is a collection of point masses Mi at angular coordinates θi and distances z=Di. Use sinh11/x=ln(1/x+1/x2+1)ln(x/2) for very small Template:Math we find

ψ(θ)i4GMiDisDsDic2[ln(|θθi|2DiDis)].

One can compute the convergence by applying the 2D Laplacian of the 2D lensing potential

κ(θ)=12θ2ψ(θ)=4πGDdsDdc2Dsdzρ(Ddθ,z)=ΣΣcr=i4πGMiDisc2DiDsδ(θθi)

in agreement with earlier definition κ(θ)=ΣΣcr as the ratio of projected density with the critical density. Here we used 21/r=4πδ(r) and θ=Dd.

We can also confirm the previously defined reduced deflection angle

θβ=θψ(θ)=iθEi2|θθi|,πθEi24πGMiDisc2DsDi

where θEi is the so-called Einstein angular radius of a point lens Mi. For a single point lens at the origin we recover the standard result that there will be two images at the two solutions of the essentially quadratic equation

θβ=θE2|θ|.

The amplification matrix can be obtained by double derivatives of the dimensionless time delay

Aij=βjθi=τθiθj=δijψθiθj=[1κγ1γ2γ21κ+γ1]

where we have define the derivatives

κ=ψ2θ1θ1+ψ2θ2θ2,γ1ψ2θ1θ1ψ2θ2θ2,γ2ψθ1θ2

which takes the meaning of convergence and shear. The amplification is the inverse of the Jacobian

A=1/det(Aij)=1(1κ)2γ12γ22

where a positive A means either a maxima or a minima, and a negative A means a saddle point in the arrival surface.

For a single point lens, one can show (albeit a lengthy calculation) that

κ=0,γ=γ12+γ22=θE2|θ|2,θE2=4GMDdsc2DdDs.

So the amplification of a point lens is given by

A=(1θE4θ4)1.

Note A diverges for images at the Einstein radius θE.

In cases there are multiple point lenses plus a smooth background of (dark) particles of surface density Σcrκsmooth, the time arrival surface is

ψ(θ)12κsmooth|θ|2+iθE2[ln(|θθi|24DdDds)].

To compute the amplification, e.g., at the origin (0,0), due to identical point masses distributed at (θxi,θyi) we have to add up the total shear, and include a convergence of the smooth background,

A=[(1κsmooth)2(i(θxi2θyi2)θE2(θxi2+θyi2)2)2(i(2θxiθyi)θE2(θxi2+θyi2)2)2]1

This generally creates a network of critical curves, lines connecting image points of infinite amplification.

General weak lensing

In weak lensing by large-scale structure, the thin-lens approximation may break down, and low-density extended structures may not be well approximated by multiple thin-lens planes. In this case, the deflection can be derived by instead assuming that the gravitational potential is slowly varying everywhere (for this reason, this approximation is not valid for strong lensing). This approach assumes the universe is well described by a Newtonian-perturbed FRW metric, but it makes no other assumptions about the distribution of the lensing mass.

As in the thin-lens case, the effect can be written as a mapping from the unlensed angular position β to the lensed position θ. The Jacobian of the transform can be written as an integral over the gravitational potential Φ along the line of sight [3]

βiθj=δij+0rdrg(r)2Φ(x(r))xixj

where r is the comoving distance, xi are the transverse distances, and

g(r)=2rrrdr(1rr)W(r)

is the lensing kernel, which defines the efficiency of lensing for a distribution of sources W(r).

The Jacobian Aij can be decomposed into convergence and shear terms just as with the thin-lens case, and in the limit of a lens that is both thin and weak, their physical interpretations are the same.

Weak lensing observables

In weak gravitational lensing, the Jacobian is mapped out by observing the effect of the shear on the ellipticities of background galaxies. This effect is purely statistical; the shape of any galaxy will be dominated by its random, unlensed shape, but lensing will produce a spatially coherent distortion of these shapes.

Measures of ellipticity

In most fields of astronomy, the ellipticity is defined as 1q, where q=ba is the axis ratio of the ellipse. In weak gravitational lensing, two different definitions are commonly used, and both are complex quantities which specify both the axis ratio and the position angle ϕ:

χ=1q21+q2e2iϕ=a2b2a2+b2e2iϕ
ϵ=1q1+qe2iϕ=aba+be2iϕ

Like the traditional ellipticity, the magnitudes of both of these quantities range from 0 (circular) to 1 (a line segment). The position angle is encoded in the complex phase, but because of the factor of 2 in the trigonometric arguments, ellipticity is invariant under a rotation of 180 degrees. This is to be expected; an ellipse is unchanged by a 180° rotation. Taken as imaginary and real parts, the real part of the complex ellipticity describes the elongation along the coordinate axes, while the imaginary part describes the elongation at 45° from the axes.

The ellipticity is often written as a two-component vector instead of a complex number, though it is not a true vector with regard to transforms:

χ={|χ|cos2ϕ,|χ|sin2ϕ}
ϵ={|ϵ|cos2ϕ,|ϵ|sin2ϕ}

Real astronomical background sources are not perfect ellipses. Their ellipticities can be measured by finding a best-fit elliptical model to the data, or by measuring the second moments of the image about some centroid (x¯,y¯)

qxx=(xx¯)2I(x,y)I(x,y)
qyy=(yy¯)2I(x,y)I(x,y)
qxy=(xx¯)(yy¯)I(x,y)I(x,y)

The complex ellipticities are then

χ=qxxqyy+2iqxyqxx+qyy
ϵ=qxxqyy+2iqxyqxx+qyy+2qxxqyyqxy2

This can be used to relate the second moments to traditional ellipse parameters:

qxx=a2cos2θ+b2sin2θ
qyy=a2sin2θ+b2cos2θ
qxy=(a2b2)sinθcosθ

and in reverse:

a2=qxx+qyy+(qxxqyy)2+4qxy22
b2=qxx+qyy(qxxqyy)2+4qxy22
tan2θ=2qxyqxxqyy

The unweighted second moments above are problematic in the presence of noise, neighboring objects, or extended galaxy profiles, so it is typical to use apodized moments instead:

qxx=(xx¯)2w(xx¯,yy¯)I(x,y)w(xx¯,yy¯)I(x,y)
qyy=(yy¯)2w(xx¯,yy¯)I(x,y)w(xx¯,yy¯)I(x,y)
qxy=(xx¯)(yy¯)w(xx¯,yy¯)I(x,y)w(xx¯,yy¯)I(x,y)

Here w(x,y) is a weight function that typically goes to zero or quickly approaches zero at some finite radius.

Image moments cannot generally be used to measure the ellipticity of galaxies without correcting for observational effects, particularly the point spread function.[4]

Shear and reduced shear

Recall that the lensing Jacobian can be decomposed into shear γ and convergence κ. Acting on a circular background source with radius R, lensing generates an ellipse with major and minor axes

a=R1κγ
b=R1κ+γ

as long as the shear and convergence do not change appreciably over the size of the source (in that case, the lensed image is not an ellipse). Galaxies are not intrinsically circular, however, so it is necessary to quantify the effect of lensing on a non-zero ellipticity.

We can define the complex shear in analogy to the complex ellipticities defined above

γ=|γ|e2iϕ

as well as the reduced shear

gγ1κ

The lensing Jacobian can now be written as

A=[1κRe[γ]Im[γ]Im[γ]1κ+Re[γ]]=(1κ)[1Re[g]Im[g]Im[g]1+Re[g]]

For a reduced shear g and unlensed complex ellipticities χs and ϵs, the lensed ellipticities are

χ=χs+2g+g2χs*1+|g|2+2Re(gχs*)
ϵ=ϵs+g1+g*ϵs

In the weak lensing limit, γ1 and κ1, so

χχs+2gχs+2γ
ϵϵs+gϵs+γ

If we can assume that the sources are randomly oriented, their complex ellipticities average to zero, so

χ=2γ and ϵ=γ.

This is the principal equation of weak lensing: the average ellipticity of background galaxies is a direct measure of the shear induced by foreground mass.

Magnification

While gravitational lensing preserves surface brightness, as dictated by Liouville's theorem, lensing does change the apparent solid angle of a source. The amount of magnification is given by the ratio of the image area to the source area. For a circularly symmetric lens, the magnification factor μ is given by

μ=θβdθdβ

In terms of convergence and shear

μ=1detA=1[(1κ)2γ2]

For this reason, the Jacobian A is also known as the "inverse magnification matrix".

The reduced shear is invariant with the scaling of the Jacobian A by a scalar λ, which is equivalent to the transformations

1κ=λ(1κ)

and

γ=λγ.

Thus, κ can only be determined up to a transformation κλκ+(1λ), which is known as the "mass sheet degeneracy." In principle, this degeneracy can be broken if an independent measurement of the magnification is available because the magnification is not invariant under the aforementioned degeneracy transformation. Specifically, μ scales with λ as μλ2.

References

Template:Reflist