Velocity Moments

From testwiki
Jump to navigation Jump to search

In the field of computer vision, velocity moments are weighted averages of the intensities of pixels in a sequence of images, similar to image moments but in addition to describing an object's shape also describe its motion through the sequence of images. Velocity moments can be used to aid automated identification of a shape in an image when information about the motion is significant in its description. There are currently two established versions of velocity moments: Cartesian[1] and Zernike.[2]

Cartesian velocity moments

Cartesian moments for single images

A Cartesian moment of a single image is calculated by

mpq=x=1My=1NxpyqPxy

where M and N are the dimensions of the image, Pxy is the intensity of the pixel at the point (x,y) in the image, and xpyq is the basis function.

Cartesian velocity moments for sequences of images

Cartesian velocity moments are based on these Cartesian moments. A Cartesian velocity moment vmpqμγ is defined by

vmpqμγ=i=2imagesx=1My=1NU(i,μ,γ)C(i,p,g)Pixy

where M and N are again the dimensions of the image, images is the number of images in the sequence, and Pixy is the intensity of the pixel at the point (x,y) in image i.

C(i,p,q) is taken from Central moments, added so the equation is translation invariant, defined as

C(i,p,q)=(xxi)p(yyi)q

where xi is the x coordinate of the centre of mass for image i, and similarly for y.

U(i,μ,γ) introduces velocity into the equation as

U(i,μ,γ)=(xixi1)μ(yiyi1)γ

where xi1 is the x coordinate of the centre of mass for the previous image, i1, and again similarly for y.

After the Cartesian velocity moment is calculated, it can be normalised by

vmpqμγ=vmpqμγA*I

where A is the average area of the object, in pixels, and I is the number of images. Now the value is not affected by the number of images in the sequence or the size of the object.

As Cartesian moments are non-orthogonal, so are Cartesian velocity moments, so different moments can be closely correlated. These velocity moments do however provide translation and scale invariance (unless the scale changes within the sequence of images).

Zernike velocity moments

Zernike moments for single images

A Zernike moment of a single image is calculated by

Amn=m+1πxy[Vmn(r,θ)]*Pxy

where * denotes the complex conjugate, m is an integer between 0 and , and n is an integer such that m|n| is even and |n|<m. For calculating Zernike moments, the image, or section of the image which is of interest is mapped to the unit disc, then Pxy is the intensity of the pixel at the point (x,y) on the disc and x2+y21 is a restriction on values of x and y. The coordinates are then mapped to polar coordinates, and r and θ are the polar coordinates of the point (x,y) on the unit disc map.

Vmn(r,θ) is derived from Zernike polynomials and is defined by

Vmn(r,θ)=Rmn(r)ejnθ
Rmn(r)=s=0m|n|2(1)sF(m,n,s,r)
F(m,n,s,r)=(ms)!s!(m+|n|2s)!(m|n|2s)!rm2s

Zernike velocity moments for sequences of images

Zernike velocity moments are based on these Zernike moments. A Zernike velocity moment Amnμγ is defined by

Amnμγ=m+1πi=2imagesx=1y=1U(i,μ,γ)[Vmn(r,θ)]*Pixy

where images is again the number of images in the sequence, and Pixy is the intensity of the pixel at the point (x,y) on the unit disc mapped from image i.

U(i,μ,γ) introduces velocity into the equation in the same way as in the Cartesian velocity moments and [Vmn(r,θ)]* is from the Zernike moments equation above.

Like the Cartesian velocity moments, Zernike velocity moments can be normalised by

Amnμγ=AmnμγA*I

where A is the average area of the object, in pixels, and I is the number of images.

As Zernike velocity moments are based on the orthogonal Zernike moments, they produce less correlated and more compact descriptions than Cartesian velocity moments. Zernike velocity moments also provide translation and scale invariance (even when the scale changes within the sequence).

Comparison of methods

Velocity moment type Translation invariance Scale invariance Orthogonal
Cartesian Yes Yes (if the object does not change scale within the sequence) No
Zernike Yes Yes Yes

References

  1. J. D. Shutler, M. S. Nixon, C. J. Harris, "Statistical Gait Description via Temporal Moments", Proc. SSIAI 2000 - Austin, Texas, :pp. 291-295, 2000
  2. J. D. Shutler and M. S. Nixon, "Zernike Velocity Moments for Description and Recognition of Moving Shapes", Proc. BMVC 2001, Manchester, UK, 2:pp. 705-714, 2001