Matrix representation of conic sections

From testwiki
Jump to navigation Jump to search

In mathematics, the matrix representation of conic sections permits the tools of linear algebra to be used in the study of conic sections. It provides easy ways to calculate a conic section's axis, vertices, tangents and the pole and polar relationship between points and lines of the plane determined by the conic. The technique does not require putting the equation of a conic section into a standard form, thus making it easier to investigate those conic sections whose axes are not parallel to the coordinate system.

Conic sections (including degenerate ones) are the sets of points whose coordinates satisfy a second-degree polynomial equation in two variables, Q(x,y)=Ax2+Bxy+Cy2+Dx+Ey+F=0. By an abuse of notation, this conic section will also be called Q when no confusion can arise.

This equation can be written in matrix notation, in terms of a symmetric matrix to simplify some subsequent formulae, as[1]

(xy)(AB/2B/2C)(xy)+(DE)(xy)+F=0.

The sum of the first three terms of this equation, namely Ax2+Bxy+Cy2=(xy)(AB/2B/2C)(xy), is the quadratic form associated with the equation, and the matrix A33=(AB/2B/2C) is called the matrix of the quadratic form. The trace and determinant of A33 are both invariant with respect to rotation of axes and translation of the plane (movement of the origin).[2][3]

The quadratic equation can also be written as

𝐱𝖳AQ𝐱=0,

where 𝐱 is the homogeneous coordinate vector in three variables restricted so that the last variable is 1, i.e.,

(xy1)

and where AQ is the matrix

AQ=(AB/2D/2B/2CE/2D/2E/2F). The matrix AQ is called the matrix of the quadratic equation.[4] Like that of A33, its determinant is invariant with respect to both rotation and translation.[3]

The 2 × 2 upper left submatrix (a matrix of order 2) of AQ, obtained by removing the third (last) row and third (last) column from AQ is the matrix of the quadratic form. The above notation A33 is used in this article to emphasize this relationship.

Classification

Proper (non-degenerate) and degenerate conic sections can be distinguished[5][6] based on the determinant of AQ=(ACB24)F+BDECD2AE24:

If detAQ=0, the conic is degenerate.

If detAQ0 so that Q is not degenerate, we can see what type of conic section it is by computing the minor, detA33=ACB24:

  • Q is a hyperbola if and only if detA33<0,
  • Q is a parabola if and only if detA33=0, and
  • Q is an ellipse if and only if detA33>0.

In the case of an ellipse, we can distinguish the special case of a circle by comparing the last two diagonal elements corresponding to the coefficients of x2 , xy and y2:

  • If A=C and B=0, then Q is a circle.

Moreover, in the case of a non-degenerate ellipse (with detA33>0 and detAQ0), we have a real ellipse if (A+C)detAQ<0 but an imaginary ellipse if (A+C)detAQ>0. An example of the latter is x2+y2+10=0, which has no real-valued solutions.

If the conic section is degenerate (detAQ=0), detA33 still allows us to distinguish its form:

  • Two intersecting lines (a hyperbola degenerated to its two asymptotes) if and only if detA33<0.
  • Two parallel straight lines (a degenerate parabola) if and only if detA33=0. These lines are distinct and real if D2+E2>4(A+C)F, coincident if D2+E2=4(A+C)F, and non-existent in the real plane if D2+E2<4(A+C)F.
  • A single point (a degenerate ellipse) if and only if detA33>0.

The case of coincident lines occurs if and only if the rank of the 3 × 3 matrix AQ is 1; in all other degenerate cases its rank is 2.[2]

Central conics

When detA330 a geometric center of the conic section exists and such conic sections (ellipses and hyperbolas) are called central conics.[7]

Center

The center of a conic, if it exists, is a point that bisects all the chords of the conic that pass through it. This property can be used to calculate the coordinates of the center, which can be shown to be the point where the gradient of the quadratic function Template:Math vanishes—that is,[8] Q=[Qx,Qy]=[0,0]. This yields the center as given below.

An alternative approach that uses the matrix form of the quadratic equation is based on the fact that when the center is the origin of the coordinate system, there are no linear terms in the equation. Any translation to a coordinate origin Template:Math, using Template:Math, Template:Math gives rise to

(x*+x0y*+y0)(AB/2B/2C)(x*+x0y*+y0)+(DE)(x*+x0y*+y0)+F=0.

The condition for Template:Math to be the conic's center Template:Math is that the coefficients of the linear Template:Mvar and Template:Mvar terms, when this equation is multiplied out, are zero. This condition produces the coordinates of the center: (xcyc)=(AB/2B/2C)1(D/2E/2)=((BE2CD)/(4ACB2)(DB2AE)/(4ACB2)).

This calculation can also be accomplished by taking the first two rows of the associated matrix Template:Math, multiplying each by Template:Math and setting both inner products equal to 0, obtaining the following system:

Ax+(B/2)y+D/2=0,(B/2)x+Cy+E/2=0.

This yields the above center point.

In the case of a parabola, that is, when Template:Math, there is no center since the above denominators become zero (or, interpreted projectively, the center is on the line at infinity.)

Centered matrix equation

A central (non-parabola) conic Ax2+Bxy+Cy2+Dx+Ey+F=0 can be rewritten in centered matrix form as (xxcyyc)(AB/2B/2C)(xxcyyc)=K, where K=det(AQ)AC(B/2)2=det(AQ)det(A33).

Then for the ellipse case of Template:Math, the ellipse is real if the sign of Template:Math equals the sign of Template:Math (that is, the sign of each of Template:Math and Template:Math), imaginary if they have opposite signs, and a degenerate point ellipse if Template:Math. In the hyperbola case of Template:Math, the hyperbola is degenerate if and only if Template:Math.

Standard form of a central conic

Template:Main article

The standard form of the equation of a central conic section is obtained when the conic section is translated and rotated so that its center lies at the center of the coordinate system and its axes coincide with the coordinate axes. This is equivalent to saying that the coordinate system's center is moved and the coordinate axes are rotated to satisfy these properties. In the diagram, the original Template:Mvar-coordinate system with origin Template:Mvar is moved to the Template:Mvar-coordinate system with origin Template:Mvar.

Translating and rotating coordinates

The translation is by the vector 𝐭=(xcyc).

The rotation by angle Template:Mvar can be carried out by diagonalizing the matrix Template:Math. Thus, if λ1 and λ2 are the eigenvalues of the matrix A33, the centered equation can be rewritten in new variables Template:Mvar and Template:Mvar as[9]

λ1x'2+λ2y'2=detAQdetA33.

Dividing by K=detAQdetA33 we obtain a standard canonical form.

For example, for an ellipse this form is x2a2+y2b2=1. From here we get Template:Math and Template:Math, the lengths of the semi-major and semi-minor axes in conventional notation.

For central conics, both eigenvalues are non-zero and the classification of the conic sections can be obtained by examining them.[10]

Axes

By the principal axis theorem, the two eigenvectors of the matrix of the quadratic form of a central conic section (ellipse or hyperbola) are perpendicular (orthogonal to each other) and each is parallel to (in the same direction as) either the major or minor axis of the conic. The eigenvector having the smallest eigenvalue (in absolute value) corresponds to the major axis.[11]

Specifically, if a central conic section has center Template:Math and an eigenvector of Template:Math is given by Template:Math then the principal axis (major or minor) corresponding to that eigenvector has equation, xxcv1=yycv2.

Vertices

The vertices of a central conic can be determined by calculating the intersections of the conic and its axes — in other words, by solving the system consisting of the quadratic conic equation and the linear equation for alternately one or the other of the axes. Two or no vertices are obtained for each axis, since, in the case of the hyperbola, the minor axis does not intersect the hyperbola at a point with real coordinates. However, from the broader view of the complex plane, the minor axis of an hyperbola does intersect the hyperbola, but at points with complex coordinates.[12]

Poles and polars

Template:Main article Using homogeneous coordinates,[13] the points[14] 𝐩=(p0p1p2) and 𝐫=(r0r1r2) are conjugate with respect to the conic Template:Mvar provided 𝐩𝖳AQ𝐫=0.

The conjugates of a fixed point Template:Math either form a line or consist of all the points in the plane of the conic. When the conjugates of Template:Math form a line, the line is called the polar of Template:Math and the point Template:Math is called the pole of the line, with respect to the conic. This relationship between points and lines is called a polarity.

If the conic is non-degenerate, the conjugates of a point always form a line and the polarity defined by the conic is a bijection between the points and lines of the extended plane containing the conic (that is, the plane together with the points and line at infinity).

If the point Template:Math lies on the conic Template:Mvar, the polar line of Template:Math is the tangent line to Template:Mvar at Template:Math.

The equation, in homogeneous coordinates, of the polar line of the point Template:Math with respect to the non-degenerate conic Template:Mvar is given by 𝐩TAQ(xyz)=0.

Just as Template:Math uniquely determines its polar line (with respect to a given conic), so each line determines a unique pole Template:Math. Furthermore, a point Template:Math is on a line Template:Math which is the polar of a point Template:Math, if and only if the polar of Template:Math passes through the point Template:Math (La Hire's theorem).[15] Thus, this relationship is an expression of geometric duality between points and lines in the plane.

Several familiar concepts concerning conic sections are directly related to this polarity. The center of a non-degenerate conic can be identified as the pole of the line at infinity. A parabola, being tangent to the line at infinity, would have its center being a point on the line at infinity. Hyperbolas intersect the line at infinity in two distinct points and the polar lines of these points are the asymptotes of the hyperbola and are the tangent lines to the hyperbola at these points of infinity. Also, the polar line of a focus of the conic is its corresponding directrix.[16]

Tangents

Let line Template:Math be the polar line of point Template:Math with respect to the non-degenerate conic Template:Mvar. By La Hire's theorem, every line passing through Template:Math has its pole on Template:Math. If Template:Math intersects Template:Mvar in two points (the maximum possible) then the polars of those points are tangent lines that pass through Template:Math and such a point is called an exterior or outer point of Template:Mvar. If Template:Math intersects Template:Mvar in only one point, then it is a tangent line and Template:Math is the point of tangency. Finally, if Template:Math does not intersect Template:Mvar then Template:Math has no tangent lines passing through it and it is called an interior or inner point.[17]

The equation of the tangent line (in homogeneous coordinates) at a point Template:Mvar on the non-degenerate conic Template:Mvar is given by,

𝐩𝖳AQ(xyz)=0.

If Template:Math is an exterior point, first find the equation of its polar (the above equation) and then the intersections of that line with the conic, say at points Template:Math and Template:Math. The polars of Template:Math and Template:Math will be the tangents through Template:Math.

Using the theory of poles and polars, the problem of finding the four mutual tangents of two conics reduces to finding the intersection of two conics.

See also

Notes

Template:Reflist

References

Template:Matrix classes

  1. Template:Harvnb
  2. 2.0 2.1 Template:Harvnb
  3. 3.0 3.1 Template:Harvnb
  4. It is also a matrix of a quadratic form, but this form has three variables and is Ax2+Bxy+Cy2+Dxz+Eyz+Fz2.
  5. Template:Harvnb
  6. Template:Harvnb
  7. Template:Harvnb
  8. Template:Harvnb
  9. Template:Harvnb
  10. Template:Harvnb
  11. Template:Harvnb
  12. Template:Citation
  13. This permits the algebraic inclusion of infinite points and a line at infinity which are necessary to have for some of the following results
  14. This section follows Template:Citation
  15. Template:Harvnb
  16. Template:Citation
  17. Interpreted in the complex plane such a point is on two complex tangent lines that meet Template:Mvar in complex points.