Hermite normal form

From testwiki
Revision as of 03:24, 21 December 2024 by imported>Citation bot (Altered template type. | Use this bot. Report bugs. | Suggested by Dominic3203 | Category:Matrix normal forms | #UCB_Category 10/10)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

In linear algebra, the Hermite normal form is an analogue of reduced echelon form for matrices over the integers Z. Just as reduced echelon form can be used to solve problems about the solution to the linear system Ax=b where x is in Rn, the Hermite normal form can solve problems about the solution to the linear system Ax=b where this time x is restricted to have integer coordinates only. Other applications of the Hermite normal form include integer programming,[1] cryptography,[2] and abstract algebra.[3]

Definition

Various authors may prefer to talk about Hermite normal form in either row-style or column-style. They are essentially the same up to transposition.

Row-style Hermite normal form

An m by n matrix A with integer entries has a (row) Hermite normal form H if there is a square unimodular matrix U where H=UA and H has the following restrictions:[4][5][6]

  1. H is upper triangular (that is, hij = 0 for i > j), and any rows of zeros are located below any other row.
  2. The leading coefficient (the first nonzero entry from the left, also called the pivot) of a nonzero row is always strictly to the right of the leading coefficient of the row above it; moreover, it is positive.
  3. The elements below pivots are zero and elements above pivots are nonnegative and strictly smaller than the pivot.

The third condition is not standard among authors, for example some sources force non-pivots to be nonpositive[7][8] or place no sign restriction on them.[9] However, these definitions are equivalent by using a different unimodular matrix U. A unimodular matrix is a square invertible integer matrix whose determinant is 1 or −1.

Column-style Hermite normal form

A m-by-n matrix A with integer entries has a (column) Hermite normal form H if there is a square unimodular matrix U where H=AU and H has the following restrictions:[8][10]

  1. H is lower triangular, hij = 0 for i < j, and any columns of zeros are located on the right.
  2. The leading coefficient (the first nonzero entry from the top, also called the pivot) of a nonzero column is always strictly below of the leading coefficient of the column before it; moreover, it is positive.
  3. The elements to the right of pivots are zero and elements to the left of pivots are nonnegative and strictly smaller than the pivot.

Note that the row-style definition has a unimodular matrix U multiplying A on the left (meaning U is acting on the rows of A), while the column-style definition has the unimodular matrix action on the columns of A. The two definitions of Hermite normal forms are simply transposes of each other.

Existence and uniqueness of the Hermite normal form

Every full row rank m-by-n matrix A with integer entries has a unique m-by-n matrix H in Hermite normal form, such that H=UA for some square unimodular matrix U.[5][11][12]

Examples

In the examples below, Template:Mvar is the Hermite normal form of the matrix Template:Mvar, and Template:Mvar is a unimodular matrix such that Template:Math. A=(331401000019160003)H=(30110100001910003)U=(1301010000150001)

A=(236256168311)H=(10501103282006113)U=(9515201161)

If Template:Mvar has only one row then either Template:Math or Template:Math, depending on whether the single row of Template:Mvar has a positive or negative leading coefficient.

Algorithms

There are many algorithms for computing the Hermite normal form, dating back to 1851. One such algorithm is described in.[13]Template:Rp But only in 1979 an algorithm for computing the Hermite normal form that ran in strongly polynomial time was first developed;[14] that is, the number of steps to compute the Hermite normal form is bounded above by a polynomial in the dimensions of the input matrix, and the space used by the algorithm (intermediate numbers) is bounded by a polynomial in the binary encoding size of the numbers in the input matrix.

One class of algorithms is based on Gaussian elimination in that special elementary matrices are repeatedly used.[11][15][16] The LLL algorithm can also be used to efficiently compute the Hermite normal form.[17][18]

Applications

Lattice calculations

A typical lattice in Rn has the form L={i=1nαi𝐚i|αi<mi fromhbox="1">Z</mi>} where the ai are in Rn. If the columns of a matrix A are the ai, the lattice can be associated with the columns of a matrix, and A is said to be a basis of L. Because the Hermite normal form is unique, it can be used to answer many questions about two lattice descriptions. For what follows, LA denotes the lattice generated by the columns of A. Because the basis is in the columns of the matrix A, the column-style Hermite normal form must be used. Given two bases for a lattice, A and A', the equivalence problem is to decide if LA=LA. This can be done by checking if the column-style Hermite normal form of A and A' are the same up to the addition of zero columns. This strategy is also useful for deciding if a lattice is a subset (LALA if and only if L[AA]=LA), deciding if a vector v is in a lattice (vLA if and only if L[vA]=LA), and for other calculations.[19]

Integer solutions to linear systems

The linear system Template:Math has an integer solution Template:Math if and only if the system Template:Math has an integer solution Template:Math where Template:Math and Template:Math is the column-style Hermite normal form of Template:Math. Checking that Template:Math has an integer solution is easier than Template:Math because the matrix Template:Math is triangular.[11]Template:Rp

Implementations

Many mathematical software packages can compute the Hermite normal form:

Over an arbitrary Dedekind domain

Hermite normal form can be defined when we replace Z by an arbitrary Dedekind domain.[20] (for instance, any principal-ideal domain). For instance, in control theory it can be useful to consider Hermite normal form for the polynomials Template:Math over a given field Template:Math.

See also

References

Template:Reflist