Covering number

From testwiki
Revision as of 22:51, 12 February 2025 by imported>Raikou04 (growthexperiments-addlink-summary-summary:2|0|0)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Distinguish

In mathematics, a covering number is the number of balls of a given size needed to completely cover a given space, with possible overlaps between the balls. The covering number quantifies the size of a set and can be applied to general metric spaces. Two related concepts are the packing number, the number of disjoint balls that fit in a space, and the metric entropy, the number of points that fit in a space when constrained to lie at some fixed minimum distance apart.

Definition

Let (M, d) be a metric space, let K be a subset of M, and let r be a positive real number. Let Br(x) denote the ball of radius r centered at x. A subset C of M is an r-external covering of K if:

KxCBr(x).

In other words, for every yK there exists xC such that d(x,y)r.

If furthermore C is a subset of K, then it is an r-internal covering.

The external covering number of K, denoted Nrext(K), is the minimum cardinality of any external covering of K. The internal covering number, denoted Nrint(K), is the minimum cardinality of any internal covering.

A subset P of K is a packing if PK and the set {Br(x)}xP is pairwise disjoint. The packing number of K, denoted Nrpack(K), is the maximum cardinality of any packing of K.

A subset S of K is r-separated if each pair of points x and y in S satisfies d(x, y) ≥ r. The metric entropy of K, denoted Nrmet(K), is the maximum cardinality of any r-separated subset of K.

Examples

Template:Ordered list

Properties

Template:Ordered list

The following properties relate to covering numbers in the standard Euclidean space, m:[1]Template:Rp Template:Ordered list

Application to machine learning

Let K be a space of real-valued functions, with the l-infinity metric (see example 3 above). Suppose all functions in K are bounded by a real constant M. Then, the covering number can be used to bound the generalization error of learning functions from K, relative to the squared loss:[2]Template:Rp

Prob[suphK|GeneralizationError(h)EmpiricalError(h)|ϵ]Nrint(K)2expmϵ22M4

where r=ϵ8M and m is the number of samples.

See also

References

Template:Reflist

  1. Cite error: Invalid <ref> tag; no text was provided for refs named book14
  2. Cite error: Invalid <ref> tag; no text was provided for refs named book12