Prophet inequality

From testwiki
Revision as of 11:33, 9 December 2024 by imported>Citation bot (Alter: doi, pages. Add: doi, isbn. Formatted dashes. | Use this bot. Report bugs. | Suggested by Abductive | Category:Mechanism design | #UCB_Category 23/46)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Short description In the theory of online algorithms and optimal stopping, a prophet inequality is a bound on the expected value of a decision-making process that handles a sequence of random inputs from known probability distributions, relative to the expected value that could be achieved by a "prophet" who knows all the inputs (and not just their distributions) ahead of time.Template:R These inequalities have applications in the theory of algorithmic mechanism design and mathematical finance.Template:R

Single item

The classical single-item prophet inequality was published by Template:Harvtxt, crediting its tight form to D. J. H. (Ben) Garling. It concerns a process in which a sequence of random variables Xi arrive from known distributions 𝒟i. When each Xi arrives, the decision-making process must decide whether to accept it and stop the process, or whether to reject it and go on to the next variable in the sequence. The value of the process is the single accepted variable, if there is one, or zero otherwise. It may be assumed that all variables are non-negative; otherwise, replacing negative values by zero does not change the outcome. This can model, for instance, financial situations in which the variables are offers to buy some indivisible good at a certain price, and the seller must decide which (if any) offer to accept. A prophet, knowing the whole sequence of variables, can obviously select the largest of them, achieving value maxiXi for any specific instance of this process, and expected value Template:Nowrap The prophet inequality states the existence of an online algorithm for this process whose expected value is at least half that of the prophet: Template:Nowrap No algorithm can achieve a greater expected value for all distributions of Template:Nowrap

One method for proving the single-item prophet inequality is to use a "threshold algorithm" that sets a parameter τ and then accepts the first random variable that is at least as large Template:Nowrap If the probability that this process accepts an item is p, then its expected value is pτ plus the expected excess over τ that the selected variable (if there is one) has. Each variable Xi will be considered by the threshold algorithm with probability at least Template:Nowrap and if it is considered will contribute max(Xiτ,0) to the excess, so by linearity of expectation the expected excess is at least 𝔼[i(1p)max(Xiτ,0)](1p)(𝔼[maxiXi]τ). Setting τ to the median of the distribution of Template:Nowrap so that Template:Nowrap and adding pτ to this bound on expected excess, causes the pτ and (1p)(τ) terms to cancel each other, showing that for this setting of τ the threshold algorithm achieves an expected value of at least Template:Nowrap A different threshold, Template:Nowrap also achieves at least this same expected value.Template:R

Generalizations

Various generalizations of the single-item prophet inequality to other online scenarios are known, and are also called prophet inequalities.Template:R

Comparison to competitive analysis

Prophet inequalities are related to the competitive analysis of online algorithms, but differ in two ways. First, much of competitive analysis assumes worst case inputs, chosen to maximize the ratio between the computed value and the optimal value that could have been achieved with knowledge of the future, whereas for prophet inequalities some knowledge of the input, its distribution, is assumed to be known. And second, in order to achieve a certain competitive ratio, an online algorithm must perform within that ratio of the optimal performance on all inputs. Instead, a prophet inequality only bounds the performance in expectation, allowing some input sequences to produce worse performance as long as the average is good.Template:R

References

Template:Reflist