posted 23 Jan 2024
Success Boosting
§ Motivation
Suppose we have a randomized algorithm which outputs a value that is correct with probability
Suppose for our application, we want to obtain the correct result with probability or or
§ Median of Means
To illustrate the usefulness of Chernoff bounds and other concentration inequalities, we can take the aforementioned algorithm . We run a total of times and take the median; with probability it arrives at the correct answer.
§ Framework
Suppose we design a randomized algorithm to estimate a hidden statistic of a dataset and we know in advance
Suppose each time we use it outputs a number with and
Suppose we want to estimate within tolerance and with probability
- Accuracy boosting: Repeat a total of times and take the mean
- Success boosting: Find the mean of a total of trials and take the median to be correct with probability
§ Max Load
Suppose we have a fair ‑sided die that is rolled times. On average, what is the largest number of times any outcome is rolled?
Let be a fixed value and
Then, The total number of rolls with value is and Using Chernoff bounds, we can let and find that
Using union bound, we have at least a probability that no outcome will be rolled more than times.
§ Coupon Collector
Suppose we have a fair ‑sided die. On average, how many times should we roll the die before we see all possible outcomes among the rolls?