# Chapter 6

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

# Hashing and Randomized Algorithms

### Probability

6.1. You are given $n$ unbiased coins, and perform the following process to generate all heads. Toss all $n$ coins independently at random onto a table. Each round consists of picking up all the tails-up coins and tossing them onto the table again. You repeat until all coins are heads.
(a) What is the expected number of rounds performed by the process?
(b) What is the expected number of coin tosses performed by the process?

6.2. Suppose we flip $n$ coins each of known bias, such that $p_{i}$ is the probability of the $i$ th coin being a head. Present an efficient algorithm to determine the exact probability of getting exactly $k$ heads given $p_{1},...,p_{n}\in [0,1]$ .

6.3. An inversion of a permutation is a pair of elements that are out of order.
(a) Show that a permutation of $n$ items has at most $n(n-1)/2$ inversions. Which permutation(s) have exactly n(n - 1)/2 inversions?
(b) Let P be a permutation and $P^{r}$ be the reversal of this permutation. Show that $P$ and $P^{r}$ have a total of exactly $n(n-1)/2$ inversions.
(c) Use the previous result to argue that the expected number of inversions in a random permutation is $n(n-1)/4$ .

6.4. A derangement is a permutation $p$ of ${1,...,n}$ such that no item is in its proper position, that is, $p_{i}\neq i$ for all $1\leq i\leq n$ . What is the probability that a random permutation is a derangement?

### Hashing

6.5. An all-Beatles radio station plays nothing but recordings by the Beatles, selecting the next song at random (uniformly with replacement). They get through about ten songs per hour. I listened for 90 minutes before hearing a repeated song. Estimate how many songs the Beatles recorded.

6.6. Given strings $S$ and $T$ of length $n$ and $m$ respectively, find the shortest window in $S$ that contains all the characters in $T$ in expected $O(n+m)$ time.

6.7. Design and implement an efficient data structure to maintain a least recently used (LRU) cache of $n$ integer elements. A LRU cache will discard the least recently accessed element once the cache has reached its capacity, supporting the following operations:
get(k)– Return the value associated with the key $k$ if it currently exists in the cache, otherwise return -1.
put(k,v) – Set the value associated with key $k$ to $v$ , or insert if $k$ is not already present. If there are already $n$ items in the queue, delete the least recently used item before inserting $(k,v)$ . Both operations should be done in $O(1)$ expected time.

### Randomized Algorithms

6.8. A pair of English words $(w_{1},w_{2})$ is called a rotodrome if one can be circularly shifted (rotated) to create the other word. For example, the words (windup, upwind) are a rotodrome pair, because we can rotate “windup” two positions to the right to get “upwind.”
Give an efficient algorithm to find all rotodrome pairs among $n$ words of length $k$ , with a worst-case analysis. Also give a faster expected-time algorithm based on hashing.

6.9. Given an array $w$ of positive integers, where $w[i]$ describes the weight of index $i$ , propose an algorithm that randomly picks an index in proportion to its weight.

6.10. You are given a function rand7, which generates a uniform random integer in the range 1 to 7. Use it to produce a function rand10, which generates a uniform random integer in the range 1 to 10.

6.11. Let $0<\alpha <1/2$ be some constant, independent of the input array length $n$ . What is the probability that, with a randomly chosen pivot element, the partition subroutine from quicksort produces a split in which the size of both the resulting subproblems is at least $\alpha$ times the size of the original array?

6.12. Show that for any given load $m/n$ , the error probability of a Bloom filter is minimized when the number of hash functions is $k=exp(-1)/(m/n)$ .

Back to Chapter List