• Ei tuloksia

Continues on the next page!

N/A
N/A
Info
Lataa
Protected

Academic year: 2022

Jaa "Continues on the next page!"

Copied!
2
0
0

Kokoteksti

(1)

582650 Information-Theoretic Modeling (Autumn 2012) Homework 2 (due 18 September)

You should turn in your solutions on paper at the start of the exercise session.

1. (a) Provide an alternative proof for Gibb’s Inequality where you use Jensen’s Inequality (b) Prove that for any n positive numbers x1, . . . , xn, their geometric mean is at most their

arithmetic mean:

n

Y

i=1

xi

!1/n

≤ 1 n

n

X

i=1

xi.

Hint: concavity oflogworks well here, too.

2. Consider two binary-valued random variablesX andY with the following joint distribution:

Y = 0 Y = 1

X= 0 1/3 1/3

X= 1 0 1/3.

In other words, the caseX = 1, Y = 0 never occurs, but the other combinations are all equally probable.

Calculate the values of (a) H(X)andH(Y) (b) H(X |Y)andH(Y |X)

(c) H(X, Y)

(d) H(Y)−H(Y |X) (e) I(X;Y).

Optionally, you may wish to draw a graphical representation to illustrate these quantities (as on page 30 of lecture notes for 12 September, or as drawn on the blackboard during the lecture).

3. Give a proof for the chain rule of mutual information:

I(X;Y1, . . . , YN) =

N

X

i=1

I(X;Yi|Y1, . . . , Yi−1).

Show also that in the special case where the random variables Yi are all independent, this simplifies to

I(X;Y1, . . . , YN) =

N

X

i=1

I(X;Yi).

Continues on the next page!

(2)

4. There are 12 balls that look exactly the same, one of which is either heavier or lighter than the others. Your task is to determine which one of the balls is different, and whether it is heavier or lighter. To do this, you are given a balance. You can put any number of balls on each side of the balance, and the balance will tell you whether the sides are of equal weight or if the left or the right side is heavier. What is the fewest number of weighings you can use to solve the problem? Please describe your strategy and prove that it always works. If you can, also prove that the problem cannot be solved using fewer weighings.

5. Consider the simple Bernoulli model that generates independent random bits withPr[Xi= 1] = pfor some fixed 0≤p≤1.

For sequence lengthn, and some > 0, the typical set An is defined as the set of sequences x1, . . . , xn such that

2−n(H(X)+)≤p(x1, . . . , xn)≤2−n(H(X)−) .

What are the sequences in the typical setA150.1 under the Bernoulli model when p= 0.1? How aboutp= 0.3, andp= 0.5? What can you say about the sizes of these sets?

Bonus ProblemWe consider a binary symmetric channel with error ratep. In other words, we transmit single bits, and each bit gets flipped with probability pindependently of anything else. For error correction, we use a simplek-fold repetition code, where each bit is just repeatedktimes. Then an error occurs for a given bit, i.e., the recipient is unable to recover its correct value, if at leastk/2 of the sent bits were flipped.

(a) Give an exact expression in terms ofpandkfor the probability of error in transmitting a single bit. Can you come up with an approximate formula that’s easier to understand and evaluate?

(b) Supposing p= 0.1 and we want the error probability to be at most 0.00001 %, what value of k would be sufficient? A reasonably tight estimate is a sufficient answer.

(c) Assume now that we actually want to transmit a byte (8 bits) so that all the bits are correct.

What value ofk would be sufficient, with other parameters as in the previous part?

2

Viittaukset

LIITTYVÄT TIEDOSTOT

In practice you should just calculate the values exactly (if you need specific numerical values) or use Chernoff bounds or similar (if you need an asymptotically fairly

(a) What is the optimal strategy if you want to minimize the expected number of questions required.. What can you say about the expected number of questions that

(This has been counted as two problems to keep the total work load more manageable. For purposes of grading, and perhaps also for planning your work, you may think that Problem 4 is

The goal is to make the Weighted Majority algorithm a bit more concrete and prepare for later exercises with other online

Your solution should consist of a brief explanation of the observations you made, a couple of representative plots to support this, and a printout of your program

Remark: The idea is to always update the weight vector so that the latest data point is classified with a positive margin, but try to keep close to the old weight vector since

Give your solution as a fairly high-level pseudocode, but explain the data structures in sufficient detail so that the running times are justified.. Remark: On this course we do

You can send your solutions as a PDF file to jyrki.kivinen@cs.helsinki.fi, drop a paper version to a box outside Jyrki Kivinen’s office B229a, or bring a paper version to the