• Ei tuloksia

Continues on the next page!

N/A
N/A
Info
Lataa
Protected

Academic year: 2022

Jaa "Continues on the next page!"

Copied!
2
0
0

Kokoteksti

(1)

582650 Information-Theoretic Modeling (Autumn 2012) Homework 1 (due 11 September)

You should turn in your solutions on paper at the start of the exercise session. Please try to actually attend the first homework session on 11 September, we’ll discuss what’s the best way of managing the future homework. If you cannot make it, contact the lecturer about the possibility of turning in your homework by e-mail.

General comments: Problems 4 and 5 give some practice with basic mathematical techniques we’ll need. If you have routine in this kind of work, they should be fairly straightforward. If not, feel free to seek additional help from textbooks, the net etc.

The parameters in the Bonus Problem are chosen so that there should be no issues with numerical accuracy or computation time, if you think a little about how to arrange the computations sensibly.

1. Explain briefly your background and what made you interested in this course. What is your ma- jor subject (computer science, math, ...), and at what stage of your studies you are? How much mathematics have you studied (name some courses you have taken)? How about programming—

what language(s) you use, and how would you describe your programming skills? If you have studied something else relevant, like statistics, signal processing, etc., you can also mention that.

Have you studied any information theory before?

2. Find out what compression algorithms are used in the gadgets you use—DVD player, digital TV, CD, iPod, cell-phone, laptop (WinZip), etc. You can simply list the names of the algorithms, you don’t have to study how these algorithms work.

3. In a programming language of your choice, find as short a program as you can for printing out the sequence 141592653589793. . . 375678 (the first 10000 digits of π after the decimal point).

Search the web to find an algorithm for calculating π. Your solution to the problem should contain your source code and a brief description of the algorithm you use (the formula and where it’s from).

4. For0≤p≤1, we define thebinary entropy H(p)as

H(p) =−plog2p−(1−p) log2(1−p).

This is the entropy of a random variable that gets value 0 with probabilitypand value1 with probability1−p, and will appear frequently during the rest of the course.

Prove that H(p)is strictly concave and achieves its maximum at p= 1/2. (Hint: Remember that a sufficient condition for concavity is that the second derivative is negative.)

Continues on the next page!

(2)

5. We writeµ= E[X]for the expectation andσ2 = Var[X]for the variance of a random variable X. (Recall thatσ2= E[(X−µ)2].)

(a) Prove Markov’s Inequality: for any non-negative random variable X and any δ > 0, we have

P(X ≥δ)≤µ δ.

(b) Apply Markov’s Inequality to the random variable(X−µ)2to obtainChebyshev’s Inequal- ity: for anyδ >0we have

P(|X−µ| ≥δ)≤ σ2 δ2.

(c) Consider nowX that follows binomial distribution with parametersnandp. That is,X is the number of heads innindependent tosses of a biased coin with probabilitypof getting heads. We know from a basic course in probability that now µ=np and σ2=np(1−p).

Use Chebyshev’s Inequality to give an upper bound for the probability that the value of X differs from its expectation by at least 5%; that is, |X−µ|/µ ≥ 0.05. Give a general formula and calculate numerical values forp= 0.5andp= 0.8, with n= 20,n= 100and n= 1000.

Warning: This gives very loose upper bounds. In practice you should just calculate the values exactly (if you need specific numerical values) or use Chernoff bounds or similar (if you need an asymptotically fairly accurate closed-form formula for theoretical purposes);

see the Bonus Problem.

Bonus ProblemConsider a binomially distributed random variableX with parametersnandp, as in Problem 5(c).

(a) We know that fork∈ {0, . . . , n} we have

P(X=k) =

n

k

pk(1−p)n−k.

Use this to calculate the probability for X differing from its expectation by at least 5%, for the same combinations ofnand pas in 5(c). That is, you should calculate the actual values for the probabilities you upper bounded in 5(c) using Chebyshev’s Inequality.

(b) Compare the probabilities you calculated in part (a) with upper bounds you get from theChernoff bound

P(|X−µ| ≥δµ)≤2 exp(−µδ2/3).

Remark: In the literature you can find a large number of inequalities called “Chernoff bound.”

The one given here is one of the simplest, but perhaps not the tightest.

2

Viittaukset

LIITTYVÄT TIEDOSTOT

Remark: The idea is to always update the weight vector so that the latest data point is classified with a positive margin, but try to keep close to the old weight vector since

Give your solution as a fairly high-level pseudocode, but explain the data structures in sufficient detail so that the running times are justified.. Remark: On this course we do

You can send your solutions as a PDF file to jyrki.kivinen@cs.helsinki.fi, drop a paper version to a box outside Jyrki Kivinen’s office B229a, or bring a paper version to the

• When you use a modifying word like ”set”, vector”, ”model” etc. before the

If you need more space (either horizontal or vertical) you can define it by commands \vspace{2cm}(vertical space of 2 cm) and \hspace{13mm} (a horizontal space of 13mm).

If you add this observation to the fact that historians of religion, like all historians, need to work with written sources, you are in a working situation which is philological:

If the original pixel values are of UINT-type, what kind of problems may arise with filtering.. Present how you calculated you answer, not just the

Now, type your student number and press Enter. If your student number ends with an alphabetical letter, you should type the numerical part of your student number, e.g., if