<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Big Red Bits &#187; renatoppl</title>
	<atom:link href="http://www.bigredbits.com/archives/author/renatoppl/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bigredbits.com</link>
	<description>Theory, Distributed Systems, and Other Random Bits</description>
	<lastBuildDate>Thu, 29 Sep 2011 07:13:29 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Three interesting puzzles</title>
		<link>http://www.bigredbits.com/archives/678</link>
		<comments>http://www.bigredbits.com/archives/678#comments</comments>
		<pubDate>Thu, 29 Sep 2011 06:12:42 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[puzzles]]></category>
		<category><![CDATA[theory]]></category>
		<category><![CDATA[algorithms]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=678</guid>
		<description><![CDATA[Here are three puzzles I got from 3 different people recently. The first I got from Charles, who got it from a problem set of the Brazilian Programming Competition. Puzzle #1: Given and a vector of in-degrees and out-degrees and for , find if there is a simple directed graph on nodes with those in [...]]]></description>
			<content:encoded><![CDATA[<p>Here are three puzzles I got from 3 different people recently. The first I got from Charles, who got it from a problem set of the Brazilian Programming Competition.</p>
<blockquote><p><strong>Puzzle #1:</strong> Given <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> and a vector of in-degrees and out-degrees <img src='http://s.wordpress.com/latex.php?latex=%5Ctext%7Bin%7D_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='\text{in}_i' title='\text{in}_i' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%5Ctext%7Bout%7D_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='\text{out}_i' title='\text{out}_i' class='latex' /> for <img src='http://s.wordpress.com/latex.php?latex=i%3D1..n&#038;bg=T&#038;fg=000000&#038;s=0' alt='i=1..n' title='i=1..n' class='latex' />, find if there is a simple directed graph on <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> nodes with those in and out-degrees in time <img src='http://s.wordpress.com/latex.php?latex=O%28n%20%5Clog%20n%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='O(n \log n)' title='O(n \log n)' class='latex' />. By a simple directed graph, I mean at most one edge between each pair <img src='http://s.wordpress.com/latex.php?latex=%28i%2Cj%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(i,j)' title='(i,j)' class='latex' />, allowing self loops <img src='http://s.wordpress.com/latex.php?latex=%28i%2Ci%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(i,i)' title='(i,i)' class='latex' />.</p></blockquote>
<p>Solving this problem in time <img src='http://s.wordpress.com/latex.php?latex=O%28n%5E3%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='O(n^3)' title='O(n^3)' class='latex' /> is easy using a max-flow computation &#8211; simply consider a bipartite graph with and edge <img src='http://s.wordpress.com/latex.php?latex=%28i%2Cj%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(i,j)' title='(i,j)' class='latex' /> between each pair with capacity <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />. Add a source and connect to each node in the left size with capacity <img src='http://s.wordpress.com/latex.php?latex=%5Ctext%7Bin%7D_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='\text{in}_i' title='\text{in}_i' class='latex' /> and add also a sink in the natural way, compute max-flow and check if it is <img src='http://s.wordpress.com/latex.php?latex=%5Csum_i%20%5Ctext%7Bin%7D_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='\sum_i \text{in}_i' title='\sum_i \text{in}_i' class='latex' />. But it turns out we can do it in a lot more efficient way. The solution I thought works in time <img src='http://s.wordpress.com/latex.php?latex=O%28n%20%5Clog%20n%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='O(n \log n)' title='O(n \log n)' class='latex' /> but maybe there is a linear solution out there. If you know of one, I am curious.</p>
<p>The second puzzle was given to me by Hyung-Chan An:</p>
<blockquote><p><strong>Puzzle #2:</strong> There is grid of size <img src='http://s.wordpress.com/latex.php?latex=n%20%5Ctimes%20m&#038;bg=T&#038;fg=000000&#038;s=0' alt='n \times m' title='n \times m' class='latex' /> formed by rigid bars. Some cells of the grid have a rigid bar in the diagonal, making that whole square rigid. The question is to decide, given a grid and the location of the diagonal bars if the entire structure is rigid or not. By rigid I mean, being able to be deformed.</p></blockquote>
<p style="text-align: left;"><a rel="attachment wp-att-679" href="http://www.bigredbits.com/archives/678/post_fig_1"><img class="aligncenter size-full wp-image-679" title="post_fig_1" src="http://www.bigredbits.com/wp-content/uploads/2011/09/post_fig_1.png" alt="" width="544" height="186" /></a></p>
<p style="text-align: left;">We thought right away in a linear algebraic formulation: look at each node and create a variable for each of the 4 angles around it. Now, write linear equations saying that some variables sum to 360, since they are around one node. Equations saying that some variable must be 90 (because it is in a rigid cell). Now, for the variables internal to each square, write that opposite angles must be equal (since all the edges are of equal length) and then you have a linear system of type <img src='http://s.wordpress.com/latex.php?latex=Ax%20%3D%20b&#038;bg=T&#038;fg=000000&#038;s=0' alt='Ax = b' title='Ax = b' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=x&#038;bg=T&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> are the variables (angles). Now, we need to check if this system admits more then one solution. We know a trivial solution to it, which is all variable is 90. So, we just need to check if the matrix <img src='http://s.wordpress.com/latex.php?latex=A&#038;bg=T&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> has full rank.</p>
<p style="text-align: left;">It turns out this problem has a much more beautiful and elegant solution and it is totally combinatorial &#8211; it is based on verifying that a certain bipartite graph is connected. You can read more about this solution in <a href="http://www.jstor.org/pss/2100966">Bracing rectangular frameworks. I</a> by (Bolker and Crapo 1979). A cute idea is to use the the following more general linear system (which works for rigidity in any number of dimensions). Consider a rigid bar from point <img src='http://s.wordpress.com/latex.php?latex=p_a%20%5Cin%20%5Cmathbb%7BR%7D%5En&#038;bg=T&#038;fg=000000&#038;s=0' alt='p_a \in \mathbb{R}^n' title='p_a \in \mathbb{R}^n' class='latex' /> to point <img src='http://s.wordpress.com/latex.php?latex=p_b%20%5Cin%20%5Cmathbb%7BR%7D%5En&#038;bg=T&#038;fg=000000&#038;s=0' alt='p_b \in \mathbb{R}^n' title='p_b \in \mathbb{R}^n' class='latex' />. If the structure is not rigid, then there is a movement it can make: let <img src='http://s.wordpress.com/latex.php?latex=v_a&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_a' title='v_a' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=v_b&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_b' title='v_b' class='latex' /> be the instantaneous velocities of points <img src='http://s.wordpress.com/latex.php?latex=a&#038;bg=T&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=b&#038;bg=T&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />. If <img src='http://s.wordpress.com/latex.php?latex=p_a%28t%29%2C%20p_b%28t%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='p_a(t), p_b(t)' title='p_a(t), p_b(t)' class='latex' /> are the movements of points <img src='http://s.wordpress.com/latex.php?latex=a%2Cb&#038;bg=T&#038;fg=000000&#038;s=0' alt='a,b' title='a,b' class='latex' />, then it must hold that: <img src='http://s.wordpress.com/latex.php?latex=%5CVert%20p_a%28t%29%20-%20p_b%28t%29%20%5CVert%20%3D%20%5CVert%20p_a%280%29%20-%20p_b%280%29%20%5CVert&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Vert p_a(t) - p_b(t) \Vert = \Vert p_a(0) - p_b(0) \Vert' title='\Vert p_a(t) - p_b(t) \Vert = \Vert p_a(0) - p_b(0) \Vert' class='latex' />, so taking derivatives we have:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%28p_a%280%29%20-%20p_b%280%29%29%20%5Ccdot%20%28v_a%20-%20v_b%29%20%3D%200&#038;bg=T&#038;fg=000000&#038;s=0' alt='(p_a(0) - p_b(0)) \cdot (v_a - v_b) = 0' title='(p_a(0) - p_b(0)) \cdot (v_a - v_b) = 0' class='latex' /></p>
<p style="text-align: left;">This is a linear system in the velocities. Now, our job is to check if there are non zero velocities, which again is to check that the matrix of the linear system is or is not full-rank.  An interesting thing is that if we look at this question for the grid above, this matrix will be the matrix of a combinatorial problem! So we can simply check if it has full rank by solving the combinatorial problem. Look at the paper for more details.</p>
<p style="text-align: left;">The third puzzle I found in the amazing website called <a href="http://www.cs.cmu.edu/puzzle/">The Puzzle Toad</a>, which is CMU&#8217;s puzzle website:</p>
<blockquote>
<p style="text-align: left;"><strong>Puzzle #3:</strong> There is a game played between Arthur and Merlin. There is a table with <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> lamps disposed in a circle, initially some are on and some are off. In each timestep, Arthur writes down the position of the lamps that are off. Then Merlin (in an adversarial way) rotates the table. The Arthur&#8217;s servant goes and flips (on &#8211;&gt; off, off &#8211;&gt; on) the lamps whose position Arthur wrote down (notice now he won&#8217;t be flipping the correct lamps, since Merlin rotated the table. if Arthur wrote lamp 1 and Merlin rotated the table by 3 positions, the servant will actually be flipping lamp 4. The question is: given <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> and an initial position of the table, is there a strategy for Merlin  such that Arthur never manages to turn all the lamps on.</p>
</blockquote>
<p style="text-align: left;">See <a href="http://www.cs.cmu.edu/puzzle/puzzle2.html">here</a> for a better description and a link to the solution. For <img src='http://s.wordpress.com/latex.php?latex=n%20%3D%202%5Ek&#038;bg=T&#038;fg=000000&#038;s=0' alt='n = 2^k' title='n = 2^k' class='latex' /> no matter what Merlin does, Arthur always manages to turn on all the lamps eventually, where eventually means in <img src='http://s.wordpress.com/latex.php?latex=O%28n%5E2%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='O(n^2)' title='O(n^2)' class='latex' /> time. The solution is a very pretty (and simple) algebraic argument. I found this problem really nice.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/678/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bayesian updates and the Lake Wobegon effect</title>
		<link>http://www.bigredbits.com/archives/643</link>
		<comments>http://www.bigredbits.com/archives/643#comments</comments>
		<pubDate>Mon, 26 Sep 2011 01:48:27 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[economics]]></category>
		<category><![CDATA[probability]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=643</guid>
		<description><![CDATA[We seem to have a good mathematical understanding of Bayesian updates, but somehow a very poor understanding of its practical implications. There are many situations in practice that we easily perceive as irrational, one of the most famous is the so called Lake Wobegon effect, named after the fictional town in Minnesota, where &#8220;all the women are [...]]]></description>
			<content:encoded><![CDATA[<p>We seem to have a good mathematical understanding of Bayesian updates, but somehow a very poor understanding of its practical implications. There are many situations in practice that we easily perceive as irrational, one of the most famous is the so called <a href="http://en.wikipedia.org/wiki/Illusory_superiority">Lake Wobegon effect</a>, named after the <a href="http://en.wikipedia.org/wiki/Lake_Wobegon">fictional town in Minnesota</a>, where &#8220;all the women are strong, all the men are good looking, and all the children are above average&#8221;. It is described as a cognitive bias where individuals tend to overestimate their own capabilities. In fact, when drivers are asked to rate their own skilled compared to the average in three groups: low-skilled, medium-skilled and high-skilled, most rate themselves above the average.</p>
<p>In fact, the behavioral economics literate is full of examples like this where the observed data is far from what you would expect to observe if all agents were rational &#8211; and those are normally attributed to cognitive biases. I was always a bit suspicious of such arguments: it was never clear if agents were simply not being rational or whether their true objective wasn&#8217;t being captured by the model. I always thought the second was a lot more likely.</p>
<p>One of the main problems of the irrationality argument is that they ignore the fact that agents live in a world where its states are not completely observed. In a beautiful paper in Econometrica called &#8220;<a href="http://www2.um.edu.uy/dubraj/documentos/Apparentfinal.pdf">Apparent Overconfidence</a>&#8220;, Benoit and Dubra argue that:</p>
<blockquote><p>&#8220;But the simple truism that most people cannot be better than the median does not imply that most people cannot rationally rate themselves above the median.&#8221;</p></blockquote>
<p>The authors show that it is possible to reverse engineer a signaling scheme such that the data is mostly consistent with the observation. Let me try to give a simple example they give in the introduction: consider that each driver has one of three types of skill: low, medium or high: <img src='http://s.wordpress.com/latex.php?latex=%5C%7BL%2CM%2CH%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\{L,M,H\}' title='\{L,M,H\}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%28L%29%20%3D%20%5Cmathbb%7BP%7D%28M%29%20%3D%20%5Cmathbb%7BP%7D%28H%29%20%3D%20%5Cfrac%7B1%7D%7B3%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}(L) = \mathbb{P}(M) = \mathbb{P}(H) = \frac{1}{3}' title='\mathbb{P}(L) = \mathbb{P}(M) = \mathbb{P}(H) = \frac{1}{3}' class='latex' />. However, they can&#8217;t observe this. They can only observe some sample of their driving. Let&#8217;s say for simplicity that they can observe a signal <img src='http://s.wordpress.com/latex.php?latex=A&#038;bg=T&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> that says if they caused an accident or not. Assume also that the larger that skill of a driver, the higher it is his probability of causing an accident, say:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%28A%20%5Cvert%20L%29%20%3D%20%5Cfrac%7B47%7D%7B80%7D%2C%20%5Cmathbb%7BP%7D%28A%20%5Cvert%20L%29%20%3D%20%5Cfrac%7B9%7D%7B16%7D%2C%20%5Cmathbb%7BP%7D%28A%20%5Cvert%20L%29%20%3D%20%5Cfrac%7B1%7D%7B20%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}(A \vert L) = \frac{47}{80}, \mathbb{P}(A \vert L) = \frac{9}{16}, \mathbb{P}(A \vert L) = \frac{1}{20}' title='\mathbb{P}(A \vert L) = \frac{47}{80}, \mathbb{P}(A \vert L) = \frac{9}{16}, \mathbb{P}(A \vert L) = \frac{1}{20}' class='latex' /></p>
<p style="text-align: left;">Before observing <img src='http://s.wordpress.com/latex.php?latex=A&#038;bg=T&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> each driver things of himself as having probability $\frac{1}{3}$ of having each type of skill. Now, after observing <img src='http://s.wordpress.com/latex.php?latex=A&#038;bg=T&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' />, they update their belief according to Bayes rule, i.e.,</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%28s%20%5Cvert%20A%29%20%3D%20%5Cfrac%7B%20%5Cmathbb%7BP%7D%28A%20%5Cvert%20s%29%20%20%20%5Cmathbb%7BP%7D%28s%29%20%20%7D%7B%20%5Csum_%7Bs%27%7D%20%20%5Cmathbb%7BP%7D%28A%20%5Cvert%20s%27%29%20%20%20%5Cmathbb%7BP%7D%28s%27%29%20%20%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}(s \vert A) = \frac{ \mathbb{P}(A \vert s)   \mathbb{P}(s)  }{ \sum_{s&#039;}  \mathbb{P}(A \vert s&#039;)   \mathbb{P}(s&#039;)  } ' title='\mathbb{P}(s \vert A) = \frac{ \mathbb{P}(A \vert s)   \mathbb{P}(s)  }{ \sum_{s&#039;}  \mathbb{P}(A \vert s&#039;)   \mathbb{P}(s&#039;)  } ' class='latex' /></p>
<p style="text-align: left;">doing the calculations, we have that <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%28A%29%20%3D%20%5Cfrac%7B2%7D%7B5%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}(A) = \frac{2}{5}' title='\mathbb{P}(A) = \frac{2}{5}' class='latex' /> and for the <img src='http://s.wordpress.com/latex.php?latex=%5Cfrac%7B3%7D%7B5%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\frac{3}{5}' title='\frac{3}{5}' class='latex' /> of the drivers that didn&#8217;t suffer an accident, they&#8217;ll evaluate <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%28L%20%5Cvert%20%5Cneg%20A%29%20%3D%20%5Cfrac%7B11%7D%7B48%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}(L \vert \neg A) = \frac{11}{48}' title='\mathbb{P}(L \vert \neg A) = \frac{11}{48}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%28M%20%5Cvert%20%5Cneg%20A%29%20%3D%20%5Cfrac%7B35%7D%7B144%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}(M \vert \neg A) = \frac{35}{144}' title='\mathbb{P}(M \vert \neg A) = \frac{35}{144}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%28H%20%5Cvert%20%5Cneg%20A%29%20%3D%20%5Cfrac%7B19%7D%7B36%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}(H \vert \neg A) = \frac{19}{36}' title='\mathbb{P}(H \vert \neg A) = \frac{19}{36}' class='latex' />, so:</p>
<div id="_mcePaste" style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%28H%20%5Cvert%20%5Cneg%20A%29%20%3E%20%5Cmathbb%7BP%7D%28L%20%5Ccup%20M%20%5Cvert%20%5Cneg%20A%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}(H \vert \neg A) &gt; \mathbb{P}(L \cup M \vert \neg A)' title='\mathbb{P}(H \vert \neg A) &gt; \mathbb{P}(L \cup M \vert \neg A)' class='latex' /></div>
<p>and therefore will report high-skill. Notice this is totally consistent with rational Bayesian-updaters. The main question in the paper is: &#8220;when it is possible to reverse engineer a signaling scheme ?&#8221;. More formally, let <img src='http://s.wordpress.com/latex.php?latex=%5CTheta&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Theta' title='\Theta' class='latex' /> be a set of types of users and let <img src='http://s.wordpress.com/latex.php?latex=%5Ctheta%20%5Csim%20H%20%5Cin%20%5CDelta%28%5CTheta%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\theta \sim H \in \Delta(\Theta)' title='\theta \sim H \in \Delta(\Theta)' class='latex' />, i.e., <img src='http://s.wordpress.com/latex.php?latex=H&#038;bg=T&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> is a distribution on the types which is common knowledge. Now, if we ask agents to report their type, their report is some <img src='http://s.wordpress.com/latex.php?latex=H%27%20%5Cin%20%5CDelta%28%5CTheta%29%2C%20H%27%20%5Cneq%20H&#038;bg=T&#038;fg=000000&#038;s=0' alt='H&#039; \in \Delta(\Theta), H&#039; \neq H' title='H&#039; \in \Delta(\Theta), H&#039; \neq H' class='latex' />. Is there a signaling scheme <img src='http://s.wordpress.com/latex.php?latex=S&#038;bg=T&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> which can be interpreted as a random variable correlated with <img src='http://s.wordpress.com/latex.php?latex=%5Ctheta&#038;bg=T&#038;fg=000000&#038;s=0' alt='\theta' title='\theta' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=H%27&#038;bg=T&#038;fg=000000&#038;s=0' alt='H&#039;' title='H&#039;' class='latex' /> is the distribution rational Bayesian updaters would report based on what they observed from <img src='http://s.wordpress.com/latex.php?latex=S&#038;bg=T&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> ? The authors give necessary and sufficient condition on when this is possible given <img src='http://s.wordpress.com/latex.php?latex=H%2CH%27&#038;bg=T&#038;fg=000000&#038;s=0' alt='H,H&#039;' title='H,H&#039;' class='latex' />.</p>
<p style="text-align: center;">&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;</p>
<p style="text-align: left;">A note also related to the Lake Wobegon effect: I started reading a very nice book by Duncan Watts called &#8220;<a href="http://www.amazon.com/Everything-Obvious-Once-Know-Answer/dp/0385531680/ref=sr_1_1?s=books&amp;ie=UTF8&amp;qid=1317001355&amp;sr=1-1">Everything Is Obvious: *Once You Know the Answer</a>&#8221; about traps of the common-sense. The discussion is different then above, but it also talks about the dangers of applying our usual common sense, which is very useful to our daily life, to scientific results. I highly recommend reading the intro of the book, which is open in Amazon. He gives examples of social phenomena where, once you are told them, you think: &#8220;oh yeah, this is obvious&#8221;. But then if you were told the exact opposite (in fact, he begins the example by telling you the opposite from the observed in data), you&#8217;d also think &#8220;yes, yes, this is obvious&#8221; and come up with very natural explanations. His point is that common sense is very useful to explaining data observations, specially observations of social data. On the other hand, it is performs very poorly on predicting how the data will look like before actually seeing it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/643/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Restaurants in Jerusalem</title>
		<link>http://www.bigredbits.com/archives/599</link>
		<comments>http://www.bigredbits.com/archives/599#comments</comments>
		<pubDate>Tue, 31 May 2011 09:31:50 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=599</guid>
		<description><![CDATA[The Algorithmic Game Theory Semester in Jerusalem has been amazing &#8211; made a lot of good friends and learned many interesting things. Now, ready to go back to the US, I feel I should share a bit about nice places to go in Jerusalem. Together with Vasilis Syrgkanis, I decided to compile a list of [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;">The <a href="http://www.cs.huji.ac.il/conferences/agt-semester/people.php">Algorithmic Game Theory Semester</a> in <a href="http://en.wikipedia.org/wiki/Jerusalem">Jerusalem</a> has been amazing &#8211; made a lot of good friends and learned many interesting things. Now, ready to go back to the US, I feel I should share a bit about nice places to go in Jerusalem. Together with <a href="http://www.cs.cornell.edu/~vasilis/">Vasilis Syrgkanis</a>, I decided to compile a list of good restaurants, cafes, &#8230; specially a list of places open in shabbat (which is quite useful for the visitor).</p>
<p style="text-align: left;"><strong>My Favourite Restaurants</strong></p>
<ul>
<li><a href="http://www.2eat.co.il/eng/machne-yuda/">Machne-yuda</a> : Next to the <a href="http://en.wikipedia.org/wiki/Mahane_Yehuda">Mahane Yehuda market</a>, the restaurant menu changes everyday (with a couple of dishes which are always there). The food is whatever is in that day in the market. Everything was perfect both times I went. I strongly recommend the polenta and the black risotto. We took almost 1 month to make a reservation and eventually we got one. Eventually we learned a nice trick &#8211; if you arrive there very late (say aroud midnight), it is not impossible to get a table (we got one in the bar right away, arriving at 11:30pm). Elisa Celis suggested this place, and I think Omer Reingold suggested it to her.</li>
<li><a href="http://www.chakra-rest.com/AboutEng.aspx">Chakra</a> : There is a tasting menu that is amazing. It is around 160 NIS, but worth every cent. It started with chicken livers, salads, calamari, ceviche, shrimp (the shrimp was awesome), fish kebab and mussels. After the fish part ended, they brought us many meat dishes, like a beef strogonoff (I am still puzzled by how soft the meat was), lamb chops and beef kebab. It ended with a simple, yet perfect, chocolate dessert and ice-cream with tahina. Thanks to Shahar Dobzinski and Sigal Oren for the suggestion.</li>
<li>Carousella: It is a small restaurant in the corner of the street I used to live in Jerusalem (it is located in the corner of Azza and Metudela in Rehavia). It is a vegetarian French/Israeli restaurant and has my favorite shakshouka in town &#8211; it is actually a mix o <a href="http://en.wikipedia.org/wiki/Shakshouka">shakshouka</a> and <a href="http://en.wikipedia.org/wiki/Ratatouille">ratatouille</a>. I also like very much their risotto. We used to have breakfast/brunch there quite a lot &#8211; usually getting either the shakshouka or the musli. The house cake is definitely recommended too&#8230; It is my favourite place to work (they have wifi and a nice record player and a huge collection of records). Thanks to Omer Tamuz for this suggestion.</li>
</ul>
<p><strong>Hummus Places</strong></p>
<ul>
<li>Lina: It is known as the best hummus in the city (some claim the best hummus in the world). It is located in the Via Dolorosa and not hard to find. Arriving in the Via Dolorosa, you can just ask directions and everyone knows. Everything is recommended.</li>
<li><a href="http://www.gojerusalem.com/discover/item_563/From-Gaza-to-Berlin">From Gaza to Berlin</a>: is a small and super cheap restaurant in Rehavia &#8211; their hummus is very tasty (specially the hummus with meat), the kubbeh soup is very nice and their falafel is famous around here.</li>
<li><a href="http://www.gojerusalem.com/discover/item_448/Marvad-Haksamim-Emek-Refaim">Marvad Haksamim</a> (The Magic Carpet): a very nice arabic place &#8211; great bread, hummus, &#8230;  The soups are very good (I tried the lentil and the kubeh soup) and the meat is usually good as well. I tried the chopped livers and the kebab. The Mixed Jerusalem Grill is a famous plate, I guess.</li>
</ul>
<p><strong>Places to eat/work on shabbat:</strong> on Saturday and Friday night most of the things in the city are closed, so it is good to know some places to go:</p>
<ul>
<li><a href="http://maps.google.com/maps/place?cid=10965397664033307606&amp;q=restobar&amp;hl=en&amp;sll=32.048018,34.937554&amp;sspn=0.546879,0.564476&amp;ie=UTF8&amp;ll=32.560704,33.700562&amp;spn=0,0&amp;z=9">Restobar</a>: good food and generally open</li>
<li><a href="http://maps.google.com/maps/place?cid=7169989799078614900&amp;q=zuni&amp;hl=en&amp;sll=31.782098,35.222784&amp;sspn=0.015039,0.010627&amp;ie=UTF8&amp;ll=31.790163,35.203457&amp;spn=0,0&amp;z=15">Zuni</a>: very nice and open 24 hours a day, 7 days a week. Has every type of meal (brunch, lunch, coffee, drinks), served all the time. Good for working too&#8230;</li>
<li><a href="http://www.monas.co.il/en.html">Mona</a>: a very cool American/Israeli restaurant inside a former art school. Food is very good and has also a very nice bar.</li>
<li><a href="http://maps.google.com/maps/place?cid=17307252824416707487&amp;q=spaghettim&amp;hl=en&amp;sll=31.779659,35.220148&amp;sspn=1.053374,0.895184&amp;ie=UTF8&amp;ll=32.805745,32.750244&amp;spn=0,0&amp;z=8">Spaguettim</a>: nice location, wifi, good coffee and snacks. I never tried the food, though.</li>
<li>many places on Hillel Street: for example the <a href="http://www.gojerusalem.com/discover/item_504/Iwo-Meat-Burger-">Iwo Meat Burger</a> (a pretty good non-kosher burger place), the pizza place next to it and a couple of bars nearby</li>
</ul>
<p><strong>In the old city</strong></p>
<ul>
<li>Cheese pie: I don&#8217;t know how to give an exact location, but this should help you: very near to the Church of the Holy Sepulcher in the Old City, you can find the Coptic Patriarchate. Try to locate the stairs leading to the Patriarchate and before going up, there is a place that sells only one thing: a cheese pie. There is a big marble table and a man with a bowl of dough, some cheese and syrup. He prepares everything in front of you and then puts it in the oven. It takes around 20 minutes. It is very impressive.</li>
<li><a href="http://www.gojerusalem.com/discover/item_61/Armenian-Tavern">Armenian Tavern</a>: a very beautiful place hidden in the old city. Food is nice  (the lemonade is very refreshing and tastes great) but the better thing is the feeling that you are dining inside a museum.</li>
</ul>
<p><strong>Hotel Bars and Cafes</strong></p>
<ul>
<li><a href="http://www.mamillahotel.com/">Mamilla Terrace</a>: the restaurant in the terrace of Mamilla Hotel is pretty nice and has a good view of the old city. It is a good place to go for drinks as well. It is closed (I think) on shabbat.</li>
<li><a href="http://www.notredamecenter.org/index.phtml">Notre Dame Terrace</a>: the Notre Dame hotel has also a great view to the Temple Mount and pretty nice food. It is very relaxing to sit there and look the old city. There is another restaurant in the hotel called La Rotisserie, which I very much appreciated. It is also open on shabbat.</li>
</ul>
<p><strong>In Tel Aviv</strong></p>
<ul>
<li><a href="http://www.telavivguide.net/Restaurants/Reasonable_Restaurants/Raphael_Restau-Bistro_2005091122/">Raphael</a>: inside the Dan Hotel and overlooking the sea. Most of the times I went to Tel Aviv I ended up eating there.</li>
<li><a href="http://maps.google.com/maps/place?cid=13539005368885765189&amp;q=tel+aviv+boya&amp;hl=en&amp;sll=32.097239,34.775617&amp;sspn=0.06838,0.006295&amp;ie=UTF8&amp;ll=32.140008,34.698429&amp;spn=0,0&amp;z=13">Boya</a>: Thanks to Lior Seeman for this suggestion. We had many small dishes (something like tapas) and all were amazing. I think it was my favourite place in Tel Aviv.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/599/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>MHR, Regular Distributions and Myerson&#8217;s Lemma</title>
		<link>http://www.bigredbits.com/archives/539</link>
		<comments>http://www.bigredbits.com/archives/539#comments</comments>
		<pubDate>Mon, 30 May 2011 10:46:08 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[probability]]></category>
		<category><![CDATA[profit maximization]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=539</guid>
		<description><![CDATA[Monotone Hazard Rate (MHR) distributions and its superclass regular distributions keep appearing in the Mechanism Design literature and this is due to a very good reason: they are the class of distributions for which Myerson&#8217;s Optimal Auction is simple and natural. Let&#8217;s brief discuss some properties of those distributions. First, two definitions: Hazard rate of [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Monotone Hazard Rate</strong> (MHR) distributions and its superclass <strong>regular distributions</strong> keep appearing in the Mechanism Design literature and this is due to a very good reason: they are the class of distributions for which <a href="http://www.econ.yale.edu/~dirkb/teach/521b-08-09/reading/1981%20optimal%20auction.pdf">Myerson&#8217;s Optimal Auction</a> is simple and natural. Let&#8217;s brief discuss some properties of those distributions. First, two definitions:</p>
<ol>
<li>Hazard rate of a distribution <img src='http://s.wordpress.com/latex.php?latex=f&#038;bg=T&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> : <img src='http://s.wordpress.com/latex.php?latex=h%28z%29%20%3D%20%5Cfrac%7Bf%28z%29%7D%7B1-F%28z%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='h(z) = \frac{f(z)}{1-F(z)}' title='h(z) = \frac{f(z)}{1-F(z)}' class='latex' /></li>
<li>Myerson virtual value of a distribution <img src='http://s.wordpress.com/latex.php?latex=f&#038;bg=T&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> : <img src='http://s.wordpress.com/latex.php?latex=%5Cphi%28z%29%20%3D%20z%20-%20%5Cfrac%7B1-F%28z%29%7D%7Bf%28z%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\phi(z) = z - \frac{1-F(z)}{f(z)}' title='\phi(z) = z - \frac{1-F(z)}{f(z)}' class='latex' /></li>
</ol>
<p>We can interpret the hazard rate in the following way: think of <img src='http://s.wordpress.com/latex.php?latex=T%20%5Csim%20f&#038;bg=T&#038;fg=000000&#038;s=0' alt='T \sim f' title='T \sim f' class='latex' /> as a random variable that indicates the time that a light bulb will take to extinguish. If we are in time <img src='http://s.wordpress.com/latex.php?latex=t&#038;bg=T&#038;fg=000000&#038;s=0' alt='t' title='t' class='latex' /> and the light bulb hasn&#8217;t extinguished so far, what is the probability it will extinguish in the next <img src='http://s.wordpress.com/latex.php?latex=%5Cdelta&#038;bg=T&#038;fg=000000&#038;s=0' alt='\delta' title='\delta' class='latex' /> time:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%5BT%20%5Cleq%20t%2B%5Cdelta%20%5Cvert%20T%20%3E%20t%5D%20%5Capprox%20%5Cfrac%7Bf%28t%29%20%5Cdelta%7D%7B1-F%28t%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}[T \leq t+\delta \vert T &gt; t] \approx \frac{f(t) \delta}{1-F(t)}' title='\mathbb{P}[T \leq t+\delta \vert T &gt; t] \approx \frac{f(t) \delta}{1-F(t)}' class='latex' /></p>
<p style="text-align: left;">We say that a distribution is monotone hazard rate, if <img src='http://s.wordpress.com/latex.php?latex=h%28z%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='h(z)' title='h(z)' class='latex' /> is non-decreasing. This is very natural for light bulbs, for example. Many of the distributions that we are used to are MHR, for example, uniform, exponential and normal. The way that I like to think about MHR distributions is the following: if some distribution has hazard rate <img src='http://s.wordpress.com/latex.php?latex=h%28z%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='h(z)' title='h(z)' class='latex' />, then it means that <img src='http://s.wordpress.com/latex.php?latex=F%27%28z%29%20%3D%20%281-F%28z%29%29%20h%28z%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='F&#039;(z) = (1-F(z)) h(z)' title='F&#039;(z) = (1-F(z)) h(z)' class='latex' />. If we define <img src='http://s.wordpress.com/latex.php?latex=G%28z%29%20%3D%201-F%28z%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='G(z) = 1-F(z)' title='G(z) = 1-F(z)' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%28log%20G%28z%29%29%27%20%3D%20%5Cfrac%7BG%27%28z%29%7D%7BG%28z%29%7D%20%3D%20-h%28z%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(log G(z))&#039; = \frac{G&#039;(z)}{G(z)} = -h(z)' title='(log G(z))&#039; = \frac{G&#039;(z)}{G(z)} = -h(z)' class='latex' />, so:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=F%28z%29%20%3D%201-%5Ctext%7Bexp%7D%28-%5Cint_0%5Ez%20h%28u%29%20du%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='F(z) = 1-\text{exp}(-\int_0^z h(u) du)' title='F(z) = 1-\text{exp}(-\int_0^z h(u) du)' class='latex' /></p>
<p style="text-align: left;">From this characterization, it is simple to see that the extremal distributions for this class, i.e. the distributions that are in the edge of being MHR and non-MHR are constant hazard rate, which correspond to the exponential distribution <img src='http://s.wordpress.com/latex.php?latex=F%28z%29%20%3D%201-e%5E%7B-%5Clambda%20z%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='F(z) = 1-e^{-\lambda z}' title='F(z) = 1-e^{-\lambda z}' class='latex' /> for <img src='http://s.wordpress.com/latex.php?latex=z%20%5Cin%20%5B0%2C%5Cinfty%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='z \in [0,\infty)' title='z \in [0,\infty)' class='latex' />. They way I like to think about those distributions is that whenever you are able to prove something about the exponential distribution, then you can prove a similar statement about MHR distributions. Consider those three examples:</p>
<p style="text-align: left;"><strong>Example 1: </strong><img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%5B%5Cphi%28z%29%20%5Cgeq%200%5D%20%5Cgeq%20%5Cfrac%7B1%7D%7Be%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}[\phi(z) \geq 0] \geq \frac{1}{e}' title='\mathbb{P}[\phi(z) \geq 0] \geq \frac{1}{e}' class='latex' /> for MHR distributions. This fact is straightforward for the exponential distribution. For the exponential distribution <img src='http://s.wordpress.com/latex.php?latex=%5Cphi%28z%29%20%3D%20z-%5Clambda%5E%7B-1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\phi(z) = z-\lambda^{-1}' title='\phi(z) = z-\lambda^{-1}' class='latex' /> and therefore</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BP%7D%5B%5Cphi%28z%29%20%5Cgeq%200%5D%20%5Cgeq%20%5Cmathbb%7BP%7D%5Bz%20%3E%20%5Clambda%5E%7B-1%7D%5D%20%3D%201-F%28%5Clambda%5E%7B-1%7D%29%20%3D%20e%5E%7B-1%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{P}[\phi(z) \geq 0] \geq \mathbb{P}[z &gt; \lambda^{-1}] = 1-F(\lambda^{-1}) = e^{-1} ' title='\mathbb{P}[\phi(z) \geq 0] \geq \mathbb{P}[z &gt; \lambda^{-1}] = 1-F(\lambda^{-1}) = e^{-1} ' class='latex' /></p>
<p style="text-align: left;">but the proof for MHR is equally simple: Let <img src='http://s.wordpress.com/latex.php?latex=r%20%3D%20%5Cinf%20%5C%7Bz%3B%20%5Cphi%28z%29%20%5Cgeq%200%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='r = \inf \{z; \phi(z) \geq 0\}' title='r = \inf \{z; \phi(z) \geq 0\}' class='latex' />, therefore <img src='http://s.wordpress.com/latex.php?latex=r%20h%28r%29%20%5Cleq%201&#038;bg=T&#038;fg=000000&#038;s=0' alt='r h(r) \leq 1' title='r h(r) \leq 1' class='latex' />.<br />
<img src='http://s.wordpress.com/latex.php?latex=P%5C%7B%20%5Cphi%28v%29%20%5Cgeq%200%5C%7D%20%3D%20P%5C%7B%20v%20%5Cgeq%20r%20%5C%7D%20%3D%201%20-%20F%28r%29%20%3D%20e%5E%7B-%5Cint_0%5Er%20h%28u%29%20du%7D%20%5Cgeq%20e%5E%7B-r%20h%28r%29%7D%20%5Cgeq%20e%5E%7B-1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='P\{ \phi(v) \geq 0\} = P\{ v \geq r \} = 1 - F(r) = e^{-\int_0^r h(u) du} \geq e^{-r h(r)} \geq e^{-1}' title='P\{ \phi(v) \geq 0\} = P\{ v \geq r \} = 1 - F(r) = e^{-\int_0^r h(u) du} \geq e^{-r h(r)} \geq e^{-1}' class='latex' /></p>
<p><strong>Example 2</strong>: Given <img src='http://s.wordpress.com/latex.php?latex=z_1%2C%20z_2%20%5Csim%20f&#038;bg=T&#038;fg=000000&#038;s=0' alt='z_1, z_2 \sim f' title='z_1, z_2 \sim f' class='latex' /> iid where <img src='http://s.wordpress.com/latex.php?latex=f&#038;bg=T&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is MHR and <img src='http://s.wordpress.com/latex.php?latex=v_1%20%3D%20%5Cmax%20%5C%7Bz_1%2C%20z_2%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_1 = \max \{z_1, z_2\}' title='v_1 = \max \{z_1, z_2\}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=v_2%20%3D%20%5Cmin%20%5C%7Bz_1%2C%20z_2%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_2 = \min \{z_1, z_2\}' title='v_2 = \min \{z_1, z_2\}' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%5Bv_2%5D%20%5Cgeq%20%5Cfrac%7B1%7D%7B3%7D%20%5Cmathbb%7BE%7D%5Bv_1%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E}[v_2] \geq \frac{1}{3} \mathbb{E}[v_1]' title='\mathbb{E}[v_2] \geq \frac{1}{3} \mathbb{E}[v_1]' class='latex' />. The proof for the exponential distribution is trivial, and in fact, this is tight for the exponential, the trick is to use the convexity of <img src='http://s.wordpress.com/latex.php?latex=z%20%5Cmapsto%20%5Cint_0%5Ez%20h%28u%29%20du&#038;bg=T&#038;fg=000000&#038;s=0' alt='z \mapsto \int_0^z h(u) du' title='z \mapsto \int_0^z h(u) du' class='latex' />. We use that <img src='http://s.wordpress.com/latex.php?latex=%5Cint_0%5E%7B2z%7D%20h%20%5Cgeq%202%20%5Cint_0%5Ez%20h&#038;bg=T&#038;fg=000000&#038;s=0' alt='\int_0^{2z} h \geq 2 \int_0^z h' title='\int_0^{2z} h \geq 2 \int_0^z h' class='latex' /> in the following way:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%20%5Bv_2%5D%20%3D%20%5Cint_0%5E%5Cinfty%20%281%20-%20F%28z%29%29%5E2%20dz%20%3D%20%5Cint_0%5E%5Cinfty%20e%5E%7B-2%20%5Cint_0%5Ez%20h%7D%20dz%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E} [v_2] = \int_0^\infty (1 - F(z))^2 dz = \int_0^\infty e^{-2 \int_0^z h} dz ' title='\mathbb{E} [v_2] = \int_0^\infty (1 - F(z))^2 dz = \int_0^\infty e^{-2 \int_0^z h} dz ' class='latex' /></p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cgeq%20%5Cint_0%5E%5Cinfty%20e%5E%7B-%5Cint_0%5E%7B2z%7D%20h%7D%20dz%3D%20%5Cfrac%7B1%7D%7B2%7D%20%5Cint_0%5E%5Cinfty%201%20-%20F%28z%29%20dz%20%3D%20%5Cfrac%7B1%7D%7B2%7D%20%5Cmathbb%7BE%7D%20%5Bz%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\geq \int_0^\infty e^{-\int_0^{2z} h} dz= \frac{1}{2} \int_0^\infty 1 - F(z) dz = \frac{1}{2} \mathbb{E} [z]' title='\geq \int_0^\infty e^{-\int_0^{2z} h} dz= \frac{1}{2} \int_0^\infty 1 - F(z) dz = \frac{1}{2} \mathbb{E} [z]' class='latex' /></p>
<p>Since <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%20%5Bv_1%20%2B%20v_2%5D%20%3D%20%5Cmathbb%7BE%7D%20%5Bz_1%20%2B%20z_2%5D%20%3D%202%20%5Cmathbb%7BE%7D%20%5Bz%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E} [v_1 + v_2] = \mathbb{E} [z_1 + z_2] = 2 \mathbb{E} [z]' title='\mathbb{E} [v_1 + v_2] = \mathbb{E} [z_1 + z_2] = 2 \mathbb{E} [z]' class='latex' />, we have that <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%5Bv_1%5D%20%3D%202%20%5Cmathbb%7BE%7D%5Bz%5D%20-%20%5Cmathbb%7BE%7D%5Bv_2%5D%20%5Cleq%20%5Cfrac%7B3%7D%7B2%7D%20%5Cmathbb%7BE%7D%5Bz%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E}[v_1] = 2 \mathbb{E}[z] - \mathbb{E}[v_2] \leq \frac{3}{2} \mathbb{E}[z]' title='\mathbb{E}[v_1] = 2 \mathbb{E}[z] - \mathbb{E}[v_2] \leq \frac{3}{2} \mathbb{E}[z]' class='latex' />. This way, we get: <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%5Bv_2%5D%20%5Cgeq%20%5Cfrac%7B1%7D%7B2%7D%5Cmathbb%7BE%7D%5Bz%5D%20%5Cgeq%20%5Cfrac%7B1%7D%7B2%7D%20%5Ccdot%20%5Cfrac%7B2%7D%7B3%7D%20%5Cmathbb%7BE%7D%5Bv_1%5D%20%3D%20%5Cfrac%7B1%7D%7B3%7D%20%5Cmathbb%7BE%7D%5Bv_1%5D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E}[v_2] \geq \frac{1}{2}\mathbb{E}[z] \geq \frac{1}{2} \cdot \frac{2}{3} \mathbb{E}[v_1] = \frac{1}{3} \mathbb{E}[v_1] ' title='\mathbb{E}[v_2] \geq \frac{1}{2}\mathbb{E}[z] \geq \frac{1}{2} \cdot \frac{2}{3} \mathbb{E}[v_1] = \frac{1}{3} \mathbb{E}[v_1] ' class='latex' /></p>
<p><strong>Example 3: </strong>For MHR distributions, there is a simple lemma that relates the virtual value and the real value and this lemma is quite useful in various settings: let <img src='http://s.wordpress.com/latex.php?latex=r%20%3D%20%5Cinf%20%5C%7Bz%3B%20%5Cphi%28z%29%20%3E%200%20%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='r = \inf \{z; \phi(z) &gt; 0 \}' title='r = \inf \{z; \phi(z) &gt; 0 \}' class='latex' />, then for <img src='http://s.wordpress.com/latex.php?latex=z%20%5Cgeq%20r&#038;bg=T&#038;fg=000000&#038;s=0' alt='z \geq r' title='z \geq r' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%5Cphi%28z%29%20%5Cgeq%20z%20-%20r&#038;bg=T&#038;fg=000000&#038;s=0' alt='\phi(z) \geq z - r' title='\phi(z) \geq z - r' class='latex' />. Again, this is tight for exponential distribution. The proof is quite trivial:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=x%20-%20%5Cphi%28x%29%20%3D%20%5Cfrac%7B1-F%28x%29%7D%7Bf%28x%29%7D%20%5Cleq%20%5Cfrac%7B1-F%28r%29%7D%7Bf%28r%29%7D%20%3D%20r&#038;bg=T&#038;fg=000000&#038;s=0' alt='x - \phi(x) = \frac{1-F(x)}{f(x)} \leq \frac{1-F(r)}{f(r)} = r' title='x - \phi(x) = \frac{1-F(x)}{f(x)} \leq \frac{1-F(r)}{f(r)} = r' class='latex' /></p>
<p>Now, MHR distributions are a subclass of regular distributions, which are the distributions for which Myerson&#8217;s virtual value <img src='http://s.wordpress.com/latex.php?latex=%5Cphi%28z%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\phi(z)' title='\phi(z)' class='latex' /> is a monotone function. I usually find harder to think about regular distributions than to think about MHR (in fact, I don&#8217;t know so many examples that are regular, but not MHR. Here is one, though, called the <em>equal-revenue-distribution</em>. Consider <img src='http://s.wordpress.com/latex.php?latex=z%20%5Cin%20%5B1%2C%20%5Cinfty%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='z \in [1, \infty)' title='z \in [1, \infty)' class='latex' /> distributed according to <img src='http://s.wordpress.com/latex.php?latex=f%28z%29%20%3D%201%2Fz%5E2&#038;bg=T&#038;fg=000000&#038;s=0' alt='f(z) = 1/z^2' title='f(z) = 1/z^2' class='latex' />. The cumulative distribution is given by <img src='http://s.wordpress.com/latex.php?latex=F%28z%29%20%3D%201-1%2Fz&#038;bg=T&#038;fg=000000&#038;s=0' alt='F(z) = 1-1/z' title='F(z) = 1-1/z' class='latex' />. The interesting thing of this distribution is that posted prices get the same revenue regardless of the price. For example, if we post any price <img src='http://s.wordpress.com/latex.php?latex=r%20%5Cin%20%5B1%2C%5Cinfty%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='r \in [1,\infty)' title='r \in [1,\infty)' class='latex' />, then a customer with valuations <img src='http://s.wordpress.com/latex.php?latex=z%20%5Csim%20f&#038;bg=T&#038;fg=000000&#038;s=0' alt='z \sim f' title='z \sim f' class='latex' /> buys the item if <img src='http://s.wordpress.com/latex.php?latex=z%20%3E%20r&#038;bg=T&#038;fg=000000&#038;s=0' alt='z &gt; r' title='z &gt; r' class='latex' /> by price <img src='http://s.wordpress.com/latex.php?latex=r&#038;bg=T&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' />, getting  revenue is <img src='http://s.wordpress.com/latex.php?latex=r%20%281-F%28r%29%29%20%3D%201&#038;bg=T&#038;fg=000000&#038;s=0' alt='r (1-F(r)) = 1' title='r (1-F(r)) = 1' class='latex' />. This can be expressed by the fact that <img src='http://s.wordpress.com/latex.php?latex=%5Cphi%28z%29%20%3D%200&#038;bg=T&#038;fg=000000&#038;s=0' alt='\phi(z) = 0' title='\phi(z) = 0' class='latex' />. I was a bit puzzled by this fact, because of Myerson&#8217;s Lemma:</p>
<blockquote>
<p style="text-align: left;"><strong>Myerson Lemma: </strong>If a mechanism sells to some player that has valuation <img src='http://s.wordpress.com/latex.php?latex=v%20%5Csim%20f&#038;bg=T&#038;fg=000000&#038;s=0' alt='v \sim f' title='v \sim f' class='latex' /> with probability <img src='http://s.wordpress.com/latex.php?latex=x%28v%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='x(v)' title='x(v)' class='latex' /> when he has value <img src='http://s.wordpress.com/latex.php?latex=v&#038;bg=T&#038;fg=000000&#038;s=0' alt='v' title='v' class='latex' />, then the revenue is <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%20%5Bx%28v%29%20%5Cphi%28v%29%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E} [x(v) \phi(v)]' title='\mathbb{E} [x(v) \phi(v)]' class='latex' />.</p>
</blockquote>
<p style="text-align: left;">And it seemed that the auctioneers was doomed to get zero revenue, since <img src='http://s.wordpress.com/latex.php?latex=%5Cphi%28z%29%20%3D%200&#038;bg=T&#038;fg=000000&#038;s=0' alt='\phi(z) = 0' title='\phi(z) = 0' class='latex' />. For example, suppose we fix some price <img src='http://s.wordpress.com/latex.php?latex=r&#038;bg=T&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> and we sell the item if <img src='http://s.wordpress.com/latex.php?latex=v%20%5Cgeq%20r&#038;bg=T&#038;fg=000000&#038;s=0' alt='v \geq r' title='v \geq r' class='latex' /> by price <img src='http://s.wordpress.com/latex.php?latex=r&#038;bg=T&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' />. Then it seems that Myerson&#8217;s Lemma should go through by a derivation like that (for this special case, although the general proof is quite similar):</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%20%5Bx%28v%29%20%5Cphi%28v%29%5D%20%3D%20%5Cint_r%5E%5Cinfty%20%5Cphi%28z%29%20f%28z%29%20dz%20%3D%20%5Cint_r%5E%5Cinfty%20z%20f%28z%29%20-%20%281-F%28z%29%29%20dz%20%3D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E} [x(v) \phi(v)] = \int_r^\infty \phi(z) f(z) dz = \int_r^\infty z f(z) - (1-F(z)) dz = ' title='\mathbb{E} [x(v) \phi(v)] = \int_r^\infty \phi(z) f(z) dz = \int_r^\infty z f(z) - (1-F(z)) dz = ' class='latex' /></p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%3D%20%5Cint_r%5E%5Cinfty%20%5B%20z%20f%28z%29%20-%20%5Cint_z%5E%5Cinfty%20f%28u%29%20du%20%5D%20dz%20%3D%20%5Cint_r%5E%5Cinfty%20z%20f%28z%29%20dz%20-%20%5Cint_r%5E%5Cinfty%20%5Cint_r%5Eu%20f%28u%29%20dz%20du%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='= \int_r^\infty [ z f(z) - \int_z^\infty f(u) du ] dz = \int_r^\infty z f(z) dz - \int_r^\infty \int_r^u f(u) dz du ' title='= \int_r^\infty [ z f(z) - \int_z^\infty f(u) du ] dz = \int_r^\infty z f(z) dz - \int_r^\infty \int_r^u f(u) dz du ' class='latex' /></p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%3D%20r%20%281-F%28r%29%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='= r (1-F(r)) ' title='= r (1-F(r)) ' class='latex' /></p>
<p style="text-align: left;">but those don&#8217;t seem to match, since one side is zero and the other is 1. The mistake we did above is classic, which is to calculate <img src='http://s.wordpress.com/latex.php?latex=%5Cinfty%20-%20%5Cinfty&#038;bg=T&#038;fg=000000&#038;s=0' alt='\infty - \infty' title='\infty - \infty' class='latex' />. We wrote:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%5B%5Cphi%28v%29%5D%20%3D%20%5Cint_r%5E%5Cinfty%20z%20f%28z%29%20dz%20-%20%5Cint_r%5E%5Cinfty%201-F%28z%29%20dz&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E}[\phi(v)] = \int_r^\infty z f(z) dz - \int_r^\infty 1-F(z) dz' title='\mathbb{E}[\phi(v)] = \int_r^\infty z f(z) dz - \int_r^\infty 1-F(z) dz' class='latex' /></p>
<p style="text-align: left;">but both are infinity! This made me realize that Myerson&#8217;s Lemma needs the condition that <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%5Bz%5D%20%3C%20%5Cinfty&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E}[z] &lt; \infty' title='\mathbb{E}[z] &lt; \infty' class='latex' />, which is quite a natural a distribution over valuations of a good. So, one of the bugs of the the equal-revenue-distribution is that <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%5Bz%5D%20%3D%20%5Cinfty&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E}[z] = \infty' title='\mathbb{E}[z] = \infty' class='latex' />. A family that is close to this, but doesn&#8217;t suffer this bug is: <img src='http://s.wordpress.com/latex.php?latex=f%28z%29%20%3D%20%5Cfrac%7B%5Calpha-1%7D%7Bz%5E%5Calpha%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='f(z) = \frac{\alpha-1}{z^\alpha}' title='f(z) = \frac{\alpha-1}{z^\alpha}' class='latex' /> for <img src='http://s.wordpress.com/latex.php?latex=z%20%5Cin%20%5B1%2C%5Cinfty%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='z \in [1,\infty)' title='z \in [1,\infty)' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=F%28z%29%20%3D%201%20-%20z%5E%7B1-%5Calpha%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='F(z) = 1 - z^{1-\alpha}' title='F(z) = 1 - z^{1-\alpha}' class='latex' />. For <img src='http://s.wordpress.com/latex.php?latex=%5Calpha%20%3E%202&#038;bg=T&#038;fg=000000&#038;s=0' alt='\alpha &gt; 2' title='\alpha &gt; 2' class='latex' /> we have <img src='http://s.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%5Bv%5D%20%3C%20%5Cinfty&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathbb{E}[v] &lt; \infty' title='\mathbb{E}[v] &lt; \infty' class='latex' />, then we get <img src='http://s.wordpress.com/latex.php?latex=%5Cphi%28z%29%20%3D%20%5Cfrac%7B%5Calpha-2%7D%7B%5Calpha-1%7D%20z&#038;bg=T&#038;fg=000000&#038;s=0' alt='\phi(z) = \frac{\alpha-2}{\alpha-1} z' title='\phi(z) = \frac{\alpha-2}{\alpha-1} z' class='latex' />.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/539/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bimatrix games</title>
		<link>http://www.bigredbits.com/archives/491</link>
		<comments>http://www.bigredbits.com/archives/491#comments</comments>
		<pubDate>Wed, 25 May 2011 16:02:17 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[algorithms]]></category>
		<category><![CDATA[game theory]]></category>
		<category><![CDATA[linear algebra]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=491</guid>
		<description><![CDATA[This week in Israel is the Workshop on Innovations in Algorithmic Game Theory, which has been a collection of amazing talks and I thought of blogging about something new I learned here. First, Paul Goldberg gave an amazing talk about the Lemke-Howson algorithm and Homotopy Methods. Later, during the poster session Jugal Garg presented the [...]]]></description>
			<content:encoded><![CDATA[<p>This week in Israel is the <a href="http://www.cs.huji.ac.il/conferences/agt-semester/iagt2011/schedule.php">Workshop on Innovations in Algorithmic Game Theory</a>, which has been a collection of amazing talks and I thought of blogging about something new I learned here. First, Paul Goldberg gave an amazing talk about the <a href="http://arxiv.org/abs/1006.5352">Lemke-Howson algorithm and Homotopy Methods</a>. Later, during the poster session Jugal Garg presented the impressive work on an <a href="http://arxiv.org/abs/1010.3083">exact algorithm for finding Nash equilibria in rank 1 bimatrix games</a>. Both have in common the use of homotopy, which I found I quite cool idea &#8211; and has been present in Game Theory for a while, but I didn&#8217;t know about. They also have to do with improving the understanding we have on the <a href="http://www.stanford.edu/~saberi/lecture4.pdf">Lemke-Howson Algorithm</a> &#8211; a (potentially exponential time) algorithm to find Nash equilibrium in 2-person bi-matrix games, but that seems to work pretty well in practice. As always, I though that blogging about it would be a good idea in order to understand those concepts well.</p>
<p><strong>Bimatrix Game and How to compute equilibrium</strong></p>
<p>Bimatrix game is the simplest class of games studied. Basically it is a <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' /> player game, with <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> strategies for player <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=m&#038;bg=T&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> strategies for player 2 which is represented by a pair of <img src='http://s.wordpress.com/latex.php?latex=n%20%5Ctimes%20m&#038;bg=T&#038;fg=000000&#038;s=0' alt='n \times m' title='n \times m' class='latex' /> matrices <img src='http://s.wordpress.com/latex.php?latex=%28A%2CB%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(A,B)' title='(A,B)' class='latex' />. Let <img src='http://s.wordpress.com/latex.php?latex=x%20%5Cin%20%5Cmathbb%7BR%7D_%2B%5En%2C%20%5CVert%20x%20%5CVert_1%20%3D%201&#038;bg=T&#038;fg=000000&#038;s=0' alt='x \in \mathbb{R}_+^n, \Vert x \Vert_1 = 1' title='x \in \mathbb{R}_+^n, \Vert x \Vert_1 = 1' class='latex' /> represent a probability distribution that player <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> assigns to his strategies and <img src='http://s.wordpress.com/latex.php?latex=y%20%5Cin%20%5Cmathbb%7BR%7D_%2B%5Em%2C%20%5CVert%20y%20%5CVert_1%20%3D%201&#038;bg=T&#038;fg=000000&#038;s=0' alt='y \in \mathbb{R}_+^m, \Vert y \Vert_1 = 1' title='y \in \mathbb{R}_+^m, \Vert y \Vert_1 = 1' class='latex' /> be the same for player <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' />. This way, the players experience utilities:</p>
<p style="text-align: center;"><img style="display: inline;" title=" u_1 (x,y) = x^t A y,  u_2(x,y) = x^t B y" src="http://www.codecogs.com/gif.latex?%20u_1%20%28x,y%29%20=%20x%5Et%20A%20y,%C2%A0%20u_2%28x,y%29%20=%20x%5Et%20B%20y" alt="[; u_1 (x,y) = x^t A y,  u_2(x,y) = x^t B y;]" /></p>
<p>The best understood class of those games is the one where <img src='http://s.wordpress.com/latex.php?latex=A%2BB%20%3D%200&#038;bg=T&#038;fg=000000&#038;s=0' alt='A+B = 0' title='A+B = 0' class='latex' />, called zero-sum games. For this class, computing a Nash equilibrium is very easy and it is given by the famous min-max theorem: player <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> finds <img src='http://s.wordpress.com/latex.php?latex=x&#038;bg=T&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> maximizing <img src='http://s.wordpress.com/latex.php?latex=%5Cmin_j%20%28x%5Et%20A%20e_j%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\min_j (x^t A e_j)' title='\min_j (x^t A e_j)' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=e_j&#038;bg=T&#038;fg=000000&#038;s=0' alt='e_j' title='e_j' class='latex' /> is the <img src='http://s.wordpress.com/latex.php?latex=j&#038;bg=T&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' />-th unit vector. Similarly player <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' /> finds <img src='http://s.wordpress.com/latex.php?latex=y&#038;bg=T&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> maximizing <img src='http://s.wordpress.com/latex.php?latex=%5Cmin_i%20%28e_i%5Et%20B%20y%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\min_i (e_i^t B y)' title='\min_i (e_i^t B y)' class='latex' />. Then the pair of strategies obtained is a Nash equilibrium &#8211; and this verifying that is not hard.</p>
<p>When <img src='http://s.wordpress.com/latex.php?latex=A%2BB%20%5Cneq%200&#038;bg=T&#038;fg=000000&#038;s=0' alt='A+B \neq 0' title='A+B \neq 0' class='latex' />, the problems gets a lot more complicated. Proving that equilibrium exist can be done using various fixed point theorems, as <a href="http://en.wikipedia.org/wiki/Brouwer_fixed_point_theorem">Brouwer</a> or <a href="http://en.wikipedia.org/wiki/Kakutani_theorem">Kakutani</a>. There is a very simple exponential time algorithm for finding it and the key observation is the following, if <img src='http://s.wordpress.com/latex.php?latex=%28x%2Cy%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(x,y)' title='(x,y)' class='latex' /> is a Nash equilibrium then:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cforall%20i%3A%20x_i%20%3D%200%20%5Ctext%7B%20or%20%7D%20%5Cforall%20i%27%3A%20e_i%5Et%20A%20y%20%5Cgeq%20e_%7Bi%27%7D%5Et%20A%20y&#038;bg=T&#038;fg=000000&#038;s=0' alt='\forall i: x_i = 0 \text{ or } \forall i&#039;: e_i^t A y \geq e_{i&#039;}^t A y' title='\forall i: x_i = 0 \text{ or } \forall i&#039;: e_i^t A y \geq e_{i&#039;}^t A y' class='latex' /></p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cforall%20j%3A%20y_j%20%3D%200%20%5Ctext%7B%20or%20%7D%20%5Cforall%20j%27%3A%20x%5Et%20B%20e_j%20%5Cgeq%20x%5Et%20B%20e_%7Bj%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\forall j: y_j = 0 \text{ or } \forall j&#039;: x^t B e_j \geq x^t B e_{j&#039;}' title='\forall j: y_j = 0 \text{ or } \forall j&#039;: x^t B e_j \geq x^t B e_{j&#039;}' class='latex' /></p>
<p style="text-align: left;">which means that each strategy for player <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> is either in the support or is a best response. Proving that is trivial (if some strategy is in the support and is not a best response, then reducing the probability we play it is an improving deviation). Therefore if we just guess the support of <img src='http://s.wordpress.com/latex.php?latex=x&#038;bg=T&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and the support of <img src='http://s.wordpress.com/latex.php?latex=y&#038;bg=T&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> we just need to find some strategies with this support satisfying the inequalities above. This can be done using a simple LP. This is clearly not very efficient, since it involves solving <img src='http://s.wordpress.com/latex.php?latex=2%5E%7Bn%2Bm%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='2^{n+m}' title='2^{n+m}' class='latex' /> LPs. A still exponential, but a lot more practical method, is:</p>
<p style="text-align: left;"><strong>Lemke-Howson Algorithm</strong></p>
<p style="text-align: left;">A good overview of the L-H algorithm can be found in <a href="http://www.stanford.edu/~saberi/lecture4.pdf">those lecture notes</a> or in a more detailed version in third chapter of the <a href="http://www.cambridge.org/us/knowledge/isbn/item1174320/?site_locale=en_US">AGT book</a> (a pdf  can be found in <a href="http://theory.stanford.edu/~tim/">Tim&#8217;s website</a>). Here I&#8217;ll present a quick overview. The main idea is to define the <strong>best-response polytopes</strong> <img src='http://s.wordpress.com/latex.php?latex=P&#038;bg=T&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=Q&#038;bg=T&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' />:</p>
<p style="text-align: center;"><img style="display: inline;" title=" P = \{(x,v) \in \mathbb{R}^{n+1} \vert x_i \geq 0, x^t B e_j \leq v; \textbf{1}^t x = 1 \}" src="http://www.codecogs.com/gif.latex?%20P%20=%20%5C%7B%28x,v%29%20%5Cin%20%5Cmathbb%7BR%7D%5E%7Bn+1%7D%20%5Cvert%20x_i%20%5Cgeq%200,%20x%5Et%20B%20e_j%20%5Cleq%20v;%20%5Ctextbf%7B1%7D%5Et%20x%20=%201%20%5C%7D" alt="[; P = \{(x,v) \in \mathbb{R}^{n+1} \vert x_i \geq 0, x^t B e_j \leq v; \textbf{1}^t x = 1 \};]" /></p>
<p style="text-align: center;"><img style="display: inline;" title=" Q = \{(y,u) \in \mathbb{R}^{m+1} \vert e_i^t A y \leq u, y_j \geq 0; \textbf{1}^t y = 1 \} " src="http://www.codecogs.com/gif.latex?%20Q%20=%20%5C%7B%28y,u%29%20%5Cin%20%5Cmathbb%7BR%7D%5E%7Bm+1%7D%20%5Cvert%20e_i%5Et%20A%20y%20%5Cleq%20u,%20y_j%20%5Cgeq%200;%20%5Ctextbf%7B1%7D%5Et%20y%20=%201%20%5C%7D" alt="[; Q = \{(y,u) \in \mathbb{R}^{m+1} \vert e_i^t A y \leq u, y_j \geq 0; \textbf{1}^t y = 1 \} ;]" /></p>
<p style="text-align: left;">The intuition is that a point <img src='http://s.wordpress.com/latex.php?latex=%28x%2Cv%29%20%5Cin%20P&#038;bg=T&#038;fg=000000&#038;s=0' alt='(x,v) \in P' title='(x,v) \in P' class='latex' /> represents the fact that the payoff <img src='http://s.wordpress.com/latex.php?latex=v&#038;bg=T&#038;fg=000000&#038;s=0' alt='v' title='v' class='latex' /> of player <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' /> when player <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> plays <img src='http://s.wordpress.com/latex.php?latex=x&#038;bg=T&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> is at least <img src='http://s.wordpress.com/latex.php?latex=v&#038;bg=T&#038;fg=000000&#038;s=0' alt='v' title='v' class='latex' />. We could re-write as <img src='http://s.wordpress.com/latex.php?latex=v%20%5Cgeq%20%5Cmax_j%20x%5Et%20B%20e_j&#038;bg=T&#038;fg=000000&#038;s=0' alt='v \geq \max_j x^t B e_j' title='v \geq \max_j x^t B e_j' class='latex' />.</p>
<p style="text-align: left;">Each of the polytopes has <img src='http://s.wordpress.com/latex.php?latex=m%2Bn&#038;bg=T&#038;fg=000000&#038;s=0' alt='m+n' title='m+n' class='latex' /> inequalities and <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> equality. Given <img src='http://s.wordpress.com/latex.php?latex=x&#038;bg=T&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> we define <img src='http://s.wordpress.com/latex.php?latex=L%28x%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='L(x)' title='L(x)' class='latex' /> as the indices of the tight inequalities in <img src='http://s.wordpress.com/latex.php?latex=P&#038;bg=T&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> and similarly we define <img src='http://s.wordpress.com/latex.php?latex=L%28y%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='L(y)' title='L(y)' class='latex' /> as the indices of the tight inequalities in <img src='http://s.wordpress.com/latex.php?latex=Q&#038;bg=T&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' />. The theorem in the previous section can be rephrased as:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%28x%2Cy%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(x,y)' title='(x,y)' class='latex' /> is Nash equilibrium iff <img src='http://s.wordpress.com/latex.php?latex=L%28x%29%20%5Ccup%20L%28y%29%20%3D%20%5C%7B1%2C%202%2C%20%5Chdots%2C%20n%2Bm%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='L(x) \cup L(y) = \{1, 2, \hdots, n+m\}' title='L(x) \cup L(y) = \{1, 2, \hdots, n+m\}' class='latex' /></p>
<p style="text-align: left;">So we need to look at points in the polytope below that are <strong>fully-labeled</strong>. And the way that this is done in L-H is quite ingenious &#8211; it is a similar idea that is used by the <a href="http://en.wikipedia.org/wiki/Simplex_algorithm">Simplex Method</a> &#8211; walking through the vertices of the polytope looking for the desired vertex. In order to do it, let&#8217;s define another polytope:</p>
<p style="text-align: center;"><img style="display: inline;" title="P' = \{x \in \mathbb{R}^n \vert x_i \geq 0, x^t B e_j \leq 1\} " src="http://www.codecogs.com/gif.latex?P%27%20=%20%5C%7Bx%20%5Cin%20%5Cmathbb%7BR%7D%5En%20%5Cvert%20x_i%20%5Cgeq%200,%20x%5Et%20B%20e_j%20%5Cleq%201%5C%7D" alt="[;P' = \{x \in \mathbb{R}^n \vert x_i \geq 0, x^t B e_j \leq 1\} ;]" /></p>
<p style="text-align: center;"><img style="display: inline;" title="Q' = \{y \in \mathbb{R}^m \vert e_i^t A y \leq 1, y_j \geq 0\} " src="http://www.codecogs.com/gif.latex?Q%27%20=%20%5C%7By%20%5Cin%20%5Cmathbb%7BR%7D%5Em%20%5Cvert%20e_i%5Et%20A%20y%20%5Cleq%201,%20y_j%20%5Cgeq%200%5C%7D" alt="[;Q' = \{y \in \mathbb{R}^m \vert e_i^t A y \leq 1, y_j \geq 0\} ;]" /></p>
<p style="text-align: left;">Note that there is a clear bijection between <img src='http://s.wordpress.com/latex.php?latex=P%27%20%5Csetminus%200%20%5Cleftrightarrow%20P&#038;bg=T&#038;fg=000000&#038;s=0' alt='P&#039; \setminus 0 \leftrightarrow P' title='P&#039; \setminus 0 \leftrightarrow P' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=Q%27%20%5Csetminus%200%20%5Cleftrightarrow%20Q&#038;bg=T&#038;fg=000000&#038;s=0' alt='Q&#039; \setminus 0 \leftrightarrow Q' title='Q&#039; \setminus 0 \leftrightarrow Q' class='latex' />. This is a projective transformation given by <img src='http://s.wordpress.com/latex.php?latex=x%20%5Cmapsto%20%28%5Cfrac%7Bx%7D%7B%5CVert%20x%20%5CVert_1%7D%2C%5CVert%20x%20%5CVert_1%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='x \mapsto (\frac{x}{\Vert x \Vert_1},\Vert x \Vert_1)' title='x \mapsto (\frac{x}{\Vert x \Vert_1},\Vert x \Vert_1)' class='latex' />. Notice that the labels are preserved by this transformation and the vertex and edges of the polytope are mapped almost 1-1 (except the vertex <img src='http://s.wordpress.com/latex.php?latex=0&#038;bg=T&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' />). Notice a couple of details:</p>
<p style="text-align: left;"><strong>1.</strong> A vertex in <img src='http://s.wordpress.com/latex.php?latex=x%20%5Cin%20P%27&#038;bg=T&#038;fg=000000&#038;s=0' alt='x \in P&#039;' title='x \in P&#039;' class='latex' /> corresponds to a point with <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> labels. A vertex in <img src='http://s.wordpress.com/latex.php?latex=y%20%5Cin%20Q%27%27&#038;bg=T&#038;fg=000000&#038;s=0' alt='y \in Q&#039;&#039;' title='y \in Q&#039;&#039;' class='latex' /> corresponds to a point with <img src='http://s.wordpress.com/latex.php?latex=m&#038;bg=T&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> labels.</p>
<p style="text-align: left;"><strong>2. </strong>The point <img src='http://s.wordpress.com/latex.php?latex=%280%2C0%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(0,0)' title='(0,0)' class='latex' /> corresponds to a fully-labeled point of <img src='http://s.wordpress.com/latex.php?latex=P%27%20%5Ctimes%20Q%27&#038;bg=T&#038;fg=000000&#038;s=0' alt='P&#039; \times Q&#039;' title='P&#039; \times Q&#039;' class='latex' />, unfortunately this is the only fully-labeled point that doesn&#8217;t correspond to a Nash equilibrium.</p>
<p style="text-align: left;"><strong>3. </strong>By taking an edge of the polytope (which is define by <img src='http://s.wordpress.com/latex.php?latex=n-1&#038;bg=T&#038;fg=000000&#038;s=0' alt='n-1' title='n-1' class='latex' /> labels in <img src='http://s.wordpress.com/latex.php?latex=P%27&#038;bg=T&#038;fg=000000&#038;s=0' alt='P&#039;' title='P&#039;' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=m-1&#038;bg=T&#038;fg=000000&#038;s=0' alt='m-1' title='m-1' class='latex' /> labels in <img src='http://s.wordpress.com/latex.php?latex=Q%27&#038;bg=T&#038;fg=000000&#038;s=0' alt='Q&#039;' title='Q&#039;' class='latex' />. We can move between two nodes that are almost fully labeled.</p>
<p style="text-align: left;">The idea is to consider the set of nodes that are almost fully-labeled: fix some label <img src='http://s.wordpress.com/latex.php?latex=k&#038;bg=T&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> and consider all the nodes such that <img src='http://s.wordpress.com/latex.php?latex=L%28x%29%20%5Ccup%20L%28y%29%20%5Csupseteq%20%5C%7B1%20..%20k-1%2C%20k%2B1%20..%20n%2Bm%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='L(x) \cup L(y) \supseteq \{1 .. k-1, k+1 .. n+m\}' title='L(x) \cup L(y) \supseteq \{1 .. k-1, k+1 .. n+m\}' class='latex' />. Those points are either <img src='http://s.wordpress.com/latex.php?latex=%280%2C0%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(0,0)' title='(0,0)' class='latex' />, a Nash equilibrium or they have one duplicated label, since <img src='http://s.wordpress.com/latex.php?latex=%5Cvert%20L%28x%29%20%5Cvert%20%2B%20%5Cvert%20L%28y%29%20%5Cvert%20%3D%20n%2Bm&#038;bg=T&#038;fg=000000&#038;s=0' alt='\vert L(x) \vert + \vert L(y) \vert = n+m' title='\vert L(x) \vert + \vert L(y) \vert = n+m' class='latex' />. An edge is composed of <img src='http://s.wordpress.com/latex.php?latex=n%2Bm-1&#038;bg=T&#038;fg=000000&#038;s=0' alt='n+m-1' title='n+m-1' class='latex' /> labels (no duplicated labels). So, one almost fully-labeled point that is not a Nash or <img src='http://s.wordpress.com/latex.php?latex=%280%2C0%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(0,0)' title='(0,0)' class='latex' /> is only connected to two other vertices via an edge (which correspond to dropping one of the duplicated labels and following the corresponding edge).</p>
<p style="text-align: left;">This gives us a topology of the space of Nash equilibria. This tells us a couple of facts: (1) we can find a Nash equilibrium by starting in <img src='http://s.wordpress.com/latex.php?latex=%280%2C0%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(0,0)' title='(0,0)' class='latex' /> and following the L-H path. In the end of the path, we must find a Nash equilibrium. (ii) If the polytope is not degenerate, then there is an odd number of Nash equilibria and they are connected by L-H edges in the following way:</p>
<p style="text-align: left;"><a rel="attachment wp-att-526" href="http://www.bigredbits.com/archives/491/lh_path"><img class="aligncenter size-full wp-image-526" title="LH_path" src="http://www.bigredbits.com/wp-content/uploads/2011/05/LH_path.png" alt="" width="330" height="244" /></a>where the blue dots are the Nash equilibria, the white dots are the almost fully-labeled points and the edges are the L-H edges. The number of Nash equilibria are odd by a simple parity argument.</p>
<p style="text-align: left;">The classic way in which simplex-like methods walk through the vertices of a polytope is by <a href="http://en.wikipedia.org/wiki/Pivot_element">pivoting</a>. The same way we can implement Lemke-Howson. For an explanation on the implementation, please look at the chapter 3 of the AGT book.</p>
<p style="text-align: left;">One can ask if we could modify the L-H path following to go through the path faster. Recently, <a href="http://arxiv.org/abs/1006.5352">Goldberg, Papadimitriou and Savani</a> proved that finding the Nash equilibrium that L-H outputs is <strong>PSPACE</strong>-complete. So, in principle, finding this specific equilibrium, seems harder than finding any Nash, which is PPAD-complete.</p>
<p style="text-align: left;">My current plan is on some other blog posts on Bimatrix games. I wanted to discuss two things: first the homotopy methods and homotopy fixed-point-theorems (which is the heart of the two papers mentioned above) and about new techniques for solving zero-matrix where the strategy space is exponential.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/491/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DP and the Erdős–Rényi model</title>
		<link>http://www.bigredbits.com/archives/443</link>
		<comments>http://www.bigredbits.com/archives/443#comments</comments>
		<pubDate>Mon, 16 May 2011 21:41:28 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[probability]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=443</guid>
		<description><![CDATA[Yesterday I was in a pub with Vasilis Syrgkanis and Elisa Celis and we were discussing about how to calculate the expected size of a connected component in , the Erdős–Rényi model. is the classical random graph obtained by considering nodes and adding each edge independently with probability . A lot is known about its [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday I was in a pub with <a href="http://www.cs.cornell.edu/~vasilis/">Vasilis Syrgkanis</a> and <a href="http://www.cs.washington.edu/homes/ecelis/other.html">Elisa Celis</a> and we were discussing about how to calculate the expected size of a connected component in <img src='http://s.wordpress.com/latex.php?latex=G%28n%2Cp%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='G(n,p)' title='G(n,p)' class='latex' />, the <a href="http://en.wikipedia.org/wiki/Erd%C5%91s%E2%80%93R%C3%A9nyi_model">Erdős–Rényi model</a>. <img src='http://s.wordpress.com/latex.php?latex=G%28n%2Cp%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='G(n,p)' title='G(n,p)' class='latex' /> is the classical random graph obtained by considering <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> nodes and adding each edge <img src='http://s.wordpress.com/latex.php?latex=%28i%2Cj%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='(i,j)' title='(i,j)' class='latex' /> independently with probability <img src='http://s.wordpress.com/latex.php?latex=p&#038;bg=T&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />. A lot is known about its properties, which very interestingly change qualitatively as the value of <img src='http://s.wordpress.com/latex.php?latex=p&#038;bg=T&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> changes relativeto <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />. For example, for <img src='http://s.wordpress.com/latex.php?latex=p%20%3C%5Cfrac%7B1%7D%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='p &lt;\frac{1}{n}' title='p &lt;\frac{1}{n}' class='latex' /> then there is no component greater than <img src='http://s.wordpress.com/latex.php?latex=O%28%5Clog%20n%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='O(\log n)' title='O(\log n)' class='latex' /> with high probability. When <img src='http://s.wordpress.com/latex.php?latex=p%20%3D%20%5Cfrac%7Bc%7D%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='p = \frac{c}{n}' title='p = \frac{c}{n}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=c%3E1&#038;bg=T&#038;fg=000000&#038;s=0' alt='c&gt;1' title='c&gt;1' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=n%20%5Crightarrow%20%5Cinfty&#038;bg=T&#038;fg=000000&#038;s=0' alt='n \rightarrow \infty' title='n \rightarrow \infty' class='latex' />, then the graph has a giant component. All those phenomena are very well studied in the context of probabilistic combinatorics and also in social networks. I remember learning about them in Jon Kleinberg&#8217;s <a href="http://www.cs.cornell.edu/Courses/cs6850/2011sp/">Structure of Information Networks</a> class.</p>
<p style="text-align: center;"><a rel="attachment wp-att-447" href="http://www.bigredbits.com/archives/443/pic1"><img class="aligncenter size-full wp-image-447" title="pic1" src="http://www.bigredbits.com/wp-content/uploads/2011/05/pic1.png" alt="" width="444" height="234" /></a></p>
<p>So, coming back to our conversation, we were thinking on how to calculate the size of a connected component. Fix some node <img src='http://s.wordpress.com/latex.php?latex=u&#038;bg=T&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=G%28n%2Cp%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='G(n,p)' title='G(n,p)' class='latex' /> &#8211; it doesn&#8217;t matter which node, since all nodes are equivalent before we start tossing the random coins. Now, let <img src='http://s.wordpress.com/latex.php?latex=C_u&#038;bg=T&#038;fg=000000&#038;s=0' alt='C_u' title='C_u' class='latex' /> be the size of the connected component of node <img src='http://s.wordpress.com/latex.php?latex=u&#038;bg=T&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' />. The question is how to calculate <img src='http://s.wordpress.com/latex.php?latex=C%28n%2Cp%29%20%3D%20%5Cmathbb%7BE%7D%20%5BC_u%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='C(n,p) = \mathbb{E} [C_u]' title='C(n,p) = \mathbb{E} [C_u]' class='latex' />.</p>
<p>Recently I&#8217;ve been learning MATLAB (actually, I am learning <a href="http://www.gnu.org/software/octave/">Octave</a>, but it is the same) and I am very amazed by it and impressed about why I haven&#8217;t learned it before. It is a programming language that somehow knows exactly how mathematicians think and the syntax is very intuitive. All the operations that you think of performing when doing mathematics, they have implemented. Not that you can&#8217;t do that in C++ or Python, in fact, I&#8217;ve been doing that all my life, but in Octave, things are so simple. So, I thought this was a nice opportunity for playing a bit with it.</p>
<p>We can calculate <img src='http://s.wordpress.com/latex.php?latex=C%28n%2Cp%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='C(n,p)' title='C(n,p)' class='latex' /> using a dynamic programming algorithm in time <img src='http://s.wordpress.com/latex.php?latex=O%28n%5E2%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='O(n^2)' title='O(n^2)' class='latex' /> &#8211; well, maybe we can do it more efficiently, but the DP I thought was the following: let&#8217;s calculate <img src='http://s.wordpress.com/latex.php?latex=%5Cmathcal%7BC%7D%28n%2Cs%2Cp%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathcal{C}(n,s,p)' title='\mathcal{C}(n,s,p)' class='latex' /> where it is the expected size of the <img src='http://s.wordpress.com/latex.php?latex=u&#038;bg=T&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' />-connected component of a random graph with <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> nodes where the edges between  <img src='http://s.wordpress.com/latex.php?latex=u&#038;bg=T&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' /> and other nodes have probability <img src='http://s.wordpress.com/latex.php?latex=p%27%20%3D%201%20-%20%281-p%29%5Es&#038;bg=T&#038;fg=000000&#038;s=0' alt='p&#039; = 1 - (1-p)^s' title='p&#039; = 1 - (1-p)^s' class='latex' /> and an edge between <img src='http://s.wordpress.com/latex.php?latex=v_1&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_1' title='v_1' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=v_2&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_2' title='v_2' class='latex' /> have probability <img src='http://s.wordpress.com/latex.php?latex=p&#038;bg=T&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />. What we want to compute is <img src='http://s.wordpress.com/latex.php?latex=C%28n%2Cp%29%20%3D%20%5Cmathcal%7BC%7D%28n%2C1%2Cp%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='C(n,p) = \mathcal{C}(n,1,p)' title='C(n,p) = \mathcal{C}(n,1,p)' class='latex' />.</p>
<p style="text-align: left;"><a rel="attachment wp-att-454" href="http://www.bigredbits.com/archives/443/pic2"><img class="aligncenter size-full wp-image-454" title="pic2" src="http://www.bigredbits.com/wp-content/uploads/2011/05/pic2.png" alt="" width="338" height="187" /></a>What we can do is to use the <a href="http://en.wikipedia.org/wiki/Principle_of_deferred_decision">Principle of Deferred Decisions</a>,  and toss the coins for the <img src='http://s.wordpress.com/latex.php?latex=n-1&#038;bg=T&#038;fg=000000&#038;s=0' alt='n-1' title='n-1' class='latex' /> edges between <img src='http://s.wordpress.com/latex.php?latex=u&#038;bg=T&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' /> and the other nodes. With probability <img src='http://s.wordpress.com/latex.php?latex=bin%28k%2Cn%2Cp%27%29%20%3D%20%7Bn%20%5Cchoose%20k%7D%20%28p%27%29%5Ek%20%281-%27p%29%5E%7Bn-k%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='bin(k,n,p&#039;) = {n \choose k} (p&#039;)^k (1-&#039;p)^{n-k}' title='bin(k,n,p&#039;) = {n \choose k} (p&#039;)^k (1-&#039;p)^{n-k}' class='latex' />, there are <img src='http://s.wordpress.com/latex.php?latex=k&#038;bg=T&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> edges between <img src='http://s.wordpress.com/latex.php?latex=u&#038;bg=T&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' /> and the other nodes, say nodes <img src='http://s.wordpress.com/latex.php?latex=w_1%2C%20%5Chdots%2C%20w_k&#038;bg=T&#038;fg=000000&#038;s=0' alt='w_1, \hdots, w_k' title='w_1, \hdots, w_k' class='latex' />. If we collapse those nodes to <img src='http://s.wordpress.com/latex.php?latex=u&#038;bg=T&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' /> we end up with a graph of <img src='http://s.wordpress.com/latex.php?latex=n-k&#038;bg=T&#038;fg=000000&#038;s=0' alt='n-k' title='n-k' class='latex' /> nodes and the problem is equivalent to <img src='http://s.wordpress.com/latex.php?latex=k&#038;bg=T&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> plus the size of the connected component of <img src='http://s.wordpress.com/latex.php?latex=u&#038;bg=T&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' /> in the collapsed graph.</p>
<p style="text-align: left;"><a rel="attachment wp-att-462" href="http://www.bigredbits.com/archives/443/pic3"><img class="aligncenter size-full wp-image-462" title="pic3" src="http://www.bigredbits.com/wp-content/uploads/2011/05/pic3.png" alt="" width="489" height="238" /></a>One difference, however is that the probability that the collapsed node <img src='http://s.wordpress.com/latex.php?latex=u&#038;bg=T&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' /> is connected to a node <img src='http://s.wordpress.com/latex.php?latex=v&#038;bg=T&#038;fg=000000&#038;s=0' alt='v' title='v' class='latex' /> of the <img src='http://s.wordpress.com/latex.php?latex=n-1-k&#038;bg=T&#038;fg=000000&#038;s=0' alt='n-1-k' title='n-1-k' class='latex' /> nodes is the probability that at least one of <img src='http://s.wordpress.com/latex.php?latex=w_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='w_i' title='w_i' class='latex' /> is connected to <img src='http://s.wordpress.com/latex.php?latex=v&#038;bg=T&#038;fg=000000&#038;s=0' alt='v' title='v' class='latex' />, which is <img src='http://s.wordpress.com/latex.php?latex=1-%281-p%29%5Ek&#038;bg=T&#038;fg=000000&#038;s=0' alt='1-(1-p)^k' title='1-(1-p)^k' class='latex' />. In this way, we can write:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cmathcal%7BC%7D%28n%2Cs%2Cp%29%20%3D%201%20%5Ccdot%20bin%280%2Cn-1%2Cp%27%29%20%2B%20%5Csum_%7Bk%3D1%7D%5E%7Bn-1%7D%20bin%28k%2Cn-1%2Cp%27%29%20%5B%20k%20%2B%20%20%5Cmathcal%7BC%7D%28n-k%2Ck%2Cp%29%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mathcal{C}(n,s,p) = 1 \cdot bin(0,n-1,p&#039;) + \sum_{k=1}^{n-1} bin(k,n-1,p&#039;) [ k +  \mathcal{C}(n-k,k,p)]' title='\mathcal{C}(n,s,p) = 1 \cdot bin(0,n-1,p&#039;) + \sum_{k=1}^{n-1} bin(k,n-1,p&#039;) [ k +  \mathcal{C}(n-k,k,p)]' class='latex' /></p>
<p style="text-align: left;">where <img src='http://s.wordpress.com/latex.php?latex=p%27%20%3D%201-%281-p%29%5Es&#038;bg=T&#038;fg=000000&#038;s=0' alt='p&#039; = 1-(1-p)^s' title='p&#039; = 1-(1-p)^s' class='latex' />. Now, we can calculate <img src='http://s.wordpress.com/latex.php?latex=C%28n%2Cp%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='C(n,p)' title='C(n,p)' class='latex' /> by using DP, simply by filling an <img src='http://s.wordpress.com/latex.php?latex=n%20%5Ctimes%20n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n \times n' title='n \times n' class='latex' /> table. In Octave, we can do it this way:</p>
<pre><code>
<strong>function</strong> component = C(N,p)
  C_table = zeros(N,N);
  <strong>for</strong> n = 1:N <strong>for</strong> s =1:N
    C_table(n,s) = binopdf(0,n-1,1-((1-p)^s)) ;
    <strong>for</strong> k = 1:n-1
      C_table(n,s) += binopdf(k,n-1,1-((1-p)^s)) * (k + C_table(n-k,k));
    <strong>end</strong>
  <strong>end end</strong>
  component = C_table(N,1);
<strong>endfunction</strong>
</code>
</pre>
<p style="text-align: left;">And in fact we can call <img src='http://s.wordpress.com/latex.php?latex=C%28n%2Cp%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='C(n,p)' title='C(n,p)' class='latex' /> for say <img src='http://s.wordpress.com/latex.php?latex=n%20%3D%20200&#038;bg=T&#038;fg=000000&#038;s=0' alt='n = 200' title='n = 200' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=p%20%3D%200.01%20..%200.3&#038;bg=T&#038;fg=000000&#038;s=0' alt='p = 0.01 .. 0.3' title='p = 0.01 .. 0.3' class='latex' /> and see how <img src='http://s.wordpress.com/latex.php?latex=C%28n%2Cp%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='C(n,p)' title='C(n,p)' class='latex' /> varies. This allows us, for example, to observe the sharp transition that happens before the giant component is formed. The plot we get is:</p>
<p style="text-align: left;"><a rel="attachment wp-att-481" href="http://www.bigredbits.com/archives/443/component_size"><a rel="attachment wp-att-484" href="http://www.bigredbits.com/archives/443/pic4"><img class="aligncenter size-full wp-image-484" title="pic4" src="http://www.bigredbits.com/wp-content/uploads/2011/05/pic4.png" alt="" width="233" height="279" /></a><br />
</a></p>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow: hidden;">
<h1 id="firstHeading" class="firstHeading">Erdős–Rényi model</h1>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/443/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Reasoning about Knowledge</title>
		<link>http://www.bigredbits.com/archives/382</link>
		<comments>http://www.bigredbits.com/archives/382#comments</comments>
		<pubDate>Sat, 16 Apr 2011 20:52:42 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[knowledge]]></category>
		<category><![CDATA[logic]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=382</guid>
		<description><![CDATA[Last week, I went to Petra in Jordan together with Vasilis Syrgkanis and in the road we kept discussing about the Blue-Eyed Islanders Puzzle, posted by Terry Tao on his blog. It is a nice puzzle because it allows us to reflect on how to think about knowledge and how to make statements like &#8220;A [...]]]></description>
			<content:encoded><![CDATA[<p>Last week, I went to Petra in Jordan together with <a href="http://www.cs.cornell.edu/~vasilis/">Vasilis Syrgkanis</a> and in the road we kept discussing about the <a href="http://terrytao.wordpress.com/2011/04/07/the-blue-eyed-islanders-puzzle-repost/">Blue-Eyed Islanders Puzzle</a>, posted by Terry Tao on his blog. It is a nice puzzle because it allows us to reflect on how to think about knowledge and how to make statements like &#8220;A knows that B knows that C doesn&#8217;t know fact F&#8221;. My goal in this post is not to discuss about the <a href="http://terrytao.wordpress.com/2011/04/07/the-blue-eyed-islanders-puzzle-repost/">puzzle</a> itself, but about a framework that allows us to think about it in a structured way. Nevertheless, I strongly recommend the puzzle, first because it is a lot of fun, and second, because it gives a great non-trivial example to think about this kind of thing. But first, a photo from Petra:</p>
<p><a rel="attachment wp-att-383" href="http://www.bigredbits.com/archives/382/img_20110409_142559"><img class="aligncenter size-large wp-image-383" title="IMG_20110409_142559" src="http://www.bigredbits.com/wp-content/uploads/2011/04/IMG_20110409_142559-1024x768.jpg" alt="" width="636" height="476" /></a></p>
<p>And then some references:</p>
<ul>
<li>Classic: &#8220;<a href="http://www.ma.huji.ac.il/raumann/pdf/Agreeing%20to%20Disagree.pdf">Agreeing to Disagree</a>&#8221; (Robert Aumann)</li>
<li>Survey:  &#8220;<a href="http://cowles.econ.yale.edu/~gean/art/p0882.pdf">Common Knowledge</a>&#8221; (John Geanakoplos)</li>
<li>Book: &#8220;<a href="http://www.amazon.com/Reasoning-About-Knowledge-Ronald-Fagin/dp/0262562006/ref=sr_1_1?ie=UTF8&amp;qid=1302982140&amp;sr=8-1">Reasoning about Knowledge</a>&#8221; (Fagin, Halpern, Moses, Vardi)</li>
</ul>
<p>I first read Aumann&#8217;s classic, which is a very beautiful and short paper &#8212; but where I got most intuition (and fun) was reading Chapter 2 of Reasoning About Knowledge.  So, I&#8217;ll show here something in between of what Chapter 2 presents and what Geanakoplos&#8217; survey presents (which is also an amazing source).</p>
<p>We want to reason about the world and the first thing we need is a set <img src='http://s.wordpress.com/latex.php?latex=%5COmega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Omega' title='\Omega' class='latex' /> representing all possible states the world can take. Each <img src='http://s.wordpress.com/latex.php?latex=%5Comega%20%5Cin%20%5COmega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega \in \Omega' title='\omega \in \Omega' class='latex' /> is called a state of the world and completely described the world we are trying to reason about. To illustrate the example, consider the situation where there are <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> people in a room and each person has a number <img src='http://s.wordpress.com/latex.php?latex=x_i%20%5Cin%20%5C%7B0%2C1%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='x_i \in \{0,1\}' title='x_i \in \{0,1\}' class='latex' /> in his head. Person <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> can see the number of everyone else, except his. We want to reason about this situation, so a good way to describe the world is to simply define <img src='http://s.wordpress.com/latex.php?latex=%5COmega%20%3D%20%5C%7B0%2C1%5C%7D%5En&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Omega = \{0,1\}^n' title='\Omega = \{0,1\}^n' class='latex' /> of all <img src='http://s.wordpress.com/latex.php?latex=0%2C1&#038;bg=T&#038;fg=000000&#038;s=0' alt='0,1' title='0,1' class='latex' />-strings. We define an event to be simply a subset of the possible states of the world, i.e., some set <img src='http://s.wordpress.com/latex.php?latex=E%20%5Csubseteq%20%5COmega&#038;bg=T&#038;fg=000000&#038;s=0' alt='E \subseteq \Omega' title='E \subseteq \Omega' class='latex' />. For example, the even that player <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> has number <img src='http://s.wordpress.com/latex.php?latex=0&#038;bg=T&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> in his head is simply: <img src='http://s.wordpress.com/latex.php?latex=E%20%3D%20%5C%7B%20x%20%5Cin%20%5COmega%3B%20x_1%20%3D%200%20%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='E = \{ x \in \Omega; x_1 = 0 \}' title='E = \{ x \in \Omega; x_1 = 0 \}' class='latex' />. We could also think about the event <img src='http://s.wordpress.com/latex.php?latex=E%27&#038;bg=T&#038;fg=000000&#038;s=0' alt='E&#039;' title='E&#039;' class='latex' /> that the sum of the numbers is odd, which would be: <img src='http://s.wordpress.com/latex.php?latex=E%27%20%3D%20%5C%7B%20x%20%5Cin%20%5COmega%3B%20%5Csum_i%20x_i%20%5Ctext%7B%20mod%20%7D%202%20%3D%201%20%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='E&#039; = \{ x \in \Omega; \sum_i x_i \text{ mod } 2 = 1 \}' title='E&#039; = \{ x \in \Omega; \sum_i x_i \text{ mod } 2 = 1 \}' class='latex' />. Now, we need to define what it means for some person to know some event.</p>
<p>For each person <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' />, his knowledge structure is defined by a partition <img src='http://s.wordpress.com/latex.php?latex=P_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='P_i' title='P_i' class='latex' /> of  <img src='http://s.wordpress.com/latex.php?latex=%5COmega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Omega' title='\Omega' class='latex' />. The rough intuition is that player <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> is unable to distinguish two elements <img src='http://s.wordpress.com/latex.php?latex=%5Comega_1%2C%20%5Comega_2&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega_1, \omega_2' title='\omega_1, \omega_2' class='latex' /> in the same cell of partition <img src='http://s.wordpress.com/latex.php?latex=P_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='P_i' title='P_i' class='latex' />. For each <img src='http://s.wordpress.com/latex.php?latex=%5Comega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega' title='\omega' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=P_i%28%5Comega%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='P_i(\omega)' title='P_i(\omega)' class='latex' /> is the cell of partition <img src='http://s.wordpress.com/latex.php?latex=P_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='P_i' title='P_i' class='latex' /> containing <img src='http://s.wordpress.com/latex.php?latex=%5Comega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega' title='\omega' class='latex' /> . The way I see knowledge representation is that if <img src='http://s.wordpress.com/latex.php?latex=%5Comega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega' title='\omega' class='latex' /> is the true state of the world, then person <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> knows that the true state of the world is some element in <img src='http://s.wordpress.com/latex.php?latex=P_i%28%5Comega%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='P_i(\omega)' title='P_i(\omega)' class='latex' />.</p>
<blockquote><p><strong>Definition:</strong> We say that person <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> knows event <img src='http://s.wordpress.com/latex.php?latex=E&#038;bg=T&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' /> on the state of the world <img src='http://s.wordpress.com/latex.php?latex=%5Comega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega' title='\omega' class='latex' /> is <img src='http://s.wordpress.com/latex.php?latex=P_i%28%5Comega%29%20%5Csubseteq%20E&#038;bg=T&#038;fg=000000&#038;s=0' alt='P_i(\omega) \subseteq E' title='P_i(\omega) \subseteq E' class='latex' />. Therefore, if person <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> knows event <img src='http://s.wordpress.com/latex.php?latex=E&#038;bg=T&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' />, the world must be in some state <img src='http://s.wordpress.com/latex.php?latex=K_i%28E%29%20%3A%3D%20%5C%7B%5Comega%20%5Cin%20%5COmega%3B%20P_i%28%5Comega%29%20%5Csubseteq%20E%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='K_i(E) := \{\omega \in \Omega; P_i(\omega) \subseteq E\}' title='K_i(E) := \{\omega \in \Omega; P_i(\omega) \subseteq E\}' class='latex' />.</p></blockquote>
<p>Above we define the knowledge operator <img src='http://s.wordpress.com/latex.php?latex=K_i%28%5Ccdot%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='K_i(\cdot)' title='K_i(\cdot)' class='latex' />. Below, there is a picture in which we represent its action:</p>
<p><a rel="attachment wp-att-414" href="http://www.bigredbits.com/archives/382/knowledge_operator"><img class="aligncenter size-full wp-image-414" title="knowledge_operator" src="http://www.bigredbits.com/wp-content/uploads/2011/04/knowledge_operator.png" alt="" width="613" height="226" /></a>Now, this allows us to represent the fact that person <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> knows that person <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' /> knows of event <img src='http://s.wordpress.com/latex.php?latex=E&#038;bg=T&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' /> as the event <img src='http://s.wordpress.com/latex.php?latex=K_1%28K_2%28E%29%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='K_1(K_2(E))' title='K_1(K_2(E))' class='latex' />. Now, the fact the person <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> knows that person <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' /> doesn&#8217;t know that person <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> knows event <img src='http://s.wordpress.com/latex.php?latex=E&#038;bg=T&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' /> can be represented as: <img src='http://s.wordpress.com/latex.php?latex=K_1%28%5Cneg%20K_2%28K_1%28E%29%29%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='K_1(\neg K_2(K_1(E)))' title='K_1(\neg K_2(K_1(E)))' class='latex' />, where <img src='http://s.wordpress.com/latex.php?latex=%5Cneg%20E%20%3A%3D%20%5COmega%20%5Csetminus%20E&#038;bg=T&#038;fg=000000&#038;s=0' alt='\neg E := \Omega \setminus E' title='\neg E := \Omega \setminus E' class='latex' />.</p>
<p>An equivalent and axiomatic way of defining the knowledge operator is by defining it as an operator <img src='http://s.wordpress.com/latex.php?latex=K_i%3A%202%5E%5COmega%20%5Crightarrow%202%5E%5COmega&#038;bg=T&#038;fg=000000&#038;s=0' alt='K_i: 2^\Omega \rightarrow 2^\Omega' title='K_i: 2^\Omega \rightarrow 2^\Omega' class='latex' /> such that:</p>
<ol>
<li><img src='http://s.wordpress.com/latex.php?latex=K_i%28%5COmega%29%20%3D%20%5COmega&#038;bg=T&#038;fg=000000&#038;s=0' alt='K_i(\Omega) = \Omega' title='K_i(\Omega) = \Omega' class='latex' /></li>
<li><img src='http://s.wordpress.com/latex.php?latex=K_i%28A%29%20%5Ccap%20K_i%28B%29%20%3D%20K_i%28A%20%5Ccap%20B%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='K_i(A) \cap K_i(B) = K_i(A \cap B)' title='K_i(A) \cap K_i(B) = K_i(A \cap B)' class='latex' /></li>
<li><img src='http://s.wordpress.com/latex.php?latex=K_i%28A%29%20%5Csubseteq%20A&#038;bg=T&#038;fg=000000&#038;s=0' alt='K_i(A) \subseteq A' title='K_i(A) \subseteq A' class='latex' /></li>
<li><img src='http://s.wordpress.com/latex.php?latex=K_i%28K_i%28A%29%29%20%3D%20K_i%28A%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='K_i(K_i(A)) = K_i(A)' title='K_i(K_i(A)) = K_i(A)' class='latex' /></li>
<li><img src='http://s.wordpress.com/latex.php?latex=%5Cneg%20K_i%28A%29%20%3D%20K_i%28%5Cneg%20K_i%28A%29%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\neg K_i(A) = K_i(\neg K_i(A))' title='\neg K_i(A) = K_i(\neg K_i(A))' class='latex' /></li>
</ol>
<p>Notice that axioms 1-4 define exactly a <a href="http://en.wikipedia.org/wiki/Topological_space">topology</a> and together with 5 it is a topology that is closed under complement. The last two properties are more interesting: they say that if player <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> knows something, then he knows that he knows and if the doesn&#8217;t know something, he knows that he doesn&#8217;t know. Aumann goes ahead and defines the notion of common knowledge:</p>
<blockquote><p><strong>Definition:</strong> We say that an event <img src='http://s.wordpress.com/latex.php?latex=E&#038;bg=T&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' /> is common knowledge at <img src='http://s.wordpress.com/latex.php?latex=%5Comega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega' title='\omega' class='latex' /> if for any <img src='http://s.wordpress.com/latex.php?latex=k&#038;bg=T&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> and for any sequence <img src='http://s.wordpress.com/latex.php?latex=i%281%29%2C%20%5Chdots%2C%20i%28k%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='i(1), \hdots, i(k)' title='i(1), \hdots, i(k)' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=i%28j%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='i(j)' title='i(j)' class='latex' /> are players, then <img src='http://s.wordpress.com/latex.php?latex=%5Comega%20%5Cin%20K_%7Bi%281%29%7D%20K_%7Bi%282%29%7D%20%5Chdots%20K_%7Bi%28k%29%7D%20E&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega \in K_{i(1)} K_{i(2)} \hdots K_{i(k)} E' title='\omega \in K_{i(1)} K_{i(2)} \hdots K_{i(k)} E' class='latex' />.</p></blockquote>
<p>Suppose that <img src='http://s.wordpress.com/latex.php?latex=P%27%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='P&#039; ' title='P&#039; ' class='latex' /> is a partition that is a simultaneous coarsening of <img src='http://s.wordpress.com/latex.php?latex=P_1%2C%20%5Chdots%2C%20P_k&#038;bg=T&#038;fg=000000&#038;s=0' alt='P_1, \hdots, P_k' title='P_1, \hdots, P_k' class='latex' />, then for all cells <img src='http://s.wordpress.com/latex.php?latex=C&#038;bg=T&#038;fg=000000&#038;s=0' alt='C' title='C' class='latex' /> of this partition, either <img src='http://s.wordpress.com/latex.php?latex=C&#038;bg=T&#038;fg=000000&#038;s=0' alt='C' title='C' class='latex' /> or <img src='http://s.wordpress.com/latex.php?latex=%5Cneg%20C&#038;bg=T&#038;fg=000000&#038;s=0' alt='\neg C' title='\neg C' class='latex' /> is common knowledge.</p>
<p>An alternative representation is to represent $\Omega$ as nodes in a graph and add an edge between <img src='http://s.wordpress.com/latex.php?latex=%5Comega_1&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega_1' title='\omega_1' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%5Comega_2&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega_2' title='\omega_2' class='latex' /> labeled with <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> if they are in the same cell of <img src='http://s.wordpress.com/latex.php?latex=P_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='P_i' title='P_i' class='latex' />. Now, given the true state of the world <img src='http://s.wordpress.com/latex.php?latex=%5Comega%20%5Cin%20%5COmega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega \in \Omega' title='\omega \in \Omega' class='latex' />, one can easily calculate the smallest event <img src='http://s.wordpress.com/latex.php?latex=E&#038;bg=T&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> knows <img src='http://s.wordpress.com/latex.php?latex=E&#038;bg=T&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' />: this is exactly the states that are reached from <img src='http://s.wordpress.com/latex.php?latex=%5Comega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega' title='\omega' class='latex' /> just following edges labeled with <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' />, which is easily recognizable as <img src='http://s.wordpress.com/latex.php?latex=P_i%28%5Comega%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='P_i(\omega)' title='P_i(\omega)' class='latex' />.</p>
<p>Now, what is the smallest set that <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> knows that <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' /> knows ? Those are the elements that we can arrive from a path following first an edge labeled <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> and then an edge labeled <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' />. Extending this reasoning, it is easy to see that the smallest set that is common knowledge at <img src='http://s.wordpress.com/latex.php?latex=%5Comega&#038;bg=T&#038;fg=000000&#038;s=0' alt='\omega' title='\omega' class='latex' /> are all the elements reachable from some path in this graph.</p>
<p>More about knowledge representation and reasoning about knowledge in future posts. In any case, I can&#8217;t recommend enough the references above.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/382/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Do we believe the Axiom of Choice ?</title>
		<link>http://www.bigredbits.com/archives/353</link>
		<comments>http://www.bigredbits.com/archives/353#comments</comments>
		<pubDate>Sun, 10 Apr 2011 20:17:32 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[puzzles]]></category>
		<category><![CDATA[theory]]></category>
		<category><![CDATA[measure theory; puzzles]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=353</guid>
		<description><![CDATA[Continuing on my series of posts from Israel, I&#8217;d like to share some exciting puzzle that I heard today from Omer Tamuz. I&#8217;ve learned before about the Axiom of Choice in a Measure Theory class, but never saw a so striking and counter-intuitive application of it. Ok, you might say the Banach–Tarski paradox is a [...]]]></description>
			<content:encoded><![CDATA[<p>Continuing on my series of posts from Israel, I&#8217;d like to share some exciting puzzle that I heard today from Omer Tamuz. I&#8217;ve learned before about the <a href="http://en.wikipedia.org/wiki/Axiom_of_choice">Axiom of Choice</a> in a Measure Theory class, but never saw a so striking and counter-intuitive application of it. Ok, you might say the <a href="http://en.wikipedia.org/wiki/Tarski_Banach_paradox">Banach–Tarski paradox</a> is a pehaps better example &#8211; but since it&#8217;s proof is so complicated, it is not as striking as seeing how a simple application of it can generate un-intuitive results. First, let me present two puzzles:</p>
<blockquote><p><strong>Puzzle #0:</strong> There are <img src='http://s.wordpress.com/latex.php?latex=n%2B1&#038;bg=T&#038;fg=000000&#038;s=0' alt='n+1' title='n+1' class='latex' /> people in a line, and each has a <img src='http://s.wordpress.com/latex.php?latex=0%2F1&#038;bg=T&#038;fg=000000&#038;s=0' alt='0/1' title='0/1' class='latex' /> number on his hat. Each player can look to the numbers of the players in front of him. So, if <img src='http://s.wordpress.com/latex.php?latex=x_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='x_i' title='x_i' class='latex' /> is the number of player <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' />, then player <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> knows <img src='http://s.wordpress.com/latex.php?latex=x_%7Bi%2B1%7D%2C%5Chdots%2Cx_n&#038;bg=T&#038;fg=000000&#038;s=0' alt='x_{i+1},\hdots,x_n' title='x_{i+1},\hdots,x_n' class='latex' />. Now, from <img src='http://s.wordpress.com/latex.php?latex=i%20%3D%200%20%5Chdots%20n&#038;bg=T&#038;fg=000000&#038;s=0' alt='i = 0 \hdots n' title='i = 0 \hdots n' class='latex' /> the players will say his own number. Is there a protocol such that players <img src='http://s.wordpress.com/latex.php?latex=1%2C%5Chdots%2Cn&#038;bg=T&#038;fg=000000&#038;s=0' alt='1,\hdots,n' title='1,\hdots,n' class='latex' /> will get their own number right? (Notice that they hear what the players before him said).</p></blockquote>
<blockquote><p><strong>Puzzle #1:</strong> Consider the same puzzle with an <strong>infinite</strong> number of players. I.e. there are <img src='http://s.wordpress.com/latex.php?latex=x_i%3B%20i%20%5Cin%20%5Cmathbb%7BZ%7D_%2B&#038;bg=T&#038;fg=000000&#038;s=0' alt='x_i; i \in \mathbb{Z}_+' title='x_i; i \in \mathbb{Z}_+' class='latex' /> and player <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> knows <img src='http://s.wordpress.com/latex.php?latex=x_j&#038;bg=T&#038;fg=000000&#038;s=0' alt='x_j' title='x_j' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=j%20%3E%20i&#038;bg=T&#038;fg=000000&#038;s=0' alt='j &gt; i' title='j &gt; i' class='latex' />. Show a protocol for all players, except the first to get the answer right?</p></blockquote>
<blockquote><p><strong>Puzzle #2:</strong> Still the same setting, but now players don&#8217;t hear what the previous player said. Is there a protocol such that only a finite number of players get it wrong ? (notice that it needs to be finite, not bounded).</p></blockquote>
<p>Puzzle #0 is very easy and the answer is simply <strong>parity check</strong>. Player <img src='http://s.wordpress.com/latex.php?latex=0&#038;bg=T&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> could  simply declares <img src='http://s.wordpress.com/latex.php?latex=y%20%3D%20x_1%20%5Cotimes%5Chdots%20%5Cotimes%20x_n&#038;bg=T&#038;fg=000000&#038;s=0' alt='y = x_1 \otimes\hdots \otimes x_n' title='y = x_1 \otimes\hdots \otimes x_n' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%5Cotimes&#038;bg=T&#038;fg=000000&#038;s=0' alt='\otimes' title='\otimes' class='latex' /> stands for XOR. Now, player <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> can for example reconstruct <img src='http://s.wordpress.com/latex.php?latex=x_1&#038;bg=T&#038;fg=000000&#038;s=0' alt='x_1' title='x_1' class='latex' /> by <img src='http://s.wordpress.com/latex.php?latex=y%20%5Cotimes%20x_2%20%5Cotimes%20%5Chdots%20%5Cotimes%20x_n&#038;bg=T&#038;fg=000000&#038;s=0' alt='y \otimes x_2 \otimes \hdots \otimes x_n' title='y \otimes x_2 \otimes \hdots \otimes x_n' class='latex' />. Now, player <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' /> can do the same computation and figure out <img src='http://s.wordpress.com/latex.php?latex=x_1&#038;bg=T&#038;fg=000000&#038;s=0' alt='x_1' title='x_1' class='latex' />. Now, he can calculate <img src='http://s.wordpress.com/latex.php?latex=x_2%20%3D%20y%20%5Cotimes%20x_1%20%5Cotimes%20x_2%20%5Cotimes%20%5Chdots%20%5Cotimes%20x_n&#038;bg=T&#038;fg=000000&#038;s=0' alt='x_2 = y \otimes x_1 \otimes x_2 \otimes \hdots \otimes x_n' title='x_2 = y \otimes x_1 \otimes x_2 \otimes \hdots \otimes x_n' class='latex' /> and so on&#8230; When we move to an infinite number of players, however, we can&#8217;t do that anymore because taking the XOR of an infinite number of bits is not well defined. However, we can still can solve Puzzles #1 and #2 if we believe and are willing to accept the Axiom of Choice.</p>
<blockquote><p><strong>Axiom of Choice: </strong>Given a family of sets <img src='http://s.wordpress.com/latex.php?latex=%5C%7B%20S_i%20%5C%7D_%7Bi%20%5Cin%20I%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\{ S_i \}_{i \in I}' title='\{ S_i \}_{i \in I}' class='latex' /> there is a set <img src='http://s.wordpress.com/latex.php?latex=%5C%7Bx_i%3B%20i%20%5Cin%20I%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\{x_i; i \in I\}' title='\{x_i; i \in I\}' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=x_i%20%5Cin%20S_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='x_i \in S_i' title='x_i \in S_i' class='latex' />, i.e. a set that takes a <em>representative</em> from each element in the family.</p></blockquote>
<p>It is used, for example to show that there is no measure <img src='http://s.wordpress.com/latex.php?latex=%5Cmu%20%3A%202%5E%7B%5B0%2C1%29%7D%20%5Crightarrow%20%5Cmathbb%7BR%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mu : 2^{[0,1)} \rightarrow \mathbb{R}' title='\mu : 2^{[0,1)} \rightarrow \mathbb{R}' class='latex' /> that is shift invariant (say under addition modulo <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />) and <img src='http://s.wordpress.com/latex.php?latex=%5Cmu%28%5B0%2C1%29%29%20%3D%201&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mu([0,1)) = 1' title='\mu([0,1)) = 1' class='latex' />. The proof goes the following way: define the following equivalence relation on <img src='http://s.wordpress.com/latex.php?latex=%5B0%2C1%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='[0,1)' title='[0,1)' class='latex' />: <img src='http://s.wordpress.com/latex.php?latex=x%20%5Csim%20y&#038;bg=T&#038;fg=000000&#038;s=0' alt='x \sim y' title='x \sim y' class='latex' /> if <img src='http://s.wordpress.com/latex.php?latex=x%20-%20y%20%5Cin%20%5Cmathbb%7BQ%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='x - y \in \mathbb{Q}' title='x - y \in \mathbb{Q}' class='latex' />. Now, consider the family of all the equivalence classes and invoke the Axiom of Choice. Let <img src='http://s.wordpress.com/latex.php?latex=K&#038;bg=T&#038;fg=000000&#038;s=0' alt='K' title='K' class='latex' /> be the set obtained. Now, we can write the interval as a disjoint union:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5B0%2C1%29%20%3D%20%5Ccup_%7Bx%20%5Cin%20%5Cmathbb%7BQ%7D%7D%20%28K%2Bx%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='[0,1) = \cup_{x \in \mathbb{Q}} (K+x)' title='[0,1) = \cup_{x \in \mathbb{Q}} (K+x)' class='latex' /></p>
<p style="text-align: left;">where all operations are modulo <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=K%2Bx%20%3D%20%5C%7By%2Bx%3B%20y%20%5Cin%20K%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='K+x = \{y+x; y \in K\}' title='K+x = \{y+x; y \in K\}' class='latex' />. Since it is an enumerable union, if such a measure existed, then: <img src='http://s.wordpress.com/latex.php?latex=%5Cmu%28%5B0%2C1%29%29%20%3D%20%5Csum_%7Bx%20%5Cin%20%5Cmathbb%7BQ%7D%7D%20%5Cmu%28K%2Bx%29%20%3D%20%5Csum_%7Bx%20%5Cin%20%5Cmathbb%7BQ%7D%7D%20%5Cmu%28K%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mu([0,1)) = \sum_{x \in \mathbb{Q}} \mu(K+x) = \sum_{x \in \mathbb{Q}} \mu(K)' title='\mu([0,1)) = \sum_{x \in \mathbb{Q}} \mu(K+x) = \sum_{x \in \mathbb{Q}} \mu(K)' class='latex' /> which is either <img src='http://s.wordpress.com/latex.php?latex=0&#038;bg=T&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> if <img src='http://s.wordpress.com/latex.php?latex=%5Cmu%28K%29%20%3D%200&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mu(K) = 0' title='\mu(K) = 0' class='latex' /> or <img src='http://s.wordpress.com/latex.php?latex=%5Cinfty&#038;bg=T&#038;fg=000000&#038;s=0' alt='\infty' title='\infty' class='latex' /> if <img src='http://s.wordpress.com/latex.php?latex=%5Cmu%28K%29%20%3E0&#038;bg=T&#038;fg=000000&#038;s=0' alt='\mu(K) &gt;0' title='\mu(K) &gt;0' class='latex' />.</p>
<p style="text-align: left;">This is kinda surprising, but more surprising is how we can use the exact same technique to solve the puzzles: first, let&#8217;s solve Puzzle #2: let <img src='http://s.wordpress.com/latex.php?latex=S&#038;bg=T&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> be the set of all infinite <img src='http://s.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\{0,1\}' title='\{0,1\}' class='latex' />-strings and consider the equivalence relation on <img src='http://s.wordpress.com/latex.php?latex=S&#038;bg=T&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=x%20%5Csim%20y&#038;bg=T&#038;fg=000000&#038;s=0' alt='x \sim y' title='x \sim y' class='latex' /> if the strings differ in a finite number of positions. Now, invoke the axiom of choice in the equivalence classes and let <img src='http://s.wordpress.com/latex.php?latex=S_0&#038;bg=T&#038;fg=000000&#038;s=0' alt='S_0' title='S_0' class='latex' /> be the set of representatives. Now, if <img src='http://s.wordpress.com/latex.php?latex=F&#038;bg=T&#038;fg=000000&#038;s=0' alt='F' title='F' class='latex' /> is the set of all strings with finite number of <img src='http://s.wordpress.com/latex.php?latex=1&#038;bg=T&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />&#8216;s and <img src='http://s.wordpress.com/latex.php?latex=%5Cotimes&#038;bg=T&#038;fg=000000&#038;s=0' alt='\otimes' title='\otimes' class='latex' /> the operation such that <img src='http://s.wordpress.com/latex.php?latex=z%20%3D%20x%20%5Cotimes%20z&#038;bg=T&#038;fg=000000&#038;s=0' alt='z = x \otimes z' title='z = x \otimes z' class='latex' /> if <img src='http://s.wordpress.com/latex.php?latex=z_i%20%3D%20x_i%20%5Cotimes%20y_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='z_i = x_i \otimes y_i' title='z_i = x_i \otimes y_i' class='latex' />. We can therefore write:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=S%20%3D%20%5Ccup_%7Bx%20%5Cin%20F%7D%20%28S_o%20%5Cotimes%20x%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='S = \cup_{x \in F} (S_o \otimes x)' title='S = \cup_{x \in F} (S_o \otimes x)' class='latex' /></p>
<p style="text-align: left;">Now, a protocol the players could use is to look ahead and since they are seeing an infinite number of bits, they can figure out which equivalence class from <img src='http://s.wordpress.com/latex.php?latex=S%2F%5Csim&#038;bg=T&#038;fg=000000&#038;s=0' alt='S/\sim' title='S/\sim' class='latex' /> they the entire string is. Now, they take <img src='http://s.wordpress.com/latex.php?latex=y%20%5Cin%20S_0&#038;bg=T&#038;fg=000000&#038;s=0' alt='y \in S_0' title='y \in S_0' class='latex' /> the representative of this class and guess <img src='http://s.wordpress.com/latex.php?latex=y_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='y_i' title='y_i' class='latex' />. Notice that <img src='http://s.wordpress.com/latex.php?latex=y&#038;bg=T&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> will differ from the real string by at most a finite number of bits.</p>
<p style="text-align: left;">Now, to solve puzzle #1, the player <img src='http://s.wordpress.com/latex.php?latex=0&#038;bg=T&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> simply looks at <img src='http://s.wordpress.com/latex.php?latex=x%20%3D%20x_1%20x_2%20%5Chdots&#038;bg=T&#038;fg=000000&#038;s=0' alt='x = x_1 x_2 \hdots' title='x = x_1 x_2 \hdots' class='latex' /> and figure out the equivalence class he is and let <img src='http://s.wordpress.com/latex.php?latex=y%20%5Cin%20S_0&#038;bg=T&#038;fg=000000&#038;s=0' alt='y \in S_0' title='y \in S_0' class='latex' /> be the representative of this class. Now, since <img src='http://s.wordpress.com/latex.php?latex=x&#038;bg=T&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=y&#038;bg=T&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> differ by a finite number of bits, he can simply calculate XOR of <img src='http://s.wordpress.com/latex.php?latex=x&#038;bg=T&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=y&#038;bg=T&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> (now, since it is a finite number of them, XOR is well defined) and announce it. With this trick, it just becomes like Puzzle #0.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/353/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Submodular Allocation Problem</title>
		<link>http://www.bigredbits.com/archives/293</link>
		<comments>http://www.bigredbits.com/archives/293#comments</comments>
		<pubDate>Wed, 06 Apr 2011 21:18:55 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[algorithms]]></category>
		<category><![CDATA[submodular]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=293</guid>
		<description><![CDATA[I am in Israel for the Algorithmic Game Theory Semester in the Center for the Study of Rationality. It is great to both explore Jerusalem and learn about games and algorithms. I think it is a great opportunity to start blogging again. To start, I decided to write about simple and beautiful algorithm by Lehman, [...]]]></description>
			<content:encoded><![CDATA[<p>I am in Israel for the Algorithmic Game Theory Semester in the <a href="http://www.ratio.huji.ac.il/">Center for the Study of Rationality</a>. It is great to both explore Jerusalem and learn about games and algorithms. I think it is a great opportunity to start blogging again. To start, I decided to write about simple and beautiful algorithm by <a href="http://www.cs.huji.ac.il/~noam/submodular.pdf">Lehman, Lehman and Nisan</a> on the allocation problem when players have submodular valuations.</p>
<p><a rel="attachment wp-att-298" href="http://www.bigredbits.com/archives/293/img_20110402_183321"></a><a rel="attachment wp-att-315" href="http://www.bigredbits.com/archives/293/img_20110402_183321-2"><img class="aligncenter size-large wp-image-315" title="IMG_20110402_183321" src="http://www.bigredbits.com/wp-content/uploads/2011/04/IMG_20110402_1833211-1024x768.jpg" alt="" width="514" height="384" /></a></p>
<p>Consider a set of <img src='http://s.wordpress.com/latex.php?latex=m&#038;bg=T&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> items and <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> agents. Each agent has a monotone <a href="http://en.wikipedia.org/wiki/Submodular">submodular</a> <img src='http://s.wordpress.com/latex.php?latex=v_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_i' title='v_i' class='latex' /> valuation over the items,  i.e., <img src='http://s.wordpress.com/latex.php?latex=v_i%3A2%5E%7B%5Bm%5D%7D%20%5Crightarrow%20%5Cmathbb%7BR%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_i:2^{[m]} \rightarrow \mathbb{R}' title='v_i:2^{[m]} \rightarrow \mathbb{R}' class='latex' /> s.t. <img src='http://s.wordpress.com/latex.php?latex=v_i%28S%29%20%2B%20v_i%28T%29%20%5Cgeq%20v_i%28S%20%5Ccap%20T%29%20%2B%20v_i%28S%20%5Ccup%20T%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_i(S) + v_i(T) \geq v_i(S \cap T) + v_i(S \cup T)' title='v_i(S) + v_i(T) \geq v_i(S \cap T) + v_i(S \cup T)' class='latex' /> for any subsets <img src='http://s.wordpress.com/latex.php?latex=S%2CT&#038;bg=T&#038;fg=000000&#038;s=0' alt='S,T' title='S,T' class='latex' /> of <img src='http://s.wordpress.com/latex.php?latex=%5Bm%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='[m]' title='[m]' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=f%28S%29%20%5Cleq%20f%28T%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='f(S) \leq f(T)' title='f(S) \leq f(T)' class='latex' /> for <img src='http://s.wordpress.com/latex.php?latex=S%20%5Csubseteq&#038;bg=T&#038;fg=000000&#038;s=0' alt='S \subseteq' title='S \subseteq' class='latex' /> T. Now, the goal is to partition the items in <img src='http://s.wordpress.com/latex.php?latex=n&#038;bg=T&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> sets <img src='http://s.wordpress.com/latex.php?latex=S_1%2C%20%5Chdots%2C%20S_n&#038;bg=T&#038;fg=000000&#038;s=0' alt='S_1, \hdots, S_n' title='S_1, \hdots, S_n' class='latex' /> in order to maximize <img src='http://s.wordpress.com/latex.php?latex=%5Csum_i%20v_i%28S_i%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\sum_i v_i(S_i)' title='\sum_i v_i(S_i)' class='latex' />.</p>
<p style="text-align: center;">
<p>This problem is clearly NP-hard (for example, we can reduce from Maximum Coverage or any similar problem), but is has a very simples Greedy Approximation. The approximation goes as follows: start with all sets being empty, i.e., start with <img src='http://s.wordpress.com/latex.php?latex=S_i%20%3D%20%5Cemptyset%2C%20%5Cforall%20i&#038;bg=T&#038;fg=000000&#038;s=0' alt='S_i = \emptyset, \forall i' title='S_i = \emptyset, \forall i' class='latex' /> then for each item <img src='http://s.wordpress.com/latex.php?latex=j&#038;bg=T&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' />, find the player <img src='http://s.wordpress.com/latex.php?latex=i&#038;bg=T&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> with maximum <img src='http://s.wordpress.com/latex.php?latex=v_i%28j%20%5Cvert%20S_i%29%20%3D%20v_i%28S_i%20%5Ccup%20%5C%7Bj%5C%7D%29%20-%20v_i%28S_i%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_i(j \vert S_i) = v_i(S_i \cup \{j\}) - v_i(S_i)' title='v_i(j \vert S_i) = v_i(S_i \cup \{j\}) - v_i(S_i)' class='latex' /> and add <img src='http://s.wordpress.com/latex.php?latex=j&#038;bg=T&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> to this player. This is a <img src='http://s.wordpress.com/latex.php?latex=2&#038;bg=T&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' />-approximation algorithm. The proof is simple:</p>
<p>Let <img src='http://s.wordpress.com/latex.php?latex=S_1%2C%20%5Chdots%2C%20S_n&#038;bg=T&#038;fg=000000&#038;s=0' alt='S_1, \hdots, S_n' title='S_1, \hdots, S_n' class='latex' /> be the sets returned by the algorithm and <img src='http://s.wordpress.com/latex.php?latex=O_1%2C%20%5Chdots%2C%20O_n&#038;bg=T&#038;fg=000000&#038;s=0' alt='O_1, \hdots, O_n' title='O_1, \hdots, O_n' class='latex' /> the optimal solution. Let also <img src='http://s.wordpress.com/latex.php?latex=S_i%5E%7B%3Cj%7D%20%3D%20S_i%20%5Ccap%20%5C%7B1%2C%202%2C%20%5Chdots%2C%20j-1%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='S_i^{&lt;j} = S_i \cap \{1, 2, \hdots, j-1\}' title='S_i^{&lt;j} = S_i \cap \{1, 2, \hdots, j-1\}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=O_i%5E%7B%3Cj%7D%20%3D%20O_i%20%5Ccap%20%5C%7B1%2C%202%2C%20%5Chdots%2C%20j-1%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='O_i^{&lt;j} = O_i \cap \{1, 2, \hdots, j-1\}' title='O_i^{&lt;j} = O_i \cap \{1, 2, \hdots, j-1\}' class='latex' />. We can write:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Csum_i%20v_i%28S_i%29%20%3D%20%5Csum_i%20%5Csum_%7Bj%20%5Cin%20S_i%7D%20v_i%28j%20%5Cvert%20S_i%5E%7B%3Cj%7D%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\sum_i v_i(S_i) = \sum_i \sum_{j \in S_i} v_i(j \vert S_i^{&lt;j}) ' title='\sum_i v_i(S_i) = \sum_i \sum_{j \in S_i} v_i(j \vert S_i^{&lt;j}) ' class='latex' /></p>
<p style="text-align: left;">if we added <img src='http://s.wordpress.com/latex.php?latex=j&#038;bg=T&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> to set <img src='http://s.wordpress.com/latex.php?latex=S_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='S_i' title='S_i' class='latex' /> it means that <img src='http://s.wordpress.com/latex.php?latex=v_i%28j%20%5Cvert%20S_i%5E%7B%3Cj%7D%29%20%5Cgeq%20v_k%28j%20%5Cvert%20S_k%5E%7B%3Cj%7D%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='v_i(j \vert S_i^{&lt;j}) \geq v_k(j \vert S_k^{&lt;j}) ' title='v_i(j \vert S_i^{&lt;j}) \geq v_k(j \vert S_k^{&lt;j}) ' class='latex' /> by the Greedy rule. Therefore we can write:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Csum_i%20v_i%28S_i%29%20%5Cgeq%20%5Csum_k%20%5Csum_%7Bj%20%5Cin%20O_k%7D%20v_k%28j%20%5Cvert%20S_k%5E%7B%3Cj%7D%29%20%5Cgeq%20%5Csum_k%20%5Csum_%7Bj%20%5Cin%20O_k%7D%20v_k%28j%20%5Cvert%20S_k%5E%7B%3Cj%7D%20%5Ccup%20O_k%5E%7B%3Cj%7D%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\sum_i v_i(S_i) \geq \sum_k \sum_{j \in O_k} v_k(j \vert S_k^{&lt;j}) \geq \sum_k \sum_{j \in O_k} v_k(j \vert S_k^{&lt;j} \cup O_k^{&lt;j}) ' title='\sum_i v_i(S_i) \geq \sum_k \sum_{j \in O_k} v_k(j \vert S_k^{&lt;j}) \geq \sum_k \sum_{j \in O_k} v_k(j \vert S_k^{&lt;j} \cup O_k^{&lt;j}) ' class='latex' /></p>
<p style="text-align: left;">where the first inequality follows from the Greedy rule and the second follows from submodularity. Now, we can simply write:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cbegin%7Baligned%7D%20%5Csum_i%20v_i%28S_i%29%20%26%20%5Cgeq%20%5Cfrac%7B1%7D%7B2%7D%20%5Csum_i%20%5Csum_%7Bj%20%5Cin%20S_i%7D%20v_i%28j%20%5Cvert%20S_i%5E%7B%3Cj%7D%29%20%2B%20%5Cfrac%7B1%7D%7B2%7D%20%5Csum_k%20%5Csum_%7Bj%20%5Cin%20O_k%7D%20%20v_k%28j%20%5Cvert%20S_k%5E%7B%3Cj%7D%20%5Ccup%20O_k%5E%7B%3Cj%7D%29%20%5C%5C%20%26%20%5Cgeq%20%5Cfrac%7B1%7D%7B2%7D%20%5Csum_i%20%5Csum_%7Bj%20%5Cin%20S_i%7D%20v_i%28j%20%5Cvert%20S_i%5E%7B%3Cj%7D%20%20%5Ccup%20O_i%5E%7B%3Cj%7D%29%20%2B%20%5Cfrac%7B1%7D%7B2%7D%20%5Csum_k%20%5Csum_%7Bj%20%5Cin%20O_k%7D%20%20v_k%28j%20%5Cvert%20%20S_k%5E%7B%3Cj%7D%20%5Ccup%20O_k%5E%7B%3Cj%7D%29%20%5C%5C%20%26%20%5Cgeq%20%5Cfrac%7B1%7D%7B2%7D%20%5Csum_i%20%5Csum_%7Bj%20%5Cin%20S_i%20%5Ccup%20O_i%7D%20v_i%28j%20%5Cvert%20S_i%5E%7B%3Cj%7D%20%5Ccup%20O_i%5E%7B%3Cj%7D%29%20%5C%5C%20%26%20%3D%20%5Cfrac%7B1%7D%7B2%7D%20%5Csum_i%20v_i%28S_i%20%5Ccup%20O_i%29%20%5Cgeq%20%5Cfrac%7B1%7D%7B2%7D%20%5Csum_i%20v_i%28O_i%29%20%5Cend%7Baligned%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\begin{aligned} \sum_i v_i(S_i) &amp; \geq \frac{1}{2} \sum_i \sum_{j \in S_i} v_i(j \vert S_i^{&lt;j}) + \frac{1}{2} \sum_k \sum_{j \in O_k}  v_k(j \vert S_k^{&lt;j} \cup O_k^{&lt;j}) \\ &amp; \geq \frac{1}{2} \sum_i \sum_{j \in S_i} v_i(j \vert S_i^{&lt;j}  \cup O_i^{&lt;j}) + \frac{1}{2} \sum_k \sum_{j \in O_k}  v_k(j \vert  S_k^{&lt;j} \cup O_k^{&lt;j}) \\ &amp; \geq \frac{1}{2} \sum_i \sum_{j \in S_i \cup O_i} v_i(j \vert S_i^{&lt;j} \cup O_i^{&lt;j}) \\ &amp; = \frac{1}{2} \sum_i v_i(S_i \cup O_i) \geq \frac{1}{2} \sum_i v_i(O_i) \end{aligned}' title='\begin{aligned} \sum_i v_i(S_i) &amp; \geq \frac{1}{2} \sum_i \sum_{j \in S_i} v_i(j \vert S_i^{&lt;j}) + \frac{1}{2} \sum_k \sum_{j \in O_k}  v_k(j \vert S_k^{&lt;j} \cup O_k^{&lt;j}) \\ &amp; \geq \frac{1}{2} \sum_i \sum_{j \in S_i} v_i(j \vert S_i^{&lt;j}  \cup O_i^{&lt;j}) + \frac{1}{2} \sum_k \sum_{j \in O_k}  v_k(j \vert  S_k^{&lt;j} \cup O_k^{&lt;j}) \\ &amp; \geq \frac{1}{2} \sum_i \sum_{j \in S_i \cup O_i} v_i(j \vert S_i^{&lt;j} \cup O_i^{&lt;j}) \\ &amp; = \frac{1}{2} \sum_i v_i(S_i \cup O_i) \geq \frac{1}{2} \sum_i v_i(O_i) \end{aligned}' class='latex' /></p>
<p style="text-align: left;">An improved algorithm was given by <a href="http://sites.google.com/site/dobzin/papers/xos.ps?attredirects=0">Dobzinski and Shapira</a> achieving an <img src='http://s.wordpress.com/latex.php?latex=1%20-%5Cfrac%7B1%7D%7Be%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='1 -\frac{1}{e}' title='1 -\frac{1}{e}' class='latex' /> approximation using demand queries &#8211; that are used as a separation oracle for a suitable linear program.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/293/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Probability Puzzles</title>
		<link>http://www.bigredbits.com/archives/278</link>
		<comments>http://www.bigredbits.com/archives/278#comments</comments>
		<pubDate>Wed, 17 Feb 2010 02:54:53 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[puzzles]]></category>
		<category><![CDATA[probability]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=278</guid>
		<description><![CDATA[Today in a dinner with Thanh, Hu and Joel I heard about a paradox I haven&#8217;t heard so far. Probability is full of cute problems that challenge our understanding of the basic concepts. The most famous of them is the Monty Hall Problem, which asks: You are on a TV game show and there are [...]]]></description>
			<content:encoded><![CDATA[<p>Today in a dinner with Thanh, Hu and Joel I heard about a paradox I haven&#8217;t heard so far. Probability is full of cute problems that challenge our understanding of the basic concepts. The most famous of them is the <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Monty_Hall_problem">Monty Hall Problem</a>, which asks:</p>
<blockquote><p>You are on a TV game show and there are <img src='http://s.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' /> doors &#8211; one of them contains a prize, say a car and the other two door contain things you don&#8217;t care about, say goats. You choose a door. Then the TV host, who knows where the prize is, opens one door you haven&#8217;t chosen and that he knows has a goat. Then he asks if you want to stick to the door you have chosen or if you want to change to the other door. What should you do?</p></blockquote>
<p>Probably you&#8217;ve already came across this question in some moment of your life and the answer is that changing doors would double your probability of getting the price. There are several ways of convincing your intuitions:</p>
<ul>
<li> Do the math: when you chose the door, there were three options so the prize is in the door you chose with <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B3%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{3}}' title='{\frac{1}{3}}' class='latex' /> probability and in the other door with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B2%7D%7B3%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{2}{3}}' title='{\frac{2}{3}}' class='latex' /> (note that the presenter can always open some door with a goat, so conditioning on that event doesn&#8217;t give you any new information).</li>
<li> Do the actual experiment (computationally) as done <a class="snap_noshots" href="http://igor-nav.livejournal.com/16784.html">here</a>. One can always ask a friend to help, get some goats and perform the actual experiment.</li>
<li> To convince yourself that &#8220;it doesn&#8217;t matter&#8221; is not correct, think <img src='http://s.wordpress.com/latex.php?latex=%7B100%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{100}' title='{100}' class='latex' /> doors. You choose one and the TV host open <img src='http://s.wordpress.com/latex.php?latex=%7B98%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{98}' title='{98}' class='latex' /> of them and asks if you want to change or stick with your first choice. Wouldn&#8217;t you change?</li>
</ul>
<p>I&#8217;ve seen TV shows where this happened and I acknowledge that other things may be involved: there might be behavioral and psychologic issues associated with the Monty Hall problem &#8211; and possibly those would interest <a class="snap_noshots" href="http://www.amazon.com/Predictably-Irrational-Revised-Intl-Decisions/dp/0062018205/ref=sr_1_2?ie=UTF8&amp;s=books&amp;qid=1266295712&amp;sr=8-2">Dan Ariely</a>, whose book I began reading today &#8211; and looks quite fun. But the problem they told me about today in dinner was another: <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Two_envelope_problem">the envelope problem</a>:</p>
<blockquote><p>There are two envelopes and you are told that in one of them there is twice the amount that there is in the other. You choose one of the envelopes at random and open it: it contains <img src='http://s.wordpress.com/latex.php?latex=%7B100%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{100}' title='{100}' class='latex' /> bucks. Now, you don&#8217;t know if the other envelope has <img src='http://s.wordpress.com/latex.php?latex=%7B50%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{50}' title='{50}' class='latex' /> bucks or <img src='http://s.wordpress.com/latex.php?latex=%7B200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{200}' title='{200}' class='latex' /> bucks. Then someone asks you if you wanted to pay <img src='http://s.wordpress.com/latex.php?latex=%7B10%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{10}' title='{10}' class='latex' /> bucks and change to the other envelope. Should you change?</p></blockquote>
<p>Now, consider two different solutions to this problem: the first is fallacious and the second is correct:</p>
<ol>
<li> If I don&#8217;t change, I get <img src='http://s.wordpress.com/latex.php?latex=%7B100%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{100}' title='{100}' class='latex' /> bucks, if I change I pay a penalty of <img src='http://s.wordpress.com/latex.php?latex=%7B10%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{10}' title='{10}' class='latex' /> and I get either <img src='http://s.wordpress.com/latex.php?latex=%7B50%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{50}' title='{50}' class='latex' /> or <img src='http://s.wordpress.com/latex.php?latex=%7B200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{200}' title='{200}' class='latex' /> with equal probability, so my expected prize if I change is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B200%2B50%7D%7B2%7D-10%20%3D%20115%20%3E%20100%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{200+50}{2}-10 = 115 &gt; 100}' title='{\frac{200+50}{2}-10 = 115 &gt; 100}' class='latex' />, so I should change.</li>
<li> I know there is one envelope with <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> and one with <img src='http://s.wordpress.com/latex.php?latex=%7B2x%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2x}' title='{2x}' class='latex' />, then my expected prize if I don&#8217;t change is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7Bx%20%2B%202x%7D%7B2%7D%20%3D%20%5Cfrac%7B3%7D%7B2%7Dx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{x + 2x}{2} = \frac{3}{2}x}' title='{\frac{x + 2x}{2} = \frac{3}{2}x}' class='latex' />. If I change, my expected prize is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7Bx%20%2B%202x%7D%7B2%7D%20-%2010%20%3C%20%5Cfrac%7B3%7D%7B2%7Dx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{x + 2x}{2} - 10 &lt; \frac{3}{2}x}' title='{\frac{x + 2x}{2} - 10 &lt; \frac{3}{2}x}' class='latex' />, so I should not change.</li>
</ol>
<p>The fallacy in the first argument is perceiving a probability distribution where there is no one. Either the other envelope contains <img src='http://s.wordpress.com/latex.php?latex=%7B50%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{50}' title='{50}' class='latex' /> bucks or it contains <img src='http://s.wordpress.com/latex.php?latex=%7B200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{200}' title='{200}' class='latex' /> bucks &#8211; we just don&#8217;t know, but there is no probability distribution there &#8211; it is a deterministic choice by the game designer. Most of those paradoxes are a result of either an ill-defined probability space, as <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Bertrand%27s_paradox_%28probability%29">Bertrand&#8217;s Paradox</a> or a wrong comprehension of the probability space, as in Monty Hall or in several paradoxes exploring the same idea as: <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Three_Prisoners_problem">Three Prisioners</a>, <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Sleeping_Beauty_problem">Sleeping Beauty</a>, <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Boy_or_Girl_paradox">Boy or Girl Paradox</a>, &#8230;</p>
<p style="text-align: center;"><img class="aligncenter size-medium wp-image-280" title="73a_humpty-dumpty" src="http://www.bigredbits.com/wp-content/uploads/2010/02/73a_humpty-dumpty-300x184.jpg" alt="73a_humpty-dumpty" width="300" height="184" /></p>
<p>There was very recently a thrilling discussion about a variant on the envelope paradox in the <a class="snap_noshots" href="http://blag.xkcd.com/">xkcd blag </a> &#8211; which is the blog accompaning that <a class="snap_noshots" href="http://xkcd.com/">amazing webcomic</a>. There was a recent blog post with <a class="snap_noshots" href="http://blog.xkcd.com/2010/02/09/math-puzzle/">a very intriguing problem</a>. A better idea is to go there and read the discussion, but if you are not doing so, let me summarize it here. The problem is:</p>
<blockquote><p>There are two envelopes containing each of them a distinct real number. You pick one envelope at random, open it and see the number, then you are asked to guess if the number in the other envelope is larger or smaller then the previous one. Can you guess correctly with more than <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> probability?</p>
<p>A related problem is: given that you are playing the envelope game and there are number <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BB%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{B}' title='{B}' class='latex' /> (with <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%3C%20B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A &lt; B}' title='{A &lt; B}' class='latex' />). You pick one envelope at random and then you are able to look at the content of the first envelope you open and then decide to switch or not. Is there a strategy that gives you expected earnings greater than <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7BA%2BB%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{A+B}{2}}' title='{\frac{A+B}{2}}' class='latex' /> ?</p></blockquote>
<p>The very unexpected answers is <strong>yes</strong> !!! The strategy that Randall presents in the blog and there is a <a class="snap_noshots" href="http://www.iwr.uni-heidelberg.de/groups/ngg/People/winckler/PU/p008.html">link to the source here</a> is: let <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> be a random variable on <img src='http://s.wordpress.com/latex.php?latex=%7B%7B%5Cmathbb%20R%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{{\mathbb R}}' title='{{\mathbb R}}' class='latex' /> such that for each <img src='http://s.wordpress.com/latex.php?latex=%7Ba%3Cb%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a&lt;b}' title='{a&lt;b}' class='latex' /> we have <img src='http://s.wordpress.com/latex.php?latex=%7BP%28a%20%3C%20X%20%3C%20b%29%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{P(a &lt; X &lt; b) &gt; 0}' title='{P(a &lt; X &lt; b) &gt; 0}' class='latex' />, for example, the normal distribution or the logistic distribution.</p>
<p>Sample <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> then open the envelope and find a number <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> now, if <img src='http://s.wordpress.com/latex.php?latex=%7BX%20%3C%20S%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X &lt; S}' title='{X &lt; S}' class='latex' /> say the other number is lower and if <img src='http://s.wordpress.com/latex.php?latex=%7BX%20%3E%20S%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X &gt; S}' title='{X &gt; S}' class='latex' /> say the other number is higher. You get it right with probability</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20P%28%5Ctext%7Bpicked%20%7DA%29%20P%28X%20%3E%20A%29%20%2B%20P%28%5Ctext%7Bpicked%20%7DB%29%20P%28X%20%3C%20B%29%20%3D%20%5Cfrac%7B1%7D%7B2%7D%20%281%20%2B%20P%28A%20%3C%20X%20%3C%20B%29%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  P(\text{picked }A) P(X &gt; A) + P(\text{picked }B) P(X &lt; B) = \frac{1}{2} (1 + P(A &lt; X &lt; B)) ' title='\displaystyle  P(\text{picked }A) P(X &gt; A) + P(\text{picked }B) P(X &lt; B) = \frac{1}{2} (1 + P(A &lt; X &lt; B)) ' class='latex' /></p>
<p>which is impressive. If you follow your guess, your expected earning <img src='http://s.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> is:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cbegin%7Baligned%7D%20%26P%28%5Ctext%7Bpicked%20%7DA%29%20%5Cmathop%7B%5Cmathbb%20E%7D%5BY%20%5Cvert%20%5Ctext%7Bpicked%20%7DA%5D%20%2B%20P%28%5Ctext%7Bpicked%20%7DB%29%20%5Cmathop%7B%5Cmathbb%20E%7D%5BY%20%5Cvert%20%5Ctext%7Bpicked%20%7DB%5D%20%3D%20%5C%5C%20%26%20%3D%20%5Cfrac%7B1%7D%7B2%7D%20%5BP%28X%3CA%29%20A%20%2B%20P%28X%3EA%29%20B%5D%20%2B%20%5Cfrac%7B1%7D%7B2%7D%20%5BP%28X%3CB%29%20B%20%2B%20P%28X%3EB%29%20A%5D%20%5C%5C%20%26%3D%20%5Cfrac%7B1%7D%7B2%7D%5BA%20%5BP%28X%3CA%29%20%2B%20P%28X%3EB%29%5D%20%2B%20B%20%5BP%28X%3EA%29%20%2B%20P%28X%3CB%29%5D%5D%20%3E%20%5Cfrac%7BA%2BB%7D%7B2%7D%20%5C%5C%20%5Cend%7Baligned%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \begin{aligned} &amp;P(\text{picked }A) \mathop{\mathbb E}[Y \vert \text{picked }A] + P(\text{picked }B) \mathop{\mathbb E}[Y \vert \text{picked }B] = \\ &amp; = \frac{1}{2} [P(X&lt;A) A + P(X&gt;A) B] + \frac{1}{2} [P(X&lt;B) B + P(X&gt;B) A] \\ &amp;= \frac{1}{2}[A [P(X&lt;A) + P(X&gt;B)] + B [P(X&gt;A) + P(X&lt;B)]] &gt; \frac{A+B}{2} \\ \end{aligned}' title='\displaystyle \begin{aligned} &amp;P(\text{picked }A) \mathop{\mathbb E}[Y \vert \text{picked }A] + P(\text{picked }B) \mathop{\mathbb E}[Y \vert \text{picked }B] = \\ &amp; = \frac{1}{2} [P(X&lt;A) A + P(X&gt;A) B] + \frac{1}{2} [P(X&lt;B) B + P(X&gt;B) A] \\ &amp;= \frac{1}{2}[A [P(X&lt;A) + P(X&gt;B)] + B [P(X&gt;A) + P(X&lt;B)]] &gt; \frac{A+B}{2} \\ \end{aligned}' class='latex' /></p>
<p>The xkcd pointed to this cool <a class="snap_noshots" href="http://www.iwr.uni-heidelberg.de/groups/ngg/People/winckler/PU/">archive of puzzles and riddles</a>. I was also told that the <a class="snap_noshots" href="http://forums.xkcd.com/viewforum.php?f=3&amp;sid=0a47f2eeadd72be7890309b1c685c503">xkcd puzzle forum</a> is also a source of excellent puzzles, as this:</p>
<blockquote><p>You are the most eligible bachelor in the kingdom, and as such the King has invited you to his castle so that you may choose one of his three daughters to marry. The eldest princess is honest and always tells the truth. The youngest princess is dishonest and always lies. The middle princess is mischievous and tells the truth sometimes and lies the rest of the time. As you will be forever married to one of the princesses, you want to marry the eldest (truth-teller) or the youngest (liar) because at least you know where you stand with them. The problem is that you cannot tell which sister is which just by their appearance, and the King will only grant you ONE yes or no question which you may only address to ONE of the sisters. What yes or no question can you ask which will ensure you do not marry the middle sister?</p></blockquote>
<p>copied from <a class="snap_noshots" href="http://forums.xkcd.com/viewtopic.php?f=3&amp;t=87">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/278/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Walrasian Equilibrium I</title>
		<link>http://www.bigredbits.com/archives/274</link>
		<comments>http://www.bigredbits.com/archives/274#comments</comments>
		<pubDate>Tue, 16 Feb 2010 04:32:53 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[equilibrium theory]]></category>
		<category><![CDATA[game theory]]></category>
		<category><![CDATA[market design]]></category>
		<category><![CDATA[microeconomics]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=274</guid>
		<description><![CDATA[Currently I&#8217;ve been trying to understand more about the dynamics of markets and basic concepts of microeconomic theory and, as always, writing a blog post will help me to keep my ideas clear. First, why are markets interesting from a computer scientist/mathematician point of view? Markets are multi-objective optimization problems: one can think of the [...]]]></description>
			<content:encoded><![CDATA[<p>Currently I&#8217;ve been trying to understand more about the dynamics of markets and basic concepts of <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Microeconomic">microeconomic theory</a> and, as always, writing a blog post will help me to keep my ideas clear. First, why are markets interesting from a computer scientist/mathematician point of view?</p>
<ul>
<li> Markets are multi-objective optimization problems: one can think of the possible state of a market some point in a space of possible <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%3D%20%28x_1%2C%20%5Chdots%2C%20x_N%29%20%5Cin%20%5COmega%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x = (x_1, \hdots, x_N) \in \Omega}' title='{x = (x_1, \hdots, x_N) \in \Omega}' class='latex' />. Each player of the market controls one variable, say <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i}' title='{x_i}' class='latex' /> and is interested in maximizing one objective function <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(x)}' title='{f_i(x)}' class='latex' />. So, player <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> is trying to set <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%20%3D%20%5Ctext%7Bargmax%7D_%7Bx_i%7D%20f_i%28x_i%2C%20x_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i = \text{argmax}_{x_i} f_i(x_i, x_{-i})}' title='{x_i = \text{argmax}_{x_i} f_i(x_i, x_{-i})}' class='latex' />.</li>
<li> Markets are a computational model: one can think of a market as a way of performing a certain computation &#8211; as extracting some kind of information, as a <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Prediction_market">prediction market</a>, <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Stock_exchange">stock exchanges</a>, &#8230; If we think of it as a computational device, we are asking the same questions: given those preferences which are implicit functions to each of the agents, calculate &#8220;fair&#8221; prices of items.</li>
<li> Markets are distributed systems where each part of the system has a selfish interest.</li>
</ul>
<p>A market is composed by a set <img src='http://s.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' /> of commodities, <img src='http://s.wordpress.com/latex.php?latex=%7BI%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{I}' title='{I}' class='latex' /> of consumers and <img src='http://s.wordpress.com/latex.php?latex=%7BJ%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{J}' title='{J}' class='latex' /> of producers. Now, we describe how to characterize each of them:</p>
<ul>
<li> Each <strong>consumer</strong> is defined by a set <img src='http://s.wordpress.com/latex.php?latex=%7BX_i%20%5Csubseteq%20%7B%5Cmathbb%20R%7D%5EL%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X_i \subseteq {\mathbb R}^L}' title='{X_i \subseteq {\mathbb R}^L}' class='latex' /> of commodities combinations he is interested (typically we take <img src='http://s.wordpress.com/latex.php?latex=%7BX_i%20%3D%20%7B%5Cmathbb%20R%7D%5EL_%2B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X_i = {\mathbb R}^L_+}' title='{X_i = {\mathbb R}^L_+}' class='latex' />) and an utility function <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%20%3A%20X_i%20%5Crightarrow%20%7B%5Cmathbb%20R%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i : X_i \rightarrow {\mathbb R}}' title='{u_i : X_i \rightarrow {\mathbb R}}' class='latex' /> expressing his interest for this bundle of commodities. Consumer <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> will try to maximize <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28%5Ccdot%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(\cdot)}' title='{u_i(\cdot)}' class='latex' /> in a further restricted <img src='http://s.wordpress.com/latex.php?latex=%7BX_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X_i}' title='{X_i}' class='latex' />.</li>
<li> Each <strong>producer</strong> is define by a set <img src='http://s.wordpress.com/latex.php?latex=%7BY_j%20%5Csubset%20%7B%5Cmathbb%20R%7D%5EL%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y_j \subset {\mathbb R}^L}' title='{Y_j \subset {\mathbb R}^L}' class='latex' /> it has the capacity to produce.</li>
<li> <strong>Endowments</strong>: Each consumer comes to the market with an initial endowment <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega_i%20%5Cin%20%7B%5Cmathbb%20R%7D%5EL%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega_i \in {\mathbb R}^L}' title='{\omega_i \in {\mathbb R}^L}' class='latex' />, so for <img src='http://s.wordpress.com/latex.php?latex=%7Bj%20%5Cin%20L%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{j \in L}' title='{j \in L}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega_%7Bij%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega_{ij}}' title='{\omega_{ij}}' class='latex' /> is the amount of commodity <img src='http://s.wordpress.com/latex.php?latex=%7Bj%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{j}' title='{j}' class='latex' /> that consumer <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> originally has. The initial total endowment of the market is given by <img src='http://s.wordpress.com/latex.php?latex=%7B%5Coverline%7B%5Comega%7D%20%3D%20%5Csum_%7Bi%20%5Cin%20I%7D%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\overline{\omega} = \sum_{i \in I} \omega_i}' title='{\overline{\omega} = \sum_{i \in I} \omega_i}' class='latex' />, which is a vector indicating how much of each commodity originally exists in the market.</li>
<li> <strong>Shares</strong>: consumers have shares in the companies, so for <img src='http://s.wordpress.com/latex.php?latex=%7Bi%20%5Cin%20I%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i \in I}' title='{i \in I}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7Bj%20%5Cin%20J%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{j \in J}' title='{j \in J}' class='latex' />, consumer <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> has <img src='http://s.wordpress.com/latex.php?latex=%7B%5Ctheta_%7Bij%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\theta_{ij}}' title='{\theta_{ij}}' class='latex' /> shares of company <img src='http://s.wordpress.com/latex.php?latex=%7Bj%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{j}' title='{j}' class='latex' />, such that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20%5Ctheta_%7Bij%7D%20%3D%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i \theta_{ij} = 1}' title='{\sum_i \theta_{ij} = 1}' class='latex' />.</li>
</ul>
<p>Something very crucial is missing in this picture: a way to compare commodities and something that makes exchanges possible: the answer to that is to attribute <strong>prices</strong> to the items. How to attribute prices to the items so that the market works fine? A price vector is a vector <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cin%20%7B%5Cmathbb%20R%7D%5EL_%2B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \in {\mathbb R}^L_+}' title='{p \in {\mathbb R}^L_+}' class='latex' />. Consider the following scenario after prices <img src='http://s.wordpress.com/latex.php?latex=%7Bp_%5Cell%2C%20%5Cell%20%5Cin%20L%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_\ell, \ell \in L}' title='{p_\ell, \ell \in L}' class='latex' /> are established to commodities:</p>
<ul>
<li> by producing <img src='http://s.wordpress.com/latex.php?latex=%7By_j%20%5Cin%20Y_j%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{y_j \in Y_j}' title='{y_j \in Y_j}' class='latex' />, company <img src='http://s.wordpress.com/latex.php?latex=%7Bj%20%5Cin%20J%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{j \in J}' title='{j \in J}' class='latex' /> gets profit <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Ccdot%20y_j%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \cdot y_j}' title='{p \cdot y_j}' class='latex' />, so each company will try to maximize its profit producing <img src='http://s.wordpress.com/latex.php?latex=%7By_j%5E%2A%20%5Cin%20%5Ctext%7Bargmax%7D_%7By_j%20%5Cin%20Y_j%7D%20p%20%5Ccdot%20y_j%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{y_j^* \in \text{argmax}_{y_j \in Y_j} p \cdot y_j}' title='{y_j^* \in \text{argmax}_{y_j \in Y_j} p \cdot y_j}' class='latex' />.</li>
<li> each consumer sells its initial endowment and gets the profit respective to the companies he owns. So, consumer <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> gets <img src='http://s.wordpress.com/latex.php?latex=%7Bw_i%20%3D%20p%20%5Ccdot%20%5Comega_i%20%2B%20%5Csum_%7Bj%20%5Cin%20J%7D%20%5Ctheta_%7Bij%7D%20p%20%5Ccdot%20y_j%5E%2A%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{w_i = p \cdot \omega_i + \sum_{j \in J} \theta_{ij} p \cdot y_j^*}' title='{w_i = p \cdot \omega_i + \sum_{j \in J} \theta_{ij} p \cdot y_j^*}' class='latex' />.</li>
<li> now consumer <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> uses the money he has to buy the best bundle he can afford, which is <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%5E%2A%20%3D%20%5Ctext%7Bargmax%7D%20%5C%7Bu_i%28x_i%29%3B%20x_i%20%5Cin%20X_i%2C%20p%20%5Ccdot%20x_i%20%5Cleq%20w_i%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i^* = \text{argmax} \{u_i(x_i); x_i \in X_i, p \cdot x_i \leq w_i\}}' title='{x_i^* = \text{argmax} \{u_i(x_i); x_i \in X_i, p \cdot x_i \leq w_i\}}' class='latex' />.</li>
</ul>
<p>The amount of commodities in the market must conserve, so that is possible only if we get:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Csum_%7Bi%20%5Cin%20I%7D%20x_i%5E%2A%20%3D%20%5Csum_%7Bi%20%5Cin%20I%7D%20%5Comega_i%20%2B%20%5Csum_%7Bj%20%5Cin%20J%7D%20y_j%5E%2A&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \sum_{i \in I} x_i^* = \sum_{i \in I} \omega_i + \sum_{j \in J} y_j^*' title='\displaystyle \sum_{i \in I} x_i^* = \sum_{i \in I} \omega_i + \sum_{j \in J} y_j^*' class='latex' />
<p>First, it is not clear if such a price vector exists. If it exists, is it unique? If this is an equilibrium, is it the best thing for the consumers? How those prices can be set in practice without a centralized authority? Can people lie? Below, let&#8217;s collect a couple of questions I&#8217;ll try to answer (yes, no or unknown) in this and the following posts.</p>
<p><strong>Question 1:</strong> Does a price vector <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \geq 0}' title='{p \geq 0}' class='latex' /> always exist that generates an equilibrium?</p>
<p><strong>Question 2:</strong> If it exists, is it unique?</p>
<p><strong>Question 3:</strong> Can we describe an efficent method to find <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> ?</p>
<p><strong>Question 4:</strong> Is it the best thing for the consumers in the following sense: if <img src='http://s.wordpress.com/latex.php?latex=%7B%28x%5E%2A%2Cy%5E%2A%2Cp%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(x^*,y^*,p)}' title='{(x^*,y^*,p)}' class='latex' /> is an equilibrium, are there feasible <img src='http://s.wordpress.com/latex.php?latex=%7B%28x%2Cy%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(x,y)}' title='{(x,y)}' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x_i%29%20%5Cgeq%20u_i%28x_i%5E%2A%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x_i) \geq u_i(x_i^*)}' title='{u_i(x_i) \geq u_i(x_i^*)}' class='latex' /> and for at least one consumer <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x_i%29%20%3E%20u_i%28x_i%5E%2A%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x_i) &gt; u_i(x_i^*)}' title='{u_i(x_i) &gt; u_i(x_i^*)}' class='latex' />? (This is called Pareto improvement)</p>
<p><strong>Question 5:</strong> A central authority could use the knowledge about functions <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i}' title='{u_i}' class='latex' /> and endowments <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega_i}' title='{\omega_i}' class='latex' /> to calculate the price vector <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> using some method. Can consumers <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> be better off by lieing about their utility and endowments?</p>
<p><strong>Question 6:</strong> How prices get defined without a central authority? Is there a dynamic/game-theoretical model to that?</p>
<p>For simplicity, let&#8217;s think of <strong>Exchange Economies</strong>, which are economies with no producers. Let&#8217;s define it formally:</p>
<blockquote><p><strong>Definition 1</strong> <em> An exchange economy is composed by a set of commodities <img src='http://s.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' /> and a set of consumers <img src='http://s.wordpress.com/latex.php?latex=%7BI%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{I}' title='{I}' class='latex' /> each with an utility <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%20%3A%20X_i%20%5Crightarrow%20%7B%5Cmathbb%20R%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i : X_i \rightarrow {\mathbb R}}' title='{u_i : X_i \rightarrow {\mathbb R}}' class='latex' /> and an initial endowment <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega_i%20%5Cin%20%7B%5Cmathbb%20R%7D%5EL_%2B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega_i \in {\mathbb R}^L_+}' title='{\omega_i \in {\mathbb R}^L_+}' class='latex' />. </em></p></blockquote>
<blockquote><p><strong>Definition 2</strong> <em> A price vector <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cin%20%7B%5Cmathbb%20R%7D%5EL_%2B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \in {\mathbb R}^L_+}' title='{p \in {\mathbb R}^L_+}' class='latex' /> is a Walrasian equilibrium for an exchange economy <img src='http://s.wordpress.com/latex.php?latex=%7B%28L%2C%20I%2C%20%5C%7BX_i%2C%20u_i%2C%20%5Comega_i%5C%7D_%7Bi%20%5Cin%20I%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(L, I, \{X_i, u_i, \omega_i\}_{i \in I})}' title='{(L, I, \{X_i, u_i, \omega_i\}_{i \in I})}' class='latex' /> if there is <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%5E%2A%20%5Cin%20X_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i^* \in X_i}' title='{x_i^* \in X_i}' class='latex' /> such that: </em></p>
<ol>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%5E%2A%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i^*}' title='{x_i^*}' class='latex' /> s.t. <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x_i%5E%2A%29%20%3D%20%5Cmax%20%5C%7Bu_i%28x_i%29%3B%20x_i%20%5Cin%20X_i%2C%20p%20%5Ccdot%20x_i%20%5Cleq%20p%20%5Ccdot%20%5Comega_i%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x_i^*) = \max \{u_i(x_i); x_i \in X_i, p \cdot x_i \leq p \cdot \omega_i\}}' title='{u_i(x_i^*) = \max \{u_i(x_i); x_i \in X_i, p \cdot x_i \leq p \cdot \omega_i\}}' class='latex' /> </em></li>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20x_i%5E%2A%20%5Cleq%20%5Csum_i%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i x_i^* \leq \sum_i \omega_i}' title='{\sum_i x_i^* \leq \sum_i \omega_i}' class='latex' /> </em></li>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Ccdot%20%28%5Csum_i%20x_i%5E%2A%20-%20%5Comega_i%29%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \cdot (\sum_i x_i^* - \omega_i)= 0}' title='{p \cdot (\sum_i x_i^* - \omega_i)= 0}' class='latex' /> </em></li>
</ol>
<p><em> </em></p></blockquote>
<p>The first condition says that each consumer is maximizing his utility given his prices, the second says that we can&#8217;t buy more commodities than what is available in the market and the third, called Walras&#8217; Law, says that if there is surplus of a certain product, it should have price zero. It is by far the most unnatural of those, but it can be easily justifiable in some circumnstances: suppose we say that utilities are non-satiated if for each <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%20%5Cin%20X_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i \in X_i}' title='{x_i \in X_i}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cepsilon%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\epsilon &gt; 0}' title='{\epsilon &gt; 0}' class='latex' />, there is <img src='http://s.wordpress.com/latex.php?latex=%7Bx%27_i%20%5Cin%20X_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x&#039;_i \in X_i}' title='{x&#039;_i \in X_i}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7B%5CVert%20x_i%20-%20x%27_i%20%5CVert%20%3C%20%5Cepsilon%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Vert x_i - x&#039;_i \Vert &lt; \epsilon}' title='{\Vert x_i - x&#039;_i \Vert &lt; \epsilon}' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x%27_i%29%20%3E%20u_i%28x_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x&#039;_i) &gt; u_i(x_i)}' title='{u_i(x&#039;_i) &gt; u_i(x_i)}' class='latex' />. If <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i}' title='{u_i}' class='latex' /> are differentiable, that would mean <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cnabla%20u_i%20%5Cneq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\nabla u_i \neq 0}' title='{\nabla u_i \neq 0}' class='latex' />, for example a linear function <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x_i%29%20%3D%20%5Csum_%5Cell%20u_%7Bi%5Cell%7D%20x_%7Bi%5Cell%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x_i) = \sum_\ell u_{i\ell} x_{i\ell}}' title='{u_i(x_i) = \sum_\ell u_{i\ell} x_{i\ell}}' class='latex' /> with some <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7Bi%5Cell%7D%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{i\ell} &gt; 0}' title='{u_{i\ell} &gt; 0}' class='latex' />. In that case, <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20%28p%20%5Ccdot%20x_i%5E%2A%20-%20p%20%5Ccdot%20%5Comega_i%29%20%3C%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i (p \cdot x_i^* - p \cdot \omega_i) &lt; 0}' title='{\sum_i (p \cdot x_i^* - p \cdot \omega_i) &lt; 0}' class='latex' /> and some player has money surplus and therefore he could increase his utility.</p>
<p>Now, we define for each price vector <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> the excess demand function <img src='http://s.wordpress.com/latex.php?latex=%7Bz_i%28p%29%20%3D%20x_i%28p%29%20-%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z_i(p) = x_i(p) - \omega_i}' title='{z_i(p) = x_i(p) - \omega_i}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28p%29%20%3D%20%5Csum_i%20z_i%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(p) = \sum_i z_i(p)}' title='{z(p) = \sum_i z_i(p)}' class='latex' />. Now, under non-satiated utilities, by the last argument, we have that <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> is an equilibrium vector iff <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28p%29%20%5Cleq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(p) \leq 0}' title='{z(p) \leq 0}' class='latex' />. Actually, if <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i}' title='{u_i}' class='latex' /> are also strong monotone, i.e., <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x_i%20%2B%20%5Cxi%29%20%5Cgeq%20u_i%28x_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x_i + \xi) \geq u_i(x_i)}' title='{u_i(x_i + \xi) \geq u_i(x_i)}' class='latex' /> for each <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cxi%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\xi \geq 0}' title='{\xi \geq 0}' class='latex' />, then it becomes: <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> is an equilibrium iff <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28p%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(p) = 0}' title='{z(p) = 0}' class='latex' />, which means that the market clears:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Csum_i%20x_i%5E%2A%20%3D%20%5Csum_i%20%5Comega_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \sum_i x_i^* = \sum_i \omega_i' title='\displaystyle \sum_i x_i^* = \sum_i \omega_i' class='latex' />
<p>The question that is easier to answer is Question 4 and it is sometimes refered as the First Fundamental Theorem of Welfare Economics:</p>
<blockquote><p><strong>Theorem 3</strong> <em> Given non-satiated preferences, each equilibrium <img src='http://s.wordpress.com/latex.php?latex=%7B%28x%2Cp%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(x,p)}' title='{(x,p)}' class='latex' /> is Pareto, i.e. there is no other feasible allocation <img src='http://s.wordpress.com/latex.php?latex=%7Bx%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x&#039;}' title='{x&#039;}' class='latex' /> such that for all <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x%27_i%29%20%5Cgeq%20u_i%28x_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x&#039;_i) \geq u_i(x_i)}' title='{u_i(x&#039;_i) \geq u_i(x_i)}' class='latex' /> with the inequality strict for at least one component. </em></p></blockquote>
<p><em>Proof:</em> Suppose there were, since <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x%27_i%29%20%5Cgeq%20u_i%28x_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x&#039;_i) \geq u_i(x_i)}' title='{u_i(x&#039;_i) \geq u_i(x_i)}' class='latex' /> then <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Ccdot%20x%27_i%20%5Cgeq%20p%20%5Ccdot%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \cdot x&#039;_i \geq p \cdot \omega_i}' title='{p \cdot x&#039;_i \geq p \cdot \omega_i}' class='latex' />, because if <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Ccdot%20x%27_i%20%3C%20p%20%5Ccdot%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \cdot x&#039;_i &lt; p \cdot \omega_i}' title='{p \cdot x&#039;_i &lt; p \cdot \omega_i}' class='latex' /> then we could improve the utility of <img src='http://s.wordpress.com/latex.php?latex=%7Bx%27_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x&#039;_i}' title='{x&#039;_i}' class='latex' /> still within the budget, contradicting the optimality of <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x_i)}' title='{u_i(x_i)}' class='latex' /> for that budget. And clearly <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x%27_i%29%20%3E%20u_i%28x_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x&#039;_i) &gt; u_i(x_i)}' title='{u_i(x&#039;_i) &gt; u_i(x_i)}' class='latex' /> implies <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Ccdot%20x%27_i%20%3E%20p%20%5Ccdot%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \cdot x&#039;_i &gt; p \cdot \omega_i}' title='{p \cdot x&#039;_i &gt; p \cdot \omega_i}' class='latex' />.</p>
<p>Summing over <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' />, we get <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20p%20x%27_i%20%3E%20%5Csum_i%20p%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i p x&#039;_i &gt; \sum_i p \omega_i}' title='{\sum_i p x&#039;_i &gt; \sum_i p \omega_i}' class='latex' />, what is a contradiction, because since <img src='http://s.wordpress.com/latex.php?latex=%7Bx%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x&#039;}' title='{x&#039;}' class='latex' /> is feasible, <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20x%27_i%20%5Cleq%20%5Csum_i%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i x&#039;_i \leq \sum_i \omega_i}' title='{\sum_i x&#039;_i \leq \sum_i \omega_i}' class='latex' /> and therefore <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20p%20%5Ccdot%20x%27_i%20%5Cleq%20%5Csum_i%20p%20%5Ccdot%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i p \cdot x&#039;_i \leq \sum_i p \cdot \omega_i}' title='{\sum_i p \cdot x&#039;_i \leq \sum_i p \cdot \omega_i}' class='latex' />. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p>Now, let&#8217;s tackle Question 1. We assume linearly of utility: <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x_i%29%20%3D%20%5Csum_%7B%5Cell%20%5Cin%20L%7D%20u_%7Bi%20%5Cell%7D%20x_%7Bi%20%5Cell%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x_i) = \sum_{\ell \in L} u_{i \ell} x_{i \ell}}' title='{u_i(x_i) = \sum_{\ell \in L} u_{i \ell} x_{i \ell}}' class='latex' /> for <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7Bi%20%5Cell%7D%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{i \ell} &gt; 0}' title='{u_{i \ell} &gt; 0}' class='latex' />. This gives us strong monotonicity and local nonsatiated preferences.</p>
<blockquote><p><strong>Theorem 4</strong> <em> Under linear utilities, there is always an equilibrium price vector <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' />. </em></p></blockquote>
<p>Consider the function <img src='http://s.wordpress.com/latex.php?latex=%7Bz%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z}' title='{z}' class='latex' /> defined above: <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28p%29%20%3D%20%5Csum_i%20%28x_i%28p%29%20-%20%5Comega_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(p) = \sum_i (x_i(p) - \omega_i)}' title='{z(p) = \sum_i (x_i(p) - \omega_i)}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(p)}' title='{x_i(p)}' class='latex' /> is the bundle of best possible utility. Now, since we are using linear utilities we can&#8217;t guarantee there will be only one such bundle, so instead of considering a function, consider <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i}' title='{x_i}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bz%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z}' title='{z}' class='latex' /> as being correspondences: <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%2C%20z%3A%20%7B%5Cmathbb%20R%7D%5EL_%2B%20%5Crightarrow%20%5Cmathcal%7BP%7D%28%7B%5Cmathbb%20R%7D%5EL%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i, z: {\mathbb R}^L_+ \rightarrow \mathcal{P}({\mathbb R}^L)}' title='{x_i, z: {\mathbb R}^L_+ \rightarrow \mathcal{P}({\mathbb R}^L)}' class='latex' />, i.e., <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28p%29%20%5Csubseteq%20%7B%5Cmathbb%20R%7D%5EL_%2B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(p) \subseteq {\mathbb R}^L_+}' title='{x_i(p) \subseteq {\mathbb R}^L_+}' class='latex' /> is the set of all allocations that maximize <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x_i)}' title='{u_i(x_i)}' class='latex' /> subject to <img src='http://s.wordpress.com/latex.php?latex=%7Bp%5Ccdot%20x_i%20%5Cleq%20p%20%5Ccdot%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p\cdot x_i \leq p \cdot \omega_i}' title='{p\cdot x_i \leq p \cdot \omega_i}' class='latex' />. Since <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i}' title='{u_i}' class='latex' /> are linear functionals, we can calculate <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(p)}' title='{x_i(p)}' class='latex' /> by a Fractional Knapsack algorithm: we sort commodities <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cell%20%5Cin%20L%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\ell \in L}' title='{\ell \in L}' class='latex' /> by <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7Bp_%5Cell%7D%7Bu_%7Bi%5Cell%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{p_\ell}{u_{i\ell}}}' title='{\frac{p_\ell}{u_{i\ell}}}' class='latex' /> and start buying in the cost-benefit order (the ones that provide more utility per buck spent). Most of the time there will be just one solution, but in points where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7Bp_%5Cell%7D%7Bu_%7Bi%5Cell%7D%7D%20%3D%20%5Cfrac%7Bp_k%7D%7Bu_%7Bi%20k%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{p_\ell}{u_{i\ell}} = \frac{p_k}{u_{i k}}}' title='{\frac{p_\ell}{u_{i\ell}} = \frac{p_k}{u_{i k}}}' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(p)}' title='{x_i(p)}' class='latex' /> might be a convex region. This correpondence is <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Hemicontinuous">upper hemicontinuous</a>, which is the correspondence analogue to continuity for functions. As Wikipedia defines:</p>
<blockquote><p><strong>Definition 5</strong> <em> A correspondence <img src='http://s.wordpress.com/latex.php?latex=%7B%5CGamma%20%3A%20A%20%5Crightarrow%20B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Gamma : A \rightarrow B}' title='{\Gamma : A \rightarrow B}' class='latex' /> is said to be <strong>upper hemicontinuous</strong> at the point <img src='http://s.wordpress.com/latex.php?latex=%7Ba%20%5Cin%20A%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a \in A}' title='{a \in A}' class='latex' /> if for any open neighbourhood <img src='http://s.wordpress.com/latex.php?latex=%7BV%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{V}' title='{V}' class='latex' /> of <img src='http://s.wordpress.com/latex.php?latex=%7B%5CGamma%28a%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Gamma(a)}' title='{\Gamma(a)}' class='latex' /> there exists a neighbourhood <img src='http://s.wordpress.com/latex.php?latex=%7BU%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{U}' title='{U}' class='latex' /> of a such that <img src='http://s.wordpress.com/latex.php?latex=%7B%5CGamma%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Gamma(x)}' title='{\Gamma(x)}' class='latex' /> is a subset of <img src='http://s.wordpress.com/latex.php?latex=%7BV%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{V}' title='{V}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7BU%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{U}' title='{U}' class='latex' />. </em></p></blockquote>
<p>It is not hard to see that <img src='http://s.wordpress.com/latex.php?latex=%7Bz%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z}' title='{z}' class='latex' /> is upper hemicontinuous according to that definition. Our goal is to prove that there is one price vector <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> for which <img src='http://s.wordpress.com/latex.php?latex=%7B%5Coverline%7B%5Comega%7D%20%5Cin%20x_i%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\overline{\omega} \in x_i(p)}' title='{\overline{\omega} \in x_i(p)}' class='latex' /> or: <img src='http://s.wordpress.com/latex.php?latex=%7B0%20%5Cin%20z%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0 \in z(p)}' title='{0 \in z(p)}' class='latex' />. To prove that we use <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Kakutani_fixed_point_theorem">Kakutani&#8217;s Fixed Point Theorem</a>. Before we go into that, we&#8217;ll explore some other properties of <img src='http://s.wordpress.com/latex.php?latex=%7Bz%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z}' title='{z}' class='latex' />:</p>
<ul>
<li> <strong>0-Homogeneous:</strong> <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28%5Calpha%20p%29%20%3D%20z%28p%29%2C%20%5Cforall%20%5Calpha%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(\alpha p) = z(p), \forall \alpha &gt; 0}' title='{z(\alpha p) = z(p), \forall \alpha &gt; 0}' class='latex' /></li>
<li> <strong>Walras&#8217; Law:</strong> <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Ccdot%20z%28p%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \cdot z(p) = 0}' title='{p \cdot z(p) = 0}' class='latex' />. For any <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20%28x_i%20-%20%5Comega_i%29%20%5Cin%20z%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i (x_i - \omega_i) \in z(p)}' title='{\sum_i (x_i - \omega_i) \in z(p)}' class='latex' /> we know <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20%28p%20%5Ccdot%20x_i%20-%20p%20%5Ccdot%20%5Comega_i%29%20%5Cleq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i (p \cdot x_i - p \cdot \omega_i) \leq 0}' title='{\sum_i (p \cdot x_i - p \cdot \omega_i) \leq 0}' class='latex' /> by the definition of <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i}' title='{x_i}' class='latex' />. So, if it not zero, some <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> has money surplus what is absurd given that preferences are strongly monotone.</li>
<li> <strong>Bounded:</strong> <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(p)}' title='{z(p)}' class='latex' /> is bounded from below, i.e., <img src='http://s.wordpress.com/latex.php?latex=%7Bz_%5Cell%28p%29%20%5Cgeq%20-s%2C%20%5Cforall%20p%2C%20%5Cell%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z_\ell(p) \geq -s, \forall p, \ell}' title='{z_\ell(p) \geq -s, \forall p, \ell}' class='latex' /> for some <img src='http://s.wordpress.com/latex.php?latex=%7Bs%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{s &gt; 0}' title='{s &gt; 0}' class='latex' />. Simply take <img src='http://s.wordpress.com/latex.php?latex=%7Bs%20%3D%20%5Cmax%20%5Comega_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{s = \max \omega_i}' title='{s = \max \omega_i}' class='latex' /></li>
<li> <strong>Boundary behavior:</strong> if <img src='http://s.wordpress.com/latex.php?latex=%7Bp%5Ek%20%5Crightarrow%20p%20%5Cneq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p^k \rightarrow p \neq 0}' title='{p^k \rightarrow p \neq 0}' class='latex' /> with <img src='http://s.wordpress.com/latex.php?latex=%7Bp_%5Cell%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_\ell = 0}' title='{p_\ell = 0}' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%7Bz_%5Cell%28p%5Ek%29%20%5Crightarrow%20%5Cinfty%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z_\ell(p^k) \rightarrow \infty}' title='{z_\ell(p^k) \rightarrow \infty}' class='latex' />. That is clear from the fractional knapsack algorithm when one desirable item gets price zero.</li>
</ul>
<p>Now, we are in shape for applying Kakutani&#8217;s Fixed Point Theorem:</p>
<blockquote><p><strong>Theorem 6 (Kakutani, 1941)</strong> <em> If <img src='http://s.wordpress.com/latex.php?latex=%7Bf%3AA%20%5Crightarrow%20%5Cmathcal%7BP%7D%28A%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f:A \rightarrow \mathcal{P}(A)}' title='{f:A \rightarrow \mathcal{P}(A)}' class='latex' /> is an upper hemicontinuous correspondence such that <img src='http://s.wordpress.com/latex.php?latex=%7Bf%28a%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f(a)}' title='{f(a)}' class='latex' /> is a convex non-empty set for all <img src='http://s.wordpress.com/latex.php?latex=%7Ba%20%5Cin%20A%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a \in A}' title='{a \in A}' class='latex' /> then <img src='http://s.wordpress.com/latex.php?latex=%7Bf%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f}' title='{f}' class='latex' /> has a fixed point, i.e., <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cexists%20x%20%5Cin%20A%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\exists x \in A}' title='{\exists x \in A}' class='latex' /> s.t. <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20f%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in f(x)}' title='{x \in f(x)}' class='latex' />. </em></p></blockquote>
<p>Since prices are <img src='http://s.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' />-homogeneous, consider the simplex <img src='http://s.wordpress.com/latex.php?latex=%7B%5CDelta%20%3D%20%5C%7Bp%20%5Cgeq%200%3B%20%5Csum_%5Cell%20p_%5Cell%20%3D%201%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Delta = \{p \geq 0; \sum_\ell p_\ell = 1\}}' title='{\Delta = \{p \geq 0; \sum_\ell p_\ell = 1\}}' class='latex' />, its relative interior <img src='http://s.wordpress.com/latex.php?latex=%7B%5CDelta%5E0%20%3D%20%5C%7Bp%20%3E%200%3B%20%5Csum_%5Cell%20p_%5Cell%20%3D%201%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Delta^0 = \{p &gt; 0; \sum_\ell p_\ell = 1\}}' title='{\Delta^0 = \{p &gt; 0; \sum_\ell p_\ell = 1\}}' class='latex' /> and the boundary <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cpartial%20%5CDelta%20%3D%20%5CDelta%20%5Csetminus%20%5CDelta%5E0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\partial \Delta = \Delta \setminus \Delta^0}' title='{\partial \Delta = \Delta \setminus \Delta^0}' class='latex' />. Now we define the following price correcting correspondence <img src='http://s.wordpress.com/latex.php?latex=%7Bf%3A%5CDelta%20%5Crightarrow%20%5Cmathcal%7BP%7D%28%5CDelta%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f:\Delta \rightarrow \mathcal{P}(\Delta)}' title='{f:\Delta \rightarrow \mathcal{P}(\Delta)}' class='latex' />.</p>
<p>If some price <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cin%20%5CDelta%5E0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \in \Delta^0}' title='{p \in \Delta^0}' class='latex' /> is set, it generates demand <img src='http://s.wordpress.com/latex.php?latex=%7Bd%20%5Cin%20z%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{d \in z(p)}' title='{d \in z(p)}' class='latex' />. For that demand, the price that would maximize profit would be <img src='http://s.wordpress.com/latex.php?latex=%7Bq%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{q}' title='{q}' class='latex' />, i.e. <img src='http://s.wordpress.com/latex.php?latex=%7Bq%5Ccdot%20d%20%5Cgeq%20q%27%20%5Ccdot%20d%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{q\cdot d \geq q&#039; \cdot d}' title='{q\cdot d \geq q&#039; \cdot d}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bq%27%20%5Cin%20%5CDelta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{q&#039; \in \Delta}' title='{q&#039; \in \Delta}' class='latex' />. It is natural to re-adjust the prices to <img src='http://s.wordpress.com/latex.php?latex=%7Bq%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{q}' title='{q}' class='latex' />. So we define for <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cin%20%5CDelta%5E0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \in \Delta^0}' title='{p \in \Delta^0}' class='latex' />:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20f%28p%29%20%3D%20%5C%7Bq%20%5Cin%20%5CDelta%3B%20q%20%5Ctext%7B%20is%20a%20best%20response%20price%20to%20some%20%7D%20d%20%5Cin%20z%28p%29%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle f(p) = \{q \in \Delta; q \text{ is a best response price to some } d \in z(p)\}' title='\displaystyle f(p) = \{q \in \Delta; q \text{ is a best response price to some } d \in z(p)\}' class='latex' />
<p>and for <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cin%20%5Cpartial%20%5CDelta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \in \partial \Delta}' title='{p \in \partial \Delta}' class='latex' />:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20f%28p%29%20%3D%20%5C%7Bq%20%5Cin%20%5CDelta%3B%20q%20%5Ccdot%20p%20%3D%200%5C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle f(p) = \{q \in \Delta; q \cdot p = 0\}' title='\displaystyle f(p) = \{q \in \Delta; q \cdot p = 0\}' class='latex' />
<p>Now, I claim that this correspondence satisfies the conditions in Kakutani&#8217;s Theorem. We skip a formal proof of this fact, but this is intuitive for the interior <img src='http://s.wordpress.com/latex.php?latex=%7B%5CDelta%5E0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Delta^0}' title='{\Delta^0}' class='latex' /> &#8211; let&#8217;s give the intuition why this is true as we approach the boundary: if <img src='http://s.wordpress.com/latex.php?latex=%7B%5CDelta%5E0%20%5Cni%20p%5Ek%20%5Crightarrow%20p%20%5Cin%20%5Cpartial%20%5CDelta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Delta^0 \ni p^k \rightarrow p \in \partial \Delta}' title='{\Delta^0 \ni p^k \rightarrow p \in \partial \Delta}' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%7Bp%5Ek_%5Cell%20%5Crightarrow%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p^k_\ell \rightarrow 0}' title='{p^k_\ell \rightarrow 0}' class='latex' />, therefore the demans explodes: <img src='http://s.wordpress.com/latex.php?latex=%7Bz_%5Cell%28p%5Ek%29%20%5Crightarrow%20%5Cinfty%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z_\ell(p^k) \rightarrow \infty}' title='{z_\ell(p^k) \rightarrow \infty}' class='latex' /> and as a result the best thing to do is to set the prices of those commodities much higher than the rest. Therefore, the price of the commodities whose demand explode are positive while the prices of the commodities where the price doesn&#8217;t get value zero.</p>
<p>Now, after waiving our hands about the upper continuity of <img src='http://s.wordpress.com/latex.php?latex=%7Bf%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f}' title='{f}' class='latex' />, we have by Kakutani&#8217;s Theorem a point <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cin%20%5CDelta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \in \Delta}' title='{p \in \Delta}' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cin%20f%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \in f(p)}' title='{p \in f(p)}' class='latex' />. By the definition of <img src='http://s.wordpress.com/latex.php?latex=%7Bf%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f}' title='{f}' class='latex' /> we must have <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cin%20%5CDelta%5E0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \in \Delta^0}' title='{p \in \Delta^0}' class='latex' /> (because for <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cin%20%5Cpartial%20%5CDelta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \in \partial \Delta}' title='{p \in \partial \Delta}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cnotin%20f%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \notin f(p)}' title='{p \notin f(p)}' class='latex' />. Now, I claim <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28p%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(p) = 0}' title='{z(p) = 0}' class='latex' />. In fact if <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28p%29%20%5Cneq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(p) \neq 0}' title='{z(p) \neq 0}' class='latex' />, still <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20z%28p%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p z(p) = 0}' title='{p z(p) = 0}' class='latex' /> by Walras&#8217; Law. So, if <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28p%29%20%5Cneq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(p) \neq 0}' title='{z(p) \neq 0}' class='latex' /> then there is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cell%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\ell}' title='{\ell}' class='latex' /> with <img src='http://s.wordpress.com/latex.php?latex=%7Bz_%5Cell%20%28p%29%20%3C%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z_\ell (p) &lt; 0}' title='{z_\ell (p) &lt; 0}' class='latex' /> and therefore <img src='http://s.wordpress.com/latex.php?latex=%7Bq_%5Cell%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{q_\ell = 0}' title='{q_\ell = 0}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bq%20%5Cin%20f%28p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{q \in f(p)}' title='{q \in f(p)}' class='latex' />, and <img src='http://s.wordpress.com/latex.php?latex=%7Bf%28p%29%20%5Csubseteq%20%5Cpartial%20%5CDelta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f(p) \subseteq \partial \Delta}' title='{f(p) \subseteq \partial \Delta}' class='latex' />. For this reason <img src='http://s.wordpress.com/latex.php?latex=%7Bz%28p%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z(p) = 0}' title='{z(p) = 0}' class='latex' />.</p>
<p>In the next blog post (or serie of blog posts, let&#8217;s see) we discuss issues related to the other questions: uniqueness, dynamics, game-theoretical considerations, &#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/274/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bounded Degree Spanning Tree and an Uncrossing Lemma</title>
		<link>http://www.bigredbits.com/archives/249</link>
		<comments>http://www.bigredbits.com/archives/249#comments</comments>
		<pubDate>Wed, 18 Nov 2009 04:07:18 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[approximation algorithms]]></category>
		<category><![CDATA[combinatorics]]></category>
		<category><![CDATA[iterated rounding]]></category>
		<category><![CDATA[linear algebra]]></category>
		<category><![CDATA[linear programming]]></category>
		<category><![CDATA[uncrossing]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=249</guid>
		<description><![CDATA[I&#8217;ve been reading about the Bounded Degree Spanning Tree problem and I thought of writing some of what I am learning here. It illustrates a beautiful techique called Iterated Rounding and uses the combinatorial idea of uncrossing. I&#8217;ll try to give a high-level idea of the argument and give references on the details. The first [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been reading about the Bounded Degree Spanning Tree problem and I thought of writing some of what I am learning here. It illustrates a beautiful techique called Iterated Rounding and uses the combinatorial idea of uncrossing. I&#8217;ll try to give a high-level idea of the argument and give references on the details. The first result of this kind was given by Goemans (although there were previous results with weaker guarantees) by Goemans in <a class="snap_noshots" href="http://math.mit.edu/~goemans/PAPERS/bmst-focs06.pdf">Minimum Bounded Degree Spanning Trees</a>, but the result based on iterated rounding and a subsequent improvement are due to Singh and Lau in a serie of papers. A main reference is <a class="snap_noshots" href="http://portal.acm.org/citation.cfm?id=1250887">Approximating minimum bounded degree spanning trees to within one of optimal</a>.</p>
<p>The problem of bounded degree spanning tree is as follows: consider a graph <img src='http://s.wordpress.com/latex.php?latex=%7BG%20%3D%20%28V%2CE%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G = (V,E)}' title='{G = (V,E)}' class='latex' /> with edge weights and we for some nodes <img src='http://s.wordpress.com/latex.php?latex=%7BW%20%5Csubseteq%20V%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{W \subseteq V}' title='{W \subseteq V}' class='latex' /> a degree bound <img src='http://s.wordpress.com/latex.php?latex=%7Bb_v%2C%20v%20%5Cin%20W%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_v, v \in W}' title='{b_v, v \in W}' class='latex' />. We want to find, among the spanning trees for which the degree of <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20b_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq b_v}' title='{\leq b_v}' class='latex' /> the one with minimum cost. It is clearly a hard problem, since taking all weights equal to <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bb_v%20%3D%202%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_v = 2}' title='{b_v = 2}' class='latex' /> for all nodes is the <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Hamiltonian_path">Hamiltonian Path problem</a>, which is NP-complete. We will get a different kind of approximation. Let OPT be the optimal solution: we will show an algorithm that gives a spanning tree of cost <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20OPT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq OPT}' title='{\leq OPT}' class='latex' /> such that each node <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> has degree <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20b_v%20%2B%202%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq b_v + 2}' title='{\leq b_v + 2}' class='latex' /> (this can be improved to <img src='http://s.wordpress.com/latex.php?latex=%7Bb_v%20%2B%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_v + 1}' title='{b_v + 1}' class='latex' /> with a more sofisticated algorithm, also based on Iterated Rounding).</p>
<p>As always, the first step to design an approximation algorithm is to relax it to an LP. We consider the following LP:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cleft.%20%5Cbegin%7Baligned%7D%20%26%20%5Cmin%20%5Csum_%7Be%5Cin%20E%7D%20c_e%20x_e%20%5Ctext%7B%20s.t.%20%7D%20%5C%5C%20%26%20%5Cqquad%20%5Cleft.%20%5Cbegin%7Baligned%7D%20%26%20%5Csum_%7Be%20%5Cin%20E%7D%20x_e%20%3D%20%5Cvert%20V%20%5Cvert%20-%201%20%5C%5C%20%26%20%5Csum_%7Be%20%5Cin%20E%28S%29%7D%20x_e%20%5Cleq%20%5Cvert%20S%20%5Cvert%20-%201%20%26%20%5Cforall%20S%20%5Csubseteq%20V%2C%20%5Cvert%20S%20%5Cvert%20%5Cgeq%202%20%5C%5C%20%26%20%5Csum_%7Be%20%5Cin%20%5Cdelta%28v%29%7D%20x_e%20%5Cleq%20b_v%20%26%20%5Cforall%20v%20%5Cin%20W%5C%5C%20%26%20x_e%20%5Cgeq%200%20%26%20%5Cforall%20e%20%5Cin%20E%20%5Cend%7Baligned%7D%20%5Cright.%20%5Cend%7Baligned%7D%20%5Cright.%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \left. \begin{aligned} &amp; \min \sum_{e\in E} c_e x_e \text{ s.t. } \\ &amp; \qquad \left. \begin{aligned} &amp; \sum_{e \in E} x_e = \vert V \vert - 1 \\ &amp; \sum_{e \in E(S)} x_e \leq \vert S \vert - 1 &amp; \forall S \subseteq V, \vert S \vert \geq 2 \\ &amp; \sum_{e \in \delta(v)} x_e \leq b_v &amp; \forall v \in W\\ &amp; x_e \geq 0 &amp; \forall e \in E \end{aligned} \right. \end{aligned} \right. ' title='\displaystyle \left. \begin{aligned} &amp; \min \sum_{e\in E} c_e x_e \text{ s.t. } \\ &amp; \qquad \left. \begin{aligned} &amp; \sum_{e \in E} x_e = \vert V \vert - 1 \\ &amp; \sum_{e \in E(S)} x_e \leq \vert S \vert - 1 &amp; \forall S \subseteq V, \vert S \vert \geq 2 \\ &amp; \sum_{e \in \delta(v)} x_e \leq b_v &amp; \forall v \in W\\ &amp; x_e \geq 0 &amp; \forall e \in E \end{aligned} \right. \end{aligned} \right. ' class='latex' /></p>
<p>The first constraint expresses that in a spanning tree, there are at most <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20V%20%5Cvert%20-%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert V \vert - 1}' title='{\vert V \vert - 1}' class='latex' /> edges, the second prevent the formation of cycles and the third guarantees the degree bounds. For <img src='http://s.wordpress.com/latex.php?latex=%7BW%20%3D%20%5Cemptyset%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{W = \emptyset}' title='{W = \emptyset}' class='latex' /> we have the standard Minimal Spanning Tree problem and for this problem the polytope is integral. With the degree bounds, we lose this nice property. We can solve this LP using the Ellipsoid Method. The separation oracle for the <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%7Be%20%5Cin%20E%28S%29%7D%20x_e%20%5Cleq%20%5Cvert%20S%20%5Cvert%20-%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_{e \in E(S)} x_e \leq \vert S \vert - 1}' title='{\sum_{e \in E(S)} x_e \leq \vert S \vert - 1}' class='latex' /> is done by a flow computation.</p>
<p><strong> Iterated Rounding </strong></p>
<p>Now, let&#8217;s go ahead and solve the LP. It would be great if we had an integral solution: we would be done. It is unfortunately not the case, but we can still hope it is almost integral in some sense: for example, some edges are integral and we can take them to the final solution and recurse the algorithm on a smaller graph. This is not far from truth and that&#8217;s the main idea of the iterated rounding. We will show that the support of the optimal solution <img src='http://s.wordpress.com/latex.php?latex=%7BE%28x%29%20%3D%20%5C%7Be%20%5Cin%20E%3B%20x_e%20%3E%200%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E(x) = \{e \in E; x_e &gt; 0\}}' title='{E(x) = \{e \in E; x_e &gt; 0\}}' class='latex' /> has some nice structure. Consider the following lemma:</p>
<blockquote><p><strong>Lemma 1</strong> <em><a name="structure_lemma"></a> For any basic solution <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> of the LP, either there is <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> with just one incident edge in the support <img src='http://s.wordpress.com/latex.php?latex=%7BE%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E(x)}' title='{E(x)}' class='latex' /> or there is one <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cin%20W%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \in W}' title='{v \in W}' class='latex' /> such that that at most <img src='http://s.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' /> edges are incident to it. </em></p></blockquote>
<p>If we can prove this lemma, we can solve the problem in the following way: we begin with an empty tree: then we solve the LP and look at the support <img src='http://s.wordpress.com/latex.php?latex=%7BE%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E(x)}' title='{E(x)}' class='latex' />. There are two possibilities according to the lemma:</p>
<ul>
<li> If there is one node <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> with just one edge <img src='http://s.wordpress.com/latex.php?latex=%7Be%20%3D%20%28u%2Cv%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{e = (u,v)}' title='{e = (u,v)}' class='latex' /> incident to it in the support, we add it to the tree, remove <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> from <img src='http://s.wordpress.com/latex.php?latex=%7BV%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{V}' title='{V}' class='latex' />, decrease <img src='http://s.wordpress.com/latex.php?latex=%7Bb_u%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_u}' title='{b_u}' class='latex' />, make <img src='http://s.wordpress.com/latex.php?latex=%7BE%20%3D%20E%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E = E(x)}' title='{E = E(x)}' class='latex' /> (the trick is to remove in each iteration edges from <img src='http://s.wordpress.com/latex.php?latex=%7BE%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E}' title='{E}' class='latex' /> that are not in the support. Clearly, removing those edges doesn&#8217;t hurt the objective value) and run the algorithm again. Notice that the LP called in the recursion has value less or equal then the actual LP <img src='http://s.wordpress.com/latex.php?latex=%7B-%20c_e%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{- c_e}' title='{- c_e}' class='latex' />. So if by induction we get a spanning tree respecting the new degree bounds plus two and value less or equal than the new LP value, we can just add <img src='http://s.wordpress.com/latex.php?latex=%7Be%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{e}' title='{e}' class='latex' /> and we have a solution with value less or equal than the one of the original LP respecting the degree bounds plus two.</li>
<li> Otherwise, there is one node <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cin%20W%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \in W}' title='{v \in W}' class='latex' /> that has degree <img src='http://s.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' /> in the support. So, we just remove that degree bound on that vertex (i.e. remove <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> from <img src='http://s.wordpress.com/latex.php?latex=%7BW%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{W}' title='{W}' class='latex' />), make <img src='http://s.wordpress.com/latex.php?latex=%7BE%20%3D%20E%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E = E(x)}' title='{E = E(x)}' class='latex' /> (again,eliminate the edges not in the support) and run the algorithm again. Clearly, if one node is still in <img src='http://s.wordpress.com/latex.php?latex=%7BW%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{W}' title='{W}' class='latex' />, it has <img src='http://s.wordpress.com/latex.php?latex=%7Bb_v%20%5Cgeq%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_v \geq 1}' title='{b_v \geq 1}' class='latex' />, since there are only three edges in the support, there will be for the rest of the computation, just three edges incident to it, so there will be at most three edges more incident to it. So it will exceed its original <img src='http://s.wordpress.com/latex.php?latex=%7Bb_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_v}' title='{b_v}' class='latex' /> by at most <img src='http://s.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' />.</li>
</ul>
<p>The algorithm eventually stops, since in each iteration we have less edges or less nodes in <img src='http://s.wordpress.com/latex.php?latex=%7BW%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{W}' title='{W}' class='latex' /> and the solution is as desired. The main effort is therefore to prove the lemma. But before, let&#8217;s look at the lemma: it is of the following kind: &#8220;any basic solution of the LP has some nice properties, which envolve having a not too big (at least in some point) support&#8221;. So, it involves proving that the support is not too large. That is our next task as we are trying to prove the lemma. And we will be done with:</p>
<blockquote><p><strong>Theorem 2</strong> <em> The algorithm described above produces a spanning tree of cost </em><em><img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20Z%5E%2A%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq Z^*}' title='{\leq Z^*}' class='latex' /></em><em> (the LP values and therefore </em><em><img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20OPT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq OPT}' title='{\leq OPT}' class='latex' />)</em><em>in which each node <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cin%20W%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \in W}' title='{v \in W}' class='latex' /> has degree <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20b_v%20%2B%202%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq b_v + 2}' title='{\leq b_v + 2}' class='latex' />. </em></p></blockquote>
<p><strong> Bounding the size of the support </strong></p>
<p>We would like now to prove some result like the Lemma above: that in the solution of the LP we have either one <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> with degree <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7BE%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E(x)}' title='{E(x)}' class='latex' /> or we have a node in <img src='http://s.wordpress.com/latex.php?latex=%7BW%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{W}' title='{W}' class='latex' /> with degree <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%203%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq 3}' title='{\leq 3}' class='latex' />. First, we suppose the opposite, that <img src='http://s.wordpress.com/latex.php?latex=%7B%28V%2CE%28x%29%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(V,E(x))}' title='{(V,E(x))}' class='latex' /> has all the nodes with degree <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cgeq%202%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\geq 2}' title='{\geq 2}' class='latex' /> and all the nodes in <img src='http://s.wordpress.com/latex.php?latex=%7BW%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{W}' title='{W}' class='latex' /> have degree <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cgeq%204%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\geq 4}' title='{\geq 4}' class='latex' />. This implies that we have a large number of edges in the support. From the degrees, we know that:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cvert%20E%28x%29%20%5Cvert%20%5Cgeq%20%5Cfrac%7B1%7D%7B2%7D%20%5Cleft%28%202%28%20%5Cvert%20V%20%5Cvert%20-%20%5Cvert%20W%20%5Cvert%20%29%20%2B%204%20%5Cvert%20W%20%5Cvert%20%5Cright%29%20%3D%20%5Cvert%20V%20%5Cvert%20%2B%20%5Cvert%20W%20%5Cvert&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \vert E(x) \vert \geq \frac{1}{2} \left( 2( \vert V \vert - \vert W \vert ) + 4 \vert W \vert \right) = \vert V \vert + \vert W \vert' title='\displaystyle \vert E(x) \vert \geq \frac{1}{2} \left( 2( \vert V \vert - \vert W \vert ) + 4 \vert W \vert \right) = \vert V \vert + \vert W \vert' class='latex' /></p>
<p>We want to prove that the support <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20E%28x%29%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert E(x) \vert}' title='{\vert E(x) \vert}' class='latex' /> of the LP can&#8217;t be too large. The first question is: how to estimate the size of the support of a basic solution. The constraints look like that:</p>
<p><img class="aligncenter size-full wp-image-251" title="bst_fig1" src="http://www.bigredbits.com/wp-content/uploads/2009/11/bst_fig1.png" alt="bst_fig1" width="506" height="248" /></p>
<p>A basic solution can be represented by picking <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20E%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert E \vert}' title='{\vert E \vert}' class='latex' /> rows of the matrix and making them tight. So, if we have a general <img src='http://s.wordpress.com/latex.php?latex=%7BAx%20%5Cleq%20b%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Ax \leq b}' title='{Ax \leq b}' class='latex' /> LP, we pick some submatrix <img src='http://s.wordpress.com/latex.php?latex=%7BA%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A&#039;}' title='{A&#039;}' class='latex' /> of <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> which is <img src='http://s.wordpress.com/latex.php?latex=%7Bn%20%5Ctimes%20n%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n \times n}' title='{n \times n}' class='latex' /> and the basic solution is just <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%3D%20A%27%5E%7B-1%7D%20b%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x = A&#039;^{-1} b&#039;}' title='{x = A&#039;^{-1} b&#039;}' class='latex' />. The lines of matrix <img src='http://s.wordpress.com/latex.php?latex=%7BA%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A&#039;}' title='{A&#039;}' class='latex' /> can be of three types: they can be <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cchi_S%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\chi_S}' title='{\chi_S}' class='latex' />, which are corresponding to <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%7Be%20%5Cin%20E%28S%29%7D%20x_e%20%5Cleq%20%5Cvert%20S%20%5Cvert%20-%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_{e \in E(S)} x_e \leq \vert S \vert - 1}' title='{\sum_{e \in E(S)} x_e \leq \vert S \vert - 1}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cchi_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\chi_v}' title='{\chi_v}' class='latex' /> that correspond to <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%7Be%20%5Cin%20%5Cdelta%28v%29%7D%20x_e%20%5Cleq%20b_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_{e \in \delta(v)} x_e \leq b_v}' title='{\sum_{e \in \delta(v)} x_e \leq b_v}' class='latex' /> or <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cdelta_e%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\delta_e}' title='{\delta_e}' class='latex' /> corresponding to <img src='http://s.wordpress.com/latex.php?latex=%7Bx_e%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_e \geq 0}' title='{x_e \geq 0}' class='latex' />. There are <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20E%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert E \vert}' title='{\vert E \vert}' class='latex' /> vectors in total. The size of the support <img src='http://s.wordpress.com/latex.php?latex=%7BE%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E(x)}' title='{E(x)}' class='latex' /> is smaller or equal the number of rows of the form <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cchi_S%2C%20%5Cchi_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\chi_S, \chi_v}' title='{\chi_S, \chi_v}' class='latex' /> in the basic solution. Therefore the idea to bound the size of the support is to prove that &#8220;all basic solutions can be represented by a small number of rows in the form <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cchi_S%2C%20%5Cchi_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\chi_S, \chi_v}' title='{\chi_S, \chi_v}' class='latex' />. And this is done using the following:</p>
<blockquote><p><strong>Lemma 3</strong> <em><a name="basic_sol"></a> Assuming <img src='http://s.wordpress.com/latex.php?latex=%7BE%20%3D%20E%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E = E(x)}' title='{E = E(x)}' class='latex' />, for any basic solution <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' />, there is <img src='http://s.wordpress.com/latex.php?latex=%7BZ%20%5Csubseteq%20W%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Z \subseteq W}' title='{Z \subseteq W}' class='latex' /> and a family <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{S}}' title='{\mathcal{S}}' class='latex' /> of sets such that: </em></p>
<ol>
<li><em> The restrictions correspondent to <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{S}}' title='{S \in \mathcal{S}}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cin%20Z%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \in Z}' title='{v \in Z}' class='latex' /> are tight for <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> </em></li>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7B%5C%7B%20%5Cchi_S%3B%20S%20%5Cin%20%5Cmathcal%7BS%7D%20%5C%7D%20%5Ccup%20%5C%7B%20%5Cchi_v%3B%20v%20%5Cin%20Z%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\{ \chi_S; S \in \mathcal{S} \} \cup \{ \chi_v; v \in Z \}}' title='{\{ \chi_S; S \in \mathcal{S} \} \cup \{ \chi_v; v \in Z \}}' class='latex' /> is an independent set </em></li>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20%5Cmathcal%7BS%7D%20%5Cvert%20%2B%20%5Cvert%20Z%20%5Cvert%20%3D%20%5Cvert%20E%28x%29%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert \mathcal{S} \vert + \vert Z \vert = \vert E(x) \vert}' title='{\vert \mathcal{S} \vert + \vert Z \vert = \vert E(x) \vert}' class='latex' /> </em></li>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{S}}' title='{\mathcal{S}}' class='latex' /> is a laminar family </em></li>
</ol>
<p><em> </em></p></blockquote>
<p>The first 3 items are straightfoward properties of basic solutions. The fourth one, means that for two sets <img src='http://s.wordpress.com/latex.php?latex=%7BS_1%2C%20S_2%20%5Cin%20%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S_1, S_2 \in \mathcal{S}}' title='{S_1, S_2 \in \mathcal{S}}' class='latex' />, one of three things happen: <img src='http://s.wordpress.com/latex.php?latex=%7BS_1%20%5Csubseteq%20S_2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S_1 \subseteq S_2}' title='{S_1 \subseteq S_2}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7BS_2%20%5Csubseteq%20S_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S_2 \subseteq S_1}' title='{S_2 \subseteq S_1}' class='latex' /> or <img src='http://s.wordpress.com/latex.php?latex=%7BS_1%20%5Ccap%20S_2%20%3D%20%5Cemptyset%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S_1 \cap S_2 = \emptyset}' title='{S_1 \cap S_2 = \emptyset}' class='latex' />. Now, we based on the previous lemma and in the following result that can be easily proved by induction, we will prove Lemma <a href="#structure_lemma">1</a>.</p>
<blockquote><p><strong>Lemma 4</strong> <em> If <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{S}}' title='{\mathcal{S}}' class='latex' /> is a laminar family over the set <img src='http://s.wordpress.com/latex.php?latex=%7BV%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{V}' title='{V}' class='latex' /> where each set <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{S}}' title='{S \in \mathcal{S}}' class='latex' /> contains at least <img src='http://s.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> elements, then <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20%5Cmathcal%7BS%7D%20%5Cvert%20%5Cleq%20%5Cvert%20V%20%5Cvert%20-%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert \mathcal{S} \vert \leq \vert V \vert - 1}' title='{\vert \mathcal{S} \vert \leq \vert V \vert - 1}' class='latex' />. </em></p></blockquote>
<p>Now, the proof of Lemma <a href="#structure_lemma">1</a> is easy. Let&#8217;s do it and then we come back to prove Lemma <a href="#basic_sol">3</a>. Simply see that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20E%28x%29%20%5Cvert%20%3D%20%5Cvert%20%5Cmathcal%7BS%7D%20%5Cvert%20%2B%20%5Cvert%20Z%20%5Cvert%20%5Cleq%20%5Cvert%20V%20%5Cvert%20-%201%20%2B%20%5Cvert%20W%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert E(x) \vert = \vert \mathcal{S} \vert + \vert Z \vert \leq \vert V \vert - 1 + \vert W \vert}' title='{\vert E(x) \vert = \vert \mathcal{S} \vert + \vert Z \vert \leq \vert V \vert - 1 + \vert W \vert}' class='latex' /> what contradicts <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20E%28x%29%20%5Cvert%20%5Cgeq%20%5Cvert%20V%20%5Cvert%20%2B%20%5Cvert%20W%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert E(x) \vert \geq \vert V \vert + \vert W \vert}' title='{\vert E(x) \vert \geq \vert V \vert + \vert W \vert}' class='latex' />.</p>
<p><strong> Uncrossing argument </strong></p>
<p>And now we arrive in the technical heart of the proof, which is proving Lemma <a href="#basic_sol">3</a>. This says that given any basic solution, given any feasible solution, we can write it as a &#8220;structured&#8221; basic solution. We start with any basic feasible solution. This already satifies (1)-(3), then we need to change that solution to satisfy condition (4) as well. We need to get rid crossing elements, i.e., <img src='http://s.wordpress.com/latex.php?latex=%7BS%2CT%20%5Cin%20%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S,T \in \mathcal{S}}' title='{S,T \in \mathcal{S}}' class='latex' /> in the form:</p>
<p style="text-align: center;"><img class="aligncenter size-full wp-image-252" title="bst_fig2" src="http://www.bigredbits.com/wp-content/uploads/2009/11/bst_fig2.png" alt="bst_fig2" width="246" height="80" /></p>
<p>We do that by the means of the:</p>
<blockquote><p><strong>Lemma 5 (Uncrossing Lemma)</strong> <em> If <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> are intersecting and tight (tight in the sense that their respective constraint is tight), then <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Ccup%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \cup T}' title='{S \cup T}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Ccap%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \cap T}' title='{S \cap T}' class='latex' /> are also tight and:<br />
</em></p>
<p align="center"><em><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cchi_%7BS%20%5Ccap%20T%7D%20%2B%20%5Cchi_%7BS%20%5Ccup%20T%7D%20%3D%20%5Cchi_S%20%2B%20%5Cchi_T&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \chi_{S \cap T} + \chi_{S \cup T} = \chi_S + \chi_T' title='\displaystyle \chi_{S \cap T} + \chi_{S \cup T} = \chi_S + \chi_T' class='latex' /></em></p>
<p><em> </em></p></blockquote>
<p>Which corresponds to that picture:</p>
<p><img class="aligncenter size-full wp-image-253" title="bst_fig3" src="http://www.bigredbits.com/wp-content/uploads/2009/11/bst_fig3.png" alt="bst_fig3" width="493" height="135" /></p>
<p><em>Proof:</em> First, we note that <img src='http://s.wordpress.com/latex.php?latex=%7Bx%28E%28S%29%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x(E(S))}' title='{x(E(S))}' class='latex' /> is a supermodular function, i.e.:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20x%28E%28S%29%29%20%2B%20x%28E%28T%29%29%20%5Cleq%20x%28E%28S%20%5Ccap%20T%29%29%20%2B%20x%28E%28S%20%5Ccup%20T%29%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle x(E(S)) + x(E(T)) \leq x(E(S \cap T)) + x(E(S \cup T))' title='\displaystyle x(E(S)) + x(E(T)) \leq x(E(S \cap T)) + x(E(S \cup T))' class='latex' /></p>
<p>We can see that by case analysis. Every edge appearing in the left side appears in the right side with at least the same multiplicity. Notice also that it holds with strict inequality iff there are edges from <img src='http://s.wordpress.com/latex.php?latex=%7BS%5Csetminus%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S\setminus T}' title='{S\setminus T}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7BT%20%5Csetminus%20S%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T \setminus S}' title='{T \setminus S}' class='latex' />. Now, we have:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cbegin%7Baligned%7D%20%28%5Cvert%20S%20%5Cvert%20-%201%29%20%2B%20%28%5Cvert%20T%20%5Cvert%20-%201%29%20%26%20%3D%20%28%5Cvert%20S%20%5Ccap%20T%20%5Cvert%20-%201%29%20%2B%20%28%5Cvert%20S%20%5Ccup%20T%20%5Cvert%20-%201%29%20%5Cgeq%20%5C%5C%20%26%20%5Cgeq%20x%28E%28S%20%5Ccap%20T%29%29%20%2B%20x%28E%28S%20%5Ccup%20T%29%29%20%5Cgeq%20%5C%5C%20%26%20%5Cgeq%20x%28E%28S%29%29%20%2B%20x%28E%28T%29%29%20%3D%20%28%5Cvert%20S%20%5Cvert%20-%201%29%20%2B%20%28%5Cvert%20T%20%5Cvert%20-%201%29%20%5Cend%7Baligned%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \begin{aligned} (\vert S \vert - 1) + (\vert T \vert - 1) &amp; = (\vert S \cap T \vert - 1) + (\vert S \cup T \vert - 1) \geq \\ &amp; \geq x(E(S \cap T)) + x(E(S \cup T)) \geq \\ &amp; \geq x(E(S)) + x(E(T)) = (\vert S \vert - 1) + (\vert T \vert - 1) \end{aligned}' title='\displaystyle \begin{aligned} (\vert S \vert - 1) + (\vert T \vert - 1) &amp; = (\vert S \cap T \vert - 1) + (\vert S \cup T \vert - 1) \geq \\ &amp; \geq x(E(S \cap T)) + x(E(S \cup T)) \geq \\ &amp; \geq x(E(S)) + x(E(T)) = (\vert S \vert - 1) + (\vert T \vert - 1) \end{aligned}' class='latex' /></p>
<p>where the first relation is trivial, the second is by feasibility, the third is by supermodularity and the lastone is by tightness. So, all hold with equality and therefore <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Ccap%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \cap T}' title='{S \cap T}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Ccup%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \cup T}' title='{S \cup T}' class='latex' /> are tight. We also proved that:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20x%28E%28S%20%5Ccap%20T%29%29%20%2B%20x%28E%28S%20%5Ccup%20T%29%29%20%3D%20x%28E%28S%29%29%20%2B%20x%28E%28T%29%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle x(E(S \cap T)) + x(E(S \cup T)) = x(E(S)) + x(E(T))' title='\displaystyle x(E(S \cap T)) + x(E(S \cup T)) = x(E(S)) + x(E(T))' class='latex' /></p>
<p>so there can be no edge from <img src='http://s.wordpress.com/latex.php?latex=%7BS%5Csetminus%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S\setminus T}' title='{S\setminus T}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7BT%20%5Csetminus%20S%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T \setminus S}' title='{T \setminus S}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7BE%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E(x)}' title='{E(x)}' class='latex' /> and therefore, thinking just of edges in <img src='http://s.wordpress.com/latex.php?latex=%7BE%28x%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E(x)}' title='{E(x)}' class='latex' /> we have:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cchi_%7BS%20%5Ccap%20T%7D%20%2B%20%5Cchi_%7BS%20%5Ccup%20T%7D%20%3D%20%5Cchi_S%20%2B%20%5Cchi_T&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \chi_{S \cap T} + \chi_{S \cup T} = \chi_S + \chi_T' title='\displaystyle \chi_{S \cap T} + \chi_{S \cup T} = \chi_S + \chi_T' class='latex' /></p>
<img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' />
<p><a class="snap_noshots" href="http://www.science.unitn.it/cirm/Goemans.pdf">Uncrossing arguments</a> are found everywhere in combinatorics. Now, we show how the Uncrossing Lemma can be used to prove Lemma <a href="#structure_lemma">1</a>:</p>
<p><em>Proof:</em> Let <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> be any basic solution. It can be represented by a pair <img src='http://s.wordpress.com/latex.php?latex=%7B%28Y%2C%20%5Cmathcal%7BC%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(Y, \mathcal{C})}' title='{(Y, \mathcal{C})}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7BY%20%5Csubseteq%20W%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y \subseteq W}' title='{Y \subseteq W}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BC%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{C}}' title='{\mathcal{C}}' class='latex' /> is a family of sets. We will show that the same basic solution can be represented by <img src='http://s.wordpress.com/latex.php?latex=%7B%28Y%2C%20%5Cmathcal%7BL%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(Y, \mathcal{L})}' title='{(Y, \mathcal{L})}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BL%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{L}}' title='{\mathcal{L}}' class='latex' /> is a laminar family and has the same size of <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BC%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{C}}' title='{\mathcal{C}}' class='latex' />.</p>
<p>Let <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{S}}' title='{\mathcal{S}}' class='latex' /> be all sets that are tight under <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BL%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{L}}' title='{\mathcal{L}}' class='latex' /> a maximal laminar family of tights sets in <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{S}}' title='{\mathcal{S}}' class='latex' />, such that <img src='http://s.wordpress.com/latex.php?latex=%7B%5C%7B%5Cchi_S%3B%20S%20%5Cin%20%5Cmathcal%7BL%7D%20%5C%7D%20%5Ccup%20%5C%7B%5Cchi_v%3B%20v%20%5Cin%20Z%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\{\chi_S; S \in \mathcal{L} \} \cup \{\chi_v; v \in Z \}}' title='{\{\chi_S; S \in \mathcal{L} \} \cup \{\chi_v; v \in Z \}}' class='latex' /> are independent. I claim that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20%5Cmathcal%7BL%7D%20%5Cvert%20%3D%20dim%28span%28%5Cmathcal%7BS%7D%29%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert \mathcal{L} \vert = dim(span(\mathcal{S}))}' title='{\vert \mathcal{L} \vert = dim(span(\mathcal{S}))}' class='latex' />.</p>
<p>In fact, suppose <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20%5Cmathcal%7BL%7D%20%5Cvert%20%3C%20dim%28span%28%5Cmathcal%7BS%7D%29%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert \mathcal{L} \vert &lt; dim(span(\mathcal{S}))}' title='{\vert \mathcal{L} \vert &lt; dim(span(\mathcal{S}))}' class='latex' />, then there are sets of <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{S}}' title='{\mathcal{S}}' class='latex' /> we could add to <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BL%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{L}}' title='{\mathcal{L}}' class='latex' /> without violating independence &#8211; the problem is that those sets would cross some set. Pick such <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{S}}' title='{S \in \mathcal{S}}' class='latex' /> intersecting fewer possible sets in <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BL%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{L}}' title='{\mathcal{L}}' class='latex' />. The set <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> intersects some <img src='http://s.wordpress.com/latex.php?latex=%7BT%20%5Cin%20%5Cmathcal%7BL%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T \in \mathcal{L}}' title='{T \in \mathcal{L}}' class='latex' />. Since both are tight we can use the Uncrossing Lemma and we get:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cchi_%7BS%20%5Ccap%20T%7D%20%2B%20%5Cchi_%7BS%20%5Ccup%20T%7D%20%3D%20%5Cchi_S%20%2B%20%5Cchi_T&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \chi_{S \cap T} + \chi_{S \cup T} = \chi_S + \chi_T' title='\displaystyle \chi_{S \cap T} + \chi_{S \cup T} = \chi_S + \chi_T' class='latex' /></p>
<p>since <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cchi_S%20%5Cnotin%20span%28%5Cmathcal%7BL%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\chi_S \notin span(\mathcal{L})}' title='{\chi_S \notin span(\mathcal{L})}' class='latex' />, we can&#8217;t have simultaneously <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cchi_%7BS%20%5Ccap%20T%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\chi_{S \cap T}}' title='{\chi_{S \cap T}}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cchi_%7BS%20%5Ccup%20T%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\chi_{S \cup T}}' title='{\chi_{S \cup T}}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7Bspan%28%5Cmathcal%7BL%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{span(\mathcal{L})}' title='{span(\mathcal{L})}' class='latex' />. Let&#8217;s consider two cases:</p>
<ol>
<li> <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cchi_%7BS%20%5Ccap%20T%7D%20%5Cnotin%20span%28%5Cmathcal%7BL%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\chi_{S \cap T} \notin span(\mathcal{L})}' title='{\chi_{S \cap T} \notin span(\mathcal{L})}' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Ccap%20T%20%5Cin%20span%28%5Cmathcal%7BS%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \cap T \in span(\mathcal{S})}' title='{S \cap T \in span(\mathcal{S})}' class='latex' /> is in <img src='http://s.wordpress.com/latex.php?latex=%7Bspan%28%5Cmathcal%7BS%7D%29%20%5Csetminus%20span%28%5Cmathcal%7BL%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{span(\mathcal{S}) \setminus span(\mathcal{L})}' title='{span(\mathcal{S}) \setminus span(\mathcal{L})}' class='latex' /> and intersects fewer sets of <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BL%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{L}}' title='{\mathcal{L}}' class='latex' /> than <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' />, since all sets that intersect <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Ccap%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \cap T}' title='{S \cap T}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BL%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{L}}' title='{\mathcal{L}}' class='latex' /> must intersect <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> as well (since no set can cross <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />).<img class="aligncenter size-full wp-image-254" title="bst_fig4" src="http://www.bigredbits.com/wp-content/uploads/2009/11/bst_fig4.png" alt="bst_fig4" width="253" height="181" /></li>
<li> <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cchi_%7BS%20%5Ccup%20T%7D%20%5Cnotin%20span%28%5Cmathcal%7BL%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\chi_{S \cup T} \notin span(\mathcal{L})}' title='{\chi_{S \cup T} \notin span(\mathcal{L})}' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Ccup%20T%20%5Cin%20span%28%5Cmathcal%7BS%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \cup T \in span(\mathcal{S})}' title='{S \cup T \in span(\mathcal{S})}' class='latex' /> is in <img src='http://s.wordpress.com/latex.php?latex=%7Bspan%28%5Cmathcal%7BS%7D%29%20%5Csetminus%20span%28%5Cmathcal%7BL%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{span(\mathcal{S}) \setminus span(\mathcal{L})}' title='{span(\mathcal{S}) \setminus span(\mathcal{L})}' class='latex' /> and intersects fewer sets of <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BL%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{L}}' title='{\mathcal{L}}' class='latex' /> than <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' />, since all sets that intersect <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Ccup%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \cup T}' title='{S \cup T}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BL%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{L}}' title='{\mathcal{L}}' class='latex' /> must intersect <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' />.</li>
</ol>
<p>In either case we have a contradiction, so we proved that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20%5Cmathcal%7BL%7D%20%5Cvert%20%3D%20dim%28span%28%5Cmathcal%7BS%7D%29%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert \mathcal{L} \vert = dim(span(\mathcal{S}))}' title='{\vert \mathcal{L} \vert = dim(span(\mathcal{S}))}' class='latex' />. So we can generate all the space of tight sets with a laminar family. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p>And this finishes the proof. Let&#8217;s go over all that we&#8217;ve done: we started with an LP and we wanted to prove that the support of each solution was not too large. We wanted that because we wanted to prove that there was one node with degree one in the support or a node in <img src='http://s.wordpress.com/latex.php?latex=%7BW%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{W}' title='{W}' class='latex' /> with small (<img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%203%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq 3}' title='{\leq 3}' class='latex' />) degree. To prove that the degree of the support is small, we show that any basic solution has a representation in terms of a laminar family. Then we use the fact that laminar families can&#8217;t be very large families of sets. For that, we use the celebrated Uncrossing Lemma.</p>
<p><strong>Note:</strong> Most of this is based on my notes on David Williamson&#8217;s Approximation Algorithms class. I spent some time thinking about this algorithm and therefore I decided o post it here.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/249/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Looking at probability distributions</title>
		<link>http://www.bigredbits.com/archives/231</link>
		<comments>http://www.bigredbits.com/archives/231#comments</comments>
		<pubDate>Fri, 13 Nov 2009 03:16:27 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[probability]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=231</guid>
		<description><![CDATA[I&#8217;ve been taking two classes in probability this semester and in those I saw the proofs of a lot of interesting theorems which I knew about previously but I have never seen the proof, as the Central Limit Theorem, the Laws of Large Numbers and so on&#8230; Also, some theory which is looks somewhat ugly [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been taking two classes in probability this semester and in those I saw the proofs of a lot of interesting theorems which I knew about previously but I have never seen the proof, as the Central Limit Theorem, the Laws of Large Numbers and so on&#8230; Also, some theory which is looks somewhat ugly in the undergrad courses becomes very clear with the proper formal treatment. Today I was thinking what was the main take-home message that a computer scientist could take from those classes and. at ;east for me, this message is the various ways of looking to probability distributions. I&#8217;ve heard about moments, Laplace transform, Fourier transform and other tools like that, but I never realized before their true power. Probably still today, most of their true power is hidden from me, but I am starting to look at them in a different way. Let me try to go over a few examples of different ways we can look at probability distributions and show cases where they are interesting.</p>
<p>Most of ways of looking at probability distributions are associated with multiplicative system: a multiplicative system <img src='http://s.wordpress.com/latex.php?latex=%7BQ%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q}' title='{Q}' class='latex' /> is a set of real-valued functions with the property that if <img src='http://s.wordpress.com/latex.php?latex=%7Bf_1%2C%20f_2%20%5Cin%20Q%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_1, f_2 \in Q}' title='{f_1, f_2 \in Q}' class='latex' /> then <img src='http://s.wordpress.com/latex.php?latex=%7Bf_1%20%5Cdot%20f_2%20%5Cin%20Q%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_1 \dot f_2 \in Q}' title='{f_1 \dot f_2 \in Q}' class='latex' />. Those kinds of sets are powerful because of the Multiplicative Systems Theorem:</p>
<blockquote><p><strong>Theorem 1 (Multiplicative Systems Theorem)</strong> <em> If <img src='http://s.wordpress.com/latex.php?latex=%7BQ%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q}' title='{Q}' class='latex' /> is a multiplicative system, <img src='http://s.wordpress.com/latex.php?latex=%7BH%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{H}' title='{H}' class='latex' /> is a linear space containing <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> (the constant function <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />) and is closed under bounded convergence, then <img src='http://s.wordpress.com/latex.php?latex=%7BQ%20%5Csubseteq%20H%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q \subseteq H}' title='{Q \subseteq H}' class='latex' /> implies that <img src='http://s.wordpress.com/latex.php?latex=%7BH%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{H}' title='{H}' class='latex' /> contains all bounded <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csigma%28Q%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sigma(Q)}' title='{\sigma(Q)}' class='latex' />-measurable functions. </em></p></blockquote>
<p>The theorem might look a bit cryptic if you are not familiar with the definitions, but it boils down to the following translation:</p>
<blockquote><p><strong>Theorem 2 (Translation of the Multiplicative Systems Theorem)</strong> <em> If <img src='http://s.wordpress.com/latex.php?latex=%7BQ%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q}' title='{Q}' class='latex' /> is &#8220;general&#8221; multiplicative system, <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> are random variable such that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%7Bf%28X%29%7D%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%7Bf%28Y%29%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E}{f(X)} = \mathop{\mathbb E}{f(Y)}}' title='{\mathop{\mathbb E}{f(X)} = \mathop{\mathbb E}{f(Y)}}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bf%20%5Cin%20Q%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f \in Q}' title='{f \in Q}' class='latex' /> then <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> have the same distribution. </em></p></blockquote>
<p>where general excludes some troublesome cases like <img src='http://s.wordpress.com/latex.php?latex=%7BQ%20%3D%20%5C%7B1%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q = \{1\}}' title='{Q = \{1\}}' class='latex' /> or all constant functions, for example. In technical terms, we wanted <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csigma%28Q%29%20%3D%20%5Csigma%5C%7Bf%5E%7B-1%7D%28%28-%5Cinfty%2Ca%5D%29%3B%20a%5Cin%20%7B%5Cmathbb%20R%7D%2C%20f%20%5Cin%20Q%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sigma(Q) = \sigma\{f^{-1}((-\infty,a]); a\in {\mathbb R}, f \in Q \}}' title='{\sigma(Q) = \sigma\{f^{-1}((-\infty,a]); a\in {\mathbb R}, f \in Q \}}' class='latex' /> to be the Borel <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csigma%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sigma}' title='{\sigma}' class='latex' />-algebra. But let&#8217;s not worry about those technical details and just look at the translated version. We now, discuss several kinds of multiplicative systems:</p>
<ol>
<li> The most common description of the a random variable is by the cummulative distribution function <img src='http://s.wordpress.com/latex.php?latex=%7BF%28u%29%20%3D%20P%5C%7BX%20%5Cleq%20u%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{F(u) = P\{X \leq u\}}' title='{F(u) = P\{X \leq u\}}' class='latex' />. This is associated with <img src='http://s.wordpress.com/latex.php?latex=%7BQ%20%3D%20%5C%7B%201_%7B%28-%5Cinfty%2Cu%5D%7D%28x%29%3B%20u%20%5Cin%20%7B%5Cmathbb%20R%7D%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q = \{ 1_{(-\infty,u]}(x); u \in {\mathbb R} \}}' title='{Q = \{ 1_{(-\infty,u]}(x); u \in {\mathbb R} \}}' class='latex' /> notice that simply <img src='http://s.wordpress.com/latex.php?latex=%7BF%28u%29%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%7B1_%7B%28-%5Cinfty%2Cu%5D%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{F(u) = \mathop{\mathbb E}{1_{(-\infty,u]}}}' title='{F(u) = \mathop{\mathbb E}{1_{(-\infty,u]}}}' class='latex' />.</li>
<li> We can characterize a random variable by its moments: the variable <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is characterized by the set <img src='http://s.wordpress.com/latex.php?latex=%7BM_n%20%3D%20%5Cint_%7B%5Cmathbb%20R%7D%20x%5En%20%5Cmu_X%28dx%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{M_n = \int_{\mathbb R} x^n \mu_X(dx)}' title='{M_n = \int_{\mathbb R} x^n \mu_X(dx)}' class='latex' />. Given the moemnts <img src='http://s.wordpress.com/latex.php?latex=%7BM_1%2C%20M_2%2C%20%5Chdots%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{M_1, M_2, \hdots}' title='{M_1, M_2, \hdots}' class='latex' />, the variable is totally characterized, i.e., if two variables have the same moments, then they have the same distribution by the Multiplicative Systems Theorem. This description is associated with the system <img src='http://s.wordpress.com/latex.php?latex=%7BQ%20%3D%20%5C%7Bx%5En%3B%20n%20%3D%201%2C%202%2C%20%5Chdots%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q = \{x^n; n = 1, 2, \hdots\}}' title='{Q = \{x^n; n = 1, 2, \hdots\}}' class='latex' /></li>
<li> <strong>Moment Generating Function</strong>: If <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is a variable that assumes only integer values, we can describe the it as <img src='http://s.wordpress.com/latex.php?latex=%7Bp_0%2C%20p_1%2C%20p_2%2C%20%5Chdots%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_0, p_1, p_2, \hdots}' title='{p_0, p_1, p_2, \hdots}' class='latex' />, where <img src='http://s.wordpress.com/latex.php?latex=%7Bp_n%20%3D%20P%5C%7B%20X%20%3D%20n%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_n = P\{ X = n \}}' title='{p_n = P\{ X = n \}}' class='latex' />. An interesting way of representing those probabilities is as the moment generating function <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cpsi_X%28z%29%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%7Bz%5EX%7D%20%3D%20%5Csum_%7Bn%3D0%7D%5E%5Cinfty%20p_n%20z%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\psi_X(z) = \mathop{\mathbb E}{z^X} = \sum_{n=0}^\infty p_n z^n}' title='{\psi_X(z) = \mathop{\mathbb E}{z^X} = \sum_{n=0}^\infty p_n z^n}' class='latex' />. This is associated with the multiplicative system <img src='http://s.wordpress.com/latex.php?latex=%7BQ%20%3D%20%5C%7Bz%5Ex%2C%200%20%5Cleq%20z%20%3C%201%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q = \{z^x, 0 \leq z &lt; 1\}}' title='{Q = \{z^x, 0 \leq z &lt; 1\}}' class='latex' />.Now suppose we are given two discrete independent variables <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' />. What do we know about <img src='http://s.wordpress.com/latex.php?latex=%7BX%2BY%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X+Y}' title='{X+Y}' class='latex' />. It is easy to know its expectation, its variance, &#8230; but what about more complicated things? What is the distribution of <img src='http://s.wordpress.com/latex.php?latex=%7BX%2BY%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X+Y}' title='{X+Y}' class='latex' /> ? Moment generating functions answer this question very easily, since:
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cpsi_%7BX%2BY%7D%20%28z%29%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%20%5Bz%5E%7BX%2BY%7D%5D%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%20%5Bz%5E%7BX%7D%5D%20%5Ccdot%20%5Cmathop%7B%5Cmathbb%20E%7D%5Bz%5E%7BY%7D%5D%20%3D%20%5Cpsi_X%28z%29%20%5Ccdot%20%5Cpsi_Y%28z%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \psi_{X+Y} (z) = \mathop{\mathbb E} [z^{X+Y}] = \mathop{\mathbb E} [z^{X}] \cdot \mathop{\mathbb E}[z^{Y}] = \psi_X(z) \cdot \psi_Y(z)' title='\displaystyle \psi_{X+Y} (z) = \mathop{\mathbb E} [z^{X+Y}] = \mathop{\mathbb E} [z^{X}] \cdot \mathop{\mathbb E}[z^{Y}] = \psi_X(z) \cdot \psi_Y(z)' class='latex' /></p>
<p>If we know moment generating functions, we can calculate expectation very easily, since <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%5BX%5D%20%3D%20%5Cpsi%27_X%281%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E}[X] = \psi&#039;_X(1)}' title='{\mathop{\mathbb E}[X] = \psi&#039;_X(1)}' class='latex' />. For example, suppose we have a process like that: there is one bacteria in time <img src='http://s.wordpress.com/latex.php?latex=%7Bt%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t = 0}' title='{t = 0}' class='latex' />. In each timestep, either this bacteria dies (with probability <img src='http://s.wordpress.com/latex.php?latex=%7Bp_0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_0}' title='{p_0}' class='latex' />), continues alive without reproducing (with probability <img src='http://s.wordpress.com/latex.php?latex=%7Bp_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_1}' title='{p_1}' class='latex' /> or has <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> offsprings (with probability <img src='http://s.wordpress.com/latex.php?latex=%7Bp_%7B1%2Bk%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_{1+k}}' title='{p_{1+k}}' class='latex' />). In that case <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_0%5E%5Cinfty%20p_n%20%3D%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_0^\infty p_n = 1}' title='{\sum_0^\infty p_n = 1}' class='latex' />. Each time, the same happens, independently with each of the bacteria alive in that moment. The question is, what is the expected number of bacteria in time <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' /> ?</p>
<p>It looks like a complicated problem with just elementary tools, but it is a simple problem if we have moment generating functions. Just let <img src='http://s.wordpress.com/latex.php?latex=%7BX_%7Bti%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X_{ti}}' title='{X_{ti}}' class='latex' /> be the variable associated with the <img src='http://s.wordpress.com/latex.php?latex=%7Bi%5E%7Bth%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i^{th}}' title='{i^{th}}' class='latex' /> bacteria of time <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' />. It is zero if it dies, <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> if it stays the same and <img src='http://s.wordpress.com/latex.php?latex=%7Bk%2B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k+1}' title='{k+1}' class='latex' /> if it has <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> offsprings. Let also <img src='http://s.wordpress.com/latex.php?latex=%7BN_t%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N_t}' title='{N_t}' class='latex' /> be the number of bacteria in time <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' />. We want to know <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%20N_t%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E} N_t}' title='{\mathop{\mathbb E} N_t}' class='latex' />. First, see that:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20N_t%20%3D%20%5Csum_%7Bi%3D1%7D%5E%7BN_%7Bt-1%7D%7D%20X_%7Bti%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle N_t = \sum_{i=1}^{N_{t-1}} X_{ti}' title='\displaystyle N_t = \sum_{i=1}^{N_{t-1}} X_{ti}' class='latex' /></p>
<p>Now, let&#8217;s write that in terms of moment generating functions:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cpsi_%7BN_t%7D%28z%29%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%7Bz%5E%7BX_%7Bt1%7D%20%2B%20%5Chdots%20%2B%20X_%7BtN_t%7D%7D%7D%20%3D%20%5Csum_%7Bk%3D0%7D%5E%5Cinfty%20P%5C%7BN_%7Bt-1%7D%20%3D%20k%5C%7D%20%5Ccdot%20%5Cmathop%7B%5Cmathbb%20E%7D%5Bz%5E%7BX_%7Bt1%7D%20%2B%20%5Chdots%20%2B%20X_%7BtN_t%7D%7D%20%5Cvert%20N_t%20%3D%20k%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \psi_{N_t}(z) = \mathop{\mathbb E}{z^{X_{t1} + \hdots + X_{tN_t}}} = \sum_{k=0}^\infty P\{N_{t-1} = k\} \cdot \mathop{\mathbb E}[z^{X_{t1} + \hdots + X_{tN_t}} \vert N_t = k]' title='\displaystyle \psi_{N_t}(z) = \mathop{\mathbb E}{z^{X_{t1} + \hdots + X_{tN_t}}} = \sum_{k=0}^\infty P\{N_{t-1} = k\} \cdot \mathop{\mathbb E}[z^{X_{t1} + \hdots + X_{tN_t}} \vert N_t = k]' class='latex' /></p>
<p>which is just:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cpsi_%7BN_t%7D%28z%29%20%3D%20%5Csum_%7Bk%3D0%7D%5E%5Cinfty%20P%5C%7BN_%7Bt-1%7D%20%3D%20k%5C%7D%20%5Ccdot%20%5Cmathop%7B%5Cmathbb%20E%7D%5Bz%5E%7BX_%7Bt1%7D%20%2B%20%5Chdots%20%2B%20X_%7Btk%7D%7D%5D%20%3D%20%5Csum_%7Bk%3D0%7D%5E%5Cinfty%20P%5C%7BN_%7Bt-1%7D%20%3D%20k%20%5C%7D%20%5Ccdot%20%5Cpsi_X%20%28z%29%5Ek&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \psi_{N_t}(z) = \sum_{k=0}^\infty P\{N_{t-1} = k\} \cdot \mathop{\mathbb E}[z^{X_{t1} + \hdots + X_{tk}}] = \sum_{k=0}^\infty P\{N_{t-1} = k \} \cdot \psi_X (z)^k' title='\displaystyle \psi_{N_t}(z) = \sum_{k=0}^\infty P\{N_{t-1} = k\} \cdot \mathop{\mathbb E}[z^{X_{t1} + \hdots + X_{tk}}] = \sum_{k=0}^\infty P\{N_{t-1} = k \} \cdot \psi_X (z)^k' class='latex' /></p>
<p>since the variables are all independent and identically distributed. Now, notice that:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cpsi_%7BN_%7Bt-1%7D%7D%28z%29%20%3D%20%5Csum_%7Bk%3D0%7D%5E%5Cinfty%20P%5C%7BN_%7Bt-1%7D%20%3D%20k%5C%7D%20%5Ccdot%20z%5Ek&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \psi_{N_{t-1}}(z) = \sum_{k=0}^\infty P\{N_{t-1} = k\} \cdot z^k' title='\displaystyle \psi_{N_{t-1}}(z) = \sum_{k=0}^\infty P\{N_{t-1} = k\} \cdot z^k' class='latex' /></p>
<p>by the definition of moment generating function, so we effectively proved that:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cpsi_%7BN_%7Bt%7D%7D%28z%29%20%3D%20%5Cpsi_%7BN_%7Bt-1%7D%7D%28%5Cpsi_X%28z%29%29%20%3D%20%5Cpsi_X%20%5Cpsi_X%20%5Chdots%20%5Cpsi_X%28z%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \psi_{N_{t}}(z) = \psi_{N_{t-1}}(\psi_X(z)) = \psi_X \psi_X \hdots \psi_X(z) ' title='\displaystyle \psi_{N_{t}}(z) = \psi_{N_{t-1}}(\psi_X(z)) = \psi_X \psi_X \hdots \psi_X(z) ' class='latex' /></p>
<p>We proved that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cpsi_%7BN_%7Bt%7D%7D%28z%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\psi_{N_{t}}(z)}' title='{\psi_{N_{t}}(z)}' class='latex' /> is just <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cpsi%28z%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\psi(z)}' title='{\psi(z)}' class='latex' /> iterated <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' /> times. Now, calculating the expectation is easy, using the fact that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cpsi_X%281%29%20%3D%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\psi_X(1) = 1}' title='{\psi_X(1) = 1}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cpsi%27_X%281%29%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%20X%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\psi&#039;_X(1) = \mathop{\mathbb E} X}' title='{\psi&#039;_X(1) = \mathop{\mathbb E} X}' class='latex' />. Just see that: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%20N_t%20%3D%20%5Cpsi_%7BN_%7Bt%7D%7D%27%281%29%20%3D%20%5Cpsi_%7BN_%7Bt-1%7D%7D%27%28%5Cpsi_X%281%29%29%20%5Ccdot%20%5Cpsi%27_X%281%29%20%3D%20%5Cpsi_%7BN_%7Bt-1%7D%7D%27%281%29%20%5Ccdot%20%5Cmathop%7B%5Cmathbb%20E%7D%20X%20%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E} N_t = \psi_{N_{t}}&#039;(1) = \psi_{N_{t-1}}&#039;(\psi_X(1)) \cdot \psi&#039;_X(1) = \psi_{N_{t-1}}&#039;(1) \cdot \mathop{\mathbb E} X }' title='{\mathop{\mathbb E} N_t = \psi_{N_{t}}&#039;(1) = \psi_{N_{t-1}}&#039;(\psi_X(1)) \cdot \psi&#039;_X(1) = \psi_{N_{t-1}}&#039;(1) \cdot \mathop{\mathbb E} X }' class='latex' />. Then, clearly <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%20N_t%20%3D%20%28%5Cmathop%7B%5Cmathbb%20E%7D%20X%29%5Et%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E} N_t = (\mathop{\mathbb E} X)^t}' title='{\mathop{\mathbb E} N_t = (\mathop{\mathbb E} X)^t}' class='latex' />. Using similar technique we can prove a lot more things about this process, just by analyzing the behavior of the moment generating function.</li>
<li> <strong>Laplace Tranform</strong>: Now, moving to continuous variables, if <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is a continuous non-negative variable we can define its Laplace tranform as: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cphi_X%28u%29%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%20%5Be%5E%7Bu%20X%7D%5D%20%3D%20%5Cint_0%5E%5Cinfty%20e%5E%7Bux%7D%20%5Cmu_X%28dx%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\phi_X(u) = \mathop{\mathbb E} [e^{u X}] = \int_0^\infty e^{ux} \mu_X(dx)}' title='{\phi_X(u) = \mathop{\mathbb E} [e^{u X}] = \int_0^\infty e^{ux} \mu_X(dx)}' class='latex' />, where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmu_X%28dx%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mu_X(dx)}' title='{\mu_X(dx)}' class='latex' /> stands for the distribution of <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' />, for example, <img src='http://s.wordpress.com/latex.php?latex=%7B%5Crho_X%28x%29%20dx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\rho_X(x) dx}' title='{\rho_X(x) dx}' class='latex' />. This is associated with the multiplicative system <img src='http://s.wordpress.com/latex.php?latex=%7BQ%20%3D%20%5C%7Be%5E%7Bux%7D%3B%20u%20%5Cgeq%200%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q = \{e^{ux}; u \geq 0\}}' title='{Q = \{e^{ux}; u \geq 0\}}' class='latex' />. Again, by the Multiplicative Systems Theorem, if <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cphi_X%20%3D%20%5Cphi_Y%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\phi_X = \phi_Y}' title='{\phi_X = \phi_Y}' class='latex' />, then the two variables have the same distribution. The Laplace tranform has the same nice properties as the Moment Generating Function, for example, <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cphi_%7BX%2BY%7D%28u%29%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%5Be%5E%7B%28X%2BY%29u%7D%5D%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%5Be%5E%7BXu%7D%5D%20%5Ccdot%20%5Cmathop%7B%5Cmathbb%20E%7D%5Be%5E%7BYu%7D%5D%20%3D%20%5Cphi_X%28u%29%20%5Ccdot%20%5Cphi_Y%28u%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\phi_{X+Y}(u) = \mathop{\mathbb E}[e^{(X+Y)u}] = \mathop{\mathbb E}[e^{Xu}] \cdot \mathop{\mathbb E}[e^{Yu}] = \phi_X(u) \cdot \phi_Y(u)}' title='{\phi_{X+Y}(u) = \mathop{\mathbb E}[e^{(X+Y)u}] = \mathop{\mathbb E}[e^{Xu}] \cdot \mathop{\mathbb E}[e^{Yu}] = \phi_X(u) \cdot \phi_Y(u)}' class='latex' />.And it allows us to do similar tricks than the one I just showed for Moment Generating Functions. One common trick that is used, for example, in the proof of Chernoff bounds is, given independent non-negative random variables:
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20P%5Cleft%5C%7B%5Csum_i%20X_i%20%3E%20u%5Cright%5C%7D%20%3D%20P%5Cleft%5C%7Be%5E%7B%5Csum_i%20X_i%7D%20%3E%20e%5Eu%5Cright%5C%7D%20%5Cleq%20%5Cfrac%7B%5Cmathop%7B%5Cmathbb%20E%7D%5Be%5E%7B%5Csum_i%20X_i%7D%20%5D%7D%7Be%5Eu%7D%20%3D%20%5Cfrac%7B%5Cprod_i%20%5Cmathop%7B%5Cmathbb%20E%7D%5Be%5E%7BX_i%7D%20%5D%7D%7Be%5Eu%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle P\left\{\sum_i X_i &gt; u\right\} = P\left\{e^{\sum_i X_i} &gt; e^u\right\} \leq \frac{\mathop{\mathbb E}[e^{\sum_i X_i} ]}{e^u} = \frac{\prod_i \mathop{\mathbb E}[e^{X_i} ]}{e^u} ' title='\displaystyle P\left\{\sum_i X_i &gt; u\right\} = P\left\{e^{\sum_i X_i} &gt; e^u\right\} \leq \frac{\mathop{\mathbb E}[e^{\sum_i X_i} ]}{e^u} = \frac{\prod_i \mathop{\mathbb E}[e^{X_i} ]}{e^u} ' class='latex' /></p>
<p>where we also used Markov Inequality: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%5BX%5D%20%5Cgeq%20%5Cbeta%20P%5C%7BX%20%5Cgeq%20%5Cbeta%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E}[X] \geq \beta P\{X \geq \beta\}}' title='{\mathop{\mathbb E}[X] \geq \beta P\{X \geq \beta\}}' class='latex' />. Passing to the Laplace transform is the main ingredient in the Chernoff bound and it allows us to sort of &#8220;decouple&#8221; the random variables in the sum. There are several other cases where the Laplace transform proves itsself very useful and turns things that looked very complicated when we saw in undergrad courses into simple and clear things. One clear example of that is the motivation for the Poisson random variable:</p>
<p>If <img src='http://s.wordpress.com/latex.php?latex=%7BT_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T_i}' title='{T_i}' class='latex' /> are independend exponentially distributed random variables with mean <img src='http://s.wordpress.com/latex.php?latex=%7B%5Clambda%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\lambda}' title='{\lambda}' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%7B%5Crho_%7BT_i%7D%28t%29%20%3D%20%5Clambda%20e%5E%7B-%20%5Clambda%20t%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\rho_{T_i}(t) = \lambda e^{- \lambda t}}' title='{\rho_{T_i}(t) = \lambda e^{- \lambda t}}' class='latex' />. An elementary calculation shows that its laplace transform is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cphi_%7BT_i%7D%28u%29%20%3D%20%5Cfrac%7B%5Clambda%7D%7B%5Clambda%2Bu%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\phi_{T_i}(u) = \frac{\lambda}{\lambda+u}}' title='{\phi_{T_i}(u) = \frac{\lambda}{\lambda+u}}' class='latex' />. Let <img src='http://s.wordpress.com/latex.php?latex=%7BS_n%20%3D%20T-0%20%2B%20T_1%20%2B%20T_2%20%2B%20%5Chdots%20%2B%20T_n%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S_n = T-0 + T_1 + T_2 + \hdots + T_n}' title='{S_n = T-0 + T_1 + T_2 + \hdots + T_n}' class='latex' />, i.e., the time of the <img src='http://s.wordpress.com/latex.php?latex=%7B%28n%2B1%29%5E%7Bth%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(n+1)^{th}}' title='{(n+1)^{th}}' class='latex' /> arrival. We want to know what is the distribution of <img src='http://s.wordpress.com/latex.php?latex=%7BS_n%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S_n}' title='{S_n}' class='latex' />. How to do that?</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cphi_%7BS_n%7D%28u%29%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%5Be%5E%7Bu%28T_0%20%2B%20%5Chdots%20%2B%20T_n%29%7D%5D%20%3D%20%5Cphi_T%28u%29%5E%7Bn%2B1%7D%20%3D%20%5Cleft%28%20%5Cfrac%7B%5Clambda%7D%7B%5Clambda%2Bu%7D%20%5Cright%29%5E%7Bn%2B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \phi_{S_n}(u) = \mathop{\mathbb E}[e^{u(T_0 + \hdots + T_n)}] = \phi_T(u)^{n+1} = \left( \frac{\lambda}{\lambda+u} \right)^{n+1}' title='\displaystyle \phi_{S_n}(u) = \mathop{\mathbb E}[e^{u(T_0 + \hdots + T_n)}] = \phi_T(u)^{n+1} = \left( \frac{\lambda}{\lambda+u} \right)^{n+1}' class='latex' /></p>
<p>Now, we need to find <img src='http://s.wordpress.com/latex.php?latex=%7B%5Crho_%7BS_n%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\rho_{S_n}}' title='{\rho_{S_n}}' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cint_0%5E%5Cinfty%20%5Crho_%7BS_n%7D%28t%29%20e%5E%7Biu%7D%20dt%20%3D%20%5Cleft%28%20%5Cfrac%7B%5Clambda%7D%7B%5Clambda%2Bu%7D%20%5Cright%29%5E%7Bn%2B1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\int_0^\infty \rho_{S_n}(t) e^{iu} dt = \left( \frac{\lambda}{\lambda+u} \right)^{n+1}}' title='{\int_0^\infty \rho_{S_n}(t) e^{iu} dt = \left( \frac{\lambda}{\lambda+u} \right)^{n+1}}' class='latex' />. Now it is just a matter of solving this equation and we get: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Crho_%7BS_n%7D%28t%29%20%3D%20%5Cfrac%7B%5Clambda%20%28%5Clambda%20t%29%5En%7D%7Bn%21%7D%20e%5E%7B-%5Clambda%20t%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\rho_{S_n}(t) = \frac{\lambda (\lambda t)^n}{n!} e^{-\lambda t}}' title='{\rho_{S_n}(t) = \frac{\lambda (\lambda t)^n}{n!} e^{-\lambda t}}' class='latex' />. Now, the Poisson varible <img src='http://s.wordpress.com/latex.php?latex=%7BN_t%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N_t}' title='{N_t}' class='latex' /> measures the number of arrivals in <img src='http://s.wordpress.com/latex.php?latex=%7B%5B0%2Ct%5D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{[0,t]}' title='{[0,t]}' class='latex' /> and therefore:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cbegin%7Baligned%7D%20P%5C%7BN_t%20%3D%20n%5C%7D%20%26%20%3D%20P%5C%7BS_%7Bn-1%7D%20%3C%20t%20%3C%20S_n%5C%7D%20%3D%20P%5C%7BS_n%20%3E%20t%5C%7D%20-%20P%5C%7BS_%7Bn-1%7D%20%5Cgeq%20t%5C%7D%20%5C%5C%20%26%20%3D%20%5Cint_t%5E%5Cinfty%20%5Crho_%7BS_n%7D%28t%29%20dt%20-%20%5Cint_t%5E%5Cinfty%20%5Crho_%7BS_%7Bn-1%7D%7D%28t%29%20dt%20%3D%20%5Cfrac%7B%28%5Clambda%20t%29%5En%7D%7Bn%21%7D%20e%5E%7B-%5Clambda%20t%7D%20%5Cend%7Baligned%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \begin{aligned} P\{N_t = n\} &amp; = P\{S_{n-1} &lt; t &lt; S_n\} = P\{S_n &gt; t\} - P\{S_{n-1} \geq t\} \\ &amp; = \int_t^\infty \rho_{S_n}(t) dt - \int_t^\infty \rho_{S_{n-1}}(t) dt = \frac{(\lambda t)^n}{n!} e^{-\lambda t} \end{aligned}' title='\displaystyle \begin{aligned} P\{N_t = n\} &amp; = P\{S_{n-1} &lt; t &lt; S_n\} = P\{S_n &gt; t\} - P\{S_{n-1} \geq t\} \\ &amp; = \int_t^\infty \rho_{S_n}(t) dt - \int_t^\infty \rho_{S_{n-1}}(t) dt = \frac{(\lambda t)^n}{n!} e^{-\lambda t} \end{aligned}' class='latex' /></p>
</li>
<li> <strong>Characteristic Function or Fourier Tranform</strong>: Taking <img src='http://s.wordpress.com/latex.php?latex=%7BQ%20%3D%20%5C%7Be%5E%7Bi%5Clambda%20x%7D%3B%20x%20%5Cin%20%5Cmathbb%7BC%7D%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Q = \{e^{i\lambda x}; x \in \mathbb{C}\}}' title='{Q = \{e^{i\lambda x}; x \in \mathbb{C}\}}' class='latex' /> we get the Fourier Transform: <img src='http://s.wordpress.com/latex.php?latex=%7Bf_X%28%5Clambda%29%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%20e%5E%7Bi%20%5Clambda%20x%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_X(\lambda) = \mathop{\mathbb E} e^{i \lambda x}}' title='{f_X(\lambda) = \mathop{\mathbb E} e^{i \lambda x}}' class='latex' /> which also has some of the nice properties of the previous ones and some additional ones. The characteristic functions were the main actors in the development of all the probability techniques that lead to the main result of 19th century Probability Theory: the Central Limit Theorem. We know that moment generating functions and Laplace transforms completely characterize the distributions, but it is not clear how to recover a distribution once we have a transform. For Fourier Transform there is a cleas and simple way of doing that by means of the Inversion Formula:
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Crho_X%28x%29%20%3D%20%5Cfrac%7B1%7D%7B2%20%5Cpi%7D%20%5Cint_%7B%5Cmathbb%20R%7D%20e%5E%7B-i%20%5Clambda%20x%7D%20f_X%28%5Clambda%29%20d%5Clambda&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \rho_X(x) = \frac{1}{2 \pi} \int_{\mathbb R} e^{-i \lambda x} f_X(\lambda) d\lambda' title='\displaystyle \rho_X(x) = \frac{1}{2 \pi} \int_{\mathbb R} e^{-i \lambda x} f_X(\lambda) d\lambda' class='latex' /></p>
<p>One fact that always puzzled me was: why is the normal distribution <img src='http://s.wordpress.com/latex.php?latex=%7B%5Crho_N%20%28x%29%20%3D%20%5Cfrac%7B1%7D%7B%5Csqrt%7B2%20%5Cpi%7D%7D%20e%5E%7B-x%5E2%2F2%7D%20%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\rho_N (x) = \frac{1}{\sqrt{2 \pi}} e^{-x^2/2} }' title='{\rho_N (x) = \frac{1}{\sqrt{2 \pi}} e^{-x^2/2} }' class='latex' /> so important? What does it have in special to be the limiting distribution in the Central Limit Theorem, i.e., if <img src='http://s.wordpress.com/latex.php?latex=%7BX_1%2C%20X_2%2C%20%5Chdots%2C%20X_n%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X_1, X_2, \hdots, X_n}' title='{X_1, X_2, \hdots, X_n}' class='latex' /> is a sequence of independent random variables, <img src='http://s.wordpress.com/latex.php?latex=%7BS_n%20%3D%20%5Csum_1%5En%20X_i%20-%20%5Csum_1%5En%20%5Cmathop%7B%5Cmathbb%20E%7D%20X_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S_n = \sum_1^n X_i - \sum_1^n \mathop{\mathbb E} X_i}' title='{S_n = \sum_1^n X_i - \sum_1^n \mathop{\mathbb E} X_i}' class='latex' /> then <img src='http://s.wordpress.com/latex.php?latex=%7BS_n%20%2F%20%5Csqrt%7Bvar%20S_n%7D%20%5Crightarrow%20N%280%2C1%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S_n / \sqrt{var S_n} \rightarrow N(0,1)}' title='{S_n / \sqrt{var S_n} \rightarrow N(0,1)}' class='latex' /> under some natural conditions on the variables. The reason the normal is so special is because it is a &#8220;fixed point&#8221; for the Fourier Transform. We can see that <img src='http://s.wordpress.com/latex.php?latex=%7Bf_N%28%5Clambda%29%20%3D%20e%5E%7B-%5Clambda_2%2F2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_N(\lambda) = e^{-\lambda_2/2}}' title='{f_N(\lambda) = e^{-\lambda_2/2}}' class='latex' />. And there we have something special about it that makes me believe the Central Limit Theorem.</li>
</ol>
<p style="text-align: center;">&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;-</p>
<p>This blog post was based on lectures by Professor Dynkin at Cornell.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/231/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Random Spanning Trees</title>
		<link>http://www.bigredbits.com/archives/226</link>
		<comments>http://www.bigredbits.com/archives/226#comments</comments>
		<pubDate>Wed, 04 Nov 2009 04:51:59 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[combinatorics]]></category>
		<category><![CDATA[probability]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=226</guid>
		<description><![CDATA[BigRedBits is again pleased to have Igor Gorodezky as a guest blogger directly from UCLA. I leave you with his excelent post on the Wilson&#8217;s algorithm. &#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212; Igor again, with another mathematical dispatch from UCLA, where I&#8217;m spending the semester eating and breathing combinatorics as part of the 2009 program on combinatorics and its applications [...]]]></description>
			<content:encoded><![CDATA[<p>BigRedBits is again pleased to have <a href="http://jay.cam.cornell.edu/~igor/">Igor Gorodezky</a> as a guest blogger directly from UCLA. I leave you with his excelent post on the Wilson&#8217;s algorithm.</p>
<p style="text-align: center;">&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;</p>
<p>Igor again, with another mathematical dispatch from UCLA, where I&#8217;m spending the semester eating and breathing combinatorics as part of the 2009 program on combinatorics and its applications at IPAM. In the course of some reading related to a problem with which I&#8217;ve been occupying myself, I ran across a neat algorithmic result &#8211; Wilson&#8217;s algorithm for uniformly generating spanning trees of a graph. With Renato&#8217;s kind permission, let me once again make myself at home here at Big Red Bits and tell you all about this little gem.</p>
<p>The problem is straightforward, and I&#8217;ve essentially already stated it: given an undirected, connected graph <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' />, we want an algorithm that outputs uniformly random spanning trees of <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' />. In the early &#8217;90s, Aldous and Broder independently discovered an algorithm for accomplishing this task. This algorithm generates a tree <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> by, roughly speaking, performing a random walk on <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> and adding the edge <img src='http://s.wordpress.com/latex.php?latex=%7B%28u%2Cv%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(u,v)}' title='{(u,v)}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> every time that the walk steps from <img src='http://s.wordpress.com/latex.php?latex=%7Bu%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u}' title='{u}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> is a vertex that has not been seen before.</p>
<p>Wilson&#8217;s algorithm (D. B. Wilson, <a class="snap_noshots" href="http://dbwilson.com/ja/tau.ps">&#8220;Generating random spanning trees more quickly than the cover time,&#8221;</a> STOC &#8217;96) takes a slightly different approach. Let us fix a root vertex <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' />. Wilson&#8217;s algorithm can be stated as a <em>loop-erased random walk</em> on <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> as follows.</p>
<blockquote><p><strong>Algorithm 1 (Loop-erased random walk)</strong> Maintain a tree <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />, initialized to consist of <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' /> alone. While there remains a vertex <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> not in <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />: perform a random walk starting at <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' />, erasing loops as they are created, until the walk encounters a vertex in <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />, then add to <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> the cycle-erased simple path from <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />.</p></blockquote>
<p>We observe that the algorithm halts with probability 1 (its expected running time is actually polynomial, but let&#8217;s not concern ourselves with these issues here), and outputs a random directed spanning tree oriented towards <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' />. It is a minor miracle that this tree is in fact sampled <em>uniformly</em> from the set of all such trees. Let us note that this offers a solution to the original problem, as sampling <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' /> randomly and then running the algorithm will produce a uniformly generated spanning tree of <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' />.</p>
<p>It remains, then, to prove that the algorithm produces uniform spanning trees rooted at <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' /> (by which we mean directed spanning trees oriented towards <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' />). To this we dedicate the remainder of this post.</p>
<p><strong>1. A &#8220;different&#8221; algorithm </strong></p>
<p>Wilson&#8217;s proof is delightfully sneaky: we begin by stating and analyzing a seemingly different algorithm, the <em>cycle-popping</em> algorithm. We will prove that this algorithm has the desired properties, and then argue that it is equivalent to the loop-erased random walk (henceforth LERW).</p>
<p>The cycle-popping algorithm works as follows. Given <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' />, associate with each non-root vertex <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> an infinite <em>stack</em> of neighbors. More formally, to each <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cneq%20r%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \neq r}' title='{v \neq r}' class='latex' /> we associate</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cmathcal%20S_v%20%3D%20%5Bu_%7B0%7D%2C%20u_%7B1%7D%2C%20%5Cdots%20%5D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \mathcal S_v = [u_{0}, u_{1}, \dots ] ' title='\displaystyle  \mathcal S_v = [u_{0}, u_{1}, \dots ] ' class='latex' /></p>
<p>where each <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7Bi%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{i}}' title='{u_{i}}' class='latex' /> is uniformly (and independently) sampled from the set of neighbors of <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' />. Note that each stack is <em>not</em> a random walk, just a list of neighbors. We refer to the left-most element above as the <em>top</em> of <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20S_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal S_v}' title='{\mathcal S_v}' class='latex' />, and by <em>popping</em> the stack <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20S_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal S_v}' title='{\mathcal S_v}' class='latex' /> we mean removing this top vertex from <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20S_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal S_v}' title='{\mathcal S_v}' class='latex' />.</p>
<p>Define the <em>stack graph</em> <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7B%5Cmathcal%20S%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{\mathcal S}}' title='{G_{\mathcal S}}' class='latex' /> to be the directed graph on <img src='http://s.wordpress.com/latex.php?latex=%7BV%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{V}' title='{V}' class='latex' /> that has an edge from <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bu%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u}' title='{u}' class='latex' /> if <img src='http://s.wordpress.com/latex.php?latex=%7Bu%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u}' title='{u}' class='latex' /> is at the top of the stack <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20S_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal S_v}' title='{\mathcal S_v}' class='latex' />. Clearly, if <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> has <img src='http://s.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> vertices then <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7B%5Cmathcal%20S%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{\mathcal S}}' title='{G_{\mathcal S}}' class='latex' /> is an oriented subgraph of <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> with <img src='http://s.wordpress.com/latex.php?latex=%7Bn-1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n-1}' title='{n-1}' class='latex' /> edges. The following lemma follows immediately.</p>
<blockquote><p><strong>Lemma 1</strong> <em><a name="lemstack_graph"></a> Either <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7B%5Cmathcal%20S%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{\mathcal S}}' title='{G_{\mathcal S}}' class='latex' /> is a directed spanning tree oriented towards <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' /> or it contains a directed cycle. </em></p></blockquote>
<p>If there is a directed cycle <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7B%5Cmathcal%20S%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{\mathcal S}}' title='{G_{\mathcal S}}' class='latex' /> we may pop it by popping <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20S_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal S_v}' title='{\mathcal S_v}' class='latex' /> for every <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cin%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \in C}' title='{v \in C}' class='latex' />. This eliminates <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' />, but of course might create other directed cycles. Without resolving this tension quite yet, let us go ahead and formally state the cycle-popping algorithm.</p>
<blockquote><p><strong>Algorithm 2 (Cycle-popping algorithm)</strong> Create a stack <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20S_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal S_v}' title='{\mathcal S_v}' class='latex' /> for every <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cneq%20r%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \neq r}' title='{v \neq r}' class='latex' />. While <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7B%5Cmathcal%20S%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{\mathcal S}}' title='{G_{\mathcal S}}' class='latex' /> contains any directed cycles, pop a cycle from the stacks. If this process ever terminates, output <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7B%5Cmathcal%20S%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{\mathcal S}}' title='{G_{\mathcal S}}' class='latex' />.</p></blockquote>
<p>Note that by the lemma, if the algorithm ever terminates then its output is a spanning tree rooted at <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' />. We claim that the algorithm terminates with probability 1, and moreover generates spanning trees rooted at <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' /> uniformly.</p>
<p>To this end, some more definitions: let us say that given a stack <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20S_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal S_v}' title='{\mathcal S_v}' class='latex' />, the vertex <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i}' title='{u_i}' class='latex' /> is at <em>level</em> <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' />. The level of a vertex in a stack is static, and is defined when the stack is created. That is, the level of <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i}' title='{u_i}' class='latex' /> does not change even if <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i}' title='{u_i}' class='latex' /> advances to the top of the stack as a result of the stack getting popped.</p>
<p>We regard the sequence of stack graphs produced by the algorithm as <em>leveled</em> stack graphs: each non-root vertex <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> is assigned the level of its stack. Observe that the level of <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7B%5Cmathcal%20S%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{\mathcal S}}' title='{G_{\mathcal S}}' class='latex' /> is the number of times that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20S_v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal S_v}' title='{\mathcal S_v}' class='latex' /> has been popped. In the same way, we regard cycles encountered by the algorithm as leveled cycles, and we can regard the tree produced by the algorithm (if indeed one is produced) as a leveled tree.</p>
<p>The analysis of the algorithm relies on the following key lemma (Theorem 4 in Wilson&#8217;s paper), which tells us that the order in which the algorithm pops cycles is irrelevant.</p>
<blockquote><p><strong>Lemma 2</strong> <em><a name="lemcycles_commute"></a> For a given set of stacks, either the cycle-popping algorithm never terminates, or there exists a unique leveled spanning tree <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> rooted at <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' /> such that the algorithm outputs <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> irrespective of the order in which cycles are popped. </em></p></blockquote>
<p><em>Proof:</em> Fix a set of stacks <img src='http://s.wordpress.com/latex.php?latex=%7B%5C%7B%20%5Cmathcal%20S_v%20%5C%7D_%7Bv%20%5Cneq%20r%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\{ \mathcal S_v \}_{v \neq r}}' title='{\{ \mathcal S_v \}_{v \neq r}}' class='latex' />. Consider a leveled cycle <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> that is pop-able, i.e.~there exist leveled cycles <img src='http://s.wordpress.com/latex.php?latex=%7BC_1%2C%20C_2%2C%20%5Cdots%2C%20C_k%3DC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_1, C_2, \dots, C_k=C}' title='{C_1, C_2, \dots, C_k=C}' class='latex' /> that can be popped in sequence. We claim that if the algorithm pops any cycle not equal to <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' />, then there still must exist a series of cycles that ends in <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> and that can be popped in sequence. In other words, if <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> is pop-able then it remains pop-able, no matter which cycles are popped, until <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> itself is actually popped.</p>
<p>Let <img src='http://s.wordpress.com/latex.php?latex=%7BC%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C&#039;}' title='{C&#039;}' class='latex' /> be a cycle popped by the algorithm. If <img src='http://s.wordpress.com/latex.php?latex=%7BC%27%3DC_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C&#039;=C_1}' title='{C&#039;=C_1}' class='latex' /> then the claim is clearly true. Also, if <img src='http://s.wordpress.com/latex.php?latex=%7BC%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C&#039;}' title='{C&#039;}' class='latex' /> shares no vertices with <img src='http://s.wordpress.com/latex.php?latex=%7BC_1%2C%20%5Cdots%2C%20C_k%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_1, \dots, C_k}' title='{C_1, \dots, C_k}' class='latex' />, then the claim is true again. So assume otherwise, and let <img src='http://s.wordpress.com/latex.php?latex=%7BC_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_i}' title='{C_i}' class='latex' /> be the first in the series <img src='http://s.wordpress.com/latex.php?latex=%7BC_1%2C%20%5Cdots%2C%20C_k%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_1, \dots, C_k}' title='{C_1, \dots, C_k}' class='latex' /> to share a vertex with <img src='http://s.wordpress.com/latex.php?latex=%7BC%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C&#039;}' title='{C&#039;}' class='latex' />. Let us show that <img src='http://s.wordpress.com/latex.php?latex=%7BC%27%3DC_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C&#039;=C_i}' title='{C&#039;=C_i}' class='latex' /> by contradiction.</p>
<p>If <img src='http://s.wordpress.com/latex.php?latex=%7BC_i%20%5Cneq%20C%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_i \neq C&#039;}' title='{C_i \neq C&#039;}' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%7BC_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_i}' title='{C_i}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BC%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C&#039;}' title='{C&#039;}' class='latex' /> must share a vertex <img src='http://s.wordpress.com/latex.php?latex=%7Bw%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{w}' title='{w}' class='latex' /> that has different successors in <img src='http://s.wordpress.com/latex.php?latex=%7BC%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C&#039;}' title='{C&#039;}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BC_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_i}' title='{C_i}' class='latex' />. But by definition of <img src='http://s.wordpress.com/latex.php?latex=%7BC_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_i}' title='{C_i}' class='latex' />, none of the <img src='http://s.wordpress.com/latex.php?latex=%7BC_1%2C%20%5Cdots%2C%20C_%7Bi-1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_1, \dots, C_{i-1}}' title='{C_1, \dots, C_{i-1}}' class='latex' /> contain <img src='http://s.wordpress.com/latex.php?latex=%7Bw%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{w}' title='{w}' class='latex' />, and this implies that <img src='http://s.wordpress.com/latex.php?latex=%7Bw%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{w}' title='{w}' class='latex' /> has the same level in <img src='http://s.wordpress.com/latex.php?latex=%7BC%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C&#039;}' title='{C&#039;}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BC_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_i}' title='{C_i}' class='latex' />. Therefore its successor in both cycles is the same, a contradiction. This proves <img src='http://s.wordpress.com/latex.php?latex=%7BC_i%3DC%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_i=C&#039;}' title='{C_i=C&#039;}' class='latex' />.</p>
<p>Moreover, the argument above proves that <img src='http://s.wordpress.com/latex.php?latex=%7BC_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_i}' title='{C_i}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BC%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C&#039;}' title='{C&#039;}' class='latex' /> are equal as <em>leveled</em> cycles (i.e.~every vertex has the same level in both cycles). Hence</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20C%27%3DC_i%2C%20C_1%2C%20C_2%2C%20%5Cdots%2C%20C_%7Bi-1%7D%2C%20C_%7Bi%2B1%7D%2C%20%5Cdots%2C%20C_k%3DC%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  C&#039;=C_i, C_1, C_2, \dots, C_{i-1}, C_{i+1}, \dots, C_k=C ' title='\displaystyle  C&#039;=C_i, C_1, C_2, \dots, C_{i-1}, C_{i+1}, \dots, C_k=C ' class='latex' /></p>
<p>is a series of cycles that can be popped in sequence, which proves the original claim about <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' />.</p>
<p>We conclude that given a set of stacks, either there is an infinite number of pop-able cycles, in which case there will always be an infinite number and the algorithm will never terminate, or there is a finite number of such cycles. In the latter case, every one of these cycles is eventually popped, and the algorithm produces a spanning tree <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> rooted at <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' />. The level of each non-root vertex in <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> is given by (one plus) the number of popped cycles that contained <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' />. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p>Wilson summarizes the cycle-popping algorithm thusly: &#8220;[T]he stacks uniquely define a tree together with a partially ordered set of cycles layered on top of it. The algorithm peels off these cycles to find the tree.&#8221;</p>
<blockquote><p><strong>Theorem 3</strong> <em> The cycle-popping algorithm terminates with probability 1, and the tree that it outputs is a uniformly sampled spanning tree rooted at <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' />. </em></p></blockquote>
<p><em>Proof:</em> The first claim is easy: <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> has a spanning tree, therefore it has a directed spanning tree oriented towards <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' />. The stacks generated in the first step of the algorithm will contain such a tree, and hence the algorithm will terminate, with probability 1.</p>
<p>Now, consider a spanning tree <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> rooted at <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' />. We&#8217;ll abuse notation and let <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> be the event that <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' /> is produced by the algorithm. Similarly, given a collection of leveled cycles <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal C}' title='{\mathcal C}' class='latex' />, we will write <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal C}' title='{\mathcal C}' class='latex' /> for the event that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal C}' title='{\mathcal C}' class='latex' /> is the set of leveled cycles popped by the algorithm before it terminates. Finally, let <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20C%20%5Cwedge%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal C \wedge T}' title='{\mathcal C \wedge T}' class='latex' /> be the event that the algorithm popped the leveled cycles in <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal C}' title='{\mathcal C}' class='latex' /> and terminated, with the resulting <em>leveled</em> tree being equal to <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />.</p>
<p>By the independence of the stack entries, we have <img src='http://s.wordpress.com/latex.php?latex=%7B%5CPr%5B%5Cmathcal%20C%20%5Cwedge%20T%5D%20%3D%20%5CPr%5B%5Cmathcal%20C%5D%20%5Ccdot%20p%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Pr[\mathcal C \wedge T] = \Pr[\mathcal C] \cdot p}' title='{\Pr[\mathcal C \wedge T] = \Pr[\mathcal C] \cdot p}' class='latex' />, where <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> is the probability that the algorithm&#8217;s output is a leveled version of <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />, a quantity which a moment&#8217;s reflection will reveal is independent of <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />. Now,</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5CPr%5BT%5D%20%3D%20%5Csum_%7B%5Cmathcal%20C%7D%20%5CPr%5B%5Cmathcal%20C%20%5Cwedge%20T%5D%20%3D%20p%20%5Csum_%7B%5Cmathcal%20C%7D%20%5CPr%5B%5Cmathcal%20C%5D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \Pr[T] = \sum_{\mathcal C} \Pr[\mathcal C \wedge T] = p \sum_{\mathcal C} \Pr[\mathcal C] ' title='\displaystyle  \Pr[T] = \sum_{\mathcal C} \Pr[\mathcal C \wedge T] = p \sum_{\mathcal C} \Pr[\mathcal C] ' class='latex' /></p>
<p>which, as desired, is independent of <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p><strong>2. Conclusion </strong></p>
<p>We have shown that the cycle-popping algorithm generates spanning trees rooted at <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' /> uniformly. It remains to observe that the LERW algorithm is nothing more than an implementation of the cycle-popping algorithm! Instead of initially generating the (infinitely long) stacks and then looking for cycles to pop, the LERW generates stack elements as necessary via random walk (computer scientists might recognize this as the Principle of Deferred Decisions). If the LERW encounters a loop, then it has found a cycle in the stack graph <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7B%5Cmathcal%20S%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{\mathcal S}}' title='{G_{\mathcal S}}' class='latex' /> induced by the stacks that the LERW has been generating. Erasing the loop is equivalent to popping this cycle. We conclude that the LERW algorithm generates spanning trees rooted at <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' /> uniformly.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/226/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>More about hats and auctions</title>
		<link>http://www.bigredbits.com/archives/218</link>
		<comments>http://www.bigredbits.com/archives/218#comments</comments>
		<pubDate>Thu, 29 Oct 2009 05:41:52 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[puzzles]]></category>
		<category><![CDATA[theory]]></category>
		<category><![CDATA[coding theory]]></category>
		<category><![CDATA[mechanism design]]></category>
		<category><![CDATA[profit maximization]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=218</guid>
		<description><![CDATA[In my last post about hats, I told I&#8217;ll soon post another version with some more problems, which I ended up not doing and would talk a bit more about those kind of problems. I ended up not doing, but here are a few nice problems: Those people are again a room, each with a [...]]]></description>
			<content:encoded><![CDATA[<p>In <a class="snap_noshots" href="http://www.bigredbits.com/?p=161">my last post about hats</a>, I told I&#8217;ll soon post another version with some more problems, which I ended up not doing and would talk a bit more about those kind of problems. I ended up not doing, but here are a few nice problems:</p>
<blockquote><p>Those <img src='http://s.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> people are again a room, each with a hat which is either black or white (picked with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> at random) and they can see the color of the other people&#8217;s hats but they can&#8217;t see their own color. They write in a piece of paper either &#8220;BLACK&#8221; or &#8220;WHITE&#8221;. The whole team wins if all of them get their colors right. The whole team loses, if at least one writes the wrong color. Before entering the room and getting the hats, they can strategyze. What is a strategy that makes them win with <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> probability?</p></blockquote>
<p>If they all choose their colors at random, the probability of winning is very small: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%5En%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2^n}}' title='{\frac{1}{2^n}}' class='latex' />. So we should try to correlate them somehow. The solution is again related with error correcting codes. We can think of the hats as a string of bits. How to correct one bit if it is lost? The simple engineering solution is to add a parity check. We append to the string <img src='http://s.wordpress.com/latex.php?latex=%7Bx_0%2C%20x_1%2C%20%5Chdots%2C%20x_n%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_0, x_1, \hdots, x_n}' title='{x_0, x_1, \hdots, x_n}' class='latex' /> a bit <img src='http://s.wordpress.com/latex.php?latex=%7By%20%3D%20%5Csum_i%20x_i%20%5Cmod%202%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{y = \sum_i x_i \mod 2}' title='{y = \sum_i x_i \mod 2}' class='latex' />. So, if bit <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> is lost, we know it is <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%20%3D%20%28y%20%2B%20%5Csum_%7Bj%20%5Cneq%20i%7D%20x_j%29%20%5Cmod%202%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i = (y + \sum_{j \neq i} x_j) \mod 2}' title='{x_i = (y + \sum_{j \neq i} x_j) \mod 2}' class='latex' />. We can use this idea to solve the puzzle above: if hats are places with <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> probability, the parity check will be <img src='http://s.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' />. They can decide before hand that everyone will use <img src='http://s.wordpress.com/latex.php?latex=%7By%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{y = 0}' title='{y = 0}' class='latex' /> and with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> they are right and everyone gets his hat color right. Now, let&#8217;s extend this problem in some ways:</p>
<blockquote><p>The same problem, but there are <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> hat colors, they are choosen independently with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7Bk%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{k}}' title='{\frac{1}{k}}' class='latex' /> and they win if everyone gets his color right. Find a strategy that wins with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7Bk%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{k}}' title='{\frac{1}{k}}' class='latex' />.</p></blockquote>
<blockquote><p>There are again <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> hat colors, they are choosen independently with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7Bk%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{k}}' title='{\frac{1}{k}}' class='latex' /> and they win if at least a fraction <img src='http://s.wordpress.com/latex.php?latex=%7Bf%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f}' title='{f}' class='latex' /> (<img src='http://s.wordpress.com/latex.php?latex=%7B0%20%3C%20f%20%3C%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0 &lt; f &lt; 1}' title='{0 &lt; f &lt; 1}' class='latex' />) of the people guesses the right color. Find a strategy that wins with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7Bfk%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{fk}}' title='{\frac{1}{fk}}' class='latex' />.</p></blockquote>
<blockquote><p>Again to the problem where we just have BLACK and WHITE colors, they are chosen with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> and everyone needs to find the right color to win, can you prove that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> is the best one can do? And what about the two other problems above?</p></blockquote>
<p>The first two use variations of the parity check idea in the solution. For the second case, given any strategy of the players, for each string <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20%5C%7B0%2C1%5C%7D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in \{0,1\}^n}' title='{x \in \{0,1\}^n}' class='latex' /> they have probability <img src='http://s.wordpress.com/latex.php?latex=%7Bp_x%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_x}' title='{p_x}' class='latex' />. Therefore the total probability of winning is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%5En%7D%5Csum_%7Bx%20%5Cin%20%5C%7B0%2C1%5C%7D%5En%7D%20p_x%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2^n}\sum_{x \in \{0,1\}^n} p_x}' title='{\frac{1}{2^n}\sum_{x \in \{0,1\}^n} p_x}' class='latex' />. Let <img src='http://s.wordpress.com/latex.php?latex=%7Bx%27%20%3D%20%281-x_1%2C%20x_2%2C%20%5Chdots%2C%20x_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x&#039; = (1-x_1, x_2, \hdots, x_n)}' title='{x&#039; = (1-x_1, x_2, \hdots, x_n)}' class='latex' />, i.e., the same input but with the bit <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> flipped. Notice that the answer of player <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> is the same (or at least has the same probabilities) in both <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bx%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x&#039;}' title='{x&#039;}' class='latex' />, since he can&#8217;t distinguish between <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bx%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x&#039;}' title='{x&#039;}' class='latex' />. Therefore, <img src='http://s.wordpress.com/latex.php?latex=%7Bp_%7Bx%7D%20%2B%20p_%7Bx%27%7D%20%5Cleq%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_{x} + p_{x&#039;} \leq 1}' title='{p_{x} + p_{x&#039;} \leq 1}' class='latex' />. So,</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%202%20%5Cfrac%7B1%7D%7B2%5En%7D%5Csum_%7Bx%20%5Cin%20%5C%7B0%2C1%5C%7D%5En%7D%20p_x%20%3D%20%5Cfrac%7B1%7D%7B2%5En%7D%5Csum_%7Bx%20%5Cin%20%5C%7B0%2C1%5C%7D%5En%7D%20p_x%20%2B%20%5Cfrac%7B1%7D%7B2%5En%7D%5Csum_%7Bx%20%5Cin%20%5C%7B0%2C1%5C%7D%5En%7D%20p_x%27%20%5Cleq%201&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle 2 \frac{1}{2^n}\sum_{x \in \{0,1\}^n} p_x = \frac{1}{2^n}\sum_{x \in \{0,1\}^n} p_x + \frac{1}{2^n}\sum_{x \in \{0,1\}^n} p_x&#039; \leq 1' title='\displaystyle 2 \frac{1}{2^n}\sum_{x \in \{0,1\}^n} p_x = \frac{1}{2^n}\sum_{x \in \{0,1\}^n} p_x + \frac{1}{2^n}\sum_{x \in \{0,1\}^n} p_x&#039; \leq 1' class='latex' /></p>
<p>. This way, no strategy can have more than <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> probability of winning.</p>
<p>Another variation of it:</p>
<blockquote><p>Suppose now we have two colors BLACK and WHITE and the hats are drawn from one distribution <img src='http://s.wordpress.com/latex.php?latex=%7BD%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{D}' title='{D}' class='latex' />, i.e., we have a probability distribution over <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20%5C%7B0%2C1%5C%7D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in \{0,1\}^n}' title='{x \in \{0,1\}^n}' class='latex' /> and we draw the colors from that distribution. Notice that now the hats are not uncorrelated. How to win again with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> (to win, everyone needs the right answer).</p></blockquote>
<p>I like a lot those hat problems. A friend of mine just pointed out to me that there is a <a class="snap_noshots" href="http://www.cs.cornell.edu/~rdk/papers/Hats.pdf">very nice paper by Bobby Kleinberg</a> generalizing several aspects of hat problems, for example, when players have limited visibility of other players hats.</p>
<p style="text-align: center;"><img class="aligncenter size-full wp-image-163" title="hats1" src="http://www.bigredbits.com/wp-content/uploads/2009/10/hats1.png" alt="hats1" width="420" height="70" /></p>
<p>I began being interested by this sort of problem after reading the <a class="snap_noshots" href="www.ece.northwestern.edu/~hartline/papers/auctions-STOC-05.ps">Derandomization of Auctions</a> paper. Hat guessing games are not just a good model for error correcting codes, but they are also a good model for truthful auctions. Consider an auction with a set <img src='http://s.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> single parameter agents, i.e., an auction where each player gives one bid <img src='http://s.wordpress.com/latex.php?latex=%7Bb_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_i}' title='{b_i}' class='latex' /> indicating how much he is willing to pay to win. We have a set of constraints: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%20%5Csubseteq%202%5EN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X} \subseteq 2^N}' title='{\mathcal{X} \subseteq 2^N}' class='latex' /> of all feasible allocations. Based on the bids <img src='http://s.wordpress.com/latex.php?latex=%7B%28b_i%29_%7Bi%20%5Cin%20N%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(b_i)_{i \in N}}' title='{(b_i)_{i \in N}}' class='latex' /> we choose an allocation <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BX%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{X}}' title='{S \in \mathcal{X}}' class='latex' /> and we charge payments to the bidders. An example of a problem like this is the Digital Goods Auction, where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%20%3D%202%5EN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X} = 2^N}' title='{\mathcal{X} = 2^N}' class='latex' />.</p>
<p>In <a class="snap_noshots" href="http://www.bigredbits.com/?p=151">this blog post</a>, I discussed the concept of truthful auction. If an auction is randomized, an universal truthful auction is an auction that is truthful even if all the random bits in the mechanism are revealed to the bidders. Consider the Digital Goods Auction. We can characterize universal truthful digital goods auction as bid-independent auctions. A bid-independent auction is given by function <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(b_{-i})}' title='{f_i(b_{-i})}' class='latex' />, which associated for each <img src='http://s.wordpress.com/latex.php?latex=%7Bb_%7B-i%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_{-i}}' title='{b_{-i}}' class='latex' /> a random variable <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(b_{-i})}' title='{f_i(b_{-i})}' class='latex' />. In that auction, we offer the service to player <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> at price <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(b_{-i})}' title='{f_i(b_{-i})}' class='latex' />. If <img src='http://s.wordpress.com/latex.php?latex=%7Bb_i%20%5Cgeq%20f_i%28b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_i \geq f_i(b_{-i})}' title='{b_i \geq f_i(b_{-i})}' class='latex' /> we allocate to <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> and charge him <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(b_{-i})}' title='{f_i(b_{-i})}' class='latex' />. Otherwise, we don&#8217;t allocate and we charge nothing.</p>
<p>It is not hard to see that all universal truthful mechanisms are like that: if <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28b_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(b_i)}' title='{x_i(b_i)}' class='latex' /> is the probability that player <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> gets the item bidding <img src='http://s.wordpress.com/latex.php?latex=%7Bb_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_i}' title='{b_i}' class='latex' /> let <img src='http://s.wordpress.com/latex.php?latex=%7BU%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{U}' title='{U}' class='latex' /> be an uniform random variable on <img src='http://s.wordpress.com/latex.php?latex=%7B%5B0%2C1%5D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{[0,1]}' title='{[0,1]}' class='latex' /> and define <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28b_%7B-i%7D%29%20%3D%20x_i%5E%7B-1%7D%28U%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(b_{-i}) = x_i^{-1}(U)}' title='{f_i(b_{-i}) = x_i^{-1}(U)}' class='latex' />. Notice that here <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28.%29%20%3D%20x_i%28.%2C%20b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(.) = x_i(., b_{-i})}' title='{x_i(.) = x_i(., b_{-i})}' class='latex' />, but we are inverting with respect to <img src='http://s.wordpress.com/latex.php?latex=%7Bb_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_i}' title='{b_i}' class='latex' />. It is a simple exercise to prove that.</p>
<p>With this characterization, universal truthful auctions suddenly look very much like hat guessing games: we need to design a function that looks at everyone else&#8217;s bid but not on our own and in some sense, &#8220;guesses&#8221; what we probably have and with that calculated the price we offer. It would be great to be able to design a function that returns <img src='http://s.wordpress.com/latex.php?latex=%7Bf%28b_%7B-i%7D%29%20%3D%20b_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f(b_{-i}) = b_i}' title='{f(b_{-i}) = b_i}' class='latex' />. That is unfortunately impossible. But how to approximate <img src='http://s.wordpress.com/latex.php?latex=%7Bb_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_i}' title='{b_i}' class='latex' /> nicely? Some papers, like the <a class="snap_noshots" href="www.ece.northwestern.edu/~hartline/papers/auctions-STOC-05.ps">Derandomization of Auctions</a> and <a class="snap_noshots" href="www.avglab.com/andrew/pub/soda03.ps">Competitiveness via Consensus</a> use this idea.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/218/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Cayley-Hamilton Theorem and Jordan Canonical Form</title>
		<link>http://www.bigredbits.com/archives/215</link>
		<comments>http://www.bigredbits.com/archives/215#comments</comments>
		<pubDate>Thu, 29 Oct 2009 04:17:15 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[finite fields]]></category>
		<category><![CDATA[linear algebra]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=215</guid>
		<description><![CDATA[I was discussing last week with my officemates Hu Fu and Ashwin about the Cayley-Hamilton Theorem. The theorem is the following, given an matrix we can define its characteristic polynomial by . The Cayley-Hamilton Theorem says that . The polynomiale is something like: so we can just see it as a formal polynomial and think [...]]]></description>
			<content:encoded><![CDATA[<p>I was discussing last week with my officemates Hu Fu and Ashwin about the Cayley-Hamilton Theorem. The theorem is the following, given an <img src='http://s.wordpress.com/latex.php?latex=%7Bn%20%5Ctimes%20n%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n \times n}' title='{n \times n}' class='latex' /> matrix <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> we can define its characteristic polynomial by <img src='http://s.wordpress.com/latex.php?latex=%7Bp_A%28%5Clambda%29%20%3D%20%5Cdet%28A%20-%20I%5Clambda%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_A(\lambda) = \det(A - I\lambda)}' title='{p_A(\lambda) = \det(A - I\lambda)}' class='latex' />. The Cayley-Hamilton Theorem says that <img src='http://s.wordpress.com/latex.php?latex=%7Bp_A%28A%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_A(A) = 0}' title='{p_A(A) = 0}' class='latex' />. The polynomiale is something like:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20p_A%28x%29%20%3D%20a_k%20x%5Ek%20%2B%20a_%7Bk-1%7D%20x%5E%7Bk-1%7D%20%2B%20%5Chdots%20%2B%20a_1%20x%5E1%20%2B%20a_0&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle p_A(x) = a_k x^k + a_{k-1} x^{k-1} + \hdots + a_1 x^1 + a_0' title='\displaystyle p_A(x) = a_k x^k + a_{k-1} x^{k-1} + \hdots + a_1 x^1 + a_0' class='latex' /></p>
<p>so we can just see it as a formal polynomial and think of:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20p_A%28A%29%20%3D%20a_k%20A%5Ek%20%2B%20a_%7Bk-1%7D%20A%5E%7Bk-1%7D%20%2B%20%5Chdots%20%2B%20a_1%20A%20%2B%20a_0%20I&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle p_A(A) = a_k A^k + a_{k-1} A^{k-1} + \hdots + a_1 A + a_0 I' title='\displaystyle p_A(A) = a_k A^k + a_{k-1} A^{k-1} + \hdots + a_1 A + a_0 I' class='latex' /></p>
<p>which is an <img src='http://s.wordpress.com/latex.php?latex=%7Bn%20%5Ctimes%20n%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n \times n}' title='{n \times n}' class='latex' /> matrix. The theorem says it is the zero matrix. We thought for a while, looked in the <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Cayley_hamilton_theorem">Wikipedia</a>, and there there were a few proofs, but not the one-line proof I was looking for. Later, I got this proof that I sent to Hu Fu:</p>
<p>Write the matrix <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> in the basis of its eigenvectors, then we can write <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%3D%20%5CGamma%5Et%20D%20%5CGamma%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A = \Gamma^t D \Gamma}' title='{A = \Gamma^t D \Gamma}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7BD%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{D}' title='{D}' class='latex' /> is the diagonal matrix with the eigenvalues in the main diagonal.</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20A%5Ek%20%3D%20%28%5CGamma%5Et%20D%20%5CGamma%29%20%5Chdots%20%28%5CGamma%5Et%20D%20%5CGamma%29%20%3D%20%5CGamma%5Et%20D%5Ek%20%5CGamma&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle A^k = (\Gamma^t D \Gamma) \hdots (\Gamma^t D \Gamma) = \Gamma^t D^k \Gamma' title='\displaystyle A^k = (\Gamma^t D \Gamma) \hdots (\Gamma^t D \Gamma) = \Gamma^t D^k \Gamma' class='latex' /></p>
<p>and since <img src='http://s.wordpress.com/latex.php?latex=%7BD%20%3D%20%5Ctext%7Bdiag%7D%28%5Clambda_1%2C%20%5Chdots%2C%20%5Clambda_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{D = \text{diag}(\lambda_1, \hdots, \lambda_n)}' title='{D = \text{diag}(\lambda_1, \hdots, \lambda_n)}' class='latex' /> we have <img src='http://s.wordpress.com/latex.php?latex=%7BD%5Ek%20%3D%20%5Ctext%7Bdiag%7D%28%5Clambda_1%5Ek%2C%20%5Chdots%2C%20%5Clambda_n%5Ek%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{D^k = \text{diag}(\lambda_1^k, \hdots, \lambda_n^k)}' title='{D^k = \text{diag}(\lambda_1^k, \hdots, \lambda_n^k)}' class='latex' />. Now, it is simple to see that:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20p_A%28A%29%20%3D%20%5CGamma%5Et%20p%28D%29%20%5CGamma&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle p_A(A) = \Gamma^t p(D) \Gamma' title='\displaystyle p_A(A) = \Gamma^t p(D) \Gamma' class='latex' /></p>
<p>and therefore:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20p%28D%29%20%3D%20%5Cbegin%7Bbmatrix%7D%26%20p_A%28%5Clambda_1%29%20%26%20%26%20%26%20%5C%5C%20%26%20%26%20p_A%28%5Clambda_2%29%20%26%20%26%20%26%20%5C%5C%20%26%20%26%20%26%20%5Cddots%20%26%20%5C%5C%20%26%20%26%20%26%20%26%20p_A%28%5Clambda_n%29%20%5Cend%7Bbmatrix%7D%20%3D%20%5Cbegin%7Bbmatrix%7D%20%26%200%20%26%20%26%20%26%20%5C%5C%20%26%20%26%200%20%26%20%26%20%26%20%5C%5C%20%26%20%26%20%26%20%5Cddots%20%26%20%5C%5C%20%26%20%26%20%26%20%26%200%20%5Cend%7Bbmatrix%7D%20%3D%200&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle p(D) = \begin{bmatrix}&amp; p_A(\lambda_1) &amp; &amp; &amp; \\ &amp; &amp; p_A(\lambda_2) &amp; &amp; &amp; \\ &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; p_A(\lambda_n) \end{bmatrix} = \begin{bmatrix} &amp; 0 &amp; &amp; &amp; \\ &amp; &amp; 0 &amp; &amp; &amp; \\ &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; 0 \end{bmatrix} = 0' title='\displaystyle p(D) = \begin{bmatrix}&amp; p_A(\lambda_1) &amp; &amp; &amp; \\ &amp; &amp; p_A(\lambda_2) &amp; &amp; &amp; \\ &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; p_A(\lambda_n) \end{bmatrix} = \begin{bmatrix} &amp; 0 &amp; &amp; &amp; \\ &amp; &amp; 0 &amp; &amp; &amp; \\ &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; 0 \end{bmatrix} = 0' class='latex' /></p>
<p>And that was the one-line proof. One even simpler proof is: let <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> be the eigenvectors, then <img src='http://s.wordpress.com/latex.php?latex=%7Bp_A%28A%29v_i%20%3D%20p_A%28%5Clambda_i%29A%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_A(A)v_i = p_A(\lambda_i)A = 0}' title='{p_A(A)v_i = p_A(\lambda_i)A = 0}' class='latex' />, so <img src='http://s.wordpress.com/latex.php?latex=%7Bp_A%28A%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_A(A)}' title='{p_A(A)}' class='latex' /> must be <img src='http://s.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> since it returns zero for all elements of a basis. Well, I sent that to Hu Fu and he told me the proof had a bug. Not really a bug, but I was proving only for symmetric matrices. More generally, I was proving for diagonalizable matrices. He showed me, for example, the matrix:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cbegin%7Bbmatrix%7D%200%20%26%201%20%5C%5C%200%20%26%200%20%5Cend%7Bbmatrix%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \begin{bmatrix} 0 &amp; 1 \\ 0 &amp; 0 \end{bmatrix}' title='\displaystyle  \begin{bmatrix} 0 &amp; 1 \\ 0 &amp; 0 \end{bmatrix}' class='latex' /></p>
<p>which has only one eigenvalue <img src='http://s.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> and the the eigenvectors are all of the form <img src='http://s.wordpress.com/latex.php?latex=%7B%28x%2C0%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(x,0)}' title='{(x,0)}' class='latex' /> for <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20%7B%5Cmathbb%20R%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in {\mathbb R}}' title='{x \in {\mathbb R}}' class='latex' />. So, the dimension of the space spanned by the eigenvectors is <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />, less than the dimension of the matrix. This never happens for symmetric matrices, and I guess after some time as a computer scientist, I got used to work only with symmetric matrices for almost everything I use: metrics, quadratic forms, correlation matrices, &#8230; but there is more out there then only symmetric matrices. The good news is that this proof is not hard to fix for the general case.</p>
<p>First, it is easy to prove that for each root of the characteristic polynomial there is one eigenvector associated to it (just see that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cdet%28A%20-%20%5Clambda%20I%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\det(A - \lambda I) = 0}' title='{\det(A - \lambda I) = 0}' class='latex' /> and therefore there must be <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cin%20%5Cker%28A%20-%20%5Clambda%20I%29%20%5Csetminus%20%5C%7B%200%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \in \ker(A - \lambda I) \setminus \{ 0 \}}' title='{v \in \ker(A - \lambda I) \setminus \{ 0 \}}' class='latex' />, so if all the roots are distinct, then there is a basis of eigenvalues, and therefore the matrix is diagonalizable (notice that maybe we will need to use complex eigenvalues, but it is ok). The good thing is that a matrix having two identical eigenvalues is a &#8220;coincidence&#8221;. We can identify matrices with <img src='http://s.wordpress.com/latex.php?latex=%7B%7B%5Cmathbb%20R%7D%5E%7Bn%5E2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{{\mathbb R}^{n^2}}' title='{{\mathbb R}^{n^2}}' class='latex' />. The matrices with identical eigenvalues form a zero measure subset of <img src='http://s.wordpress.com/latex.php?latex=%7B%7B%5Cmathbb%20R%7D%5E%7Bn%5E2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{{\mathbb R}^{n^2}}' title='{{\mathbb R}^{n^2}}' class='latex' />, they are in fact the roots of a polynomial in <img src='http://s.wordpress.com/latex.php?latex=%7Bx_%7Bij%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_{ij}}' title='{x_{ij}}' class='latex' />. This polynomial is the <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Resultant">resultant polynomial</a> <img src='http://s.wordpress.com/latex.php?latex=%7BR_%7Bp%2Cp%27%7D%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{R_{p,p&#039;} = 0}' title='{R_{p,p&#039;} = 0}' class='latex' />. Therefore, we proved Cayley-Hamilton theorem in the complement of a zero-measure set in <img src='http://s.wordpress.com/latex.php?latex=%7B%7B%5Cmathbb%20R%7D%5E%7Bn%5E2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{{\mathbb R}^{n^2}}' title='{{\mathbb R}^{n^2}}' class='latex' />. Since <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%5Cmapsto%20p_A%28A%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A \mapsto p_A(A)}' title='{A \mapsto p_A(A)}' class='latex' /> is a continuous function, it extends naturally to all matrices <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%5Cin%20%7B%5Cmathbb%20R%7D%5E%7Bn%5E2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A \in {\mathbb R}^{n^2}}' title='{A \in {\mathbb R}^{n^2}}' class='latex' />.</p>
<p>We can also interpret that probabilistically: get a matrix <img src='http://s.wordpress.com/latex.php?latex=%7BU%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{U}' title='{U}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7BU_%7Bij%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{U_{ij}}' title='{U_{ij}}' class='latex' /> is taken uniformly at random from <img src='http://s.wordpress.com/latex.php?latex=%7B%5B0%2C1%5D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{[0,1]}' title='{[0,1]}' class='latex' />. Then <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%2B%20%5Cepsilon%20U%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A + \epsilon U}' title='{A + \epsilon U}' class='latex' /> has with probability <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> all different eigenvalues. So, <img src='http://s.wordpress.com/latex.php?latex=%7Bp_%7BA%2B%5Cepsilon%20U%7D%20%28A%2B%5Cepsilon%20U%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_{A+\epsilon U} (A+\epsilon U) = 0}' title='{p_{A+\epsilon U} (A+\epsilon U) = 0}' class='latex' /> with probability <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />. Now, just make <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cepsilon%20%5Crightarrow%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\epsilon \rightarrow 0}' title='{\epsilon \rightarrow 0}' class='latex' />.</p>
<p>Ok, this proves the Theorem for real and complex matrices, but what about a matrix defined over a general field where we can&#8217;t use those continuity arguments. A way to get around it is by using <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Jordan_canonical_form">Jordan Canonical Form</a>, which is a generalization of eigenvector decomposition. Not all matrices have eigenvector decomposition, but all matrices over an algebraic closed field can be written in Jordan Canonical Form. Given any <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%5Cin%20%5Coverline%7BK%7D%5E%7Bn%5E2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A \in \overline{K}^{n^2}}' title='{A \in \overline{K}^{n^2}}' class='latex' /> there is a matrix <img src='http://s.wordpress.com/latex.php?latex=%7B%5CGamma%20%5Cin%20%5Coverline%7BK%7D%5E%7Bn%5E2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Gamma \in \overline{K}^{n^2}}' title='{\Gamma \in \overline{K}^{n^2}}' class='latex' /> so that:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20A%20%3D%20%5CGamma%5E%7B-1%7D%20%5Cbegin%7Bbmatrix%7D%26%20B_1%20%26%20%26%20%26%20%5C%5C%20%26%20%26%20B_2%20%26%20%26%20%26%20%5C%5C%20%26%20%26%20%26%20%5Cddots%20%26%20%5C%5C%20%26%20%26%20%26%20%26%20B_p%20%5Cend%7Bbmatrix%7D%5CGamma&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle A = \Gamma^{-1} \begin{bmatrix}&amp; B_1 &amp; &amp; &amp; \\ &amp; &amp; B_2 &amp; &amp; &amp; \\ &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; B_p \end{bmatrix}\Gamma' title='\displaystyle A = \Gamma^{-1} \begin{bmatrix}&amp; B_1 &amp; &amp; &amp; \\ &amp; &amp; B_2 &amp; &amp; &amp; \\ &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; B_p \end{bmatrix}\Gamma' class='latex' /></p>
<p>where <img src='http://s.wordpress.com/latex.php?latex=%7BB_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{B_i}' title='{B_i}' class='latex' /> are blocks of the form:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20B_i%20%3D%20%5Cbegin%7Bbmatrix%7D%26%20%5Clambda%20%26%201%20%26%20%26%20%5C%5C%20%26%20%26%20%5Clambda%20%26%201%20%26%20%26%20%5C%5C%20%26%20%26%20%26%20%5Cddots%20%26%20%5C%5C%20%26%20%26%20%26%20%26%20%5Clambda%20%5Cend%7Bbmatrix%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle B_i = \begin{bmatrix}&amp; \lambda &amp; 1 &amp; &amp; \\ &amp; &amp; \lambda &amp; 1 &amp; &amp; \\ &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; \lambda \end{bmatrix}' title='\displaystyle B_i = \begin{bmatrix}&amp; \lambda &amp; 1 &amp; &amp; \\ &amp; &amp; \lambda &amp; 1 &amp; &amp; \\ &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; \lambda \end{bmatrix}' class='latex' /></p>
<p>By the same argument as above, we just need to prove Cayley Hamilton for each block in separate. So we need to prove that <img src='http://s.wordpress.com/latex.php?latex=%7Bp_A%28B_i%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_A(B_i) = 0}' title='{p_A(B_i) = 0}' class='latex' />. If the block has size <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />, then it is exacly the proof above. If the block is bigger, then we need to look at how does <img src='http://s.wordpress.com/latex.php?latex=%7BB_i%5Ek%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{B_i^k}' title='{B_i^k}' class='latex' /> looks like. By inspection:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20B_i%5E2%20%3D%20%5Cbegin%7Bbmatrix%7D%26%20%5Clambda%5E2%20%26%202%20%5Clambda%20%26%201%20%26%20%26%20%26%20%5C%5C%20%26%20%26%20%5Clambda%5E2%20%26%202%20%5Clambda%20%26%201%20%26%20%26%20%26%20%5C%5C%20%26%20%26%20%26%20%26%20%26%20%5Cddots%20%26%20%5C%5C%20%26%20%26%20%26%20%26%20%26%20%5Clambda%5E2%20%5Cend%7Bbmatrix%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle B_i^2 = \begin{bmatrix}&amp; \lambda^2 &amp; 2 \lambda &amp; 1 &amp; &amp; &amp; \\ &amp; &amp; \lambda^2 &amp; 2 \lambda &amp; 1 &amp; &amp; &amp; \\ &amp; &amp; &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; &amp; \lambda^2 \end{bmatrix}' title='\displaystyle B_i^2 = \begin{bmatrix}&amp; \lambda^2 &amp; 2 \lambda &amp; 1 &amp; &amp; &amp; \\ &amp; &amp; \lambda^2 &amp; 2 \lambda &amp; 1 &amp; &amp; &amp; \\ &amp; &amp; &amp; &amp; &amp; \ddots &amp; \\ &amp; &amp; &amp; &amp; &amp; \lambda^2 \end{bmatrix}' class='latex' /></p>
<p>Tipically, for <img src='http://s.wordpress.com/latex.php?latex=%7BB_i%5Ek%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{B_i^k}' title='{B_i^k}' class='latex' /> we have in each row, starting in column <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> the sequence <img src='http://s.wordpress.com/latex.php?latex=%7B%5Clambda%5Ek%2C%20k%20%5Clambda%5E%7Bk-1%7D%2C%20k%28k-1%29%20%5Clambda%5E%7Bk-1%7D%2C%20%5Chdots%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\lambda^k, k \lambda^{k-1}, k(k-1) \lambda^{k-1}, \hdots}' title='{\lambda^k, k \lambda^{k-1}, k(k-1) \lambda^{k-1}, \hdots}' class='latex' />, i.e., <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7Bd%5E0%7D%7Bd%5Clambda%5E0%7D%20%5Clambda%5Ek%2C%20%5Cfrac%7Bd%5E1%7D%7Bd%5Clambda%5E1%7D%20%5Clambda%5Ek%20%2C%20%5Cfrac%7Bd%5E2%7D%7Bd%5Clambda%5E2%7D%20%5Clambda%5Ek%20%5Chdots%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{d^0}{d\lambda^0} \lambda^k, \frac{d^1}{d\lambda^1} \lambda^k , \frac{d^2}{d\lambda^2} \lambda^k \hdots}' title='{\frac{d^0}{d\lambda^0} \lambda^k, \frac{d^1}{d\lambda^1} \lambda^k , \frac{d^2}{d\lambda^2} \lambda^k \hdots}' class='latex' />. So, we have</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20p%28B_i%29%20%3D%20%5Cbegin%7Bbmatrix%7D%20p%28%5Clambda%29%20%26%20p%27%28%5Clambda%29%20%26%20p%27%27%28%5Clambda%29%20%26%20p%27%27%27%28%5Clambda%29%20%26%20%5Chdots%20%5C%5C%20%26%20p%28%5Clambda%29%20%26%20p%27%28%5Clambda%29%20%26%20p%27%27%28%5Clambda%29%20%26%20%5Chdots%20%5C%5C%20%09%09%09%09%26%20%26%20p%28%5Clambda%29%20%26%20p%27%28%5Clambda%29%20%26%20%5Chdots%20%5C%5C%20%09%09%09%09%26%20%26%20%26%20p%28%5Clambda%29%20%26%20%5Chdots%20%5C%5C%20%09%09%09%09%26%20%26%20%26%20%26%20%5Cddots%20%5C%5C%20%5Cend%7Bbmatrix%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle p(B_i) = \begin{bmatrix} p(\lambda) &amp; p&#039;(\lambda) &amp; p&#039;&#039;(\lambda) &amp; p&#039;&#039;&#039;(\lambda) &amp; \hdots \\ &amp; p(\lambda) &amp; p&#039;(\lambda) &amp; p&#039;&#039;(\lambda) &amp; \hdots \\ 				&amp; &amp; p(\lambda) &amp; p&#039;(\lambda) &amp; \hdots \\ 				&amp; &amp; &amp; p(\lambda) &amp; \hdots \\ 				&amp; &amp; &amp; &amp; \ddots \\ \end{bmatrix}' title='\displaystyle p(B_i) = \begin{bmatrix} p(\lambda) &amp; p&#039;(\lambda) &amp; p&#039;&#039;(\lambda) &amp; p&#039;&#039;&#039;(\lambda) &amp; \hdots \\ &amp; p(\lambda) &amp; p&#039;(\lambda) &amp; p&#039;&#039;(\lambda) &amp; \hdots \\ 				&amp; &amp; p(\lambda) &amp; p&#039;(\lambda) &amp; \hdots \\ 				&amp; &amp; &amp; p(\lambda) &amp; \hdots \\ 				&amp; &amp; &amp; &amp; \ddots \\ \end{bmatrix}' class='latex' /></p>
<p>If block <img src='http://s.wordpress.com/latex.php?latex=%7BB_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{B_i}' title='{B_i}' class='latex' /> has size <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' />, then <img src='http://s.wordpress.com/latex.php?latex=%7B%5Clambda_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\lambda_i}' title='{\lambda_i}' class='latex' /> has multiplicity <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7Bp%28.%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p(.)}' title='{p(.)}' class='latex' /> and therefore <img src='http://s.wordpress.com/latex.php?latex=%7Bp%28%5Clambda_i%29%20%3D%20p%27%28%5Clambda_i%29%20%3D%20%5Chdots%20%3D%20p%5E%7B%28k-1%29%7D%28%5Clambda_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p(\lambda_i) = p&#039;(\lambda_i) = \hdots = p^{(k-1)}(\lambda_i)}' title='{p(\lambda_i) = p&#039;(\lambda_i) = \hdots = p^{(k-1)}(\lambda_i)}' class='latex' /> and therefore, <img src='http://s.wordpress.com/latex.php?latex=%7Bp%28B_i%29%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p(B_i) = 0}' title='{p(B_i) = 0}' class='latex' /> as we wanted to prove.</p>
<p>It turned out not to be a very very short proof, but it is still short, since it uses mostly elementary stuff and the proof is really intuitive in some sense. I took some lessons from that: (i) first it reinforces my idea that, if I need to say something about a matrix, the first thing I do is to look at its eigenvectors decomposition. A lot of Linear Algebra problems are very simple when we consider things in the right basis. Normally the right basis is the eigenvector basis. (ii) not all matrices are diagonalizable. But in those cases, Jordan Canonical Form comes in our help and we can do almost the same as we did with eigenvalue decomposition.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/215/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Hats, codes and puzzles</title>
		<link>http://www.bigredbits.com/archives/161</link>
		<comments>http://www.bigredbits.com/archives/161#comments</comments>
		<pubDate>Sat, 03 Oct 2009 22:51:16 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[puzzles]]></category>
		<category><![CDATA[coding theory]]></category>
		<category><![CDATA[game theory]]></category>
		<category><![CDATA[mechanism design]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=161</guid>
		<description><![CDATA[When I was a child someone told me the following problem: A king promised to marry his daughter to the most intelligent man. Three princes came to claim her hand and he tryed the following logic experiment with them: The princes are gathered into a room and seated in a line, one behind the other, [...]]]></description>
			<content:encoded><![CDATA[<p>When I was a child someone told me the following problem:</p>
<blockquote><p>A king promised to marry his daughter to the most intelligent man. Three princes came to claim her hand and he tryed the following logic experiment with them: The princes are gathered into a room and seated in a line, one behind the other, and are shown 2 black hats and 3 white hats. They are blindfolded, and 1 hat is placed on each of their heads, with the remaining hats hidden in a different room. The first one to deduce his hat color will marry the princess. If some prince claims his hat color incorrectly he dies.</p>
<p>The prince who is seated behind removes his blindfold, sees the two hats in front of him and says nothing. Then the prince in the middle removes his blindfold after that and he can see the hat of the prince in front of him. He also says nothing. Noticing the other princes said nothing, the prince seated in the first whole, without even removing his blindfold, gives the correct answer? The question is: what is the color he said?</p></blockquote>
<p>This is a simple logical puzzle: we just write all the possibilities and start ruling them out given that the other princes didn&#8217;t answer and in the end we can find the color of his hat. I remember that this puzzle surprised me a lot as a kid. A found it extremly cool by then, what made me want to read books about logic problems. After that, I had a lot of fun reading the books by <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Raymond_Smullyan">Raymond Smullyan</a>. I usually would read the problems, think something like: there can&#8217;t ba a solution to that. Then go to school with the problem in mind and spend the day thinking about that. Here is a problem I liked a lot:</p>
<blockquote><p>There is one prisoner and there are two doors: each has one guardian. One door leads to an exit and one door leads to death. The prisioner can choose one door to open. One guardian speaks only the truth and one guardian always lies. But you don&#8217;t know which door is which, which guardian is which and who guards each door. You are allowed to choose one guardian and make him one Yes/No question, and then you need to choose a door. What is the right question to ask?</p></blockquote>
<p style="text-align: center;"><img class="aligncenter size-full wp-image-163" title="hats1" src="http://www.bigredbits.com/wp-content/uploads/2009/10/hats1.png" alt="hats1" width="420" height="70" /></p>
<p>But my goal is not to talk about logic puzzles, but about Hat problems. There are a lot of variations of the problems above: in all of them a person is allowed to see the other hats but not his own hat and we need to &#8220;guess&#8221; which is the color of our hat. If we think carefully, we will see that this is a very general kind of problem in computer science: (i) the whole goal of learning theory is to predict one thing from a lot of other things you observe; (ii) in error correcting code, we should guess one bit from all the others, or from some subset of the others; (iii) in universal truthful mechanisms, we need to make a price offer to one player that just depends on all other players bids. I&#8217;ll come back to this example in a later post, since it is what made me interested in those kinds of problems, but for now, let&#8217;s look at one puzzle I was told about by <a class="snap_noshots" href="http://pages.cs.wisc.edu/~dmalec/">David Malec</a> at EC&#8217;09:</p>
<blockquote><p>There are black and white hats and <img src='http://s.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' /> people: for each of them we choose one color independently at random with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' />. Now, they can look at each others hats but not at their own hat. Then they need to write in a piece of paper either &#8220;PASS&#8221; or one color. If all pass or if someone has a wrong color, the whole team loses (this is a team game) and if at lest one person gets the color right and no one gets wrong, the whole team wins. Create a strategy for the team to win with <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B3%7D%7B4%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{3}{4}}' title='{\frac{3}{4}}' class='latex' /> probability.</p></blockquote>
<p>To win with <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' /> probability is easy: one person will always write &#8220;BLACK&#8221; and the others &#8220;PASS&#8221;. A better strategy is the following: if one person sees two hats of equal color, he writes the opposite color, otherwise, he passes. It is easy to see the team wins except in the case where all hats are the same color, what happens with <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B4%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{4}}' title='{\frac{1}{4}}' class='latex' /> probability. We would like to extend this to a more general setting:</p>
<blockquote><p>There are black and white hats and <img src='http://s.wordpress.com/latex.php?latex=%7B2%5Ek%20-%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2^k - 1}' title='{2^k - 1}' class='latex' /> people: for each of them we choose one color independently at random with probability <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2}}' title='{\frac{1}{2}}' class='latex' />. Now, they can look at each others hats but not at their own hat. Then they need to write in a piece of paper either &#8220;PASS&#8221; or one color. If all pass or if someone has a wrong color, the whole team loses (this is a team game) and if at lest one person gets the color right and no one gets wrong, the whole team wins. Create a strategy for the team to win with <img src='http://s.wordpress.com/latex.php?latex=%7B1-%5Cfrac%7B1%7D%7B2%5Ek%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1-\frac{1}{2^k}}' title='{1-\frac{1}{2^k}}' class='latex' /> probability.</p></blockquote>
<p>It is a tricky question on how to extend the above solution in that case. A detailed solution can be found <a class="snap_noshots" href="www.math.ksu.edu/~dbski/hats.pdf">here</a>. The idea is quite ingenious, so I&#8217;ll sketch here. It envolves Error Correcting Code, in that case, the Hamming Code. Let <img src='http://s.wordpress.com/latex.php?latex=%7BF%20%3D%20%5C%7B0%2C1%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{F = \{0,1\}}' title='{F = \{0,1\}}' class='latex' /> with sum and product modulo <img src='http://s.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' />. Let <img src='http://s.wordpress.com/latex.php?latex=%7Bw_1%2C%20%5Chdots%2C%202%5E%7B2%5Ek-1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{w_1, \hdots, 2^{2^k-1}}' title='{w_1, \hdots, 2^{2^k-1}}' class='latex' /> be the non-zero vector of <img src='http://s.wordpress.com/latex.php?latex=%7BF%5Ek%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{F^k}' title='{F^k}' class='latex' /> and the following linear map:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cbegin%7Baligned%7D%20%5Cphi%3A%20F%5E%7B2%5Ek-1%7D%20%5Crightarrow%20F%5Ek%20%5C%5C%20%28a_1%2C%5Chdots%2C%20a_%7B2%5Ek-1%7D%29%20%5Cmapsto%20%5Csum_i%20a_i%20w_i%20%5Cend%7Baligned%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \begin{aligned} \phi: F^{2^k-1} \rightarrow F^k \\ (a_1,\hdots, a_{2^k-1}) \mapsto \sum_i a_i w_i \end{aligned} ' title='\displaystyle \begin{aligned} \phi: F^{2^k-1} \rightarrow F^k \\ (a_1,\hdots, a_{2^k-1}) \mapsto \sum_i a_i w_i \end{aligned} ' class='latex' /></p>
<p>Let <img src='http://s.wordpress.com/latex.php?latex=%7BH%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{H}' title='{H}' class='latex' /> be the kernel of that application. Then, it is not hard to see that <img src='http://s.wordpress.com/latex.php?latex=%7BH%2C%20H%2Be_1%2C%20%5Chdots%2C%20H%2Be_%7B2%5Ek-1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{H, H+e_1, \hdots, H+e_{2^k-1}}' title='{H, H+e_1, \hdots, H+e_{2^k-1}}' class='latex' /> is a partition of <img src='http://s.wordpress.com/latex.php?latex=%7BF%5E%7B2%5Ek-1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{F^{2^k-1}}' title='{F^{2^k-1}}' class='latex' /> and also that because of that fact, for each <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20F%5E%7B2%5Ek-1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in F^{2^k-1}}' title='{x \in F^{2^k-1}}' class='latex' /> either <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20H%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in H}' title='{x \in H}' class='latex' /> or exists a unique <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> s.t. <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%2B%20e_i%20%5Cin%20H%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x + e_i \in H}' title='{x + e_i \in H}' class='latex' />. This gives an algorithm for just one player to guess his correct color. Let <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20F%5E%7B2%5Ek-1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in F^{2^k-1}}' title='{x \in F^{2^k-1}}' class='latex' /> be the color vector of the hats. Player <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> sees this vector as:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%28x_1%2C%20%5Chdots%2C%20x_%7Bi-1%7D%2C%20%3F%2C%20x_%7Bi%2B1%7D%2C%20%5Chdots%2C%20x_n%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle (x_1, \hdots, x_{i-1}, ?, x_{i+1}, \hdots, x_n)' title='\displaystyle (x_1, \hdots, x_{i-1}, ?, x_{i+1}, \hdots, x_n)' class='latex' /></p>
<p>which can be <img src='http://s.wordpress.com/latex.php?latex=%7B%28x_1%2C%20%5Chdots%2C%20x_%7Bi-1%7D%2C%200%2C%20x_%7Bi%2B1%7D%2C%20%5Chdots%2C%20x_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(x_1, \hdots, x_{i-1}, 0, x_{i+1}, \hdots, x_n)}' title='{(x_1, \hdots, x_{i-1}, 0, x_{i+1}, \hdots, x_n)}' class='latex' /> or <img src='http://s.wordpress.com/latex.php?latex=%7B%28x_1%2C%20%5Chdots%2C%20x_%7Bi-1%7D%2C%201%2C%20x_%7Bi%2B1%7D%2C%20%5Chdots%2C%20x_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(x_1, \hdots, x_{i-1}, 1, x_{i+1}, \hdots, x_n)}' title='{(x_1, \hdots, x_{i-1}, 1, x_{i+1}, \hdots, x_n)}' class='latex' />. The strategy is: if either one of those vector is in <img src='http://s.wordpress.com/latex.php?latex=%7BH%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{H}' title='{H}' class='latex' />, write the color corresponding to the other vector. If both are out of <img src='http://s.wordpress.com/latex.php?latex=%7BH%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{H}' title='{H}' class='latex' />, pass. The team wins iff <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cnotin%20H%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \notin H}' title='{x \notin H}' class='latex' />, what happens with <img src='http://s.wordpress.com/latex.php?latex=%7B1%20-%20%5Cfrac%7B1%7D%7B2%5Ek%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1 - \frac{1}{2^k}}' title='{1 - \frac{1}{2^k}}' class='latex' /> probability. Is is an easy and fun exercise to prove those facts. Or you can refer to the solution I just wrote.</p>
<p style="text-align: center;"><img class="aligncenter size-full wp-image-164" title="hats2" src="http://www.bigredbits.com/wp-content/uploads/2009/10/hats2.png" alt="hats2" width="502" height="95" /></p>
<p>Now, we can complicate it a bit more: we can add other colors and other distributions. But I wanted to move to a different problem: the paper <a href="http://www.ece.northwestern.edu/~hartline/papers/auctions-STOC-05.ps">Derandomization of Auctions</a> showed me an impressive thing: we can use coding theory to derandomize algorithms. To illustrate their ideas, they propose the following problem:</p>
<blockquote><p><strong>Color guessing problem:</strong> There are <img src='http://s.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> people wearing hats of <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> different colors. If each person can see everyone else&#8217;s hats but not his or her own. Each person needs to guess the color of his own hat. We want a <strong>deterministic</strong> guessing algorithm that <img src='http://s.wordpress.com/latex.php?latex=%7B1%2Fk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1/k}' title='{1/k}' class='latex' /> fraction of each color class is guessed correctly.</p></blockquote>
<p>The problem is very easy if we have a source of random bits. Each person guesses some color at random. It seems very complicated to do that without random bits. Surprisingly, we will solve that using a flow computation:</p>
<p>Let <img src='http://s.wordpress.com/latex.php?latex=%7Bc%20%3D%20%28c_1%2C%20%5Chdots%2C%20c_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c = (c_1, \hdots, c_n)}' title='{c = (c_1, \hdots, c_n)}' class='latex' /> be an array of colors <img src='http://s.wordpress.com/latex.php?latex=%7Bc_%7B-i%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c_{-i}}' title='{c_{-i}}' class='latex' /> the array with color <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> removed. Consider the following flow network: nodes <img src='http://s.wordpress.com/latex.php?latex=%7Bs%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{s}' title='{s}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' /> (source and sink), nodes <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7Bc_%7B-i%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{c_{-i}}}' title='{v_{c_{-i}}}' class='latex' /> for each <img src='http://s.wordpress.com/latex.php?latex=%7Bc_%7B-i%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c_{-i}}' title='{c_{-i}}' class='latex' />. There are <img src='http://s.wordpress.com/latex.php?latex=%7Bn%20%5Ccdot%20k%5E%7Bn-1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n \cdot k^{n-1}}' title='{n \cdot k^{n-1}}' class='latex' /> such nodes. Consider also nodes in the form <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7B%5Cgamma%2C%20c%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{\gamma, c})}' title='{u_{\gamma, c})}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cgamma%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\gamma}' title='{\gamma}' class='latex' /> is a color (<img src='http://s.wordpress.com/latex.php?latex=%7B1%2C%20%5Chdots%2C%20k%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1, \hdots, k}' title='{1, \hdots, k}' class='latex' />) and <img src='http://s.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' /> is a color vector. There are <img src='http://s.wordpress.com/latex.php?latex=%7Bk%5E%7Bn%2B1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k^{n+1}}' title='{k^{n+1}}' class='latex' /> such nodes.</p>
<p style="text-align: center;"><img class="aligncenter size-full wp-image-165" title="hats_net" src="http://www.bigredbits.com/wp-content/uploads/2009/10/hats_net.png" alt="hats_net" width="408" height="325" /></p>
<p>We have edges from <img src='http://s.wordpress.com/latex.php?latex=%7Bs%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{s}' title='{s}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7Bc_%7B-i%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{c_{-i}}}' title='{v_{c_{-i}}}' class='latex' /> for all nodes of that kind. And we have edges from <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7B%5Cgamma%2C%20c%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{\gamma, c})}' title='{u_{\gamma, c})}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' />. Now, if <img src='http://s.wordpress.com/latex.php?latex=%7Bc%20%3D%20%28%5Cgamma%2C%20c_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c = (\gamma, c_{-i})}' title='{c = (\gamma, c_{-i})}' class='latex' />, i.e., if <img src='http://s.wordpress.com/latex.php?latex=%7Bc_%7B-i%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c_{-i}}' title='{c_{-i}}' class='latex' /> completed in the <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' />-th coordinate with <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cgamma%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\gamma}' title='{\gamma}' class='latex' /> generates <img src='http://s.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' />, then add an edge from <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7Bc_%7B-i%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{c_{-i}}}' title='{v_{c_{-i}}}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7B%5Cgamma%2C%20c%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{\gamma, c})}' title='{u_{\gamma, c})}' class='latex' />.</p>
<p>Consider the following flow: add <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> unit of flow from <img src='http://s.wordpress.com/latex.php?latex=%7Bs%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{s}' title='{s}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7Bc_%7B-i%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{c_{-i}}}' title='{v_{c_{-i}}}' class='latex' /> and from <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7Bc_%7B-i%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{c_{-i}}}' title='{v_{c_{-i}}}' class='latex' /> split that flow in pieces of size <img src='http://s.wordpress.com/latex.php?latex=%7B1%2Fk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1/k}' title='{1/k}' class='latex' /> and send each to <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7B%5Cgamma%2C%20c%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{\gamma, c}}' title='{u_{\gamma, c}}' class='latex' /> for <img src='http://s.wordpress.com/latex.php?latex=%7Bc%20%3D%20%28%5Cgamma%2C%20c_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c = (\gamma, c_{-i})}' title='{c = (\gamma, c_{-i})}' class='latex' />. Now, each node <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7B%5Cgamma%2C%20c_%7B-i%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{\gamma, c_{-i}}}' title='{u_{\gamma, c_{-i}}}' class='latex' /> receives <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7Bn_%5Cgamma%28c%29%7D%7B%5Cgamma%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{n_\gamma(c)}{\gamma}}' title='{\frac{n_\gamma(c)}{\gamma}}' class='latex' /> flow, where <img src='http://s.wordpress.com/latex.php?latex=%7Bn_%7B%5Cgamma%7D%28c%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n_{\gamma}(c)}' title='{n_{\gamma}(c)}' class='latex' /> is the number of occurencies of <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cgamma%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\gamma}' title='{\gamma}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' />. Send all that flow to <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' />.</p>
<p>We can think of that flow as the guessing procedure. When we see <img src='http://s.wordpress.com/latex.php?latex=%7Bc_%7B-i%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c_{-i}}' title='{c_{-i}}' class='latex' /> we choose the guess independently at random and this way, each <img src='http://s.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' /> receives in expectation <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7Bn_%5Cgamma%28c%29%7D%7B%5Cgamma%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{n_\gamma(c)}{\gamma}}' title='{\frac{n_\gamma(c)}{\gamma}}' class='latex' /> guesses <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cgamma%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\gamma}' title='{\gamma}' class='latex' />. Notice that an integral flow in that graph represents a deterministic guessing procedure: so all we need is an integral flow so that the flow from <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7B%5Cgamma%2C%20c%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{\gamma, c}}' title='{u_{\gamma, c}}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' /> is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Clfloor%20n_%5Cgamma%20%28c%29%20%2F%20k%20%5Crfloor%20%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\lfloor n_\gamma (c) / k \rfloor }' title='{\lfloor n_\gamma (c) / k \rfloor }' class='latex' />. The flow received is from nodes of the type: <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7Bc_%7B-i%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{c_{-i}}}' title='{v_{c_{-i}}}' class='latex' /> and that means that bidder <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> in <img src='http://s.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' />, looking at the other hats will correctly choose <img src='http://s.wordpress.com/latex.php?latex=%7Bc_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c_i}' title='{c_i}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7B%5Clfloor%20n_%5Cgamma%20%28c%29%20%2F%20k%20%5Crfloor%20%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\lfloor n_\gamma (c) / k \rfloor }' title='{\lfloor n_\gamma (c) / k \rfloor }' class='latex' /> times.</p>
<p>Now, define the capacities this way: for all edges from <img src='http://s.wordpress.com/latex.php?latex=%7Bs%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{s}' title='{s}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7Bc_%7B-i%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{c_{-i}}}' title='{v_{c_{-i}}}' class='latex' /> and from <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7Bc_%7B-i%7D%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{c_{-i}}}' title='{v_{c_{-i}}}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7B%5Cgamma%2C%20c%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{\gamma, c}}' title='{u_{\gamma, c}}' class='latex' /> have capacity <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> and from <img src='http://s.wordpress.com/latex.php?latex=%7Bu_%7B%5Cgamma%2C%20c%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_{\gamma, c}}' title='{u_{\gamma, c}}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' /> capacity <img src='http://s.wordpress.com/latex.php?latex=%7B%5Clfloor%20n_%5Cgamma%20%28c%29%20%2F%20k%20%5Crfloor%20%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\lfloor n_\gamma (c) / k \rfloor }' title='{\lfloor n_\gamma (c) / k \rfloor }' class='latex' />. There is an integral flow that saturates all edges from <img src='http://s.wordpress.com/latex.php?latex=%7Bu%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u}' title='{u}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' />, because of the fractional flow showed. So, the solution gives us a deterministic decision procedure.</p>
<p>In the next blog post, I&#8217;ll try to show the result in the <a href="http://www.ece.northwestern.edu/~hartline/papers/auctions-STOC-05.ps">Derandomization of Auctions</a> that relates that to competitive auctions.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/161/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Using expanders to prove sum-product inequalities in finite fields</title>
		<link>http://www.bigredbits.com/archives/154</link>
		<comments>http://www.bigredbits.com/archives/154#comments</comments>
		<pubDate>Thu, 24 Sep 2009 03:47:28 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[combinatorics]]></category>
		<category><![CDATA[expanders]]></category>
		<category><![CDATA[finite fields]]></category>
		<category><![CDATA[sum-product]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=154</guid>
		<description><![CDATA[I am happy to have the first guest post of BigRedBits written by Igor Gorodezky about an elegant and exciting result in combinatorics. &#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;- I&#8217;m fortunate enough to be spending this semester in beautiful Los Angeles as a core participant in the 2009 long program on combinatorics at IPAM (an NSF-funded math institute on UCLA&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>I am happy to have the first guest post of BigRedBits written by <a href="http://jay.cam.cornell.edu/~igor/">Igor Gorodezky</a> about an elegant and exciting result in combinatorics.</p>
<p style="text-align: center;">&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;-</p>
<p>I&#8217;m fortunate enough to be spending this semester in beautiful Los Angeles as a core participant in the 2009 long program on combinatorics at IPAM (an NSF-funded math institute on UCLA&#8217;s campus). We&#8217;ve recently wrapped up the first part of the program, which consisted of tutorial lectures on various topics in combinatorics. There was an abundance of gorgeous mathematics, and with Renato&#8217;s kind permission I&#8217;d like to hijack his blog and write about what to me was one of the most memorable lectures.</p>
<p>This was a lecture by Jozsef Solymosi of the University of British Columbia describing some of his recent work on the sum-product problem in finite fields. In particular, he outlined a spectral-graph-theoretic proof of a recent sum-product inequality due to Garaev. Solymosi&#8217;s proof is an extremely clever and elegant application of spectral graph theory to a classical problem in combinatorial number theory, and so I thought I&#8217;d present it here. Before stating the result and giving Solymosi&#8217;s proof, let us begin with a very brief introduction to the sum-product problem.</p>
<p><strong>1. Introduction </strong></p>
<p>Given a finite set of real numbers <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />, define the sum set</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20A%2BA%20%3D%20%5C%7Ba%2Bb%20%5Cmid%20a%2Cb%20%5Cin%20A%5C%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  A+A = \{a+b \mid a,b \in A\} ' title='\displaystyle  A+A = \{a+b \mid a,b \in A\} ' class='latex' /></p>
<p>and the product set</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20A%20%5Ccdot%20A%20%3D%20%5C%7Bab%20%5Cmid%20a%2Cb%20%5Cin%20A%5C%7D.%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  A \cdot A = \{ab \mid a,b \in A\}. ' title='\displaystyle  A \cdot A = \{ab \mid a,b \in A\}. ' class='latex' /></p>
<p>Both the sum set and the product set clearly must have cardinality between <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A|}' title='{|A|}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%7C%5E2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A|^2}' title='{|A|^2}' class='latex' />. Observe that if <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is an arithmetic progression then <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%2BA%7C%20%5Capprox%202%7CA%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A+A| \approx 2|A|}' title='{|A+A| \approx 2|A|}' class='latex' /> while <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%20%5Ccdot%20A%7C%20%5Capprox%20%7CA%7C%5E2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A \cdot A| \approx |A|^2}' title='{|A \cdot A| \approx |A|^2}' class='latex' />, while if <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is a geometric progression then <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%20%5Ccdot%20A%7C%20%5Capprox%202%7CA%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A \cdot A| \approx 2|A|}' title='{|A \cdot A| \approx 2|A|}' class='latex' /> while <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%20%2B%20A%7C%20%5Capprox%20%7CA%7C%5E2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A + A| \approx |A|^2}' title='{|A + A| \approx |A|^2}' class='latex' />. Intuition suggests that keeping <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%2BA%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A+A|}' title='{|A+A|}' class='latex' /> small by giving <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> lots of additive structure inevitably blows up <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%20%5Ccdot%20A%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A \cdot A|}' title='{|A \cdot A|}' class='latex' />, while keep <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%20%5Ccdot%20A%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A \cdot A|}' title='{|A \cdot A|}' class='latex' /> small by giving <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> lots of multiplicative structure in turn blows up <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%2BA%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A+A|}' title='{|A+A|}' class='latex' />. For an arbitrary <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />, one would expect at least one of these sets, if not both, to be fairly large.</p>
<p>Estimating the maximum of <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%2BA%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A+A|}' title='{|A+A|}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%20%5Ccdot%20A%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A \cdot A|}' title='{|A \cdot A|}' class='latex' /> is the <em>sum-product problem</em>. It was posed in a paper by Erdos and Szemeredi, who proved the existence of a small constant <img src='http://s.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' /> such that</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cmax%5C%7B%20%7CA%2BA%7C%2C%20%7CA%20%5Ccdot%20A%7C%20%5C%7D%20%3D%20%5COmega%5Cleft%28%20%7CA%7C%5E%7B1%2Bc%7D%20%5Cright%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \max\{ |A+A|, |A \cdot A| \} = \Omega\left( |A|^{1+c} \right) ' title='\displaystyle  \max\{ |A+A|, |A \cdot A| \} = \Omega\left( |A|^{1+c} \right) ' class='latex' /></p>
<p>for any finite <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />. They conjecture that we actually have</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cmax%5C%7B%20%7CA%2BA%7C%2C%20%7CA%20%5Ccdot%20A%7C%20%5C%7D%20%3D%20%5COmega%5Cleft%28%20%7CA%7C%5E%7B2-%5Cdelta%7D%20%5Cright%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \max\{ |A+A|, |A \cdot A| \} = \Omega\left( |A|^{2-\delta} \right) ' title='\displaystyle  \max\{ |A+A|, |A \cdot A| \} = \Omega\left( |A|^{2-\delta} \right) ' class='latex' /></p>
<p>for any <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cdelta%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\delta &gt; 0}' title='{\delta &gt; 0}' class='latex' /> and sufficiently large <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A|}' title='{|A|}' class='latex' />. In other words, the value of <img src='http://s.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' /> in their bound can be made arbitrarily close to 1.</p>
<p>Much ink has been spilled in attempts to push up the value of <img src='http://s.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' />. At present, the best sum-product bound is due to Solymosi and gives <img src='http://s.wordpress.com/latex.php?latex=%7Bc%20%5Capprox%203%2F11%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c \approx 3/11}' title='{c \approx 3/11}' class='latex' />. As an aside, I want to mention an extremely simple and elegant probabilistic proof of Elekes that gives <img src='http://s.wordpress.com/latex.php?latex=%7Bc%3D1%2F4%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c=1/4}' title='{c=1/4}' class='latex' />; it is detailed in Alon and Spencer&#8217;s classic text <em>The Probabilistic Method</em>, and is an absolute gem (look in the chapter on crossing numbers of graphs).</p>
<p><strong>2. The finite field variant </strong></p>
<p>Solymosi&#8217;s IPAM lecture was not, however, on this original sum-product problem, but rather on its natural finite field analogue: if <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is a subset of <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathbb%20F_p%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathbb F_p}' title='{\mathbb F_p}' class='latex' />, the finite field of prime order <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' />, what can we say about the maximum of <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%2BA%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A+A|}' title='{|A+A|}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%20%5Ccdot%20A%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A \cdot A|}' title='{|A \cdot A|}' class='latex' />? Observe that it is important to consider fields whose order is prime and not the power of a prime, for in the latter case we could take <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> to be a subring and end up with the degenerate case <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%7C%3D%7CA%2BA%7C%3D%7CA%20%5Ccdot%20A%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A|=|A+A|=|A \cdot A|}' title='{|A|=|A+A|=|A \cdot A|}' class='latex' />.</p>
<p>Bourgain, Katz and Tao got the party started in 2004 by proving the following sum-product bound.</p>
<blockquote><p><strong>Theorem 1</strong> <em> For all <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cepsilon%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\epsilon &gt; 0}' title='{\epsilon &gt; 0}' class='latex' /> there exists <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cdelta%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\delta &gt; 0}' title='{\delta &gt; 0}' class='latex' /> such that if <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%5Csubseteq%20%5Cmathbb%20F_p%20%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A \subseteq \mathbb F_p }' title='{A \subseteq \mathbb F_p }' class='latex' /> with <img src='http://s.wordpress.com/latex.php?latex=%7Bp%5E%7B%5Cepsilon%7D%20%3C%20%7CA%7C%20%3C%20p%5E%7B1%2B%5Cepsilon%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p^{\epsilon} &lt; |A| &lt; p^{1+\epsilon}}' title='{p^{\epsilon} &lt; |A| &lt; p^{1+\epsilon}}' class='latex' /> then<br />
</em></p>
<p align="center"><em><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cmax%5C%7B%20%7CA%2BA%7C%2C%20%7CA%20%5Ccdot%20A%7C%20%5C%7D%20%3D%20%5COmega%5Cleft%28%20%7CA%7C%5E%7B1%2B%5Cdelta%7D%20%5Cright%29.%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \max\{ |A+A|, |A \cdot A| \} = \Omega\left( |A|^{1+\delta} \right). ' title='\displaystyle  \max\{ |A+A|, |A \cdot A| \} = \Omega\left( |A|^{1+\delta} \right). ' class='latex' /></em></p>
<p><em> </em></p></blockquote>
<p>We note that the implied constant also depends on <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cepsilon%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\epsilon}' title='{\epsilon}' class='latex' />. The best known bound is the following, due to Garaev (2007).</p>
<blockquote><p><strong>Theorem 2</strong> <em> If <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%5Csubseteq%20%5Cmathbb%20F_p%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A \subseteq \mathbb F_p}' title='{A \subseteq \mathbb F_p}' class='latex' /> then<br />
</em></p>
<p align="center"><em><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cmax%5C%7B%20%7CA%2BA%7C%2C%20%7CA%20%5Ccdot%20A%7C%20%5C%7D%20%3D%20%5COmega%5Cleft%28%20%5Cmin%5Cleft%5C%7B%20%7CA%7C%5E%7B1%2F2%7Dp%5E%7B1%2F2%7D%2C%7CA%7C%5E2p%5E%7B-1%2F2%7D%20%5Cright%5C%7D%20%5Cright%29.%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \max\{ |A+A|, |A \cdot A| \} = \Omega\left( \min\left\{ |A|^{1/2}p^{1/2},|A|^2p^{-1/2} \right\} \right). ' title='\displaystyle  \max\{ |A+A|, |A \cdot A| \} = \Omega\left( \min\left\{ |A|^{1/2}p^{1/2},|A|^2p^{-1/2} \right\} \right). ' class='latex' /></em></p>
<p><em> </em></p></blockquote>
<p>To illustrate the theorem, observe that if <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%7C%20%5Capprox%20p%5E%7B2%2F3%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A| \approx p^{2/3}}' title='{|A| \approx p^{2/3}}' class='latex' /> then <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmax%5C%7B%20%7CA%2BA%7C%2C%20%7CA%20%5Ccdot%20A%7C%20%5C%7D%20%3D%20%5COmega%28%20%7CA%7C%5E%7B5%2F4%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\max\{ |A+A|, |A \cdot A| \} = \Omega( |A|^{5/4})}' title='{\max\{ |A+A|, |A \cdot A| \} = \Omega( |A|^{5/4})}' class='latex' />. It is this theorem that Solymosi proves using spectral graph theory (Garaev&#8217;s original proof went by way of Fourier analysis and the estimation of exponential sums).</p>
<p><strong>3. Solymosi&#8217;s proof </strong></p>
<p>In this section we give Solymosi&#8217;s proof of Theorem 2 (we will assume familiarity with basic facts about eigenvalues and eigenvectors of adjacency matrices). Let us first establish some notation. Consider a <img src='http://s.wordpress.com/latex.php?latex=%7Bd%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{d}' title='{d}' class='latex' />-regular graph <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> and let the eigenvalues of its adjacency matrix be</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20d%3D%5Clambda_1%20%5Cgeq%20%5Clambda_2%20%5Cgeq%20%5Cdots%20%5Cgeq%20%5Clambda_n.%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  d=\lambda_1 \geq \lambda_2 \geq \dots \geq \lambda_n. ' title='\displaystyle  d=\lambda_1 \geq \lambda_2 \geq \dots \geq \lambda_n. ' class='latex' /></p>
<p>As usual, we define</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Clambda%28G%29%20%3D%20%5Cmax%5C%7B%20%5Clambda_2%2C%7C%5Clambda_n%7C%5C%7D.%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \lambda(G) = \max\{ \lambda_2,|\lambda_n|\}. ' title='\displaystyle  \lambda(G) = \max\{ \lambda_2,|\lambda_n|\}. ' class='latex' /></p>
<p>Recall that if <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> is connected and non-bipartite then <img src='http://s.wordpress.com/latex.php?latex=%7B%5Clambda%28G%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\lambda(G)}' title='{\lambda(G)}' class='latex' /> is strictly smaller than <img src='http://s.wordpress.com/latex.php?latex=%7Bd%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{d}' title='{d}' class='latex' />, and such a graph is an expander if <img src='http://s.wordpress.com/latex.php?latex=%7Bd-%5Clambda%28G%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{d-\lambda(G)}' title='{d-\lambda(G)}' class='latex' /> is bounded below by some constant.</p>
<p>We will make use of a fundamental result in spectral graph theory: the expander mixing lemma.</p>
<blockquote><p><strong>Lemma 3</strong> <em> Let <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> be a <img src='http://s.wordpress.com/latex.php?latex=%7Bd%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{d}' title='{d}' class='latex' />-regular graph with <img src='http://s.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> vertices. For all <img src='http://s.wordpress.com/latex.php?latex=%7BS%2CT%20%5Csubseteq%20V%28G%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S,T \subseteq V(G)}' title='{S,T \subseteq V(G)}' class='latex' /> we have<br />
</em></p>
<p align="center"><em><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cleft%7C%20e%28S%2CT%29%20-%20%5Cfrac%7Bd%7D%7Bn%7D%7CS%7C%7CT%7C%20%5Cright%7C%20%5Cleq%20%5Clambda%28G%29%20%5Csqrt%7B%7CS%7C%7CT%7C%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \left| e(S,T) - \frac{d}{n}|S||T| \right| \leq \lambda(G) \sqrt{|S||T|} ' title='\displaystyle  \left| e(S,T) - \frac{d}{n}|S||T| \right| \leq \lambda(G) \sqrt{|S||T|} ' class='latex' /></em></p>
<p><em> where <img src='http://s.wordpress.com/latex.php?latex=%7Be%28S%2CT%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{e(S,T)}' title='{e(S,T)}' class='latex' /> is the number of edges with one endpoint in <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> and the other in <img src='http://s.wordpress.com/latex.php?latex=%7BT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{T}' title='{T}' class='latex' />. </em></p></blockquote>
<p>The proof of the lemma is straightforward; we omit it because of space considerations, but it can be found in the survey on expanders by Linial, Hoory, and Wigderson.</p>
<p>Now, back to Theorem 2: we have a finite field <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathbb%20F_p%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathbb F_p}' title='{\mathbb F_p}' class='latex' /> and a subset <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />. Solymosi proves the desired lower bound on <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmax%5C%7B%20%7CA%2BA%7C%2C%20%7CA%20%5Ccdot%20A%7C%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\max\{ |A+A|, |A \cdot A| \}}' title='{\max\{ |A+A|, |A \cdot A| \}}' class='latex' /> by constructing a <em>sum-product graph</em> <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' /> over <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathbb%20F_p%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathbb F_p}' title='{\mathbb F_p}' class='latex' /> and using its spectral properties to reason about <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%2BA%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A+A|}' title='{|A+A|}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%7CA%20%5Ccdot%20A%7C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|A \cdot A|}' title='{|A \cdot A|}' class='latex' />. So without further ado, let&#8217;s define <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' />.</p>
<p>The vertex set of <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' /> is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleft%28%20%5Cmathbb%20F_p%20%5Csetminus%20%5C%7B0%5C%7D%20%5Cright%29%20%5Ctimes%20%5Cmathbb%20F_p%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\left( \mathbb F_p \setminus \{0\} \right) \times \mathbb F_p}' title='{\left( \mathbb F_p \setminus \{0\} \right) \times \mathbb F_p}' class='latex' />, and two vertices <img src='http://s.wordpress.com/latex.php?latex=%7B%28a%2Cb%29%2C%28c%2Cd%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(a,b),(c,d)}' title='{(a,b),(c,d)}' class='latex' /> have an edge between them if <img src='http://s.wordpress.com/latex.php?latex=%7Bac%3Db%2Bd%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{ac=b+d}' title='{ac=b+d}' class='latex' />. It is easy to see that <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' /> has <img src='http://s.wordpress.com/latex.php?latex=%7Bp%28p-1%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p(p-1)}' title='{p(p-1)}' class='latex' /> vertices and is <img src='http://s.wordpress.com/latex.php?latex=%7B%28p-1%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(p-1)}' title='{(p-1)}' class='latex' />-regular (some edges are loops). We also have the following key fact.</p>
<blockquote><p><strong>Lemma 4</strong> <em><a name="neighbors"></a> Consider <img src='http://s.wordpress.com/latex.php?latex=%7B%28a%2Cb%29%2C%28c%2Cd%29%20%5Cin%20V%28G_%7BSP%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(a,b),(c,d) \in V(G_{SP})}' title='{(a,b),(c,d) \in V(G_{SP})}' class='latex' />. If <img src='http://s.wordpress.com/latex.php?latex=%7Ba%3Dc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a=c}' title='{a=c}' class='latex' /> or <img src='http://s.wordpress.com/latex.php?latex=%7Bb%3Dd%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b=d}' title='{b=d}' class='latex' /> then these two vertices have no common neighbor, otherwise they have precisely one. </em></p></blockquote>
<p><em>Proof:</em> This follows from the fact that the unique solution of the system</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cbegin%7Barray%7D%7Bc%7D%20ax%3Db%2By%20%5C%5C%20cx%3Dd%2By%20%5Cend%7Barray%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \begin{array}{c} ax=b+y \\ cx=d+y \end{array} ' title='\displaystyle  \begin{array}{c} ax=b+y \\ cx=d+y \end{array} ' class='latex' /></p>
<p>is given by</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cbegin%7Barray%7D%7Bc%7D%20x%20%3D%20%28b-d%29%28a-c%29%5E%7B-1%7D%5C%5C%202y%20%3D%20x%28a%2Bc%29-b-d%20%5Cend%7Barray%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \begin{array}{c} x = (b-d)(a-c)^{-1}\\ 2y = x(a+c)-b-d \end{array} ' title='\displaystyle  \begin{array}{c} x = (b-d)(a-c)^{-1}\\ 2y = x(a+c)-b-d \end{array} ' class='latex' /></p>
<p>which is only defined when <img src='http://s.wordpress.com/latex.php?latex=%7Ba%20%5Cneq%20c%2C%20b%20%5Cneq%20d%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a \neq c, b \neq d}' title='{a \neq c, b \neq d}' class='latex' />. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p>Lemma <a href="#neighbors">4</a> can be used to show that <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' /> is in fact a very good expander (those more familiar with the literature on expanders will recognize that moreover, <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' /> is almost a Ramanujan graph).</p>
<blockquote><p><strong>Lemma 5</strong> <em><a name="eig"></a> <img src='http://s.wordpress.com/latex.php?latex=%7B%5Clambda%28G_%7BSP%7D%29%20%3C%20%5Csqrt%7B3p%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\lambda(G_{SP}) &lt; \sqrt{3p}}' title='{\lambda(G_{SP}) &lt; \sqrt{3p}}' class='latex' />. </em></p></blockquote>
<p><em>Proof:</em> Let <img src='http://s.wordpress.com/latex.php?latex=%7BM%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{M}' title='{M}' class='latex' /> be the adjacency matrix of <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' />. Recall that the <img src='http://s.wordpress.com/latex.php?latex=%7B%28u%2Cv%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(u,v)}' title='{(u,v)}' class='latex' />-entry of <img src='http://s.wordpress.com/latex.php?latex=%7BM%5E2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{M^2}' title='{M^2}' class='latex' /> is the number of walks from <img src='http://s.wordpress.com/latex.php?latex=%7Bu%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u}' title='{u}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> of length 2. If <img src='http://s.wordpress.com/latex.php?latex=%7Bu%3Dv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u=v}' title='{u=v}' class='latex' />, this number is the degree <img src='http://s.wordpress.com/latex.php?latex=%7Bp-1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p-1}' title='{p-1}' class='latex' />, while if <img src='http://s.wordpress.com/latex.php?latex=%7Bu%20%5Cneq%20v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u \neq v}' title='{u \neq v}' class='latex' />, with <img src='http://s.wordpress.com/latex.php?latex=%7Bu%3D%28a%2Cb%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u=(a,b)}' title='{u=(a,b)}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bv%3D%28c%2Cd%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v=(c,d)}' title='{v=(c,d)}' class='latex' />, Lemma <a href="#neighbors">4</a> tells us that this number is 1 if <img src='http://s.wordpress.com/latex.php?latex=%7Ba%5Cneq%20c%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a\neq c}' title='{a\neq c}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bb%20%5Cneq%20d%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b \neq d}' title='{b \neq d}' class='latex' /> and 0 otherwise. It follows that <a name="eqq"><br />
</a></p>
<p align="center"><a name="eqq"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20M%5E2%20%3D%20J%2B%28p-2%29I-E%20%5C%20%5C%20%5C%20%5C%20%5C%20%281%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  M^2 = J+(p-2)I-E \ \ \ \ \ (1)' title='\displaystyle  M^2 = J+(p-2)I-E \ \ \ \ \ (1)' class='latex' /></a></p>
<p>where <img src='http://s.wordpress.com/latex.php?latex=%7BJ%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{J}' title='{J}' class='latex' /> is the all-1 matrix, <img src='http://s.wordpress.com/latex.php?latex=%7BI%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{I}' title='{I}' class='latex' /> is the identity matrix, and <img src='http://s.wordpress.com/latex.php?latex=%7BE%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E}' title='{E}' class='latex' /> is the adjacency matrix of the graph <img src='http://s.wordpress.com/latex.php?latex=%7BG_E%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_E}' title='{G_E}' class='latex' /> whose vertex set is the same as <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' />, and in which two vertices <img src='http://s.wordpress.com/latex.php?latex=%7B%28a%2Cb%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(a,b)}' title='{(a,b)}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%28c%2Cd%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(c,d)}' title='{(c,d)}' class='latex' /> are connected by an edge if <img src='http://s.wordpress.com/latex.php?latex=%7Ba%3Dc%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a=c}' title='{a=c}' class='latex' /> or <img src='http://s.wordpress.com/latex.php?latex=%7Bb%3Dd%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b=d}' title='{b=d}' class='latex' />. It is easy to see that <img src='http://s.wordpress.com/latex.php?latex=%7BG_E%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_E}' title='{G_E}' class='latex' /> is a <img src='http://s.wordpress.com/latex.php?latex=%7B%282p-3%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(2p-3)}' title='{(2p-3)}' class='latex' />-regular graph.</p>
<p>Since <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' /> is regular and its adjacency matrix <img src='http://s.wordpress.com/latex.php?latex=%7BM%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{M}' title='{M}' class='latex' /> is symmetric, we know that the all-1 vector is an eigenvector of <img src='http://s.wordpress.com/latex.php?latex=%7BM%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{M}' title='{M}' class='latex' /> and all other eigenvectors are orthogonal to it. It is easy to check that <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' /> is connected and not bipartite, so that the eigenvalue <img src='http://s.wordpress.com/latex.php?latex=%7Bp-1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p-1}' title='{p-1}' class='latex' /> has multiplicity 1, and for any other eigenvalue <img src='http://s.wordpress.com/latex.php?latex=%7B%5Ctheta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\theta}' title='{\theta}' class='latex' /> we have <img src='http://s.wordpress.com/latex.php?latex=%7B%7C%5Ctheta%7C%20%3C%20p-1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|\theta| &lt; p-1}' title='{|\theta| &lt; p-1}' class='latex' />.</p>
<p>Given such an eigenvalue <img src='http://s.wordpress.com/latex.php?latex=%7B%5Ctheta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\theta}' title='{\theta}' class='latex' />, let <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cbf%20v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\bf v}' title='{\bf v}' class='latex' /> be a corresponding eigenvector. Then by equation~<a href="#eqq">(1)</a>,</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Ctheta%5E2%20%5Cmathbf%7Bv%7D%20%3D%20%28p-2%29%5Cmathbf%7Bv%7D%20-%20E%20%5Cmathbf%20v%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \theta^2 \mathbf{v} = (p-2)\mathbf{v} - E \mathbf v ' title='\displaystyle  \theta^2 \mathbf{v} = (p-2)\mathbf{v} - E \mathbf v ' class='latex' /></p>
<p>since <img src='http://s.wordpress.com/latex.php?latex=%7BJ%5Cmathbf%20v%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{J\mathbf v}' title='{J\mathbf v}' class='latex' /> is the all-0 vector. Therefore <img src='http://s.wordpress.com/latex.php?latex=%7Bp-2-%5Ctheta%5E2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p-2-\theta^2}' title='{p-2-\theta^2}' class='latex' /> is an eigenvalue of <img src='http://s.wordpress.com/latex.php?latex=%7BE%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E}' title='{E}' class='latex' />.</p>
<p>Now, the degree of <img src='http://s.wordpress.com/latex.php?latex=%7BG_E%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_E}' title='{G_E}' class='latex' /> is an upper bound on the absolute value of every eigenvalue of <img src='http://s.wordpress.com/latex.php?latex=%7BE%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E}' title='{E}' class='latex' />. It follows that</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20p-2-%5Ctheta%5E2%20%5Cgeq%20-2p%2B3%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  p-2-\theta^2 \geq -2p+3 ' title='\displaystyle  p-2-\theta^2 \geq -2p+3 ' class='latex' /></p>
<p>which implies <img src='http://s.wordpress.com/latex.php?latex=%7B%7C%5Ctheta%7C%20%3C%20%5Csqrt%7B3p%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{|\theta| &lt; \sqrt{3p}}' title='{|\theta| &lt; \sqrt{3p}}' class='latex' />, as desired. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p>So <img src='http://s.wordpress.com/latex.php?latex=%7BG_%7BSP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G_{SP}}' title='{G_{SP}}' class='latex' /> is an expander; very good, but what about <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />? Solymosi introduces <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> into the proof through very clever use of the expander mixing lemma: if we define <img src='http://s.wordpress.com/latex.php?latex=%7BS%2CT%20%5Csubseteq%20V%28G_%7BSP%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S,T \subseteq V(G_{SP})}' title='{S,T \subseteq V(G_{SP})}' class='latex' /> by</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20S%3D%28A%20%5Ccdot%20A%29%20%5Ctimes%20%28-A%29%2C%20%5C%20%5Cquad%20T%3D%28A%5E%7B-1%7D%29%20%5Ctimes%20%28A%2BA%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  S=(A \cdot A) \times (-A), \ \quad T=(A^{-1}) \times (A+A) ' title='\displaystyle  S=(A \cdot A) \times (-A), \ \quad T=(A^{-1}) \times (A+A) ' class='latex' /></p>
<p>then that lemma tells us that</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20e%28S%2CT%29%20%5Cleq%20%5Cfrac%7B%7CS%7C%7CT%7C%7D%7Bp%7D%20%2B%20%5Clambda%28G_%7BSP%7D%29%20%5Csqrt%7B%7CS%7C%7CT%7C%7D%20%3C%20%5Cfrac%7B%7CA%5Ccdot%20A%7C%7CA%2BA%7C%7CA%7C%5E2%7D%7Bp%7D%20%2B%20%5Csqrt%7B3p%7CA%20%5Ccdot%20A%7C%7CA%2BA%7C%7CA%7C%5E2%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  e(S,T) \leq \frac{|S||T|}{p} + \lambda(G_{SP}) \sqrt{|S||T|} &lt; \frac{|A\cdot A||A+A||A|^2}{p} + \sqrt{3p|A \cdot A||A+A||A|^2} ' title='\displaystyle  e(S,T) \leq \frac{|S||T|}{p} + \lambda(G_{SP}) \sqrt{|S||T|} &lt; \frac{|A\cdot A||A+A||A|^2}{p} + \sqrt{3p|A \cdot A||A+A||A|^2} ' class='latex' /></p>
<p>where the second inequality used Lemma <a href="#eig">5</a>.</p>
<p>But for every <img src='http://s.wordpress.com/latex.php?latex=%7Ba%2Cb%2Cc%20%5Cin%20A%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a,b,c \in A}' title='{a,b,c \in A}' class='latex' /> there is an edge between <img src='http://s.wordpress.com/latex.php?latex=%7B%28ab%2C-c%29%20%5Cin%20S%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(ab,-c) \in S}' title='{(ab,-c) \in S}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%28b%5E%7B-1%7D%2Ca%2Bc%29%20%5Cin%20T%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(b^{-1},a+c) \in T}' title='{(b^{-1},a+c) \in T}' class='latex' />, so that <img src='http://s.wordpress.com/latex.php?latex=%7Be%28S%2CT%29%20%5Cgeq%20%7CA%7C%5E3%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{e(S,T) \geq |A|^3}' title='{e(S,T) \geq |A|^3}' class='latex' />. Using this observation and rearranging the resulting inequality gives</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Csqrt%7B%7CA%20%5Ccdot%20A%7C%7CA%2BA%7C%7D%20%3E%20%5Cleft%28%20%5Cfrac%7B%5Csqrt%7B%7CA%20%5Ccdot%20A%7C%7CA%2BA%7C%7D%7D%7Bp%7CA%7C%7D%2B%5Csqrt%7B3%7D%5Cfrac%7Bp%5E%7B1%2F2%7D%7D%7B%7CA%7C%5E2%7D%20%5Cright%29%5E%7B-1%7D.%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \sqrt{|A \cdot A||A+A|} &gt; \left( \frac{\sqrt{|A \cdot A||A+A|}}{p|A|}+\sqrt{3}\frac{p^{1/2}}{|A|^2} \right)^{-1}. ' title='\displaystyle  \sqrt{|A \cdot A||A+A|} &gt; \left( \frac{\sqrt{|A \cdot A||A+A|}}{p|A|}+\sqrt{3}\frac{p^{1/2}}{|A|^2} \right)^{-1}. ' class='latex' /></p>
<p>Now, since <img src='http://s.wordpress.com/latex.php?latex=%7B%28x%2By%29%5E%7B-1%7D%20%5Cgeq%20%5Cfrac%7B1%7D%7B2%7D%5Cmin%5C%7Bx%5E%7B-1%7D%2Cy%5E%7B-1%7D%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(x+y)^{-1} \geq \frac{1}{2}\min\{x^{-1},y^{-1}\}}' title='{(x+y)^{-1} \geq \frac{1}{2}\min\{x^{-1},y^{-1}\}}' class='latex' /> for positive <img src='http://s.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7By%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{y}' title='{y}' class='latex' />, we find that</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Csqrt%7B%7CA%20%5Ccdot%20A%7C%7CA%2BA%7C%7D%20%3D%20%5COmega%5Cleft%28%20%5Cmin%5Cleft%5C%7B%20%5Cfrac%7Bp%7CA%7C%7D%7B%5Csqrt%7B%7CA%20%5Ccdot%20A%7C%7CA%2BA%7C%7D%7D%2C%20%7CA%7C%5E2%20p%5E%7B-1%2F2%7D%20%5Cright%5C%7D%20%5Cright%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \sqrt{|A \cdot A||A+A|} = \Omega\left( \min\left\{ \frac{p|A|}{\sqrt{|A \cdot A||A+A|}}, |A|^2 p^{-1/2} \right\} \right) ' title='\displaystyle  \sqrt{|A \cdot A||A+A|} = \Omega\left( \min\left\{ \frac{p|A|}{\sqrt{|A \cdot A||A+A|}}, |A|^2 p^{-1/2} \right\} \right) ' class='latex' /></p>
<p>which in turn implies</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Csqrt%7B%7CA%20%5Ccdot%20A%7C%7CA%2BA%7C%7D%20%3D%20%5COmega%5Cleft%28%20%5Cmin%5Cleft%5C%7B%20%7CA%7C%5E%7B1%2F2%7Dp%5E%7B1%2F2%7D%2C%20%7CA%7C%5E2%20p%5E%7B-1%2F2%7D%20%5Cright%5C%7D%20%5Cright%29.%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \sqrt{|A \cdot A||A+A|} = \Omega\left( \min\left\{ |A|^{1/2}p^{1/2}, |A|^2 p^{-1/2} \right\} \right). ' title='\displaystyle  \sqrt{|A \cdot A||A+A|} = \Omega\left( \min\left\{ |A|^{1/2}p^{1/2}, |A|^2 p^{-1/2} \right\} \right). ' class='latex' /></p>
<p>To finish the proof, we need only cite the two-term AM-GM inequality:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cmax%5C%7B%20%7CA%2BA%7C%2C%20%7CA%20%5Ccdot%20A%7C%20%5C%7D%20%5Cgeq%20%5Cfrac%7B%7CA%20%5Ccdot%20A%7C%2B%7CA%2BA%7C%7D%7B2%7D%20%5Cgeq%20%5Csqrt%7B%7CA%20%5Ccdot%20A%7C%7CA%2BA%7C%7D%20.%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \max\{ |A+A|, |A \cdot A| \} \geq \frac{|A \cdot A|+|A+A|}{2} \geq \sqrt{|A \cdot A||A+A|} . ' title='\displaystyle  \max\{ |A+A|, |A \cdot A| \} \geq \frac{|A \cdot A|+|A+A|}{2} \geq \sqrt{|A \cdot A||A+A|} . ' class='latex' /></p>
<p><strong>4. A very terse bibliography </strong></p>
<p>Solymosi&#8217;s proof is from his paper <a class="snap_noshots" href="http://www.springerlink.com/content/n044057tw410178t/">&#8220;Incidences and the spectra of graphs&#8221;</a> (requires institutional access).</p>
<p>A more knowledgeable treatment of sum-product problems than I could ever provide can be found in <a class="snap_noshots" href="http://terrytao.wordpress.com/tag/sum-product-estimates/">these two entries from Terry Tao&#8217;s blog</a>. In these, Tao provides a detailed introduction to the problem, gives the probabilistic proof of Elekes&#8217; bound, discusses an interesting cryptographic application, and provides many references.</p>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px;"><span class="ik"><img id="upi" class="de" src="http://mail.google.com/mail/images/cleardot.gif" alt="" width="16" height="16" /></span></p>
<h3 class="gD" style="color: #00681c;"><span>Igor Gorodezky</span></h3>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/154/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Competitive Auctions</title>
		<link>http://www.bigredbits.com/archives/151</link>
		<comments>http://www.bigredbits.com/archives/151#comments</comments>
		<pubDate>Thu, 17 Sep 2009 19:18:38 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[game theory]]></category>
		<category><![CDATA[mechanism design]]></category>
		<category><![CDATA[profit maximization]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=151</guid>
		<description><![CDATA[This week I will present the Theory Discussion Group about Competitive Auctions. It is mainly a serie of results in papers from Jason Hartline, Andrew Goldberg, Anna Karlin, Amos Fiat, &#8230; The first paper is Competitive Auctions and Digital Goods and the second is Competitive Generalized Auctions. My objective is to begin with a short [...]]]></description>
			<content:encoded><![CDATA[<p>This week I will present the Theory Discussion Group about Competitive Auctions. It is mainly a serie of results in papers from Jason Hartline, Andrew Goldberg, Anna Karlin, Amos Fiat, &#8230; The first paper is <a class="snap_noshots" href="http://www.ece.northwestern.edu/~hartline/papers/auctions-SODA-01.pdf">Competitive Auctions and Digital Goods</a> and the second is <a class="snap_noshots" href="http://www.ece.northwestern.edu/~hartline/papers/auctions-STOC-02.pdf">Competitive Generalized Auctions</a>. My objective is to begin with a short introduction about Mechanism Design, the concept of truthfulness and the characterization of Truthful Mechanisms for Single Parameter Agents. Then we describe the Random Sampling Auction for Digital Goods and in the end we discuss open questions. I thought writting a blog post was a good way of organizing my ideas to the talk.</p>
<p><strong>1. Mechanism Design and Truthfulness </strong></p>
<p>A mechanism is an algorithm augmented with economic incentives. They are usually applied in the following context: there is an algorithmic problem and the input is distributed among several agents that have some interest in the final outcome and therefore they may try manipulate the algorithm. Today we restrict our attention to a specific class of mechanisms called <strong>single parameter agents</strong>. In that setting, there is a set <img src='http://s.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> consisting of <img src='http://s.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> agents and a service. Each agent <img src='http://s.wordpress.com/latex.php?latex=%7Bi%20%5Cin%20N%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i \in N}' title='{i \in N}' class='latex' /> has a value <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> for receiving the service and <img src='http://s.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> otherwise. We can think of <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> as the maximum player <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> is willing to pay for that service. We call an environment <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%20%5Csubseteq%202%5EN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X} \subseteq 2^N}' title='{\mathcal{X} \subseteq 2^N}' class='latex' /> the subsets of the bidders that can be simultaneously served. For example:</p>
<ol>
<li> Single item auction: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%20%3D%5C%7BS%3B%20%5Cvert%20S%20%5Cvert%20%5Cleq%201%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X} =\{S; \vert S \vert \leq 1\}}' title='{\mathcal{X} =\{S; \vert S \vert \leq 1\}}' class='latex' /></li>
<li> Multi item auction: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%20%3D%5C%7BS%3B%20%5Cvert%20S%20%5Cvert%20%5Cleq%20k%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X} =\{S; \vert S \vert \leq k\}}' title='{\mathcal{X} =\{S; \vert S \vert \leq k\}}' class='latex' /></li>
<li> Digital goods auction: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%20%3D2%5EN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X} =2^N}' title='{\mathcal{X} =2^N}' class='latex' /></li>
<li> Matroid auctions: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X}}' title='{\mathcal{X}}' class='latex' /> is a matroid on <img src='http://s.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /></li>
<li> Path auctions: <img src='http://s.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> is the set of edges in a graph and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X}}' title='{\mathcal{X}}' class='latex' /> is the set of <img src='http://s.wordpress.com/latex.php?latex=%7Bs-t%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{s-t}' title='{s-t}' class='latex' />-paths in the graph</li>
<li> Knapsack auctions: there is a size <img src='http://s.wordpress.com/latex.php?latex=%7Bs_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{s_i}' title='{s_i}' class='latex' /> for each <img src='http://s.wordpress.com/latex.php?latex=%7Bi%20%5Cin%20N%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i \in N}' title='{i \in N}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BX%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{X}}' title='{S \in \mathcal{X}}' class='latex' /> iff <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%7Bi%20%5Cin%20S%7D%20s_i%20%5Cleq%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_{i \in S} s_i \leq C}' title='{\sum_{i \in S} s_i \leq C}' class='latex' /> for a fixed <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /></li>
</ol>
<p>Most mechanism design problems focus in maximizing (or approximating) the social welfare, i.e., finding <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BX%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{X}}' title='{S \in \mathcal{X}}' class='latex' /> maximizing <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%7Bi%20%5Cin%20S%7D%20v_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_{i \in S} v_i}' title='{\sum_{i \in S} v_i}' class='latex' />. Our focus here will be maximizing the revenue of the auctioneer. Before we start searching for such a mechanism, we should first see which properties it is supposed to have, and maybe even first that that, define what we mean by a mechanism. In the first moment, the agents report their valuations <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> (which can be their true valuations or lies), then the mechanism decides on an allocation <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Csubseteq%20N%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \subseteq N}' title='{S \subseteq N}' class='latex' /> (in a possibly randomized way) and charges a payment <img src='http://s.wordpress.com/latex.php?latex=%7BP_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{P_i}' title='{P_i}' class='latex' /> for each allocated agents. The profit of the auctioneer is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%7Bi%20%5Cin%20S%7D%20P_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_{i \in S} P_i}' title='{\sum_{i \in S} P_i}' class='latex' /> and the utility of a bidder is:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20u_i%20%3D%20%5Cleft%5C%7B%20%5Cbegin%7Baligned%7D%20v_i%20-%20P_i%20%26%2C%20i%20%5Cin%20S%20%5C%5C%200%20%26%2C%20i%20%5Cnotin%20S%20%5Cend%7Baligned%7D%20%5Cright.&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle u_i = \left\{ \begin{aligned} v_i - P_i &amp;, i \in S \\ 0 &amp;, i \notin S \end{aligned} \right.' title='\displaystyle u_i = \left\{ \begin{aligned} v_i - P_i &amp;, i \in S \\ 0 &amp;, i \notin S \end{aligned} \right.' class='latex' />
<p>The agents will report valuations so to maximize their final utility. We could either consider a general mechanism e calculate the profit/social welfare in the game induced by this mechanism or we could design an algorithm that gives incentives for the bidders to report their true valuation. The revelation principle says there is no loss of generality to consider only mechanisms of the second type. The intuition is: the mechanisms of the first type can be simulated by mechanisms of the second type. So, we restrict our attention to mechanisms of the second type, which we call <strong>truthful mechanisms</strong>. This definnition is clear for deterministic mechanisms but not so clear for randomized mechanisms. There are two such definitions:</p>
<ul>
<li> <strong>Universal Truthful mechanisms</strong>: distribution over deterministic truthful mechanisms, i.e., some coins are tossed and based on those coins, we choose a deterministic mechanism and run it. Even if the players knew the random coins, the mechanism would still be truthful.</li>
<li> <strong>Truthful in Expectation mechanisms:</strong> Let <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28b_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(b_i)}' title='{u_i(b_i)}' class='latex' /> be the utility of agent <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> if he bids <img src='http://s.wordpress.com/latex.php?latex=%7Bb_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_i}' title='{b_i}' class='latex' />. Since it is a randomized mechanism, then it is random variable. Truthful in expectation means that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%5Bu_i%28v_i%29%5D%20%5Cgeq%20%5Cmathop%7B%5Cmathbb%20E%7D%5Bu_i%28b_i%29%5D%2C%20%5Cforall%20b_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E}[u_i(v_i)] \geq \mathop{\mathbb E}[u_i(b_i)], \forall b_i}' title='{\mathop{\mathbb E}[u_i(v_i)] \geq \mathop{\mathbb E}[u_i(b_i)], \forall b_i}' class='latex' />.</li>
</ul>
<p>Clearly all Universal Truthful mechanisms are Truthful in Expectation but the converse is not true. Now, before we proceed, we will redefine a mechanism in a more formal way so that it will be easier to reason about:</p>
<blockquote><p><strong>Definition 1</strong> <em> A mechanism <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BM%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{M}}' title='{\mathcal{M}}' class='latex' /> is a function that associated for each <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cin%20%7B%5Cmathbb%20R%7D%5EN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \in {\mathbb R}^N}' title='{v \in {\mathbb R}^N}' class='latex' /> a distribution over elements of <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X}}' title='{\mathcal{X}}' class='latex' />. </em></p></blockquote>
<blockquote><p><strong>Theorem 2</strong> <em> Let <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28v%29%20%3D%20%5Csum_%7Bi%20%5Cin%20S%20%5Cin%20%5Cmathcal%7BX%7D%7D%20Pr_%7B%5Cmathcal%7BM%7D%28v%29%7D%5BS%5D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(v) = \sum_{i \in S \in \mathcal{X}} Pr_{\mathcal{M}(v)}[S]}' title='{x_i(v) = \sum_{i \in S \in \mathcal{X}} Pr_{\mathcal{M}(v)}[S]}' class='latex' /> be the probability that <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> is allocated by the mechanism given <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> is reported. The mechanism is truthful iff <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28v%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(v)}' title='{x_i(v)}' class='latex' /> is monotone and each allocated bidder is charged payment: </em></p>
<p><em><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20P_i%20%3D%20v_i%20-%20%5Cfrac%7B1%7D%7Bx_i%28v_i%2C%20v_%7B-i%7D%29%7D%20%5Cint_0%5E%7Bv_i%7D%20x_i%28w%2C%20v_%7B-i%7D%29%20dw&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle P_i = v_i - \frac{1}{x_i(v_i, v_{-i})} \int_0^{v_i} x_i(w, v_{-i}) dw' title='\displaystyle P_i = v_i - \frac{1}{x_i(v_i, v_{-i})} \int_0^{v_i} x_i(w, v_{-i}) dw' class='latex' /></em></p>
<p><em> </em></p></blockquote>
<p>This is a classical theorem by Myerson about the characterization of truthful auctions. It is not hard to see that the auction define above is truthful. We just need to check that <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28v_i%2C%20v_%7B-i%7D%29%20%28v_i%20-%20P%28v_i%2C%20v_%7B-i%7D%29%29%20%5Cgeq%20x_i%28v%27_i%2C%20v_%7B-i%7D%29%20%28v_i%20-%20P%28v%27_i%2C%20v_%7B-i%7D%29%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(v_i, v_{-i}) (v_i - P(v_i, v_{-i})) \geq x_i(v&#039;_i, v_{-i}) (v_i - P(v&#039;_i, v_{-i}))}' title='{x_i(v_i, v_{-i}) (v_i - P(v_i, v_{-i})) \geq x_i(v&#039;_i, v_{-i}) (v_i - P(v&#039;_i, v_{-i}))}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bv%27_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v&#039;_i}' title='{v&#039;_i}' class='latex' />. The opposite is trickier but is also not hard to see.</p>
<p>Note that this characterization implies the following characterization of deterministic truthful auctions, i.e., auctions that map each <img src='http://s.wordpress.com/latex.php?latex=%7Bv%20%5Cin%20%7B%5Cmathbb%20R%7D%5EN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v \in {\mathbb R}^N}' title='{v \in {\mathbb R}^N}' class='latex' /> to a set <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BX%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{X}}' title='{S \in \mathcal{X}}' class='latex' />, i.e., the probability distribution is concentrated in one set.</p>
<blockquote><p><strong>Theorem 3</strong> <em> A mechanism is a truthful deterministic auction iff there is a functions <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(b_{-i})}' title='{f_i(b_{-i})}' class='latex' /> such that for each we allocate to bidder <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> iff <img src='http://s.wordpress.com/latex.php?latex=%7Bb_i%20%5Cgeq%20f_i%28b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_i \geq f_i(b_{-i})}' title='{b_i \geq f_i(b_{-i})}' class='latex' /> and in case it is allocated, we charge payment <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(b_{-i})}' title='{f_i(b_{-i})}' class='latex' />. </em></p></blockquote>
<p>It is actually easy to generate this function. Given a mechanism, <img src='http://s.wordpress.com/latex.php?latex=%7Bb_i%20%5Cmapsto%20x_i%28b_i%2C%20b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_i \mapsto x_i(b_i, b_{-i})}' title='{b_i \mapsto x_i(b_i, b_{-i})}' class='latex' /> is a monotone and is a <img src='http://s.wordpress.com/latex.php?latex=%7B%5C%7B0%2C1%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\{0,1\}}' title='{\{0,1\}}' class='latex' />-function. Let <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28b_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(b_{-i})}' title='{f_i(b_{-i})}' class='latex' /> the point where it transitions from <img src='http://s.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> to <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />. Now, we can give a similar characterization for Universal Truthful Mechanism:</p>
<blockquote><p><strong>Theorem 4</strong> <em> A mechanism is a universal truthful randomized auction if there are functions <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28r%2Cb_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(r,b_{-i})}' title='{f_i(r,b_{-i})}' class='latex' /> such that for each we allocate to bidder <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> iff <img src='http://s.wordpress.com/latex.php?latex=%7Bb_i%20%5Cgeq%20f_i%28r%2Cb_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_i \geq f_i(r,b_{-i})}' title='{b_i \geq f_i(r,b_{-i})}' class='latex' /> and in case it is allocated, we charge payment <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%28r%2Cb_%7B-i%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i(r,b_{-i})}' title='{f_i(r,b_{-i})}' class='latex' />, where <img src='http://s.wordpress.com/latex.php?latex=%7Br%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{r}' title='{r}' class='latex' /> are random bits. </em></p></blockquote>
<p><strong>2. Profit benchmarks </strong></p>
<p>Let&#8217;s consider a Digital Goods auction, where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%20%3D%202%5EN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X} = 2^N}' title='{\mathcal{X} = 2^N}' class='latex' />. Two natural goals for profit extraction would be <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BT%7D%28v%29%20%3D%20%5Csum_i%20v_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{T}(v) = \sum_i v_i}' title='{\mathcal{T}(v) = \sum_i v_i}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%28v%29%20%3D%20%5Cmax_i%20i%20v_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}(v) = \max_i i v_i}' title='{\mathcal{F}(v) = \max_i i v_i}' class='latex' /> where we can think of <img src='http://s.wordpress.com/latex.php?latex=%7Bv_1%20%5Cgeq%20v_2%20%5Cgeq%20%5Chdots%20%5Cgeq%20v_n%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_1 \geq v_2 \geq \hdots \geq v_n}' title='{v_1 \geq v_2 \geq \hdots \geq v_n}' class='latex' />, the first is the best profit you can extract charging different prices and the second is the best profit you can hope to extract by charging a fixed price. Unfortunately it is impossible to design a mechanism that even <img src='http://s.wordpress.com/latex.php?latex=%7BO%281%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{O(1)}' title='{O(1)}' class='latex' />-approximates both benchmarks on every input. The intuition is that <img src='http://s.wordpress.com/latex.php?latex=%7Bv_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_1}' title='{v_1}' class='latex' /> can be much larger then the rest, so there is no way of setting <img src='http://s.wordpress.com/latex.php?latex=%7Bf_1%28b_%7B-1%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_1(b_{-1})}' title='{f_1(b_{-1})}' class='latex' /> in a proper way. Under the assumption that the first value is not much larger than the second, we can do a good profit approximation, though. This motivates us to find an universal truthful mechanism that approximates the following profit benchmark:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cmathcal%7BF%7D%5E%7B%282%29%7D%28v%29%20%3D%20%5Cmax_%7Bi%20%5Cgeq%202%7D%20i%20v_i&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \mathcal{F}^{(2)}(v) = \max_{i \geq 2} i v_i' title='\displaystyle \mathcal{F}^{(2)}(v) = \max_{i \geq 2} i v_i' class='latex' />
<p>which is the highest single-price profit we can get selling to at least <img src='http://s.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> agents. We will show a truthful mechanism that <img src='http://s.wordpress.com/latex.php?latex=%7B4%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{4}' title='{4}' class='latex' />-approximates this benchmark.</p>
<p><strong>3. Profit Extractors </strong></p>
<p>Profit extractor are building blocks of many mechanisms. The goal of a profit extractor is, given a constant target profit <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' />, extract that profit from a set of agents if that is possible. In this first moment, let&#8217;s see <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> as an exogenous constant. Consider the following mechanism called CostShare<img src='http://s.wordpress.com/latex.php?latex=%7B_C%20%28v%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{_C (v)}' title='{_C (v)}' class='latex' />: find the largest <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> s.t. <img src='http://s.wordpress.com/latex.php?latex=%7Bk%20%5Ccdot%20v_k%20%5Cgeq%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k \cdot v_k \geq C}' title='{k \cdot v_k \geq C}' class='latex' />. Then allocate to</p>
<blockquote><p><strong>Lemma 5</strong> <em> CostShare<img src='http://s.wordpress.com/latex.php?latex=%7B_C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{_C}' title='{_C}' class='latex' /> is a truthful profit-extractor that can extract profit <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> whenever <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%28v%29%20%3D%20%5Cmax_i%20i%20v_i%20%5Cgeq%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}(v) = \max_i i v_i \geq C}' title='{\mathcal{F}(v) = \max_i i v_i \geq C}' class='latex' />. </em></p></blockquote>
<p><em>Proof:</em> It is clear that it can extract profit at most <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> if <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%28v%29%20%5Cgeq%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}(v) \geq C}' title='{\mathcal{F}(v) \geq C}' class='latex' />. We just need to prove it is a truthful mechanism and this can be done by checking the characterization of truthful mechanisms. Suppose that under CostShare<img src='http://s.wordpress.com/latex.php?latex=%7B_C%20%28v%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{_C (v)}' title='{_C (v)}' class='latex' /> exacly <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> bidders are getting the item, then let&#8217;s look at a bidder <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' />. If bidder <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> is not getting the item, then his value is smaller than <img src='http://s.wordpress.com/latex.php?latex=%7BC%2Fk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C/k}' title='{C/k}' class='latex' />, otherwise we could incluse all bidders up to <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> and sell for a price <img src='http://s.wordpress.com/latex.php?latex=%7BC%2Fk_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C/k_1}' title='{C/k_1}' class='latex' /> for some <img src='http://s.wordpress.com/latex.php?latex=%7Bk_1%20%3E%20k%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k_1 &gt; k}' title='{k_1 &gt; k}' class='latex' />. It is easy to see that bidder <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> will get the item just if he changes his value <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> to some value greater or equal than <img src='http://s.wordpress.com/latex.php?latex=%7BC%2Fk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C/k}' title='{C/k}' class='latex' />.</p>
<p>On the other hand, it <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> is currently getting the item under <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' />, then increasing his value won&#8217;t make it change. It is also clear that for any value <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%20%5Cgeq%20C%2Fk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i \geq C/k}' title='{v_i \geq C/k}' class='latex' />, he will still get the item. For <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%20%3C%20C%2Fk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i &lt; C/k}' title='{v_i &lt; C/k}' class='latex' /> he doesn&#8217;t get it. Suppose it got, then at least <img src='http://s.wordpress.com/latex.php?latex=%7Bk%2B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k+1}' title='{k+1}' class='latex' /> people get the item, because the price they sell it to <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> must be less than <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%20%3C%20C%2Fk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i &lt; C/k}' title='{v_i &lt; C/k}' class='latex' />. Thefore, increasing <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> back to its original value, we could still sell it to <img src='http://s.wordpress.com/latex.php?latex=%7Bk%2B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k+1}' title='{k+1}' class='latex' /> players, what is a contradiction, since we assumed we were selling to <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> players.</p>
<p>We checked monotonicity and we also need to check the payments, but it is straightforward to check they satisfy the second condition, since <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28v_i%29%20%3D%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(v_i) = 1}' title='{x_i(v_i) = 1}' class='latex' /> for <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%20%5Cgeq%20C%2Fk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i \geq C/k}' title='{v_i \geq C/k}' class='latex' /> and zero instead. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p><strong>4. Random Sampling Auctions </strong></p>
<p>Now, using that profit extractor as a building block, the main idea is to estimate <img src='http://s.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> smaller than <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%28v%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}(v)}' title='{\mathcal{F}(v)}' class='latex' /> for one subset of the agents and extract that profit from them using a profit extractor. First we partition <img src='http://s.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> is two sets <img src='http://s.wordpress.com/latex.php?latex=%7BN%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N&#039;}' title='{N&#039;}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BN%27%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N&#039;&#039;}' title='{N&#039;&#039;}' class='latex' /> tossing a coin for each agent to decide in which set we will place it, then we calculate <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%27%20%3D%20%5Cmathcal%7BF%7D%28v_%7BN%27%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}&#039; = \mathcal{F}(v_{N&#039;})}' title='{\mathcal{F}&#039; = \mathcal{F}(v_{N&#039;})}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%27%27%20%3D%20%5Cmathcal%7BF%7D%28v_%7BN%27%27%7D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}&#039;&#039; = \mathcal{F}(v_{N&#039;&#039;})}' title='{\mathcal{F}&#039;&#039; = \mathcal{F}(v_{N&#039;&#039;})}' class='latex' />. Now, we run CostShare<img src='http://s.wordpress.com/latex.php?latex=%7B_%7B%5Cmathcal%7BF%7D%27%7D%20%28v%27%27%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{_{\mathcal{F}&#039;} (v&#039;&#039;)}' title='{_{\mathcal{F}&#039;} (v&#039;&#039;)}' class='latex' /> and CostShare<img src='http://s.wordpress.com/latex.php?latex=%7B_%7B%5Cmathcal%7BF%7D%27%27%7D%20%28v%27%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{_{\mathcal{F}&#039;&#039;} (v&#039;)}' title='{_{\mathcal{F}&#039;&#039;} (v&#039;)}' class='latex' />. This is called Random Cost Sharing Auction.</p>
<blockquote><p><strong>Theorem 6</strong> <em> The Random Cost Sharing Auction is a truthful auction whose revenue <img src='http://s.wordpress.com/latex.php?latex=%7B4%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{4}' title='{4}' class='latex' />-approximates the benchmark <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%5E%7B%282%29%7D%28v%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}^{(2)}(v)}' title='{\mathcal{F}^{(2)}(v)}' class='latex' />. </em></p></blockquote>
<p><em>Proof:</em> Let <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BR%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{R}}' title='{\mathcal{R}}' class='latex' /> be a random variable associated with the revenue of the Sampling Auction mechanism. It is clear that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BR%7D%20%3D%20%5Cmin%20%5C%7B%20%5Cmathcal%7BF%7D%27%2C%20%5Cmathcal%7BF%7D%27%27%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{R} = \min \{ \mathcal{F}&#039;, \mathcal{F}&#039;&#039; \}}' title='{\mathcal{R} = \min \{ \mathcal{F}&#039;, \mathcal{F}&#039;&#039; \}}' class='latex' />. Let&#8217;s write <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%5E%7B%282%29%7D%28v%29%20%3D%20kp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}^{(2)}(v) = kp}' title='{\mathcal{F}^{(2)}(v) = kp}' class='latex' /> meaning that we sell <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> items at price <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' />. Let <img src='http://s.wordpress.com/latex.php?latex=%7Bk%20%3D%20k%27%20%2B%20k%27%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k = k&#039; + k&#039;&#039;}' title='{k = k&#039; + k&#039;&#039;}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7Bk%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k&#039;}' title='{k&#039;}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bk%27%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k&#039;&#039;}' title='{k&#039;&#039;}' class='latex' /> are the items among those <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> items that went to <img src='http://s.wordpress.com/latex.php?latex=%7BN%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N&#039;}' title='{N&#039;}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BN%27%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N&#039;&#039;}' title='{N&#039;&#039;}' class='latex' /> respectively. Then, clearly <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%27%20%5Cgeq%20p%20k%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}&#039; \geq p k&#039;}' title='{\mathcal{F}&#039; \geq p k&#039;}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%27%27%20%5Cgeq%20p%20k%27%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}&#039;&#039; \geq p k&#039;&#039;}' title='{\mathcal{F}&#039;&#039; \geq p k&#039;&#039;}' class='latex' />, what gives us:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cfrac%7B%5Cmathcal%7BR%7D%7D%7B%5Cmathcal%7BF%7D%5E%7B%282%29%7D%7D%20%3D%20%5Cfrac%7B%5Cmin%5C%7B%5Cmathcal%7BF%7D%27%2C%20%5Cmathcal%7BF%7D%27%27%5C%7D%7D%7B%5Cmathcal%7BF%7D%5E%7B%282%29%7D%7D%20%5Cgeq%20%5Cfrac%7B%5Cmin%5C%7Bk%27p%2C%20k%27%27p%5C%7D%7D%7Bkp%7D%20%3D%20%5Cfrac%7B%5Cmin%5C%7Bk%27%2C%20k%27%27%5C%7D%7D%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \frac{\mathcal{R}}{\mathcal{F}^{(2)}} = \frac{\min\{\mathcal{F}&#039;, \mathcal{F}&#039;&#039;\}}{\mathcal{F}^{(2)}} \geq \frac{\min\{k&#039;p, k&#039;&#039;p\}}{kp} = \frac{\min\{k&#039;, k&#039;&#039;\}}{k}' title='\displaystyle \frac{\mathcal{R}}{\mathcal{F}^{(2)}} = \frac{\min\{\mathcal{F}&#039;, \mathcal{F}&#039;&#039;\}}{\mathcal{F}^{(2)}} \geq \frac{\min\{k&#039;p, k&#039;&#039;p\}}{kp} = \frac{\min\{k&#039;, k&#039;&#039;\}}{k}' class='latex' />
<p>and from there, it is a straighforward probability exercise:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cfrac%7B%5Cmathop%7B%5Cmathbb%20E%7D%5B%7B%5Cmathcal%7BR%7D%7D%5D%7D%7B%5Cmathcal%7BF%7D%5E%7B%282%29%7D%7D%20%3D%20%5Cmathop%7B%5Cmathbb%20E%7D%5Cleft%5B%7B%20%5Cfrac%7B%5Cmin%5C%7Bk%27%2C%20k%27%27%5C%7D%7D%7Bk%7D%20%7D%5Cright%5D%20%3D%20%5Cfrac%7B1%7D%7Bk%7D%20%5Csum_%7Bi%3D1%7D%5E%7Bk-1%7D%20%5Cmin%5C%7B%20i%2C%20k-i%20%5C%7D%20%5Cbegin%7Bpmatrix%7D%20k%20%5C%5C%20i%20%5Cend%7Bpmatrix%7D%202%5E%7B-k%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \frac{\mathop{\mathbb E}[{\mathcal{R}}]}{\mathcal{F}^{(2)}} = \mathop{\mathbb E}\left[{ \frac{\min\{k&#039;, k&#039;&#039;\}}{k} }\right] = \frac{1}{k} \sum_{i=1}^{k-1} \min\{ i, k-i \} \begin{pmatrix} k \\ i \end{pmatrix} 2^{-k}' title='\displaystyle \frac{\mathop{\mathbb E}[{\mathcal{R}}]}{\mathcal{F}^{(2)}} = \mathop{\mathbb E}\left[{ \frac{\min\{k&#039;, k&#039;&#039;\}}{k} }\right] = \frac{1}{k} \sum_{i=1}^{k-1} \min\{ i, k-i \} \begin{pmatrix} k \\ i \end{pmatrix} 2^{-k}' class='latex' />
<p>since:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cfrac%7Bk%7D%7B2%7D%20%3D%20%5Csum_%7Bi%20%3D%200%7D%5Ek%20i%20%5Cbegin%7Bpmatrix%7D%20k%20%5C%5C%20i%20%5Cend%7Bpmatrix%7D%202%5E%7B-k%7D%20%5Cleq%20%5Cfrac%7Bk%7D%7B4%7D%20%2B%202%20%5Csum_%7Bi%20%3D%200%7D%5E%7Bk%2F2%7D%20i%20%5Cbegin%7Bpmatrix%7D%20k%20%5C%5C%20i%20%5Cend%7Bpmatrix%7D%202%5E%7B-k%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \frac{k}{2} = \sum_{i = 0}^k i \begin{pmatrix} k \\ i \end{pmatrix} 2^{-k} \leq \frac{k}{4} + 2 \sum_{i = 0}^{k/2} i \begin{pmatrix} k \\ i \end{pmatrix} 2^{-k} ' title='\displaystyle \frac{k}{2} = \sum_{i = 0}^k i \begin{pmatrix} k \\ i \end{pmatrix} 2^{-k} \leq \frac{k}{4} + 2 \sum_{i = 0}^{k/2} i \begin{pmatrix} k \\ i \end{pmatrix} 2^{-k} ' class='latex' />
<p>and therefore:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cfrac%7B%5Cmathop%7B%5Cmathbb%20E%7D%5B%7B%5Cmathcal%7BR%7D%7D%5D%7D%7B%5Cmathcal%7BF%7D%5E%7B%282%29%7D%7D%20%5Cgeq%20%5Cfrac%7B1%7D%7B4%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \frac{\mathop{\mathbb E}[{\mathcal{R}}]}{\mathcal{F}^{(2)}} \geq \frac{1}{4} ' title='\displaystyle  \frac{\mathop{\mathbb E}[{\mathcal{R}}]}{\mathcal{F}^{(2)}} \geq \frac{1}{4} ' class='latex' />
<img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' />
<p>This similar approximations can be extended to more general environments with very little change. For example, for multi-unit auctions, where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BX%7D%20%3D%20%5C%7B%20S%3B%20%5Cvert%20S%20%5Cvert%20%5Cleq%20k%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{X} = \{ S; \vert S \vert \leq k \}}' title='{\mathcal{X} = \{ S; \vert S \vert \leq k \}}' class='latex' /> we use the benchmark <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%5E%7B%282%2Ck%29%7D%20%3D%20%5Cmax_%7B2%20%5Cleq%20i%20%5Cleq%20k%7D%20i%20v_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}^{(2,k)} = \max_{2 \leq i \leq k} i v_i}' title='{\mathcal{F}^{(2,k)} = \max_{2 \leq i \leq k} i v_i}' class='latex' /> and we can be <img src='http://s.wordpress.com/latex.php?latex=%7BO%281%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{O(1)}' title='{O(1)}' class='latex' />-competitive against it, by random-sampling, evaluating <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%5E%7B%281%2Ck%29%7D%20%3D%20%5Cmax_%7B%5Cleq%20i%20%5Cleq%20k%7D%20i%20v_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{F}^{(1,k)} = \max_{\leq i \leq k} i v_i}' title='{\mathcal{F}^{(1,k)} = \max_{\leq i \leq k} i v_i}' class='latex' /> on both sets and running a profit extractor on both. The profit extractor is a simple generalization of the previous one.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/151/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Minimum average cost cycle and TSP</title>
		<link>http://www.bigredbits.com/archives/148</link>
		<comments>http://www.bigredbits.com/archives/148#comments</comments>
		<pubDate>Thu, 03 Sep 2009 04:01:34 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[approximation algorithms]]></category>
		<category><![CDATA[TSP]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=148</guid>
		<description><![CDATA[After some time, I did again Code Jam &#8211; well, not again, this is the first time I do Code Jam, but there is a while I don&#8217;t do Programming Competitions. Back in my undergrad I remember all the fun I had with my ACM-ICPC team solving problems and discussing algorithms problems. Actually, ICPC was [...]]]></description>
			<content:encoded><![CDATA[<p>After some time, I did again <a class="snap_noshots" href="http://code.google.com/codejam/">Code Jam</a> &#8211; well, not again, this is the first time I do Code Jam, but there is a while I don&#8217;t do Programming Competitions. Back in my undergrad I remember all the fun I had with <a class="snap_noshots" href="http://picasaweb.google.com/renatoppl/BanffICPC2008#5191786378005370066">my ACM-ICPC team</a> solving problems and discussing algorithms problems. Actually, <a class="snap_noshots" href="http://cm2prod.baylor.edu/welcome.icpc">ICPC</a> was what made me interested in Algorithms and Theory of Computing for the first time. I was remembering that not only because Code Jam because I came across a nice problem whose solution I learned in programming competitions, specifically a technique I learned to solve <a class="snap_noshots" href="http://acm.pku.edu.cn/JudgeOnline/problem?id=2728">this problem</a>.</p>
<p>Let&#8217;s formulate the problem in a more abstract way: Given a graph <img src='http://s.wordpress.com/latex.php?latex=%7BG%20%3D%20%28V%2CE%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G = (V,E)}' title='{G = (V,E)}' class='latex' /> and two functions: a cost function <img src='http://s.wordpress.com/latex.php?latex=%7Bc%3AE%20%5Crightarrow%20%7B%5Cmathbb%20R%7D_%2B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c:E \rightarrow {\mathbb R}_+}' title='{c:E \rightarrow {\mathbb R}_+}' class='latex' /> and a benefit function <img src='http://s.wordpress.com/latex.php?latex=%7Bb%3AE%20%5Crightarrow%20%7B%5Cmathbb%20R%7D_%2B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b:E \rightarrow {\mathbb R}_+}' title='{b:E \rightarrow {\mathbb R}_+}' class='latex' />, we define the cost-benefit of a set of edges as <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7Bb%28S%29%7D%7Bc%28S%29%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{b(S)}{c(S)}}' title='{\frac{b(S)}{c(S)}}' class='latex' />. Now, consider those two questions:</p>
<p><strong>Question I</strong>: Find the spanning tree of maximum (minimum) cost-benefit.</p>
<p><strong>Question II</strong>: Find the cycle of maximum (minimum) cost-benefit.</p>
<p>The solution of those uses binary search. If we can answer the following query: given <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cbeta%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\beta &gt; 0}' title='{\beta &gt; 0}' class='latex' />, is there a cycle (spanning tree) of cost-benefit smaller (larger) than <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cbeta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\beta}' title='{\beta}' class='latex' />? We either state there is no such tree (cycle) or exhibit that. How can we solve this? It is simple: consider the graph <img src='http://s.wordpress.com/latex.php?latex=%7BG%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{G}' title='{G}' class='latex' /> with edge weights given by <img src='http://s.wordpress.com/latex.php?latex=%7Bb_e%20-%20%5Cbeta%20c_e%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b_e - \beta c_e}' title='{b_e - \beta c_e}' class='latex' />. Then there is a cycle (spanning tree) of cost benefit <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20%5Cbeta%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq \beta}' title='{\leq \beta}' class='latex' /> if and only if there is a cycle (spanning tree) in this graph with transformed weights with negative total weight. Finding a cycle with negative weight is easy and can be done, for example, using Bellman Ford&#8217;s algorithm. Finding a spanning tree with negative weights can be done using any minimal spanning tree algorithm, as Kruskal, Prim or Boruvka.</p>
<p>Taking <img src='http://s.wordpress.com/latex.php?latex=%7Bc%28e%29%20%3D%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c(e) = 1}' title='{c(e) = 1}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Be%20%5Cin%20E%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{e \in E}' title='{e \in E}' class='latex' /> we can find using binary search, the cycle with smallest average length, i.e., smallest <img src='http://s.wordpress.com/latex.php?latex=%7Bb%28C%29%20%2F%20%5Cvert%20C%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{b(C) / \vert C \vert}' title='{b(C) / \vert C \vert}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cvert%20C%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\vert C \vert}' title='{\vert C \vert}' class='latex' /> is the number of edges in the cycle.</p>
<p><strong> Asymmetric Travelling Salesman Problem </strong></p>
<p>We can use this trick just described to design an <img src='http://s.wordpress.com/latex.php?latex=%7BO%28%5Clog%20n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{O(\log n)}' title='{O(\log n)}' class='latex' />-approximation to the asymmetric TSP problem. Consider we have <img src='http://s.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> nodes in <img src='http://s.wordpress.com/latex.php?latex=%7BV%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{V}' title='{V}' class='latex' /> and a function <img src='http://s.wordpress.com/latex.php?latex=%7Bc%3A%20V%20%5Ctimes%20V%20%5Crightarrow%20%7B%5Cmathbb%20R%7D_%2B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c: V \times V \rightarrow {\mathbb R}_+}' title='{c: V \times V \rightarrow {\mathbb R}_+}' class='latex' />, not necessarily symmetric, such that the triangular inequality holds, i.e., <img src='http://s.wordpress.com/latex.php?latex=%7Bc%28i%2Cj%29%20%5Cleq%20c%28i%2Ck%29%20%2B%20c%28k%2Cj%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c(i,j) \leq c(i,k) + c(k,j)}' title='{c(i,j) \leq c(i,k) + c(k,j)}' class='latex' />. A TSP tour is an ordering <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cpi%3A%20%5C%7B1%2C%20%5Chdots%2C%20n%5C%7D%20%5Crightarrow%20V%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\pi: \{1, \hdots, n\} \rightarrow V}' title='{\pi: \{1, \hdots, n\} \rightarrow V}' class='latex' /> and has total cost:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20c%28%5Cpi%29%20%3D%20%5Csum_%7Bj%3D1%7D%5En%20c%28%5Cpi_j%2C%20%5Cpi_%7Bj%2B1%7D%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle c(\pi) = \sum_{j=1}^n c(\pi_j, \pi_{j+1})' title='\displaystyle c(\pi) = \sum_{j=1}^n c(\pi_j, \pi_{j+1})' class='latex' />
<p>where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cpi_%7Bn%2B1%7D%20%3D%20%5Cpi_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\pi_{n+1} = \pi_1}' title='{\pi_{n+1} = \pi_1}' class='latex' />. Let OPT be the cost of the optimal tour. It is NP-complete to calculate the optimal, but consider the following approximation algorithm: find the cycle with smallest average cost. Then remove all the nodes in that cycle except one, in the remaining graph find again the cycle of smallest average cost and remove all nodes except one. Continue doing that until there is just one node left. Taking all those cycles together, we have a strongly connected Eulerian graph (in-degrees are equal to out-degrees) for each node). I claim that the total weight of edges <img src='http://s.wordpress.com/latex.php?latex=%7BE%27%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{E&#039;}' title='{E&#039;}' class='latex' /> in this Eulerian graph is:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20c%28E%27%29%20%5Cleq%202%20%5Cmathcal%7BH%7D_n%20%5Ccdot%20OPT&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle c(E&#039;) \leq 2 \mathcal{H}_n \cdot OPT' title='\displaystyle c(E&#039;) \leq 2 \mathcal{H}_n \cdot OPT' class='latex' />
<p>where <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BH%7D_n%20%3D%20%5Csum_%7Bj%3D1%7D%5En%20%5Cfrac%7B1%7D%7Bj%7D%20%3D%20O%28%5Clog%20n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{H}_n = \sum_{j=1}^n \frac{1}{j} = O(\log n)}' title='{\mathcal{H}_n = \sum_{j=1}^n \frac{1}{j} = O(\log n)}' class='latex' /> is the harmonic number. Now, since we have this graph we can find an Eulerian tour and transform it into a TSP tour shortcutting when necessary (triangle inequality guarantees that shortcutting doesn&#8217;t decrease the cost of the tour). So, we just need to prove the claim.</p>
<p>In fact, it is not hard to see that after removing some nodes, the optimal tour is still <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20OPT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq OPT}' title='{\leq OPT}' class='latex' />, where <img src='http://s.wordpress.com/latex.php?latex=%7BOPT%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{OPT}' title='{OPT}' class='latex' /> is the tour of smallest cost for all nodes. To see this, just take the original tour and shortcut it, for example, if the original tour passed through a sequence of nodes <img src='http://s.wordpress.com/latex.php?latex=%7Bi_1%2C%20%5Chdots%2C%20i_p%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i_1, \hdots, i_p}' title='{i_1, \hdots, i_p}' class='latex' /> but nodes <img src='http://s.wordpress.com/latex.php?latex=%7Bi_2%2C%20%5Chdots%2C%20i_%7Bp-1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i_2, \hdots, i_{p-1}}' title='{i_2, \hdots, i_{p-1}}' class='latex' /> then by triangle inequality:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20c%28i_1%2C%20i_p%29%20%5Cleq%20%5Csum_%7Bj%3D1%7D%5E%7Bp-1%7D%20c%28i_j%2C%20i_%7Bj%2B1%7D%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle c(i_1, i_p) \leq \sum_{j=1}^{p-1} c(i_j, i_{j+1})' title='\displaystyle c(i_1, i_p) \leq \sum_{j=1}^{p-1} c(i_j, i_{j+1})' class='latex' />
<p>so we can just substitute the edges <img src='http://s.wordpress.com/latex.php?latex=%7B%28i_1%2C%20i_2%29%2C%20%5Chdots%2C%20%28i_%7Bp-1%7D%2C%20i_p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(i_1, i_2), \hdots, (i_{p-1}, i_p)}' title='{(i_1, i_2), \hdots, (i_{p-1}, i_p)}' class='latex' /> by <img src='http://s.wordpress.com/latex.php?latex=%7B%28i_1%2C%20i_p%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(i_1, i_p)}' title='{(i_1, i_p)}' class='latex' />. Now, suppose we do <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> iterations and in the beginning of the <img src='http://s.wordpress.com/latex.php?latex=%7Bj%5E%7Bth%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{j^{th}}' title='{j^{th}}' class='latex' /> iteration there are <img src='http://s.wordpress.com/latex.php?latex=%7Bn_j%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n_j}' title='{n_j}' class='latex' /> nodes left. So, clearly the average length of the cycle we picked in the algorithm is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20%5Cfrac%7BOPT%7D%7Bn_j%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq \frac{OPT}{n_j}}' title='{\leq \frac{OPT}{n_j}}' class='latex' /> and therefore, if <img src='http://s.wordpress.com/latex.php?latex=%7BC_1%2C%20%5Chdots%20C_k%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_1, \hdots C_k}' title='{C_1, \hdots C_k}' class='latex' /> are the cycles chosen, we have:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20c%28E%27%29%20%3D%20%5Csum_%7Bj%3D1%7D%5Ek%20c%28C_j%29%20%5Cleq%20%5Csum_%7Bj%3D1%7D%5Ek%20%5Cfrac%7BOPT%7D%7Bn_j%7D%20%28n_j%20-%20n_%7Bj%2B1%7D%20%2B%201%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle c(E&#039;) = \sum_{j=1}^k c(C_j) \leq \sum_{j=1}^k \frac{OPT}{n_j} (n_j - n_{j+1} + 1)' title='\displaystyle c(E&#039;) = \sum_{j=1}^k c(C_j) \leq \sum_{j=1}^k \frac{OPT}{n_j} (n_j - n_{j+1} + 1)' class='latex' />
<p>since:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cfrac%7Bn_j%20-%20n_%7Bj%2B1%7D%20%2B%201%7D%7Bn_j%7D%20%5Cleq%20%5Cfrac%7B1%7D%7Bn_j%7D%20%2B%20%5Cfrac%7B1%7D%7Bn_j%20-%201%7D%20%2B%20%5Chdots%20%2B%20%5Cfrac%7B1%7D%7Bn_%7Bj%2B1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \frac{n_j - n_{j+1} + 1}{n_j} \leq \frac{1}{n_j} + \frac{1}{n_j - 1} + \hdots + \frac{1}{n_{j+1}}' title='\displaystyle \frac{n_j - n_{j+1} + 1}{n_j} \leq \frac{1}{n_j} + \frac{1}{n_j - 1} + \hdots + \frac{1}{n_{j+1}}' class='latex' />
<p>we plug those two expressions together and we get the claim.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/148/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Entropy</title>
		<link>http://www.bigredbits.com/archives/145</link>
		<comments>http://www.bigredbits.com/archives/145#comments</comments>
		<pubDate>Fri, 28 Aug 2009 01:28:54 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[entropy]]></category>
		<category><![CDATA[information theory]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[probability]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=145</guid>
		<description><![CDATA[Today was the first day of classes here at Cornell and as usual, I attend to a lot of different classes to try to decide which ones to take. I usually feel like I wanted to take them all, but there is this constant struggle: if I take too many classes I have no time [...]]]></description>
			<content:encoded><![CDATA[<p>Today was the first day of classes here at Cornell and as usual, I attend to a lot of different classes to try to decide which ones to take. I usually feel like I wanted to take them all, but there is this constant struggle: if I take too many classes I have no time to do research and to read random things that happen to catch my attention at that moment, and if I don&#8217;t take many classes I feel like not learning a lot of interesting stuff I wanted to be learning. The solution in the middle of the way is to audit a lot of classes and start dropping them as a start needing more time: what happens usually quickly. This particular fall I decided that I need to build a stronger background in probability &#8211; since I am finding a lot of probabilistic stuff in my way and I have nothing more than my undergrad course and things I learned on demand. I attended at least three probability classes with different flavours today and I decided to blog about a simple, yet very impressive result I saw in one of them.</p>
<p>Since I took a class on &#8220;Principles of Telecommunications&#8221; in my undergrad, I became impressed by Shannon&#8217;s <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Information_Theory">Information Theory</a> and the concept of entropy. There was one theorem that I always heard about but never saw the proof. I thought it was a somewhat complicated proof, but it turned out not to be that much.</p>
<p>Consder an alphabet <img src='http://s.wordpress.com/latex.php?latex=%7B%5COmega%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Omega}' title='{\Omega}' class='latex' /> and a probability distribution over it. I want to associate to each <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega%20%5Cin%20%5COmega%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega \in \Omega}' title='{\omega \in \Omega}' class='latex' /> a string <img src='http://s.wordpress.com/latex.php?latex=%7Bc%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c(\omega)}' title='{c(\omega)}' class='latex' /> of <img src='http://s.wordpress.com/latex.php?latex=%7Bk%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k(\omega)}' title='{k(\omega)}' class='latex' /> <img src='http://s.wordpress.com/latex.php?latex=%7B%5C%7B0%2C1%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\{0,1\}}' title='{\{0,1\}}' class='latex' />-digits to represent each simbol of the alphabet. One way of allowing the code to be decodable is to make them a proper code. A proper code is a code such that given any <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega_1}' title='{\omega_1}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega_2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega_2}' title='{\omega_2}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7Bc%28%5Comega_1%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c(\omega_1)}' title='{c(\omega_1)}' class='latex' /> is not a prefix of <img src='http://s.wordpress.com/latex.php?latex=%7Bc%28%5Comega_2%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c(\omega_2)}' title='{c(\omega_2)}' class='latex' />. There are several codes like this, but some are more efficient then others. Since the letters have different frequencies, it makes sense to code a frequent letter (say &#8216;e&#8217; in English) with few bits and a letter that doesn&#8217;t appear much, say &#8216;q&#8217; with more bits. We want to find a proper code to minimize:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cmathop%7B%5Cmathbb%20E%7D%5Bk%28%5Comega%29%5D%20%3D%20%5Csum_%7B%5Comega%20%5Cin%20%5COmega%7D%20k%28%5Comega%29%20p%28%5Comega%29%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \mathop{\mathbb E}[k(\omega)] = \sum_{\omega \in \Omega} k(\omega) p(\omega) ' title='\displaystyle \mathop{\mathbb E}[k(\omega)] = \sum_{\omega \in \Omega} k(\omega) p(\omega) ' class='latex' />
<p>The celebrated theorem by Shannon shows that for any proper code (actually it holds more generally for any decodable code), we have <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%5Bk%28%5Comega%29%5D%20%5Cgeq%20H%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E}[k(\omega)] \geq H}' title='{\mathop{\mathbb E}[k(\omega)] \geq H}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7BH%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{H}' title='{H}' class='latex' /> is the entropy of the alphabet, defined as:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20H%20%3D%20-%20%5Csum_%7B%5Comega%7D%20p%28%5Comega%29%20%5Clog_2%20p%28%5Comega%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle H = - \sum_{\omega} p(\omega) \log_2 p(\omega)' title='\displaystyle H = - \sum_{\omega} p(\omega) \log_2 p(\omega)' class='latex' />
<p>even more impressive is that we can achieve something very close to it:</p>
<blockquote><p><strong>Theorem 1</strong> <em> There is a code such that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%5Bk%28%5Comega%29%5D%20%5Cleq%20H%20%2B%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E}[k(\omega)] \leq H + 1}' title='{\mathop{\mathbb E}[k(\omega)] \leq H + 1}' class='latex' />. </em></p></blockquote>
<p>With an additional trick we can get <img src='http://s.wordpress.com/latex.php?latex=%7BH%20%2B%20%5Cepsilon%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{H + \epsilon}' title='{H + \epsilon}' class='latex' /> for any <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cepsilon%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\epsilon &gt; 0}' title='{\epsilon &gt; 0}' class='latex' />. The first part is trickier and I won&#8217;t do here (but again, it is not as hard as I thought it would be). For proving that there is a code with average length <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20H%20%2B%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq H + 1}' title='{\leq H + 1}' class='latex' /> we use the following lemma:</p>
<blockquote><p><strong>Lemma 2</strong> <em> There is a proper code for <img src='http://s.wordpress.com/latex.php?latex=%7B%5COmega%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Omega}' title='{\Omega}' class='latex' /> with code-lengths <img src='http://s.wordpress.com/latex.php?latex=%7Bk%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k(\omega)}' title='{k(\omega)}' class='latex' /> if and only if <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%5Comega%202%5E%7B-k%28%5Comega%29%7D%20%5Cleq%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_\omega 2^{-k(\omega)} \leq 1}' title='{\sum_\omega 2^{-k(\omega)} \leq 1}' class='latex' /> </em></p></blockquote>
<p><em>Proof:</em> Let <img src='http://s.wordpress.com/latex.php?latex=%7BN%20%3D%20%5Cmax_%5Comega%20k%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N = \max_\omega k(\omega)}' title='{N = \max_\omega k(\omega)}' class='latex' /> and imagine all the possible codewords of length <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleq%20N%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\leq N}' title='{\leq N}' class='latex' /> as a complete binary tree. Since it is a proper code, no two codes <img src='http://s.wordpress.com/latex.php?latex=%7Bc%28%5Comega_1%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c(\omega_1)}' title='{c(\omega_1)}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bc%28%5Comega_2%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c(\omega_2)}' title='{c(\omega_2)}' class='latex' /> are in the same path to the root. So, picking one node as a codeword means that we can&#8217;t pick any node in the subtree from it. Also, for each leave, the is at most one codeword in its path to the root. Therefore we can assign each leaf of the tree to a single codeword or to no codeword at all. It is easy to see that a codeword with size <img src='http://s.wordpress.com/latex.php?latex=%7Bk%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k(\omega)}' title='{k(\omega)}' class='latex' /> has associated with it <img src='http://s.wordpress.com/latex.php?latex=%7B2%5E%7BN%20-%20k%28%5Comega%29%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2^{N - k(\omega)}}' title='{2^{N - k(\omega)}}' class='latex' /> leaves. Since there are <img src='http://s.wordpress.com/latex.php?latex=%7B2%5EN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2^N}' title='{2^N}' class='latex' /> leaves in total, we have that:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Csum_%5Comega%202%5E%7BN-k%28%5Comega%29%7D%20%5Cleq%202%5EN&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \sum_\omega 2^{N-k(\omega)} \leq 2^N' title='\displaystyle \sum_\omega 2^{N-k(\omega)} \leq 2^N' class='latex' />
<p>what proves one direction of the result. Now, to prove the converse direction, we can propose a greedy algorithm: given <img src='http://s.wordpress.com/latex.php?latex=%7B%5COmega%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Omega}' title='{\Omega}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bk%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k(\omega)}' title='{k(\omega)}' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%5Comega%202%5E%7B-k%28%5Comega%29%7D%20%5Cleq%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_\omega 2^{-k(\omega)} \leq 1}' title='{\sum_\omega 2^{-k(\omega)} \leq 1}' class='latex' />, let <img src='http://s.wordpress.com/latex.php?latex=%7BN%20%3D%20%5Cmax_%5Comega%20k%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{N = \max_\omega k(\omega)}' title='{N = \max_\omega k(\omega)}' class='latex' />. Now, suppose <img src='http://s.wordpress.com/latex.php?latex=%7Bk%28%5Comega_1%29%20%5Cleq%20k%28%5Comega_2%29%20%5Cleq%20k%28%5Comega_3%29%20%5Cleq%20%5Chdots%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k(\omega_1) \leq k(\omega_2) \leq k(\omega_3) \leq \hdots}' title='{k(\omega_1) \leq k(\omega_2) \leq k(\omega_3) \leq \hdots}' class='latex' />. Start with <img src='http://s.wordpress.com/latex.php?latex=%7B2%5EN%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2^N}' title='{2^N}' class='latex' /> leaves in a whole block. Start dividing them in <img src='http://s.wordpress.com/latex.php?latex=%7B2%5E%7Bk%28%5Comega_1%29%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2^{k(\omega_1)}}' title='{2^{k(\omega_1)}}' class='latex' /> blocks and assign one to <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega_1}' title='{\omega_1}' class='latex' />. Now we define the recursive step: when we analyze <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega_j%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega_j}' title='{\omega_j}' class='latex' />, the leaves are divided in <img src='http://s.wordpress.com/latex.php?latex=%7B2%5E%7Bk%28%5Comega_j-1%29%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2^{k(\omega_j-1)}}' title='{2^{k(\omega_j-1)}}' class='latex' /> blocks, some occupied, some not. Divide each free block in <img src='http://s.wordpress.com/latex.php?latex=%7B2%5E%7Bk%28%5Comega_j%29%20-%20k%28%5Comega_j-1%29%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2^{k(\omega_j) - k(\omega_j-1)}}' title='{2^{k(\omega_j) - k(\omega_j-1)}}' class='latex' /> blocks and assign one of them to <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega_j%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega_j}' title='{\omega_j}' class='latex' />. It is not hard to see that each block corresponds to one node in the tree (the common ancestor of all the leaves in that block) and that it corresponds to a proper code. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p>Now, using this we show how to find a code with with <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%5Bk%28%5Comega%29%5D%20%5Cleq%20H%20%2B%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E}[k(\omega)] \leq H + 1}' title='{\mathop{\mathbb E}[k(\omega)] \leq H + 1}' class='latex' />. For each <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega}' title='{\omega}' class='latex' />, since <img src='http://s.wordpress.com/latex.php?latex=%7Bp%28%5Comega%29%20%5Cin%20%280%2C1%5D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p(\omega) \in (0,1]}' title='{p(\omega) \in (0,1]}' class='latex' /> we can always find <img src='http://s.wordpress.com/latex.php?latex=%7Bk%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k(\omega)}' title='{k(\omega)}' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cfrac%7B1%7D%7B2%7D%20p%28%5Comega%29%20%5Cleq%202%5E%7B-k%28%5Comega%29%7D%20%5Cleq%20p%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\frac{1}{2} p(\omega) \leq 2^{-k(\omega)} \leq p(\omega)}' title='{\frac{1}{2} p(\omega) \leq 2^{-k(\omega)} \leq p(\omega)}' class='latex' />. Now, clearly:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Csum_%5Comega%202%5E%7B-k%28%5Comega%29%7D%20%5Cleq%20%5Csum_%5Comega%20p%28%5Comega%29%20%3D%201&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \sum_\omega 2^{-k(\omega)} \leq \sum_\omega p(\omega) = 1' title='\displaystyle \sum_\omega 2^{-k(\omega)} \leq \sum_\omega p(\omega) = 1' class='latex' />
<p>and:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cmathop%7B%5Cmathbb%20E%7D%5Bk%28%5Comega%29%5D%20%3D%20%5Csum_%5Comega%20k%28%5Comega%29%20p%28%5Comega%29%20%5Cleq%20%5Csum_%5Comega%20%5B1%20-%20%5Clog_2%20p%28%5Comega%29%5D%20p%28%5Comega%29%20%3D%20H%20%2B%201&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \mathop{\mathbb E}[k(\omega)] = \sum_\omega k(\omega) p(\omega) \leq \sum_\omega [1 - \log_2 p(\omega)] p(\omega) = H + 1' title='\displaystyle \mathop{\mathbb E}[k(\omega)] = \sum_\omega k(\omega) p(\omega) \leq \sum_\omega [1 - \log_2 p(\omega)] p(\omega) = H + 1' class='latex' />
<p>Cool, but now how to bring it to <img src='http://s.wordpress.com/latex.php?latex=%7BH%20%2B%20%5Cepsilon%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{H + \epsilon}' title='{H + \epsilon}' class='latex' /> ? The idea is to code multiple blocks at the same time (even if they are independent, we are not taking advantage of correlation between the blocks). Consider <img src='http://s.wordpress.com/latex.php?latex=%7B%5COmega%5Ek%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Omega^k}' title='{\Omega^k}' class='latex' /> and the probability function induced on it, i.e.:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20p_k%20%28%5Comega_1%2C%20%5Chdots%2C%20%5Comega_k%29%20%3D%20%5Cprod_%7Bi%3D1%7D%5Ek%20p%28%5Comega_i%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle p_k (\omega_1, \hdots, \omega_k) = \prod_{i=1}^k p(\omega_i)' title='\displaystyle p_k (\omega_1, \hdots, \omega_k) = \prod_{i=1}^k p(\omega_i)' class='latex' />
<p>It is not hard ot see that <img src='http://s.wordpress.com/latex.php?latex=%7B%5COmega%5Ek%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Omega^k}' title='{\Omega^k}' class='latex' /> with <img src='http://s.wordpress.com/latex.php?latex=%7Bp_k%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_k}' title='{p_k}' class='latex' /> has entropy <img src='http://s.wordpress.com/latex.php?latex=%7BkH%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{kH}' title='{kH}' class='latex' /> because:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cbegin%7Baligned%7D%20%5Csum_%7B%5Comega_1%2C%20%5Chdots%2C%20%5Comega_k%7D%20p_k%28%5Comega_1%2C%20%5Chdots%2C%20%5Comega_k%29%20%5Clog_2%20p_k%28%5Comega_1%2C%20%5Chdots%2C%20%5Comega_k%29%20%3D%5C%5C%20%3D%20%5Csum_%7B%5Comega_1%2C%20%5Chdots%2C%20%5Comega_k%7D%20%5Cprod_i%20p%28%5Comega_i%29%20%5Csum_i%20%5Clog_2%20p%28%5Comega_i%29%20%3D%5C%5C%20%3D%20%5Csum_i%20%5Csum_%5Comega%20p%28%5Comega%29%20%5Clog_2%20p%28%5Comega%29%20%3D%20kH%20%5Cend%7Baligned%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \begin{aligned} \sum_{\omega_1, \hdots, \omega_k} p_k(\omega_1, \hdots, \omega_k) \log_2 p_k(\omega_1, \hdots, \omega_k) =\\ = \sum_{\omega_1, \hdots, \omega_k} \prod_i p(\omega_i) \sum_i \log_2 p(\omega_i) =\\ = \sum_i \sum_\omega p(\omega) \log_2 p(\omega) = kH \end{aligned} ' title='\displaystyle \begin{aligned} \sum_{\omega_1, \hdots, \omega_k} p_k(\omega_1, \hdots, \omega_k) \log_2 p_k(\omega_1, \hdots, \omega_k) =\\ = \sum_{\omega_1, \hdots, \omega_k} \prod_i p(\omega_i) \sum_i \log_2 p(\omega_i) =\\ = \sum_i \sum_\omega p(\omega) \log_2 p(\omega) = kH \end{aligned} ' class='latex' />
<p>and then we can just apply the last theorem to that: we can find a function that codifies <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> symbols <img src='http://s.wordpress.com/latex.php?latex=%7B%5Comega%20%3D%20%28%5Comega_1%2C%20%5Chdots%2C%20%5Comega_k%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\omega = (\omega_1, \hdots, \omega_k)}' title='{\omega = (\omega_1, \hdots, \omega_k)}' class='latex' /> with <img src='http://s.wordpress.com/latex.php?latex=%7Bl%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{l(\omega)}' title='{l(\omega)}' class='latex' /> symbols such that:</p>
<p><img src='http://s.wordpress.com/latex.php?latex=%7BkH%20%5Cleq%20%5Cmathop%7B%5Cmathbb%20E%7D%5Bl%28%5Comega%29%5D%20%5Cleq%20kH%20%2B%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{kH \leq \mathop{\mathbb E}[l(\omega)] \leq kH + 1}' title='{kH \leq \mathop{\mathbb E}[l(\omega)] \leq kH + 1}' class='latex' /> since <img src='http://s.wordpress.com/latex.php?latex=%7Bl%28%5Comega%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{l(\omega)}' title='{l(\omega)}' class='latex' /> codifies <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> symbols, we are actually interested in <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathop%7B%5Cmathbb%20E%7D%5Bl%28%5Comega%29%2Fk%5D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathop{\mathbb E}[l(\omega)/k]}' title='{\mathop{\mathbb E}[l(\omega)/k]}' class='latex' /> and therefore we get:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20H%20%5Cleq%20%5Cmathop%7B%5Cmathbb%20E%7D%5Cleft%5B%5Cfrac%7Bl%28%5Comega%29%7D%7Bk%7D%5Cright%5D%20%5Cleq%20H%20%2B%20%5Cfrac%7B1%7D%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle H \leq \mathop{\mathbb E}\left[\frac{l(\omega)}{k}\right] \leq H + \frac{1}{k}' title='\displaystyle H \leq \mathop{\mathbb E}\left[\frac{l(\omega)}{k}\right] \leq H + \frac{1}{k}' class='latex' />
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/145/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Programming and Storytelling</title>
		<link>http://www.bigredbits.com/archives/139</link>
		<comments>http://www.bigredbits.com/archives/139#comments</comments>
		<pubDate>Sat, 22 Aug 2009 00:35:25 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[stotytelling]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=139</guid>
		<description><![CDATA[Recently I looked in Papadimitriou&#8217;s website looking for something else and found this great article called: &#8220;MythematiCS: In Praise of Storytelling in the Teaching of Computer Science and Math&#8221;. He begins by pointing out the in early times knowledge was transferred mostly by storytelling &#8211; and there is much more place in contemporary technical teaching [...]]]></description>
			<content:encoded><![CDATA[<p>Recently I looked in <a href="http://www.cs.berkeley.edu/~christos/">Papadimitriou&#8217;s website</a> looking for something else and found <a href="http://www.cs.berkeley.edu/~christos/papers/MYTHEMATICS.doc">this great article</a> called: &#8220;MythematiCS: In Praise of Storytelling in the Teaching of Computer Science and Math&#8221;. He begins by pointing out the in early times knowledge was transferred mostly by storytelling &#8211; and there is much more place in contemporary technical teaching to storytelling than most of people realize. He has several interesting points: one of them is that we can think of <a href="http://www.cs.berkeley.edu/~christos/narrative.ppt">writting a computer program as telling a story</a>. For example, the variables are the characters: they have characteristics (data types) and the whole story (program) is about playing around with them. Sometimes they have multiple faces and behaviors depending on the circumstance (<a href="http://en.wikipedia.org/wiki/Polymorphism_in_object-oriented_programming">polymorphism</a>). Iteration and recursion are common literary tools, used for example in fairy tales &#8220;in the first day, this happens, in the second day, that happens, then&#8230;&#8221; or &#8220;he might be able to do that just if he does that&#8230;&#8221;. He mentions one of my favourite books: <a href="http://en.wikipedia.org/wiki/If_on_a_Winter%27s_Night_a_Traveler">&#8220;If on a Winter&#8217;s Night a Traveler&#8221;</a> as a great example of recursion. This made me think that maybe Italo Calvino is my favourite author because his stories are so beautifully constructed in an almost mathematical fashion &#8211; like an <a href="http://en.wikipedia.org/wiki/M._C._Escher">Escher paiting</a> or  <a href="http://en.wikipedia.org/wiki/Bach">Bach music</a>. They went very far in showing the beauty of math and showing it is really one art. For example, this beautiful representation of the hyperbolic plane:</p>
<p style="text-align: center;"><img class="aligncenter size-full wp-image-140" style="border: 2px solid black;" title="Escher_Circle_Limit_III" src="http://www.bigredbits.com/wp-content/uploads/2009/08/Escher_Circle_Limit_III.jpg" alt="Escher_Circle_Limit_III" width="383" height="383" /></p>
<p style="text-align: left;">Back to programming there are still a lot of interesting relations: several novels are multi-threaded. We look at the novels from perspectives of multiple characters. Stories also need to &#8220;compile and run&#8221;, which in this case mean, make sense and be accepted by people. I was thinking that there are a lot of books which everyone knows about but very few people have ever read (Ulisses, for example). Are those NP-complete problems?</p>
<p style="text-align: left;">Back to Papadimitriou&#8217;s article, he talks about a few interesting books that do a good job in mixing together math and stories. One that he mentions I read a long time ago, still when I was in high-school and it did a great job in further stimulating me on math. The book was <a href="http://www.amazon.com/Parrots-Theorem-Novel-Denis-Guedj/dp/0312303025/ref=sr_1_1?ie=UTF8&amp;s=books&amp;qid=1250900284&amp;sr=1-1">The Parrot&#8217;s Theorem</a>. Recently I also read one other book that he mentioned: <a href="http://www.amazon.com/Surreal-Numbers-Donald-E-Knuth/dp/0201038129/ref=sr_1_1?ie=UTF8&amp;s=books&amp;qid=1250900330&amp;sr=1-1">Surreal Numbers</a>, by Don Knuth. Although I am a great fan of almost everything Knuth writes, this book didn&#8217;t caught me much. I think it may be because I am not the right audience. If I read it a couple of years back I might have enjoyed it much more.</p>
<p style="text-align: left;">When I was in Greece last year, I came across this very interesting comic book: <a href="http://www.amazon.com/Logicomix-Search-Truth-Apostolos-Doxiadis/dp/0747597200/ref=sr_1_1?ie=UTF8&amp;s=books&amp;qid=1250900446&amp;sr=1-1">Logicomix</a>. It was in Greek but just by looking into it I figured out it was something about math and it seemed pretty cool. Later I found out this was written by Papadimitriou and Doxiadis, which made me even more curious to read it. Now I am waiting the English translation of it. One last pointer: Doxiadis has a webpage with some <a href="http://www.apostolosdoxiadis.com/en/index.php?option=com_content&amp;view=article&amp;id=189&amp;Itemid=70">interesting essays</a> about the relations of mathematical proofs, computer programming and storytelling.</p>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px;"><!-- 		@page { margin: 0.79in } 		P { margin-bottom: 0.08in } -->as t</p>
<p style="margin-bottom: 0in; font-style: normal; text-decoration: none;" align="left"><strong><span style="text-decoration: none;"><span style="font-size: medium;"><em>Myth</em></span></span><span style="text-decoration: none;"><span style="font-size: medium;"><span style="font-style: normal;">ematiCS</span></span></span></strong><span style="text-decoration: none;"><span style="font-size: medium;"><span style="font-style: normal;"><span style="font-weight: normal;">: </span></span></span></span><span style="text-decoration: none;"><sup><span style="font-size: xx-small;"><span style="font-style: normal;"><span style="font-weight: normal;">(1)</span></span></span></sup></span></p>
<p style="margin-bottom: 0in; font-style: normal; text-decoration: none;" align="center"><span style="font-size: medium;"><strong>In Praise of Storytelling in the Teaching of<br />
Computer Science and Math</strong></span></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/139/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Duality Theorem for Semidefinite Programming</title>
		<link>http://www.bigredbits.com/archives/128</link>
		<comments>http://www.bigredbits.com/archives/128#comments</comments>
		<pubDate>Mon, 10 Aug 2009 04:44:28 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[linear algebra]]></category>
		<category><![CDATA[linear programming]]></category>
		<category><![CDATA[semidefinite programming]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=128</guid>
		<description><![CDATA[I&#8217;ve been reading through Luca Trevisan&#8217;s blog posts about relaxations of the Sparsest Cut problem. He discusses three relaxations: (i) spectral relaxation; (ii) Leighton-Rao and (iii) Arora-Rao-Vazirani. (i) and (iii) are semi-definite programs (SDP), which is an incredibly beautiful and useful generalization of Linear Programs. This technique is the core of the famous MAX-CUT algorithm [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been reading through <a class="snap_noshots" href="http://lucatrevisan.wordpress.com/2008/05/03/relaxations-of-edge-expansion-and-their-duals/">Luca Trevisan&#8217;s blog posts</a> about relaxations of the Sparsest Cut problem. He discusses three relaxations: (i) spectral relaxation; (ii) Leighton-Rao and (iii) Arora-Rao-Vazirani. (i) and (iii) are <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Semidefinite_programming">semi-definite programs</a> (SDP), which is an incredibly beautiful and useful generalization of Linear Programs. This technique is the core of the famous MAX-CUT algorithm from Goemans and Williamson and was also applied to design approximation algorithms for a large variety of combinatorial problems. As for Linear Programming, there is a Duality Theorem that allows us to give certificates to lower and upper bounds. I was curious to learn more about it and my friend Ashwin suggested me <a class="snap_noshots" href="http://www.cs.elte.hu/~lovasz/semidef.ps">those great lecture notes by Lazlo Lovasz</a> about semi-definite programming. I was surprised that the proof of the duality theorem for SDP is almost the same as for LP, just changing a few steps. We begin by using the Convex Separation Theorem to derive a Semi-definite version of <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Farkas_lemma">Farkas&#8217; Lemma</a>. We use this version of Farkas&#8217; Lemma to prove the Duality Theorem in a similar manner that is normally done for LP. Let&#8217;s go through the proof, but first, let&#8217;s define positive semidefinite matrices:</p>
<blockquote><p><strong>Definition 1</strong> <em> A matrix <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is said to be positive semidefinite (denoted by <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%5Csucceq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A \succeq 0}' title='{A \succeq 0}' class='latex' />) if for all <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20%7B%5Cmathbb%20R%7D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in {\mathbb R}^n}' title='{x \in {\mathbb R}^n}' class='latex' /> we have <img src='http://s.wordpress.com/latex.php?latex=%7Bx%5Et%20A%20x%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x^t A x \geq 0}' title='{x^t A x \geq 0}' class='latex' />. </em></p></blockquote>
<p>It is the same as saying that all eigenvalues of <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> are non-negative, since the smallest eigenvalue of <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is given by <img src='http://s.wordpress.com/latex.php?latex=%7Bmin_%7B%5CVert%20x%20%5CVert%20%3D%201%7D%20x%5Et%20A%20x%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{min_{\Vert x \Vert = 1} x^t A x}' title='{min_{\Vert x \Vert = 1} x^t A x}' class='latex' />. We denote <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%5Csucc%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A \succ 0}' title='{A \succ 0}' class='latex' /> when all eigenvalues of <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> are stricly positive. It is the same as <img src='http://s.wordpress.com/latex.php?latex=%7Bx%5Et%20A%20x%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x^t A x &gt; 0}' title='{x^t A x &gt; 0}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cneq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \neq 0}' title='{x \neq 0}' class='latex' />. Also, given two matrices <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BB%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{B}' title='{B}' class='latex' /> we use the following notation: <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%5Cbullet%20B%20%3D%20%5Csum_%7Bij%7D%20A_%7Bij%7D%20B_%7Bij%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A \bullet B = \sum_{ij} A_{ij} B_{ij}}' title='{A \bullet B = \sum_{ij} A_{ij} B_{ij}}' class='latex' /> what is nothing more than the dot-product when we see those matrices as vectors in <img src='http://s.wordpress.com/latex.php?latex=%7B%7B%5Cmathbb%20R%7D%5E%7Bn%5E2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{{\mathbb R}^{n^2}}' title='{{\mathbb R}^{n^2}}' class='latex' />. Therefore:</p>
<blockquote><p><strong>Definition 2</strong> <em> A semi-definite program (SDP) is an optimization problem where the variable is a matrix <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> and has the form:</em></p>
<p><em><a name="primal"><br />
</a></em></p>
<p align="center"><em><a name="primal"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cleft.%20%5Cbegin%7Baligned%7D%20%26%20%5Cmin%20C%20%5Cbullet%20X%20%5Ctext%7B%20s.t.%20%7D%20%5C%5C%20%26%20%5Cqquad%20%5Cleft.%20%5Cbegin%7Baligned%7D%20%26%20D_i%20%5Cbullet%20X%20%3D%20d_i%20%26%20%5Cforall%20i%20%5C%5C%20%26%20X%20%5Csucceq%200%20%5C%5C%20%5Cend%7Baligned%7D%20%5Cright.%20%5Cend%7Baligned%7D%20%5Cright.%20%5C%20%5C%20%5C%20%5C%20%5C%20%281%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \left. \begin{aligned} &amp; \min C \bullet X \text{ s.t. } \\ &amp; \qquad \left. \begin{aligned} &amp; D_i \bullet X = d_i &amp; \forall i \\ &amp; X \succeq 0 \\ \end{aligned} \right. \end{aligned} \right. \ \ \ \ \ (1)' title='\displaystyle  \left. \begin{aligned} &amp; \min C \bullet X \text{ s.t. } \\ &amp; \qquad \left. \begin{aligned} &amp; D_i \bullet X = d_i &amp; \forall i \\ &amp; X \succeq 0 \\ \end{aligned} \right. \end{aligned} \right. \ \ \ \ \ (1)' class='latex' /></a></em></p>
<p><em> </em></p>
<p><em>that is, it is a linear program and the restriction that the variable, viewed as a matrix, must be positive semi-definite. </em></p></blockquote>
<p>The interesting thing is to use that the set <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BP%7D%20%5Csubseteq%20%7B%5Cmathbb%20R%7D%5E%7Bn%5E2%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{P} \subseteq {\mathbb R}^{n^2}}' title='{\mathcal{P} \subseteq {\mathbb R}^{n^2}}' class='latex' /> of semi-definite matrices is a convex cone, i.e., if <img src='http://s.wordpress.com/latex.php?latex=%7BA%2C%20B%20%5Cin%20%5Cmathcal%7BP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A, B \in \mathcal{P}}' title='{A, B \in \mathcal{P}}' class='latex' /> then <img src='http://s.wordpress.com/latex.php?latex=%7Bc_1%20A%20%2B%20c_2%20B%20%5Cin%20%5Cmathcal%7BP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c_1 A + c_2 B \in \mathcal{P}}' title='{c_1 A + c_2 B \in \mathcal{P}}' class='latex' /> for any <img src='http://s.wordpress.com/latex.php?latex=%7Bc_1%2C%20c_2%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{c_1, c_2 \geq 0}' title='{c_1, c_2 \geq 0}' class='latex' />. It is easy to see this is a convex set. Now we use the following theorem to prove Farkas&#8217; Lemma:</p>
<blockquote><p><strong>Theorem 3 (convex separation)</strong> <em> Given two convex sets <img src='http://s.wordpress.com/latex.php?latex=%7BC_1%2C%20C_2%20%5Csubseteq%20%7B%5Cmathbb%20R%7D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_1, C_2 \subseteq {\mathbb R}^n}' title='{C_1, C_2 \subseteq {\mathbb R}^n}' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=%7BC_1%20%5Ccap%20C_2%20%3D%20%5Cemptyset%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C_1 \cap C_2 = \emptyset}' title='{C_1 \cap C_2 = \emptyset}' class='latex' /> then there is <img src='http://s.wordpress.com/latex.php?latex=%7Bp%2C%20v%20%5Cin%20%7B%5Cmathbb%20R%7D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p, v \in {\mathbb R}^n}' title='{p, v \in {\mathbb R}^n}' class='latex' /> such that <img src='http://s.wordpress.com/latex.php?latex=%7Bv%5Et%20%28x%20-%20p%29%20%5Cleq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v^t (x - p) \leq 0}' title='{v^t (x - p) \leq 0}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20C_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in C_1}' title='{x \in C_1}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bv%5Et%20%28x%20-%20p%29%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v^t (x - p) \geq 0}' title='{v^t (x - p) \geq 0}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20C_2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in C_2}' title='{x \in C_2}' class='latex' />. Besides, if one of them is a cone, it holds with <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p = 0}' title='{p = 0}' class='latex' />. </em></p></blockquote>
<blockquote><p><strong>Theorem 4 (Semi-definite Farkas&#8217; Lemma)</strong> <em> <a name="farkas"></a> Exacly one of the following problems has a solution: </em></p>
<ol>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7Bx_1%20A_1%20%2B%20%5Chdots%20%2B%20x_k%20A_k%20%5Csucc%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_1 A_1 + \hdots + x_k A_k \succ 0}' title='{x_1 A_1 + \hdots + x_k A_k \succ 0}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%20%5Cin%20%7B%5Cmathbb%20R%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i \in {\mathbb R}}' title='{x_i \in {\mathbb R}}' class='latex' /> </em></li>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7BA_1%20%5Cbullet%20Y%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A_1 \bullet Y = 0}' title='{A_1 \bullet Y = 0}' class='latex' />, &#8230;, <img src='http://s.wordpress.com/latex.php?latex=%7BA_k%20%5Cbullet%20Y%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A_k \bullet Y = 0}' title='{A_k \bullet Y = 0}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7BY%20%5Csucceq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y \succeq 0}' title='{Y \succeq 0}' class='latex' /> </em></li>
</ol>
<p><em> </em></p></blockquote>
<p><em>Proof:</em> If problem <img src='http://s.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> doesn&#8217;t have a solution then the cone <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{P}}' title='{\mathcal{P}}' class='latex' /> is disjoint from the convex cone <img src='http://s.wordpress.com/latex.php?latex=%7B%5C%7B%20%5Csum_i%20x_i%20A_i%20%3B%20x_i%20%5Cin%20%7B%5Cmathbb%20R%7D%20%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\{ \sum_i x_i A_i ; x_i \in {\mathbb R} \}}' title='{\{ \sum_i x_i A_i ; x_i \in {\mathbb R} \}}' class='latex' /> and therefore they can be separated. This means that there is <img src='http://s.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> (<img src='http://s.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> plays the role of <img src='http://s.wordpress.com/latex.php?latex=%7Bv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v}' title='{v}' class='latex' /> in the convex separation theorem), such that: <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cleft%28%20%5Csum_i%20x_i%20A_i%20%5Cright%29%20%5Cbullet%20Y%20%5Cleq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\left( \sum_i x_i A_i \right) \bullet Y \leq 0}' title='{\left( \sum_i x_i A_i \right) \bullet Y \leq 0}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bx_1%2C%20%5Chdots%2C%20x_k%20%5Cin%20%7B%5Cmathbb%20R%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_1, \hdots, x_k \in {\mathbb R}}' title='{x_1, \hdots, x_k \in {\mathbb R}}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BY%20%5Cbullet%20M%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y \bullet M \geq 0}' title='{Y \bullet M \geq 0}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7BM%20%5Cin%20%5Cmathcal%7BP%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{M \in \mathcal{P}}' title='{M \in \mathcal{P}}' class='latex' />. Now, taking <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%20%3D%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i = 1}' title='{x_i = 1}' class='latex' /> and all others <img src='http://s.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> and then later <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%20%3D%20-1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i = -1}' title='{x_i = -1}' class='latex' /> and all other zero, we can easily prove that <img src='http://s.wordpress.com/latex.php?latex=%7BA_i%20%5Cbullet%20Y%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A_i \bullet Y = 0}' title='{A_i \bullet Y = 0}' class='latex' />.</p>
<p>It remains to prove that <img src='http://s.wordpress.com/latex.php?latex=%7BY%20%5Csucceq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y \succeq 0}' title='{Y \succeq 0}' class='latex' />. Just take <img src='http://s.wordpress.com/latex.php?latex=%7BM%20%3D%20x%20x%5Et%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{M = x x^t}' title='{M = x x^t}' class='latex' /> for <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20%7B%5Cmathbb%20R%7D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in {\mathbb R}^n}' title='{x \in {\mathbb R}^n}' class='latex' /> and then we have that <img src='http://s.wordpress.com/latex.php?latex=%7BY%20%5Cbullet%20M%20%3D%20x%5Et%20Y%20x%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y \bullet M = x^t Y x \geq 0}' title='{Y \bullet M = x^t Y x \geq 0}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20%7B%5Cmathbb%20R%7D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in {\mathbb R}^n}' title='{x \in {\mathbb R}^n}' class='latex' /> and therefore, <img src='http://s.wordpress.com/latex.php?latex=%7BY%20%5Csucceq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y \succeq 0}' title='{Y \succeq 0}' class='latex' />. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p>Now, we can use this to prove the duality theorem. We will use a slighly different version of Farkas&#8217; Lemma together with an elementary result in Linear Algebra. The following version comes from applying the last theorem to the matrices:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cbegin%7Bbmatrix%7D%20A_1%20%26%20%5C%5C%20%26%200%20%5Cend%7Bbmatrix%7D%2C%20%5Chdots%2C%20%5Cbegin%7Bbmatrix%7D%20A_k%20%26%20%5C%5C%20%26%200%20%5Cend%7Bbmatrix%7D%2C%20%5Cbegin%7Bbmatrix%7D%20-B%20%26%20%5C%5C%20%26%201%20%5Cend%7Bbmatrix%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \begin{bmatrix} A_1 &amp; \\ &amp; 0 \end{bmatrix}, \hdots, \begin{bmatrix} A_k &amp; \\ &amp; 0 \end{bmatrix}, \begin{bmatrix} -B &amp; \\ &amp; 1 \end{bmatrix} ' title='\displaystyle \begin{bmatrix} A_1 &amp; \\ &amp; 0 \end{bmatrix}, \hdots, \begin{bmatrix} A_k &amp; \\ &amp; 0 \end{bmatrix}, \begin{bmatrix} -B &amp; \\ &amp; 1 \end{bmatrix} ' class='latex' /></p>
<p>instead of <img src='http://s.wordpress.com/latex.php?latex=%7BA_1%2C%20%5Chdots%2C%20A_%7Bk%2B1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A_1, \hdots, A_{k+1}}' title='{A_1, \hdots, A_{k+1}}' class='latex' />.</p>
<blockquote><p><strong>Theorem 5 (Semi-definite Farkas&#8217; Lemma: Inomogeneous Version)</strong> <em> <a name="farkas"></a> Exacly one of the following problems has a solution: </em></p>
<ol>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7Bx_1%20A_1%20%2B%20%5Chdots%20%2B%20x_k%20A_k%20%5Csucc%20B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_1 A_1 + \hdots + x_k A_k \succ B}' title='{x_1 A_1 + \hdots + x_k A_k \succ B}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%20%5Cin%20%7B%5Cmathbb%20R%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i \in {\mathbb R}}' title='{x_i \in {\mathbb R}}' class='latex' /> </em></li>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7BA_1%20%5Cbullet%20Y%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A_1 \bullet Y = 0}' title='{A_1 \bullet Y = 0}' class='latex' />, &#8230;, <img src='http://s.wordpress.com/latex.php?latex=%7BA_k%20%5Cbullet%20Y%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A_k \bullet Y = 0}' title='{A_k \bullet Y = 0}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7BB%20%5Cbullet%20Y%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{B \bullet Y \geq 0}' title='{B \bullet Y \geq 0}' class='latex' /> , <img src='http://s.wordpress.com/latex.php?latex=%7BY%20%5Csucceq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{Y \succeq 0}' title='{Y \succeq 0}' class='latex' /> </em></li>
</ol>
<p><em> </em></p></blockquote>
<p>where <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%5Csucc%20B%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A \succ B}' title='{A \succ B}' class='latex' /> means <img src='http://s.wordpress.com/latex.php?latex=%7BA%20-%20B%20%5Csucc%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A - B \succ 0}' title='{A - B \succ 0}' class='latex' />. Now, an elementary Linear Algebra result before we can proceed to the proof of the duality theorem:</p>
<blockquote><p><strong>Theorem 6</strong> <em> Given two semi-definite matrices <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BB%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{B}' title='{B}' class='latex' /> we have <img src='http://s.wordpress.com/latex.php?latex=%7Btr%28AB%29%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{tr(AB) \geq 0}' title='{tr(AB) \geq 0}' class='latex' />, where <img src='http://s.wordpress.com/latex.php?latex=%7Btr%28M%29%20%3D%20%5Csum_i%20M_%7Bii%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{tr(M) = \sum_i M_{ii}}' title='{tr(M) = \sum_i M_{ii}}' class='latex' /> </em></p></blockquote>
<p><em>Proof:</em> The trace of a matrix is invariant under change of basis, i.e. for any non-singular matrix <img src='http://s.wordpress.com/latex.php?latex=%7B%5CGamma%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Gamma}' title='{\Gamma}' class='latex' />, we have that <img src='http://s.wordpress.com/latex.php?latex=%7Btr%28A%29%20%3D%20tr%28%5CGamma%5E%7B-1%7D%20A%20%5CGamma%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{tr(A) = tr(\Gamma^{-1} A \Gamma)}' title='{tr(A) = tr(\Gamma^{-1} A \Gamma)}' class='latex' />. This is very easy to see, since:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cbegin%7Baligned%7D%20tr%28%5CGamma%5E%7B-1%7D%20A%20%5CGamma%29%20%26%20%3D%20%5Csum_i%20%28%5CGamma%5E%7B-1%7D%20A%20%5CGamma%29_%7Bii%7D%20%3D%20%5Csum_i%20%5Csum_j%20%28%5CGamma%5E%7B-1%7D%29_%7Bij%7D%20%28A%20%5CGamma%29_%7Bji%7D%20%3D%20%5C%5C%20%26%20%3D%20%5Csum_i%20%5Csum_j%20%5Csum_k%20%28%5CGamma%5E%7B-1%7D%29_%7Bij%7D%20A_%7Bjk%7D%20%28%5CGamma%29_%7Bki%7D%20%3D%20%5C%5C%20%26%20%3D%20%5Csum_j%20%5Csum_k%20A_%7Bjk%7D%20%5Cleft%28%20%5Csum_i%20%28%5CGamma%29_%7Bki%7D%20%28%5CGamma%5E%7B-1%7D%29_%7Bij%7D%20%5Cright%29%20%3D%20%5C%5C%20%09%09%09%20%26%20%3D%20%5Csum_j%20%5Csum_k%20A_%7Bjk%7D%20%5Cdelta_%7Bjk%7D%20%3D%20tr%28A%29%20%5C%5C%20%5Cend%7Baligned%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \begin{aligned} tr(\Gamma^{-1} A \Gamma) &amp; = \sum_i (\Gamma^{-1} A \Gamma)_{ii} = \sum_i \sum_j (\Gamma^{-1})_{ij} (A \Gamma)_{ji} = \\ &amp; = \sum_i \sum_j \sum_k (\Gamma^{-1})_{ij} A_{jk} (\Gamma)_{ki} = \\ &amp; = \sum_j \sum_k A_{jk} \left( \sum_i (\Gamma)_{ki} (\Gamma^{-1})_{ij} \right) = \\ 			 &amp; = \sum_j \sum_k A_{jk} \delta_{jk} = tr(A) \\ \end{aligned} ' title='\displaystyle  \begin{aligned} tr(\Gamma^{-1} A \Gamma) &amp; = \sum_i (\Gamma^{-1} A \Gamma)_{ii} = \sum_i \sum_j (\Gamma^{-1})_{ij} (A \Gamma)_{ji} = \\ &amp; = \sum_i \sum_j \sum_k (\Gamma^{-1})_{ij} A_{jk} (\Gamma)_{ki} = \\ &amp; = \sum_j \sum_k A_{jk} \left( \sum_i (\Gamma)_{ki} (\Gamma^{-1})_{ij} \right) = \\ 			 &amp; = \sum_j \sum_k A_{jk} \delta_{jk} = tr(A) \\ \end{aligned} ' class='latex' /></p>
<p>Now, we can write in the basis of <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />&#8216;s eigenvectors. Let <img src='http://s.wordpress.com/latex.php?latex=%7B%5CGamma%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Gamma}' title='{\Gamma}' class='latex' /> be a matrix s.t. <img src='http://s.wordpress.com/latex.php?latex=%7BA%20%3D%20%5CGamma%5E%7Bt%7D%20D%20%5CGamma%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A = \Gamma^{t} D \Gamma}' title='{A = \Gamma^{t} D \Gamma}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7BD%20%3D%20diag%28%5Clambda_1%2C%20%5Chdots%2C%20%5Clambda_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{D = diag(\lambda_1, \hdots, \lambda_n)}' title='{D = diag(\lambda_1, \hdots, \lambda_n)}' class='latex' /> Now:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cbegin%7Baligned%7D%20tr%28AB%29%20%26%20%3D%20tr%28%5CGamma%5E%7Bt%7D%20AB%20%5CGamma%29%20%3D%20tr%28%5CGamma%5E%7Bt%7D%20A%20%5CGamma%20%5CGamma%5E%7Bt%7D%20B%20%5CGamma%29%20%3D%20tr%28D%20%5CGamma%5E%7Bt%7D%20B%20%5CGamma%29%20%3D%20%5C%5C%20%26%20%3D%20%5Csum_%7Bii%7D%20%5Clambda_i%20%28%5CGamma%5E%7Bt%7D%20B%20%5CGamma%29_%7Bii%7D%20%5Cgeq%200%20%5Cend%7Baligned%7D%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \begin{aligned} tr(AB) &amp; = tr(\Gamma^{t} AB \Gamma) = tr(\Gamma^{t} A \Gamma \Gamma^{t} B \Gamma) = tr(D \Gamma^{t} B \Gamma) = \\ &amp; = \sum_{ii} \lambda_i (\Gamma^{t} B \Gamma)_{ii} \geq 0 \end{aligned} ' title='\displaystyle  \begin{aligned} tr(AB) &amp; = tr(\Gamma^{t} AB \Gamma) = tr(\Gamma^{t} A \Gamma \Gamma^{t} B \Gamma) = tr(D \Gamma^{t} B \Gamma) = \\ &amp; = \sum_{ii} \lambda_i (\Gamma^{t} B \Gamma)_{ii} \geq 0 \end{aligned} ' class='latex' /></p>
<p>since <img src='http://s.wordpress.com/latex.php?latex=%7B%5Clambda_i%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\lambda_i \geq 0}' title='{\lambda_i \geq 0}' class='latex' /> because the matrix is positive semi-definite and:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%28%5CGamma%5E%7Bt%7D%20B%20%5CGamma%29_%7Bii%7D%20%3D%20e_i%5Et%20%5CGamma%5E%7Bt%7D%20B%20%5CGamma%20e_i%20%3D%20%28%5CGamma%20e_i%29%5Et%20B%20%28%5CGamma%20e_i%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle (\Gamma^{t} B \Gamma)_{ii} = e_i^t \Gamma^{t} B \Gamma e_i = (\Gamma e_i)^t B (\Gamma e_i)' title='\displaystyle (\Gamma^{t} B \Gamma)_{ii} = e_i^t \Gamma^{t} B \Gamma e_i = (\Gamma e_i)^t B (\Gamma e_i)' class='latex' /></p>
<img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' />
<p>Now, we are ready for the Duality Theorem:</p>
<blockquote><p><strong>Theorem 7 (SDP Duality)</strong> <em> The dual of the program <a href="#primal">1</a> is given by:</em></p>
<p><em><a name="dual"><br />
</a></em></p>
<p align="center"><em><a name="dual"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cleft.%20%5Cbegin%7Baligned%7D%20%26%20%5Cmax%20d%5Et%20y%20%5Ctext%7B%20s.t.%20%7D%20%5C%5C%20%26%20%5Cqquad%20%5Cleft.%20%5Cbegin%7Baligned%7D%20%26%20y_1%20D_1%20%2B%20%5Chdots%20%2B%20y_k%20D_k%20%5Cpreceq%20C%20%5Cend%7Baligned%7D%20%5Cright.%20%5Cend%7Baligned%7D%20%5Cright.%20%5C%20%5C%20%5C%20%5C%20%5C%20%282%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \left. \begin{aligned} &amp; \max d^t y \text{ s.t. } \\ &amp; \qquad \left. \begin{aligned} &amp; y_1 D_1 + \hdots + y_k D_k \preceq C \end{aligned} \right. \end{aligned} \right. \ \ \ \ \ (2)' title='\displaystyle  \left. \begin{aligned} &amp; \max d^t y \text{ s.t. } \\ &amp; \qquad \left. \begin{aligned} &amp; y_1 D_1 + \hdots + y_k D_k \preceq C \end{aligned} \right. \end{aligned} \right. \ \ \ \ \ (2)' class='latex' /></a></em></p>
<p><em> </em></p>
<p><em>We call <a href="#primal">1</a> the primal problem and <a href="#dual">2</a> the dual. If <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is feasible for <a href="#primal">1</a> and <img src='http://s.wordpress.com/latex.php?latex=%7By%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{y}' title='{y}' class='latex' /> is feasible for <a href="#dual">2</a>, then <img src='http://s.wordpress.com/latex.php?latex=%7Bd%5Et%20y%20%5Cleq%20C%20%5Cbullet%20X%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{d^t y \leq C \bullet X}' title='{d^t y \leq C \bullet X}' class='latex' />. Besides, if the dual has a strict feasible solution (i.e. with <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20y_i%20D_i%20%5Cprec%20C%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i y_i D_i \prec C}' title='{\sum_i y_i D_i \prec C}' class='latex' />) then the dual optimum is attained and both have the the same optimal value. </em></p></blockquote>
<p>As we can see in the theorem above, the main difference between this duality theorem and the duality theorem of linear programming is this &#8220;stictly feasible&#8221; condition in the theorem. Let&#8217;s proceed to the proof. As in the the LP proof the outline is the following: first we prove weak duality (<img src='http://s.wordpress.com/latex.php?latex=%7Bd%5Et%20y%20%5Cleq%20C%20%5Cbullet%20X%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{d^t y \leq C \bullet X}' title='{d^t y \leq C \bullet X}' class='latex' /> for feasible <img src='http://s.wordpress.com/latex.php?latex=%7BX%2Cy%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X,y}' title='{X,y}' class='latex' />) and then we consider the dual restrictions with <img src='http://s.wordpress.com/latex.php?latex=%7Bd%5Et%20y%20%3E%20v%5E%2A_%7Bdual%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{d^t y &gt; v^*_{dual}}' title='{d^t y &gt; v^*_{dual}}' class='latex' /> and we apply Farkas&#8217; Lemma:</p>
<p><em>Proof:</em> <strong>Weak duality:</strong> If <img src='http://s.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is feasible for <a href="#primal">1</a> and <img src='http://s.wordpress.com/latex.php?latex=%7By%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{y}' title='{y}' class='latex' /> is feasible for <a href="#dual">2</a> then:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20C%5Cbullet%20X%20%5Cgeq%20%5Cleft%28%20%5Csum_i%20y_i%20D_i%20%5Cright%29%20%5Cbullet%20X%20%3D%20%5Csum_i%20y_i%20D_i%5Cbullet%20X%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle C\bullet X \geq \left( \sum_i y_i D_i \right) \bullet X = \sum_i y_i D_i\bullet X ' title='\displaystyle C\bullet X \geq \left( \sum_i y_i D_i \right) \bullet X = \sum_i y_i D_i\bullet X ' class='latex' /></p>
<p>because <img src='http://s.wordpress.com/latex.php?latex=%7BC%20-%20%5Csum_i%20y_i%20D_i%20%5Csucceq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C - \sum_i y_i D_i \succeq 0}' title='{C - \sum_i y_i D_i \succeq 0}' class='latex' /> and the <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cbullet%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\bullet}' title='{\bullet}' class='latex' /> of two positive semi-definite matrices is non-negative. Now:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20d%5Et%20y%20%3D%20%5Csum_i%20y_i%20d_i%20%3D%20%5Csum_i%20y_i%20D_i%5Cbullet%20X%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle d^t y = \sum_i y_i d_i = \sum_i y_i D_i\bullet X ' title='\displaystyle d^t y = \sum_i y_i d_i = \sum_i y_i D_i\bullet X ' class='latex' /></p>
<p>by <a href="#primal">1</a>. Therefore, <img src='http://s.wordpress.com/latex.php?latex=%7BC%5Cbullet%20X%20%5Cgeq%20d%5Et%20y%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C\bullet X \geq d^t y}' title='{C\bullet X \geq d^t y}' class='latex' /></p>
<p><strong>Strong duality:</strong> Clearly, the following problem has no solution:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%20y_1%20D_1%20%2B%20%5Chdots%20%2B%20y_k%20D_k%20%5Cprec%20C&#038;bg=T&#038;fg=000000&#038;s=0' alt=' y_1 D_1 + \hdots + y_k D_k \prec C' title=' y_1 D_1 + \hdots + y_k D_k \prec C' class='latex' /></p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=y%5Et%20d%20%3E%20v%5E%2A_%7Bdual%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='y^t d &gt; v^*_{dual}' title='y^t d &gt; v^*_{dual}' class='latex' /></p>
<p>where <img src='http://s.wordpress.com/latex.php?latex=%7Bv%5E%2A_%7Bdual%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v^*_{dual}}' title='{v^*_{dual}}' class='latex' /> is the optimal value of <a href="#dual">2</a>. Now we apply the inomogeneous Farkas&#8217; Lemma to:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20y_1%20%5Cbegin%7Bbmatrix%7D%20-D_1%20%26%20%5C%5C%20%26%20d_1%20%5Cend%7Bbmatrix%7D%20%2B%20%5Chdots%20%2B%20y_k%20%5Cbegin%7Bbmatrix%7D%20-D_k%20%26%20%5C%5C%20%26%20d_k%20%5Cend%7Bbmatrix%7D%20%5Csucc%20%5Cbegin%7Bbmatrix%7D%20-C%20%26%20%5C%5C%20%26%20v%5E%2A_%7Bdual%7D%20%5Cend%7Bbmatrix%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  y_1 \begin{bmatrix} -D_1 &amp; \\ &amp; d_1 \end{bmatrix} + \hdots + y_k \begin{bmatrix} -D_k &amp; \\ &amp; d_k \end{bmatrix} \succ \begin{bmatrix} -C &amp; \\ &amp; v^*_{dual} \end{bmatrix}' title='\displaystyle  y_1 \begin{bmatrix} -D_1 &amp; \\ &amp; d_1 \end{bmatrix} + \hdots + y_k \begin{bmatrix} -D_k &amp; \\ &amp; d_k \end{bmatrix} \succ \begin{bmatrix} -C &amp; \\ &amp; v^*_{dual} \end{bmatrix}' class='latex' /></p>
<p>and we get a matrix <img src='http://s.wordpress.com/latex.php?latex=%7BX%27%20%3D%20%5Cbegin%7Bbmatrix%7D%20X%20%26%20x%5Et%20%5C%5C%20x%20%26%20x_%7Bn%2B1%7D%20%5Cend%7Bbmatrix%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X&#039; = \begin{bmatrix} X &amp; x^t \\ x &amp; x_{n+1} \end{bmatrix}}' title='{X&#039; = \begin{bmatrix} X &amp; x^t \\ x &amp; x_{n+1} \end{bmatrix}}' class='latex' /> such that:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cbegin%7Bbmatrix%7D%20-D_i%20%26%20%5C%5C%20%26%20d_i%20%5Cend%7Bbmatrix%7D%20%5Cbullet%20X%27%20%3D%200%2C%20%5Cbegin%7Bbmatrix%7D%20-C%20%26%20%5C%5C%20%26%20v%5E%2A_%7Bdual%7D%20%5Cend%7Bbmatrix%7D%20%5Cbullet%20X%27%20%3D%200%2C%20X%27%20%5Csucceq%200&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \begin{bmatrix} -D_i &amp; \\ &amp; d_i \end{bmatrix} \bullet X&#039; = 0, \begin{bmatrix} -C &amp; \\ &amp; v^*_{dual} \end{bmatrix} \bullet X&#039; = 0, X&#039; \succeq 0' title='\displaystyle  \begin{bmatrix} -D_i &amp; \\ &amp; d_i \end{bmatrix} \bullet X&#039; = 0, \begin{bmatrix} -C &amp; \\ &amp; v^*_{dual} \end{bmatrix} \bullet X&#039; = 0, X&#039; \succeq 0' class='latex' /></p>
<p>which means: <img src='http://s.wordpress.com/latex.php?latex=%7BD_i%20%5Cbullet%20X%20%3D%20d_i%20x_%7Bn%2B1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{D_i \bullet X = d_i x_{n+1}}' title='{D_i \bullet X = d_i x_{n+1}}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7BC%20%5Cbullet%20X%20%5Cleq%20v%5E%2A_%7Bdual%7D%20x_%7Bn%2B1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{C \bullet X \leq v^*_{dual} x_{n+1}}' title='{C \bullet X \leq v^*_{dual} x_{n+1}}' class='latex' />. We know that <img src='http://s.wordpress.com/latex.php?latex=%7BX%20%5Csucceq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{X \succeq 0}' title='{X \succeq 0}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bx_%7Bn%2B1%7D%20%5Cgeq%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_{n+1} \geq 0}' title='{x_{n+1} \geq 0}' class='latex' />. We just need to prove that <img src='http://s.wordpress.com/latex.php?latex=%7Bx_%7Bn%2B1%7D%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_{n+1} &gt; 0}' title='{x_{n+1} &gt; 0}' class='latex' />. We already know it is not negative, so we need to prove it is not zero and here we will need the hypothesis that the dual has a strict solution. If <img src='http://s.wordpress.com/latex.php?latex=%7Bx_%7Bn%2B1%7D%20%3D%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_{n+1} = 0}' title='{x_{n+1} = 0}' class='latex' /> then there would be solution to:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20X%5Cbullet%20D_i%20%3D%200%2C%20X%5Cbullet%20C%20%3D%200%2C%20X%20%5Csucceq%200&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle X\bullet D_i = 0, X\bullet C = 0, X \succeq 0' title='\displaystyle X\bullet D_i = 0, X\bullet C = 0, X \succeq 0' class='latex' /></p>
<p>and by Farkas&#8217; Lemma, the problem:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Csum_i%20y_i%20D_i%20%5Cprec%20C&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \sum_i y_i D_i \prec C' title='\displaystyle \sum_i y_i D_i \prec C' class='latex' /></p>
<p>would have no solution and we know it does have a solution. Therefore, <img src='http://s.wordpress.com/latex.php?latex=%7Bx_%7Bn%2B1%7D%20%3E%200%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_{n+1} &gt; 0}' title='{x_{n+1} &gt; 0}' class='latex' />.</p>
<p>Everything else comes from combining the weak and strong version described above.</p>
<img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' />
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/128/feed</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Mechanism Design</title>
		<link>http://www.bigredbits.com/archives/120</link>
		<comments>http://www.bigredbits.com/archives/120#comments</comments>
		<pubDate>Wed, 05 Aug 2009 16:36:39 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[game theory]]></category>
		<category><![CDATA[mechanism design]]></category>
		<category><![CDATA[profit maximization]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=120</guid>
		<description><![CDATA[In the early stage of algorithm design, the goal was to solve exacly and efficiently the combinatorial problems. Cook&#8217;s Theorem and the NP-completeness theory showed us that some problems are inehently hard. How can we solve this problem? By trying to find approximated solutions or by trying to look at restricted instances of those problems [...]]]></description>
			<content:encoded><![CDATA[<p>In the early stage of algorithm design, the goal was to solve exacly and efficiently the combinatorial problems. <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Cook%27s_theorem">Cook&#8217;s Theorem</a> and the <a class="snap_noshots" href="http://en.wikipedia.org/wiki/NP-hard">NP-completeness theory</a> showed us that some problems are inehently hard. How can we solve this problem? By trying to find approximated solutions or by trying to look at restricted instances of those problems that are tractable. It turns out that the lack of computational power (NP-hardness in some sense) is not the only obstracle to solving problems optimally. <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Online_algorithm">Online algorithms</a> propose a model where the difficulty is due to lack of information. The algorithm must take some decisions before seeing the entire input. Again, in most of the cases it is hopeless to get an optimal solution (the solution we could get if we knew the input ahead of time) and our goal is not to be far from it. Other example of natural limitation are <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Streaming_algorithms">streaming algorithms</a>, where you should solve a certain problem with limited memory. Imagine for example one algorithm that runs in a router, that received gigabits each second. It is impossible to store all the information, and yet, we want to process this very very large input and give an anwer at some point.</p>
<p>An additional model is inspired in economy: there are a bunch of agents which have part of the input to the algorithm and they are interested in the solution, say, they have a certain value for each final outcome. Now, they will release their part of the input. The may lie about it to manipulate the final result according to their own interest. How to prevent that? We need to augment the algorithm with some economic incentives to make sure they don&#8217;t harm the final solution too much. We need to care about two things now: we still want a solution not to far from the optimal, but we also need to provide incentives. Such algorithm with incentives is called a mechanism and this represents one important field in Algorithmic Game Theory called Mechanism Design.</p>
<p>The simplest setting where this can occur is in a matching problem. Suppose there are <img src='http://s.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> people and one item. I want to decide which person will get the item. Person <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> has a value of <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> of the item. Then I ask people their values and give the item to the person that has a higher value and this way I maximize the total cost of the matching. But wait, this is a not a mechanism yet. People don&#8217;t have incentives to play truthfully in this game. They may want to report higher valuations to get the item. The natural solution is to charge some payment <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> from whoever gets the item. But how much to charge? The solution is the <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Vickrey_auction">Vickrey auction</a>, where we charge the second highest bid to the winner. More about Vickrey auction and some other examples of truthful mechanisms can be found in the <a class="snap_noshots" href="http://www.cambridge.org/us/catalogue/catalogue.asp?isbn=0521872820">Algorithmic Game Theory</a> book, which can be found in <a class="snap_noshots" href="http://theory.stanford.edu/~tim/">Tim Roughgarden&#8217;s website</a>.</p>
<p>One of the most famous early papers in this are is the <a class="snap_noshots" href="http://www.eecs.harvard.edu/~parkes/cs286r/spring07/papers/myerson.pdf">&#8220;Optimal Design Auction&#8221;</a> by Myerson where he discusses auctions mechanisms that maximize the profit for the seller. After reading some papers in this area, I thought I should return and read the original paper by Myerson, and it was indeed a good idea, since the paper formalizes (or makes very explicit) a lot of central concept in this area. I wanted to blog about the two main tools in mechanism design discussed in the paper: the revelation principle and the revenue equivalence theorem.</p>
<p><strong> Revelation Principle </strong></p>
<p>We could imagine thousands of possible ways of providing economic incentives. It seems a difficult task to look at all possible mechanisms and choose the best one. The good thing is: we don&#8217;t need to look at all kinds of mechanisms: we can look only at <em>truthful revelation mechanisms</em>. Let&#8217;s formalize that: consider <img src='http://s.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> bidders and each has value <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> for getting what they want (suppose they are single parameter agents, i.e., they get <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> if they get what they want or <img src='http://s.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> if they don&#8217;t). We can represent the final outcome as a vector <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%5Cin%20%5B0%2C1%5D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x \in [0,1]^n}' title='{x \in [0,1]^n}' class='latex' />. Here we consider also randomizes mechanisms, where the final outcome can be, for example, allocate the item to bidder <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> with probability <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i}' title='{x_i}' class='latex' />. This vector has some restrictions imposed by the structure of the problem, i.e., if there is only one item, then we must have <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20x_i%20%5Cleq%201%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i x_i \leq 1}' title='{\sum_i x_i \leq 1}' class='latex' />. In the end of the mechanism, each player will pay some amount to the mechanism, say <img src='http://s.wordpress.com/latex.php?latex=%7Bp_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_i}' title='{p_i}' class='latex' />. Then at the end, player <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> has utility: <img src='http://s.wordpress.com/latex.php?latex=%7Bu_i%28x%2Cp%29%20%3D%20x_i%20v_i%20-%20p_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{u_i(x,p) = x_i v_i - p_i}' title='{u_i(x,p) = x_i v_i - p_i}' class='latex' /> and the total profit of the auctioneer is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_i%20p_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_i p_i}' title='{\sum_i p_i}' class='latex' />.</p>
<p>Let&#8217;s describe a general mechanism: in this mechanism each player has a set of strategies, say <img src='http://s.wordpress.com/latex.php?latex=%7B%5CTheta_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Theta_i}' title='{\Theta_i}' class='latex' />. Each player chooses one strategy <img src='http://s.wordpress.com/latex.php?latex=%7B%5Ctheta_i%20%5Cin%20%5CTheta_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\theta_i \in \Theta_i}' title='{\theta_i \in \Theta_i}' class='latex' /> and the mechanism chooses one allocation and one payment function based on the strategies of the players. <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%3D%20x%28%5Ctheta_1%2C%20%5Chdots%2C%5Ctheta_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x = x(\theta_1, \hdots,\theta_n)}' title='{x = x(\theta_1, \hdots,\theta_n)}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%3D%20p%28%5Ctheta_1%2C%20%5Chdots%2C%5Ctheta_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p = p(\theta_1, \hdots,\theta_n)}' title='{p = p(\theta_1, \hdots,\theta_n)}' class='latex' />. Notice it includes even complicated multi-round strategies: in this case, the strategy space <img src='http://s.wordpress.com/latex.php?latex=%7B%5CTheta_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\Theta_i}' title='{\Theta_i}' class='latex' /> would be a very complicated description of what a player would do for each outcome of each round.</p>
<p>Let <img src='http://s.wordpress.com/latex.php?latex=%7BV_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{V_i}' title='{V_i}' class='latex' /> be the set of the possible valuations a player would have. So, given the mechanism, each player would pick a function <img src='http://s.wordpress.com/latex.php?latex=%7B%5Ctheta_i%3A%20V_i%20%5Crightarrow%20%5CTheta_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\theta_i: V_i \rightarrow \Theta_i}' title='{\theta_i: V_i \rightarrow \Theta_i}' class='latex' />, i.e., <img src='http://s.wordpress.com/latex.php?latex=%7B%5Ctheta_i%28v_i%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\theta_i(v_i)}' title='{\theta_i(v_i)}' class='latex' /> is the strategy he would pick if he observed his value was <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' />. This mechanism has an equilibrium if there is a set of such functions that are in an equilibrium, i.e., no one would change his function and be better off. If those exist, we can implement a this as a direct revelation mechanism. A direct revelation mechanism is a mechanism where the strategies are simply to reveal a value in <img src='http://s.wordpress.com/latex.php?latex=%7BV_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{V_i}' title='{V_i}' class='latex' />. So, I just ask each player his valuation.</p>
<p>Given a complicated mechanism <img src='http://s.wordpress.com/latex.php?latex=%7Bx%3A%20%5CTheta%20%5Crightarrow%20%5B0%2C1%5D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x: \Theta \rightarrow [0,1]^n}' title='{x: \Theta \rightarrow [0,1]^n}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7Bx%3A%20%5CTheta%20%5Crightarrow%20%7B%5Cmathbb%20R%7D_%2B%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x: \Theta \rightarrow {\mathbb R}_+^n}' title='{x: \Theta \rightarrow {\mathbb R}_+^n}' class='latex' /> and equilibrium strategies <img src='http://s.wordpress.com/latex.php?latex=%7B%5Ctheta_i%3A%20V_i%20%5Crightarrow%20%5CTheta_i%20%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\theta_i: V_i \rightarrow \Theta_i }' title='{\theta_i: V_i \rightarrow \Theta_i }' class='latex' />, I can implement this as a direct revelation mechanism just by taking:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20x%27%28v_1%2C%20%5Chdots%2C%20v_n%29%20%3D%20x%28%5Ctheta_1%28v_1%29%2C%20%5Chdots%2C%20%5Ctheta_n%28v_n%29%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle x&#039;(v_1, \hdots, v_n) = x(\theta_1(v_1), \hdots, \theta_n(v_n))' title='\displaystyle x&#039;(v_1, \hdots, v_n) = x(\theta_1(v_1), \hdots, \theta_n(v_n))' class='latex' /></p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20p%27%28v_1%2C%20%5Chdots%2C%20v_n%29%20%3D%20p%28%5Ctheta_1%28v_1%29%2C%20%5Chdots%2C%20%5Ctheta_n%28v_n%29%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle p&#039;(v_1, \hdots, v_n) = p(\theta_1(v_1), \hdots, \theta_n(v_n))' title='\displaystyle p&#039;(v_1, \hdots, v_n) = p(\theta_1(v_1), \hdots, \theta_n(v_n))' class='latex' /></p>
<p>It is not hard to see that if the mechanism is <img src='http://s.wordpress.com/latex.php?latex=%7B%28x%27%2C%20v%27%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(x&#039;, v&#039;)}' title='{(x&#039;, v&#039;)}' class='latex' />, for the players the best thing to do is to reveal directly their true valuation, eliminating all the complicated steps in between. One practical example of a direct revelation mechanism is the Vickrey auction. Consider the <a class="snap_noshots" href="http://en.wikipedia.org/wiki/English_auction">English auction</a>, which is the famous auction that happens in most of movied depicting auctions (or <a class="snap_noshots" href="http://www.sothebys.com/">Sotheby&#8217;s</a> for a clear example): there is one item and people keep raising their bids untill everyone else drops and the last person still biddings gets the item. The clear strategy in those auctions is to raise your bid as long as the current value is below your valuation <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> and there still other bidders that haven&#8217;t dropped the auction. Clearly the person with highest value <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> will get the item. Let <img src='http://s.wordpress.com/latex.php?latex=%7Bv_j%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_j}' title='{v_j}' class='latex' /> be the second highest value. It is not hard to see that all but one will drop the auction after <img src='http://s.wordpress.com/latex.php?latex=%7Bv_j%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_j}' title='{v_j}' class='latex' />, so the highest bidders is likely to pay <img src='http://s.wordpress.com/latex.php?latex=%7Bv_j%20%2B%20%5Cepsilon%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_j + \epsilon}' title='{v_j + \epsilon}' class='latex' />. This is exacly the Vickrey auction, where we just emulate the English as a direct revelation mechanism. There are of course other issues. The following quote I got from <a class="snap_noshots" href="http://www.cs.tau.ac.il/~fiat/stocauc.ps">this paper</a>:</p>
<blockquote><p>&#8220;God help us if we ever take the theater out of the auction business or anything else. It would be an awfully boring world.&#8221; (A. Alfred Taubman, Chairman, Sotheby&#8217;s Galleries)</p></blockquote>
<p>So, we can restrict our attention to mechanims in the form <img src='http://s.wordpress.com/latex.php?latex=%7Bx%20%3A%20V%20%5Crightarrow%20%5B0%2C1%5D%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x : V \rightarrow [0,1]^n}' title='{x : V \rightarrow [0,1]^n}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%3A%20V%20%5Crightarrow%20%7B%5Cmathbb%20R%7D_%2B%5En%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p : V \rightarrow {\mathbb R}_+^n}' title='{p : V \rightarrow {\mathbb R}_+^n}' class='latex' /> that are truthful, i.e., where the players have no incentives not to report their true valuation. We can characterize truthful auctions using the next theorem. Just a bit of notation before: let <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7B-i%7D%20%3D%20%28v_1%2C%20%5Chdots%2C%20v_%7Bi-1%7D%2C%20v_%7Bi%2B1%7D%2C%20%5Chdots%2C%20v_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{-i} = (v_1, \hdots, v_{i-1}, v_{i+1}, \hdots, v_n)}' title='{v_{-i} = (v_1, \hdots, v_{i-1}, v_{i+1}, \hdots, v_n)}' class='latex' />, <img src='http://s.wordpress.com/latex.php?latex=%7Bx_%7B-i%7D%20%3D%20%28x_1%2C%20%5Chdots%2C%20x_%7Bi-1%7D%2C%20x_%7Bi%2B1%7D%2C%20%5Chdots%2C%20x_n%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_{-i} = (x_1, \hdots, x_{i-1}, x_{i+1}, \hdots, x_n)}' title='{x_{-i} = (x_1, \hdots, x_{i-1}, x_{i+1}, \hdots, x_n)}' class='latex' />, &#8230; and for <img src='http://s.wordpress.com/latex.php?latex=%7Bf%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f}' title='{f}' class='latex' />, let <img src='http://s.wordpress.com/latex.php?latex=%7Bf%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f}' title='{f}' class='latex' /> be the joint probability distribution and <img src='http://s.wordpress.com/latex.php?latex=%7Bf_%7B-i%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_{-i}}' title='{f_{-i}}' class='latex' /> be the joint probability distribution over <img src='http://s.wordpress.com/latex.php?latex=%7Bv_%7B-i%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_{-i}}' title='{v_{-i}}' class='latex' />:</p>
<blockquote><p><strong>Theorem 1</strong> <em> An auction is truthful if and only if, for all possible probability distributions over values given by <img src='http://s.wordpress.com/latex.php?latex=%7Bf_1%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_1}' title='{f_1}' class='latex' />, &#8230;, <img src='http://s.wordpress.com/latex.php?latex=%7Bf_n%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_n}' title='{f_n}' class='latex' /> we have </em></p>
<ol>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7Bx_i%28v_i%29%20%3D%20%5Cint%20x_i%28v_i%2C%20v_%7B-1%7D%29%20f_%7B-i%7D%28v_%7B-i%7D%29%20dv_%7B-i%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_i(v_i) = \int x_i(v_i, v_{-1}) f_{-i}(v_{-i}) dv_{-i}}' title='{x_i(v_i) = \int x_i(v_i, v_{-1}) f_{-i}(v_{-i}) dv_{-i}}' class='latex' /> is monotone non-decreasing </em></li>
<li><em> <img src='http://s.wordpress.com/latex.php?latex=%7Bp_i%28v_i%29%20%3D%20v_i%20x_i%28v_i%29%20-%20%5Cint_0%5E%7Bv_i%7D%20x_i%28z%29%20dz%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_i(v_i) = v_i x_i(v_i) - \int_0^{v_i} x_i(z) dz}' title='{p_i(v_i) = v_i x_i(v_i) - \int_0^{v_i} x_i(z) dz}' class='latex' /> where <img src='http://s.wordpress.com/latex.php?latex=%7Bp_i%28v_i%29%20%3D%20%5Cint%20p_i%28v_i%2C%20v_%7B-1%7D%29%20f_%7B-i%7D%28v_%7B-i%7D%29%20dv_%7B-i%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_i(v_i) = \int p_i(v_i, v_{-1}) f_{-i}(v_{-i}) dv_{-i}}' title='{p_i(v_i) = \int p_i(v_i, v_{-1}) f_{-i}(v_{-i}) dv_{-i}}' class='latex' /> </em></li>
</ol>
<p><em> </em></p></blockquote>
<p><strong> Revenue Equivalence </strong></p>
<p>The second main tool to reason about mechanisms concerns the revenue of the mechanism: it is Myerson&#8217;s Revnue Equivalence Principle, which roughly says that the revenue under a truthful mechanism depends only on the allocation and not on the payment function. This is somehow expected by the last theorem, since we showed that when a mechanism is truthtful, the payments are totally dependent on <img src='http://s.wordpress.com/latex.php?latex=%7B%28x%2Cf%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{(x,f)}' title='{(x,f)}' class='latex' />.</p>
<p>The profit of the auctioneer is given by <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cint%20%5Csum_%7Bi%3D0%7D%5En%20p_i%28v%29%20f%28v%29%20dv%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\int \sum_{i=0}^n p_i(v) f(v) dv}' title='{\int \sum_{i=0}^n p_i(v) f(v) dv}' class='latex' />. We can substitute <img src='http://s.wordpress.com/latex.php?latex=%7Bp_i%28v%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p_i(v)}' title='{p_i(v)}' class='latex' /> by the payment formula in last theorem obtaining:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cbegin%7Baligned%7D%20%5Ctext%7Bprofit%7D%20%26%3D%20%5Csum_%7Bi%3D0%7D%5En%20%5Cint_%7BV_i%7D%20p_i%28v_i%29%20f_i%28v_i%29%20dv_i%20%3D%20%5C%5C%26%3D%20%5Csum_%7Bi%3D0%7D%5En%20%5Cint_%7BV_i%7D%20%5Cleft%28%20v_i%20x_i%28v_i%29%20-%20%5Cint_0%5E%7Bv_i%7D%20x_i%28z%29%20dz%20%5Cright%29%20f_i%28v_i%29%20dv_i%20%5Cend%7Baligned%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \begin{aligned} \text{profit} &amp;= \sum_{i=0}^n \int_{V_i} p_i(v_i) f_i(v_i) dv_i = \\&amp;= \sum_{i=0}^n \int_{V_i} \left( v_i x_i(v_i) - \int_0^{v_i} x_i(z) dz \right) f_i(v_i) dv_i \end{aligned}' title='\displaystyle \begin{aligned} \text{profit} &amp;= \sum_{i=0}^n \int_{V_i} p_i(v_i) f_i(v_i) dv_i = \\&amp;= \sum_{i=0}^n \int_{V_i} \left( v_i x_i(v_i) - \int_0^{v_i} x_i(z) dz \right) f_i(v_i) dv_i \end{aligned}' class='latex' /></p>
<p>We can invert the order of the integration in the second part, getting:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cbegin%7Baligned%7D%20%5Cint_%7BV_i%7D%20%5Cint_0%5E%7Bv_i%7D%20x_i%28z%29%20f_i%28v_i%29%20dz%20dv_i%20%26%3D%20%5Cint_%7BV_i%7D%20%5Cint_%7Bv_i%7D%5E%7B%5Cinfty%7D%20x_i%28z%29%20f_i%28v_i%29%20dv_i%20dz%20%3D%20%5C%5C%20%26%3D%20%5Cint_%7BV_i%7D%20x_i%28z%29%20%281%20-%20F_i%28z%29%29%20dz%20%5C%5C%20%5Cend%7Baligned%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \begin{aligned} \int_{V_i} \int_0^{v_i} x_i(z) f_i(v_i) dz dv_i &amp;= \int_{V_i} \int_{v_i}^{\infty} x_i(z) f_i(v_i) dv_i dz = \\ &amp;= \int_{V_i} x_i(z) (1 - F_i(z)) dz \\ \end{aligned}' title='\displaystyle \begin{aligned} \int_{V_i} \int_0^{v_i} x_i(z) f_i(v_i) dz dv_i &amp;= \int_{V_i} \int_{v_i}^{\infty} x_i(z) f_i(v_i) dv_i dz = \\ &amp;= \int_{V_i} x_i(z) (1 - F_i(z)) dz \\ \end{aligned}' class='latex' /></p>
<p>So, we can rewrite profit as:</p>
<p align="center"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Ctext%7Bprofit%7D%20%3D%20%5Csum_%7Bi%3D0%7D%5En%20%5Cint_V%20%5Cleft%5B%201%20-%20%5Cfrac%7B1%20-%20F_i%28v_i%29%7D%7Bf_i%28v_i%29%7D%20%5Cright%5D%20x_i%28v_i%29%20f%28v%29%20dv%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \text{profit} = \sum_{i=0}^n \int_V \left[ 1 - \frac{1 - F_i(v_i)}{f_i(v_i)} \right] x_i(v_i) f(v) dv ' title='\displaystyle  \text{profit} = \sum_{i=0}^n \int_V \left[ 1 - \frac{1 - F_i(v_i)}{f_i(v_i)} \right] x_i(v_i) f(v) dv ' class='latex' /></p>
<p>And that proves the following result:</p>
<blockquote><p><strong>Theorem 2</strong> <em> The seller&#8217;s expected utility from a truthful direct revelation mechanism depends only on the assignment function <img src='http://s.wordpress.com/latex.php?latex=%7Bx%28v%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x(v)}' title='{x(v)}' class='latex' />. </em></p></blockquote>
<p>Now, to implement a revenue-maximizing mechanism we just need to find <img src='http://s.wordpress.com/latex.php?latex=%7Bx%28v%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x(v)}' title='{x(v)}' class='latex' /> that optimize the profit functional above still meeting the truthfulness constraints in the first theorem. This is discussed by Myerson in his paper. There is just one issue here: our analysis is dependent on the probabilities <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i}' title='{f_i}' class='latex' />. There are various approaches to that:</p>
<ul>
<li> Assume that the values of the bidders are drawn from distribution and <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i}' title='{f_i}' class='latex' /> and given to them. The distributions are public knowledge but the realization of <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i}' title='{f_i}' class='latex' /> is just known to bidder <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' />.</li>
<li> Bidders have fixed values <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> (i.e., <img src='http://s.wordpress.com/latex.php?latex=%7Bf_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_i}' title='{f_i}' class='latex' /> are the <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Dirac_distribution">Dirac distribution</a> concentrated on <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' />) and in this case, the revenue maximizing problem becomes trivial. It is still an interesting problem in the point of view of truthfulness. But in this case, we should assume that <img src='http://s.wordpress.com/latex.php?latex=%7Bv_i%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{v_i}' title='{v_i}' class='latex' /> is fixed but just player <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> knows its value.</li>
<li> The distributions exist but they are unknown by the mechanism designer. In this case, he wants to design a mechanism that provided good profit guaranteed against all possible distributions. The profit guarantees need to be established accourding to some benchmark. This is called <em>prior-free mechanism design</em>.</li>
</ul>
<p>More references about Mechanism Design can be found in these <a class="snap_noshots" href="http://en.wikipedia.org/wiki/Dirac_distribution">lectures by Jason Harline</a>, in the <a class="snap_noshots" href="http://www.eecs.harvard.edu/~parkes/cs286r/spring07/papers/myerson.pdf">original Myerson paper</a> or in the <a class="snap_noshots" href="http://www.cambridge.org/us/catalogue/catalogue.asp?isbn=0521872820">Algorithmic Game Theory</a> book.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/120/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Consistent Labeling</title>
		<link>http://www.bigredbits.com/archives/109</link>
		<comments>http://www.bigredbits.com/archives/109#comments</comments>
		<pubDate>Tue, 04 Aug 2009 04:14:48 +0000</pubDate>
		<dc:creator>renatoppl</dc:creator>
				<category><![CDATA[theory]]></category>
		<category><![CDATA[approximation algorithms]]></category>
		<category><![CDATA[linear programming]]></category>
		<category><![CDATA[metric embeddings]]></category>

		<guid isPermaLink="false">http://www.bigredbits.com/?p=109</guid>
		<description><![CDATA[This week, me and Igor are presenting the paper &#8220;Metric clustering via consistent labeling&#8221; by Krauthgamer and Roughgarden in our Algorithms Reading Group. To prepare for the presentation, I thought that writting a blog post about it was a nice idea. Consistent Labeling is a framework that allows us to represent a variety of metric [...]]]></description>
			<content:encoded><![CDATA[<p>This week, me and Igor are presenting the paper <a class="snap_noshots" href="http://portal.acm.org/citation.cfm?id=1347171">&#8220;Metric clustering via consistent labeling&#8221;</a> by Krauthgamer and Roughgarden in our Algorithms Reading Group. To prepare for the presentation, I thought that writting a blog post about it was a nice idea. Consistent Labeling is a framework that allows us to represent a variety of metric problems, as computing a separating decomposition of a metric space, a padded decomposition (both which are the main ingredient of <a class="snap_noshots" href="http://www.cs.berkeley.edu/~satishr/logn.ps">embedding metric spaces into dominating trees</a>), sparse cover, metric triangulation and so on. Here, I&#8217;ll define the Consistent Labeling Problem, formulate it as an Integer Program and show how we can get an approximation algorithm to it by rounding its linear relaxation.</p>
<p>First, consider a base set <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />. We want to attribute labels in <img src='http://s.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' /> to the elements of <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> respecting some constraints. First, each element <img src='http://s.wordpress.com/latex.php?latex=%7Ba%20%5Cin%20A%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a \in A}' title='{a \in A}' class='latex' /> has associated with it a subset <img src='http://s.wordpress.com/latex.php?latex=%7BL_a%20%5Csubseteq%20L%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{L_a \subseteq L}' title='{L_a \subseteq L}' class='latex' /> of labels it can be assigned. Each element can receive at most <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> labels. Second, there is a collection <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmathcal%7BC%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\mathcal{C}}' title='{\mathcal{C}}' class='latex' /> of subsets of <img src='http://s.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />, so that for each <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BC%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{C}}' title='{S \in \mathcal{C}}' class='latex' />, there must be one label <img src='http://s.wordpress.com/latex.php?latex=%7Bi%20%5Cin%20L%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i \in L}' title='{i \in L}' class='latex' /> that is assigned to all elements in <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' />. For each element in <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BC%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{C}}' title='{S \in \mathcal{C}}' class='latex' />, there is a penalty <img src='http://s.wordpress.com/latex.php?latex=%7Bw_S%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{w_S}' title='{w_S}' class='latex' /> to violate this constraint.</p>
<p>Our goal is to find a probability distribution over labelings that minimizes the total penalty <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%7BS%20%5Cin%20%5Cmathcal%7BC%7D%7D%20w_S%20Pr%5BS%20%5Ctext%7B%20is%20violated%7D%5D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_{S \in \mathcal{C}} w_S Pr[S \text{ is violated}]}' title='{\sum_{S \in \mathcal{C}} w_S Pr[S \text{ is violated}]}' class='latex' />. Let&#8217;s formulate this is an integer program. For that, the first thing we need are decision variables: let <img src='http://s.wordpress.com/latex.php?latex=%7Bx_%7Bai%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_{ai}}' title='{x_{ai}}' class='latex' /> be a <img src='http://s.wordpress.com/latex.php?latex=%7B%7B0%2C1%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{{0,1}}' title='{{0,1}}' class='latex' />-variable indicating if label <img src='http://s.wordpress.com/latex.php?latex=%7Bi%20%5Cin%20L%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i \in L}' title='{i \in L}' class='latex' /> is assigned to element <img src='http://s.wordpress.com/latex.php?latex=%7Ba%20%5Cin%20A%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a \in A}' title='{a \in A}' class='latex' />. Let the variable <img src='http://s.wordpress.com/latex.php?latex=%7By_%7BiS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{y_{iS}}' title='{y_{iS}}' class='latex' /> mean that the label <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> was assigned to all elements in <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> and let <img src='http://s.wordpress.com/latex.php?latex=%7Bz_S%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{z_S}' title='{z_S}' class='latex' /> mean that set <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> is consistently labeled. A linear programming formulation can be therefore expressed as:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%20%5Cleft.%20%5Cbegin%7Baligned%7D%20%26%20%5Cmin%20%5Csum_%7BS%20%5Cin%20%5Cmathcal%7BC%7D%7D%20w_S%20z_S%20%5Ctext%7B%20s.t.%20%7D%20%5C%5C%20%26%20%5Cqquad%20%5Cleft%5Clbrace%20%5Cbegin%7Baligned%7D%20%26%20%5Csum_%7Bi%20%5Cin%20L%7D%20x_%7Bia%7D%20%5Cleq%20k%20%26%20%5Cforall%20a%20%5Cin%20A%20%5C%5C%20%26%20y_%7BiS%7D%20%5Cleq%20x_%7Bia%7D%20%26%20%5Cforall%20S%20%5Cin%20%5Cmathcal%7BC%7D%2C%20a%20%5Cin%20S%2C%20i%20%5Cin%20L%5C%5C%20%26%20z_S%20%5Cleq%20%5Csum_%7Bi%20%5Cin%20L%7D%20y_%7BiS%7D%20%26%20%5Cforall%20S%20%5Cin%20%5Cmathcal%7BC%7D%20%5C%5C%20%26%20z_S%20%5Cleq%201%20%26%20%5Cforall%20S%20%5Cin%20%5Cmathcal%7BC%7D%20%5C%5C%20%26%20x_%7Bia%7D%20%3D%200%20%26%20%5Cforall%20i%20%5Cnotin%20L_a%20%5C%5C%20%5Cend%7Baligned%7D%20%5Cright.%20%5Cend%7Baligned%7D%20%5Cright.%20%5C%20%5C%20%5C%20%5C%20%5C%20%281%29&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle  \left. \begin{aligned} &amp; \min \sum_{S \in \mathcal{C}} w_S z_S \text{ s.t. } \\ &amp; \qquad \left\lbrace \begin{aligned} &amp; \sum_{i \in L} x_{ia} \leq k &amp; \forall a \in A \\ &amp; y_{iS} \leq x_{ia} &amp; \forall S \in \mathcal{C}, a \in S, i \in L\\ &amp; z_S \leq \sum_{i \in L} y_{iS} &amp; \forall S \in \mathcal{C} \\ &amp; z_S \leq 1 &amp; \forall S \in \mathcal{C} \\ &amp; x_{ia} = 0 &amp; \forall i \notin L_a \\ \end{aligned} \right. \end{aligned} \right. \ \ \ \ \ (1)' title='\displaystyle  \left. \begin{aligned} &amp; \min \sum_{S \in \mathcal{C}} w_S z_S \text{ s.t. } \\ &amp; \qquad \left\lbrace \begin{aligned} &amp; \sum_{i \in L} x_{ia} \leq k &amp; \forall a \in A \\ &amp; y_{iS} \leq x_{ia} &amp; \forall S \in \mathcal{C}, a \in S, i \in L\\ &amp; z_S \leq \sum_{i \in L} y_{iS} &amp; \forall S \in \mathcal{C} \\ &amp; z_S \leq 1 &amp; \forall S \in \mathcal{C} \\ &amp; x_{ia} = 0 &amp; \forall i \notin L_a \\ \end{aligned} \right. \end{aligned} \right. \ \ \ \ \ (1)' class='latex' /></p>
<p>It is not hard to see that if <img src='http://s.wordpress.com/latex.php?latex=%7Bx%2Cy%2Cx%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x,y,x}' title='{x,y,x}' class='latex' /> are <img src='http://s.wordpress.com/latex.php?latex=%7B%5C%7B0%2C1%5C%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\{0,1\}}' title='{\{0,1\}}' class='latex' />-variables then the formulation corresponds to the original problem. Now, let&#8217;s relax it to a Linear Program and interpret those as probabilities. The rounding procedure we use is a generalization of the one in <a class="snap_noshots" href="http://www.cs.cornell.edu/home/kleinber/focs99-mrf.ps">&#8220;Approximation algorithms for classification problems&#8221;</a> of Kleinberg and Tardos: until every object has <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> labels, repeat the following procedure: pick <img src='http://s.wordpress.com/latex.php?latex=%7Bi%20%5Csim%20%5Ctext%7BUniform%7D%28L%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i \sim \text{Uniform}(L)}' title='{i \sim \text{Uniform}(L)}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bt%20%5Csim%20%5Ctext%7BUniform%7D%28%5B0%2C1%5D%29%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t \sim \text{Uniform}([0,1])}' title='{t \sim \text{Uniform}([0,1])}' class='latex' /> and assign label <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> to all objects with <img src='http://s.wordpress.com/latex.php?latex=%7Bx_%7Bai%7D%20%3E%20t%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{x_{ai} &gt; t}' title='{x_{ai} &gt; t}' class='latex' />. In the end, pick the <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> first labels assigned to each object.</p>
<p>What we just described is a procedure that, given a solution to the relaxed LP, produces a randomized labeling of the objects. Now, we need to prove that the solution produced by this randomized labeling is good in expectation, that is, that <img src='http://s.wordpress.com/latex.php?latex=%7B%5Csum_%7BS%20%5Cin%20%5Cmathcal%7BC%7D%7D%20w_S%20Pr%5BS%20%5Ctext%7B%20is%20violated%7D%5D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\sum_{S \in \mathcal{C}} w_S Pr[S \text{ is violated}]}' title='{\sum_{S \in \mathcal{C}} w_S Pr[S \text{ is violated}]}' class='latex' /> is good compared to the optimal single deterministic assignment. The authors prove that they are within a factor of <img src='http://s.wordpress.com/latex.php?latex=%7B2%20f_%7Bmax%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2 f_{max}}' title='{2 f_{max}}' class='latex' />, where <img src='http://s.wordpress.com/latex.php?latex=%7Bf_%7Bmax%7D%20%3D%20%5Cmax_%7BS%20%5Cin%20%5Cmathcal%7BC%7D%7D%20%5Cvert%20S%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{f_{max} = \max_{S \in \mathcal{C}} \vert S \vert}' title='{f_{max} = \max_{S \in \mathcal{C}} \vert S \vert}' class='latex' />.</p>
<blockquote><p><strong>Theorem 1</strong> <em> For each <img src='http://s.wordpress.com/latex.php?latex=%7BS%20%5Cin%20%5Cmathcal%7BC%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S \in \mathcal{C}}' title='{S \in \mathcal{C}}' class='latex' />, the probability that <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> is consistently labeled is lower bounded by <img src='http://s.wordpress.com/latex.php?latex=%7B1%20-%20%5Cleft%28%201%20-%20%5Cfrac%7Bz_S%7D%7Bk%20%5Cvert%20S%20%5Cvert%7D%20%5Cright%29%5Ek%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{1 - \left( 1 - \frac{z_S}{k \vert S \vert} \right)^k}' title='{1 - \left( 1 - \frac{z_S}{k \vert S \vert} \right)^k}' class='latex' />. </em></p></blockquote>
<p><em>Proof:</em> Since we are trying to lower bound the probability of getting consistently labeled, we can consider just the probability of all the elements in <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> to be consistently labeled in the same iteration &#8211; let&#8217;s estimate this probability. This is:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20q%20%3D%20%5Csum_%7Bi%20%5Cin%20L%7D%20%5Cfrac%7B1%7D%7B%5Cvert%20L%20%5Cvert%7D%20Pr%5BS%20%5Ctext%7Bis%20all%20labeled%20with%20%7D%20i%20%5Ctext%7B%20in%20iteration%20%7D%20j%5D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle q = \sum_{i \in L} \frac{1}{\vert L \vert} Pr[S \text{is all labeled with } i \text{ in iteration } j]' title='\displaystyle q = \sum_{i \in L} \frac{1}{\vert L \vert} Pr[S \text{is all labeled with } i \text{ in iteration } j]' class='latex' /></p>
<p>If <img src='http://s.wordpress.com/latex.php?latex=%7Bi%20%5Cin%20L%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i \in L}' title='{i \in L}' class='latex' /> is chosen in iteration <img src='http://s.wordpress.com/latex.php?latex=%7Bj%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{j}' title='{j}' class='latex' />, all elements are labeled with <img src='http://s.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' /> if <img src='http://s.wordpress.com/latex.php?latex=%7Bt%20%3C%20x_%7Bia%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t &lt; x_{ia}}' title='{t &lt; x_{ia}}' class='latex' /> for all <img src='http://s.wordpress.com/latex.php?latex=%7Ba%20%5Cin%20S%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{a \in S}' title='{a \in S}' class='latex' />, so the probability is <img src='http://s.wordpress.com/latex.php?latex=%7B%5Cmin_%7Ba%20%5Cin%20S%7D%20x_%7Bia%7D%20%5Cgeq%20y_%7BiS%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{\min_{a \in S} x_{ia} \geq y_{iS}}' title='{\min_{a \in S} x_{ia} \geq y_{iS}}' class='latex' />. So, we have:</p>
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20q%20%3D%20%5Csum_%7Bi%20%5Cin%20L%7D%5Csum_%7Bi%20%5Cin%20L%7D%20%5Cfrac%7B1%7D%7B%5Cvert%20L%20%5Cvert%7D%20y_%7BiS%7D%20%3D%20%5Cfrac%7Bz_S%7D%7B%5Cvert%20L%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle q = \sum_{i \in L}\sum_{i \in L} \frac{1}{\vert L \vert} y_{iS} = \frac{z_S}{\vert L \vert}' title='\displaystyle q = \sum_{i \in L}\sum_{i \in L} \frac{1}{\vert L \vert} y_{iS} = \frac{z_S}{\vert L \vert}' class='latex' />
<p>Now, let <img src='http://s.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> be the probability that set <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> is hit by the labeling in phase <img src='http://s.wordpress.com/latex.php?latex=%7Bj%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{j}' title='{j}' class='latex' />. If label <img src='http://s.wordpress.com/latex.php?latex=%7Bi%20%5Cin%20L%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{i \in L}' title='{i \in L}' class='latex' /> is chosen, the set <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> is hit by the labeling if <img src='http://s.wordpress.com/latex.php?latex=%7Bt%20%3C%20%5Cmax_%7Ba%20%5Cin%20S%7D%20x_%7Bia%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{t &lt; \max_{a \in S} x_{ia}}' title='{t &lt; \max_{a \in S} x_{ia}}' class='latex' />, therefore:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20p%20%3D%20%5Csum_%7Bi%20%5Cin%20L%7D%20%5Cfrac%7B1%7D%7B%5Cvert%20L%20%5Cvert%7D%20%5Cmax_%7Ba%20%5Cin%20S%7D%20x_%7Bia%7D%20%5Cleq%20%5Csum_%7Bi%20%5Cin%20L%7D%20%5Cfrac%7B1%7D%7B%5Cvert%20L%20%5Cvert%7D%20%5Csum_%7Ba%20%5Cin%20S%7D%20x_%7Bia%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle p = \sum_{i \in L} \frac{1}{\vert L \vert} \max_{a \in S} x_{ia} \leq \sum_{i \in L} \frac{1}{\vert L \vert} \sum_{a \in S} x_{ia}' title='\displaystyle p = \sum_{i \in L} \frac{1}{\vert L \vert} \max_{a \in S} x_{ia} \leq \sum_{i \in L} \frac{1}{\vert L \vert} \sum_{a \in S} x_{ia}' class='latex' /></p>
<p>inverting the order of the summation, we get:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20p%20%5Cleq%20%5Cfrac%7B1%7D%7B%5Cvert%20L%20%5Cvert%7D%20%5Csum_%7Ba%20%5Cin%20S%7D%20%5Csum_%7Bi%20%5Cin%20L%7D%20x_%7Bia%7D%20%5Cleq%20%5Cfrac%7B1%7D%7B%5Cvert%20L%20%5Cvert%7D%20%5Csum_%7Ba%20%5Cin%20S%7D%20k%20%3D%20%5Cfrac%7Bk%20%5Cvert%20S%20%5Cvert%7D%7B%5Cvert%20L%20%5Cvert%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle p \leq \frac{1}{\vert L \vert} \sum_{a \in S} \sum_{i \in L} x_{ia} \leq \frac{1}{\vert L \vert} \sum_{a \in S} k = \frac{k \vert S \vert}{\vert L \vert}' title='\displaystyle p \leq \frac{1}{\vert L \vert} \sum_{a \in S} \sum_{i \in L} x_{ia} \leq \frac{1}{\vert L \vert} \sum_{a \in S} k = \frac{k \vert S \vert}{\vert L \vert}' class='latex' /></p>
<p>The probability that <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> is consistently labeled is smaller greater than the probability that it is consistently labeled in the same iteration before the set is hit <img src='http://s.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> times. In one iteration three things may happen: either the set is not hit, or it is hit but it is not consistently labeled or it is consistently labeled. The figure below measures how many times the set is hit. The down arrows represent the event that the set <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> is consistently labeled:</p>
<p><img class="aligncenter size-full wp-image-110" title="consist_lab" src="http://www.bigredbits.com/wp-content/uploads/2009/08/consist_lab.png" alt="consist_lab" width="420" height="369" /></p>
<p>A standard trick in this cases is to disregard the self-loops and normalize the probabilities. This way the probability that <img src='http://s.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> is consistently labeled is:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cfrac%7Bq%7D%7Bp%7D%20%5Csum_%7Bj%3D0%7D%5E%7Bk-1%7D%20%5Cleft%28%20%5Cfrac%7Bp-q%7D%7Bp%7D%20%5Cright%29%5Ej%20%3D%20%5Cfrac%7Bq%7D%7Bp%7D%20%5Ccdot%20%5Cfrac%7B%201%20-%20%5Cleft%28%20%5Cfrac%7Bp-q%7D%7Bp%7D%20%5Cright%29%5Ek%20%7D%7B1%20-%20%5Cfrac%7Bp-q%7D%7Bp%7D%20%7D%20%3D%201%20-%20%5Cleft%28%201%20-%20%5Cfrac%7Bq%7D%7Bp%7D%20%5Cright%29%5Ek%20&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \frac{q}{p} \sum_{j=0}^{k-1} \left( \frac{p-q}{p} \right)^j = \frac{q}{p} \cdot \frac{ 1 - \left( \frac{p-q}{p} \right)^k }{1 - \frac{p-q}{p} } = 1 - \left( 1 - \frac{q}{p} \right)^k ' title='\displaystyle \frac{q}{p} \sum_{j=0}^{k-1} \left( \frac{p-q}{p} \right)^j = \frac{q}{p} \cdot \frac{ 1 - \left( \frac{p-q}{p} \right)^k }{1 - \frac{p-q}{p} } = 1 - \left( 1 - \frac{q}{p} \right)^k ' class='latex' /></p>
<p>Now, we just use that <img src='http://s.wordpress.com/latex.php?latex=%7Bp%20%5Cleq%20%5Cfrac%7Bk%20%5Cvert%20S%20%5Cvert%7D%7B%5Cvert%20L%20%5Cvert%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{p \leq \frac{k \vert S \vert}{\vert L \vert}}' title='{p \leq \frac{k \vert S \vert}{\vert L \vert}}' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%7Bq%20%3D%20%5Cfrac%7Bz_S%7D%7B%5Cvert%20L%20%5Cvert%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{q = \frac{z_S}{\vert L \vert}}' title='{q = \frac{z_S}{\vert L \vert}}' class='latex' /> to obtain the desired result. <img src='http://s.wordpress.com/latex.php?latex=%5CBox&#038;bg=T&#038;fg=000000&#038;s=0' alt='\Box' title='\Box' class='latex' /></p>
<p>The approximation factor of <img src='http://s.wordpress.com/latex.php?latex=%7B2%20f_%7Bmax%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2 f_{max}}' title='{2 f_{max}}' class='latex' /> follows straight from the previous theorem by considering the following inequalities:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%20%5Cleft%28%201%20-%20%5Cfrac%7Ba%7D%7Bk%7D%5Cright%29%5Ek%20%5Cleq%20e%5E%7B-a%7D%20%5Cleq%201%20-%20%5Cfrac%7Ba%7D%7B2%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='\displaystyle \left( 1 - \frac{a}{k}\right)^k \leq e^{-a} \leq 1 - \frac{a}{2}' title='\displaystyle \left( 1 - \frac{a}{k}\right)^k \leq e^{-a} \leq 1 - \frac{a}{2}' class='latex' /></p>
<p>and noting that the LP solution is a lower bound of the optimal value</p>
<blockquote><p><strong>Theorem 2</strong> <em> The rounding procedure gives a randomized algorithm that, in expectation, achieves a <img src='http://s.wordpress.com/latex.php?latex=%7B2%20f_%7Bmax%7D%7D&#038;bg=T&#038;fg=000000&#038;s=0' alt='{2 f_{max}}' title='{2 f_{max}}' class='latex' /> approximation to the optimal consistent labeling. </em></p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.bigredbits.com/archives/109/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
<!-- WP Super Cache is installed but broken. The path to wp-cache-phase1.php in wp-content/advanced-cache.php must be fixed! -->
