Math – Quirky Quintet

July 31, 2022July 31, 2022

Putnam 2020 A2

For $k$ a non-negative integer, evaluate
\begin{align*}
\sum_{j=0}^k 2^{k-j} \binom{k +j}{j}.
\end{align*}

Apparently, I only know how to do induction anymore now but this problem is straightforward using it. We suppose the inductive hypothesis, and assume that for a fixed $k$ that
\begin{align*}
\sum_{j=0}^k 2^{k-j} \binom{k +j}{j} = 2^{2k}.
\end{align*}
We will proceed to show
\begin{align*}
\sum_{j=0}^{k + 1} 2^{k + 1-j} \binom{k + 1 +j}{j} = 2^{2k + 2}.
\end{align*}

By a simple binomial identity, we have
\begin{align*}
\sum_{j=0}^{k + 1} 2^{k + 1-j} \binom{k + 1 +j}{j} &= \sum_{j=0}^{k + 1} 2^{k + 1-j} \binom{k+j}{j} + \sum_{j=0}^{k + 1} 2^{k + 1-j} \binom{k +j}{j – 1}.
\end{align*}
Let $A$, $B$ be the first and second term respectively on the right hand side, then
\begin{align*}
A &= 2\sum_{j=0}^{k} 2^{k-j} \binom{k+j}{j} + \binom{2k + 1}{k+1} \\
&= 2^{2k + 1} + \binom{2k + 1}{k+1}.
\end{align*}
For the other term,
\begin{align*}
B &= \sum_{j=1}^{k + 1} 2^{k + 1-j} \binom{k +j}{j – 1} \\
&= \sum_{j=0}^{k} 2^{k-j} \binom{k +1 +j}{j} \\
&= \frac{1}{2} \sum_{j=0}^{k + 1} 2^{k + 1-j} \binom{k +1 +j}{j} – \frac{1}{2} \binom{2k + 2}{k + 1}.
\end{align*}
Thus, we rearranging, we have our result
\begin{align*}
\frac{1}{2}\sum_{j=0}^{k + 1} 2^{k + 1-j} \binom{k + 1 +j}{j} &= 2^{2k+1} + \binom{2k + 1}{k+1} – \frac{1}{2} \binom{2k + 2}{k + 1} \\
&= 2^{2k+1}.
\end{align*}

August 14, 2018August 16, 2018

Scaling Arguments

This is a pretty important concept in PDEs and its numerical approximations. Specifically, tt shows up in Bramble-Hilbert lemma, and domain decomposition analysis.
Most of this post is pretty much written right after reading Toselli and Widlund’s book, so there are a lot of resemblance.

Let $\Omega$ be a bounded domain in $\mathbb{R}^n$ which is ‘nice’ (say Lipschitz boundary) with radius $h$. Now let $u, v \in H^1(\Omega)$ such that
\begin{align*}
|v|_{H^1(\Omega)} \le C||u||_{H^1(\Omega)}
\end{align*}
and we wish to obtain the $h$ dependence from $C$.

What we do is to first consider a scaled domain $\hat \Omega$ which is just $\Omega$ scaled to be of radius 1, with the change of basis $x = h\hat x$.
If we find the corresponding inequality on $\hat \Omega$, then the constant $C$ will not depend on $h$.
Let $\hat v(\hat x) := v(h\hat x)$, then we note that $\hat \nabla \hat v(\hat x) = h\hat \nabla v(h\hat x)$ where $\hat \nabla $ is the gradient with respect to $\hat x$.
Then,
\begin{align*}
|v|^2_{H^1(\Omega)} &= \int_\Omega |\nabla v(x)|^2 \, dx \\
&= \int_{\hat \Omega} |\hat\nabla v(h \hat x)|^2 h^n \, d\hat x \\
&= \int_{\hat \Omega} |\hat\nabla \hat v(\hat x)|^2 h^{-2} h^n \, d\hat x = h^{n-2}|\hat v|_{H^1(\hat \Omega)}^2
\end{align*}

But for $L^2$ norm, there is no $h^2$ scaling, hence
\begin{align*}
||u||_{L^2(\Omega)}^2 &= \int_\Omega |u(x)|^2 \, dx \\
&= \int_{\hat \Omega} |u(h \hat x)|^2 h^n \, d\hat x = h^n ||\hat u||_{L^2(\hat \Omega)}^2.
\end{align*}
This is why derivatives mixing causes scaling issues.

June 4, 2018August 16, 2018

Putnam 2003 A2

Let $a_1, \ldots, a_n$ and $b_1, \ldots, b_n$ be non-negative real numbers. Show that $$(a_1\ldots a_n)^{1/n} + (b_1\ldots b_n)^{1/n} \le [(a_1 + b_1) \cdots (a_n + b_n)]^{1/n}.$$

Solution: we will use the generalized Holder’s inequality which states that
$$||f_1\ldots f_n ||_1 \le ||f_1||_{\lambda_1} \cdots ||f_n||_{\lambda_n}$$
for lambda weights $\lambda_1^{-1} + \cdots + \lambda_n^{-1} = 1$ all greater than 1.

Assuming this is true, let $f_i = (a_i^{1/n}, b_i^{1/n})$ and the norms be the discrete $l^p$ norm. This will give us $||f_1 \ldots f_n||_1 = (a_1\ldots a_n)^{1/n} + (b_1\ldots b_n)^{1/n}$ as everything is non-negative. The weight will be uniform $\lambda_i = n$, then the right hand side will be
$$||f_i||_{n} = (a_i + b_i)^{1/n}$$
and we have our inequality.

The sole remaining thing to prove is the generalized Holder’s inequality. We will assume the famous base case of the two element case. In the inductive case, we have
\begin{align*}
||f_1\cdots f_{n+1}||_1 &\le ||f_1 \cdots f_n||_{\lambda_{n+1}/(\lambda_{n+1} – 1)} ||f_{n+1}||_{\lambda_{n+1}} \\
&= ||(f_1 \cdots f_n)^{\lambda_{n+1}/(\lambda_{n+1} – 1)}||_1^{(\lambda_{n+1} – 1)/\lambda_{n+1}} ||f_{n+1}||_{\lambda_{n+1}}.
\end{align*}
From here, just change the weights and use the inductive case and we are done.

May 31, 2018August 16, 2018

Putnam 2003 A1

Let $n$ be a fixed positive integer. How many ways are there to write $n$ as a sum of positive integers, $K_n$ to be the set of tuples of We are not done here, as we would need to show that there exists no other tuples in $K </div> </article> <article id="post-1057" class="post-1057 post type-post status-publish format-standard hentry category-math category-programming"> <header class="entry-header"> <div class="entry-meta"><span class="screen-reader-text">Posted on</span> <a href="https://marshalljiang.com/vectorization-over-c/" rel="bookmark"><time class="entry-date published updated" datetime="2018-02-28T12:35:41-05:00">February 28, 2018</time></a></div><h2 class="entry-title"><a href="https://marshalljiang.com/vectorization-over-c/" rel="bookmark">Vectorization over C</a></h2> </header> <div class="entry-content"> <p>The title is probably misleading, but this is a lesson I needed to talk about.</p> <p>I wrote out some simple code for the quadrature over the reference triangle last time, which involves a double loop. To my chagrin, my immediate reaction to speeding up the code was to put it into Cython, and give it some type declaration.</p> <p>This did speed up my integrals, but not as much as vectorization. By simply condensing one of the loops into a dot product, and using vector-function evaluation, I sped up my code a substantial amount, especially with higher order integration of “hard” functions.</p> <div class="code-embed-wrapper"> <pre class="language-python code-embed-pre line-numbers" data-start="1" data-line-offset="0"><code class="language-python code-embed-code">def quadtriangle_vector(f, w00, w10, x00, x10):<br/> total = 0<br/> for i in range(len(w00)):<br/> total = w00[i] * np.dot(w10 / 2, f([(1 x00[i]) * (1 - x10) / 2 - 1, x10]))<br/> return total</code></pre> <div class="code-embed-infos"> </div> </div> <p>To see what I mean, consider the following function</p> <div class="code-embed-wrapper"> <pre class="language-python code-embed-pre line-numbers" data-start="1" data-line-offset="0"><code class="language-python code-embed-code">from scipy.special import eval_jacobi as jac<br/>def f(x):<br/> return jac(2, 1, 1, np.sin(x[0] - x[1]))<br/>p = 20<br/>x00, w00 = rj(p 1, 0, 0)<br/>x10, w10 = rj(p 1, 1, 0)</code></pre> <div class="code-embed-infos"> </div> </div> <p>The speedup I get is staggering.</p> <div class="code-embed-wrapper"> <pre class="language-python code-embed-pre line-numbers" data-start="1" data-line-offset="0"><code class="language-python code-embed-code" <p>Also, I tried to fully vectorize by removing the outer-loop. This actually slowed down the code a bit. Maybe I did it wrong? But for now, I’m decently happy with the speed.</p> </div> </article> <article id="post-1023" class="post-1023 post type-post status-publish format-standard hentry category-blargh category-math"> <header class="entry-header"> <div class="entry-meta"><span class="screen-reader-text">Posted on</span> <a href="https://marshalljiang.com/spd-matrices-and-a/" rel="bookmark"><time class="entry-date published" datetime="2018-01-15T17:38:34-05:00">January 15, 2018</time><time class="updated" datetime="2018-01-15T17:40:04-05:00">January 15, 2018</time></a></div><h2 class="entry-title"><a href="https://marshalljiang.com/spd-matrices-and-a/" rel="bookmark">SPD Matrices and a False Inequality</a></h2> </header> <div class="entry-content"> <p>This one is from my research, and it’s a doozy. Given two vectors <img src=$ After spending a good amount of timing trying to prove this, I realized that this is in general not true (in fact, the result I was suppose to be chasing would’ve been disproven if the above statement was true). As a counter example, consider the following counterexample from a Bernstein basis application:

Let $x <pre>[2/7, 1/7, 2/35; 1/7, 6/35, 9/70 ; 2/35, 9/70, 6/35].</pre> <p>Then the quadratic forms will be 4/15 and 1/7 respectively.</p> </div> </article> <article id="post-1014" class="post-1014 post type-post status-publish format-standard hentry category-blargh category-math"> <header class="entry-header"> <div class="entry-meta"><span class="screen-reader-text">Posted on</span> <a href="https://marshalljiang.com/a-matrix-inequality/" rel="bookmark"><time class="entry-date published" datetime="2018-01-11T17:10:03-05:00">January 11, 2018</time><time class="updated" datetime="2018-01-11T17:16:33-05:00">January 11, 2018</time></a></div><h2 class="entry-title"><a href="https://marshalljiang.com/a-matrix-inequality/" rel="bookmark">A Matrix Inequality</a></h2> </header> <div class="entry-content"> <p>An exercise from Braess FEM book: for <img src=$ The book actually gave quite a lot of hints for this one.

For example, in Python it would be literally one line:

return optimize.fsolve(lambda u: u - self.u0(x - u*t), 1)

February 25, 2017

Spectacular Books

Mathematicians suck at writing. This is part of the reasons why they went into math in the first place, because they suck at writing. Sucking at writing doesn’t mean that mathematicians don’t write books though; in fact there are tons of math books written by mathematicians.

The problem is most of them suck.

Half of them are for geniuses written by geniuses.

The other half are for geniuses written by borderline geniuses.

By far my two favorite books are William’s Probability with Martingales and Trefethen’s NLA book. Both British authors. Both written in a highly colloquial style.

It’s really too bad academics have this ego-stroking urge to write to the highest denominator rather than to, say grad students or undergrad.

God damn.