Math/Programming – Page 3

June 4, 2018August 16, 2018

Putnam 2003 A2

Let $a_1, \ldots, a_n$ and $b_1, \ldots, b_n$ be non-negative real numbers. Show that $$(a_1\ldots a_n)^{1/n} + (b_1\ldots b_n)^{1/n} \le [(a_1 + b_1) \cdots (a_n + b_n)]^{1/n}.$$

Solution: we will use the generalized Holder’s inequality which states that
$$||f_1\ldots f_n ||_1 \le ||f_1||_{\lambda_1} \cdots ||f_n||_{\lambda_n}$$
for lambda weights $\lambda_1^{-1} + \cdots + \lambda_n^{-1} = 1$ all greater than 1.

Assuming this is true, let $f_i = (a_i^{1/n}, b_i^{1/n})$ and the norms be the discrete $l^p$ norm. This will give us $||f_1 \ldots f_n||_1 = (a_1\ldots a_n)^{1/n} + (b_1\ldots b_n)^{1/n}$ as everything is non-negative. The weight will be uniform $\lambda_i = n$, then the right hand side will be
$$||f_i||_{n} = (a_i + b_i)^{1/n}$$
and we have our inequality.

The sole remaining thing to prove is the generalized Holder’s inequality. We will assume the famous base case of the two element case. In the inductive case, we have
\begin{align*}
||f_1\cdots f_{n+1}||_1 &\le ||f_1 \cdots f_n||_{\lambda_{n+1}/(\lambda_{n+1} – 1)} ||f_{n+1}||_{\lambda_{n+1}} \\
&= ||(f_1 \cdots f_n)^{\lambda_{n+1}/(\lambda_{n+1} – 1)}||_1^{(\lambda_{n+1} – 1)/\lambda_{n+1}} ||f_{n+1}||_{\lambda_{n+1}}.
\end{align*}
From here, just change the weights and use the inductive case and we are done.

May 31, 2018August 16, 2018

Putnam 2003 A1

Let $n$ be a fixed positive integer. How many ways are there to write $n$ as a sum of positive integers, $n = a_1 + a_2 + \cdots + a_k,$ with $k$ an arbitrary positive integer and $a_1 \le a_2 \le \cdots \le a_k \le a_1 + 1$ ? For example with $n = 4$ , there are 4 ways: $4, 2 + 2, 1 + 1+ 2, 1+1+1+1.$

Solution: Denote $K_n$ to be the set of tuples of $(a_1, \ldots, a_k)$ with the above properties. We claim that $|K_n| = n$ . We will use induction. It is easy to verify the claim for $|K_n| = 1,2,3,4$ for $n = 1,2,3,4$ respectively.

Assume that $|K_{l}| = l$ for some positive integer $l$ , then for a given tuple $(a_1, \ldots, a_k) \in K_l$ we can add 1 to one of the elements in the tuple, and still preserve the property that $a_1 \le a_2 \le \cdots \le a_k \le a_1 + 1$ . If $a_1 \not = a_k$ , then we simply can add 1 where the integers jump, otherwise $a_1 = a_2 = \cdots = a_k$ and we can just have $a_k + 1$ . This gives rise to tuples which are in $K_{l+1}$ . Finally, we have a tuple of 1s to add; this results in $|K_{l+1}| = l+1$ .

We are not done here, as we would need to show that there exists no other tuples in $K_{l+1}$ that we cannot construct as above. This is easy to see, as we can do the inverse operation of subtracting one (with the exception of the tuple of all 1s).

February 28, 2018

Vectorization over C

The title is probably misleading, but this is a lesson I needed to talk about.

I wrote out some simple code for the quadrature over the reference triangle last time, which involves a double loop. To my chagrin, my immediate reaction to speeding up the code was to put it into Cython, and give it some type declaration.

This did speed up my integrals, but not as much as vectorization. By simply condensing one of the loops into a dot product, and using vector-function evaluation, I sped up my code a substantial amount, especially with higher order integration of “hard” functions.

def quadtriangle_vector(f, w00, w10, x00, x10):
    total = 0
    for i in range(len(w00)):
        total += w00[i] * np.dot(w10 / 2, f([(1 + x00[i]) * (1 - x10) / 2 - 1, x10]))
    return total

To see what I mean, consider the following function

from scipy.special import eval_jacobi as jac
def f(x):
    return jac(2, 1, 1, np.sin(x[0] - x[1]))
p = 20
x00, w00 = rj(p + 1, 0, 0)
x10, w10 = rj(p + 1, 1, 0)

The speedup I get is staggering.


Also, I tried to fully vectorize by removing the outer-loop. This actually slowed down the code a bit. Maybe I did it wrong? But for now, I’m decently happy with the speed.




		
		Posted on January 15, 2018January 15, 2018
SPD Matrices and a False Inequality
	


	
	
		This one is from my research, and it’s a doozy. Given two vectors  such that each element in  is less in absolute value than the corresponding element in , show that for any SPD matrix  that .
After spending a good amount of timing trying to prove this, I realized that this is in general not true (in fact, the result I was suppose to be chasing would’ve been disproven if the above statement was true). As a counter example, consider the following counterexample from a Bernstein basis application:
Let . Let the matrix be
[2/7, 1/7, 2/35; 1/7, 6/35, 9/70 ; 2/35, 9/70, 6/35].
Then the quadratic forms will be 4/15 and 1/7 respectively.
	


	



		
		Posted on January 11, 2018January 11, 2018
A Matrix Inequality
	


	
	
		An exercise from Braess FEM book: for  symmetric, positive definite matrices, let . We want to show that the inverses satisfies a similar property .
The book actually gave quite a lot of hints for this one.

.

Then from the hypothesis, .

So we can plug this in to our equation above to find that .
	


	



		
		Posted on April 18, 2017
The (lack of a) Matrix
	


	
	
		I think I finally understand why software packages like PETSc has an option for an operator when doing something like conjugate gradient. Why isn’t having a matrix good enough for everyone?
Well turns out that while all linear operators can be translated to a matrix, it may not be the best way to represent the operator. As an example, consider a basis transformation from Bernstein polynomials to Jacobi (or vice versa). It’s certainly possible to find and construct a matrix which does the operation, but it’s ugly.
On the other hand, it’s not that bad to write a code which utilizes the properties of the polynomials and convert it within in O(n^2) time. The key is that a Jacobi polynomial is a sum of Bernsteins, and Bernsteins can be degree raised or dropped at will.
This function will outperform the matrix in many sense. For one, there’s no need to construct a matrix, which will take n^2 operations in the first place. Next, matrix multiplication will take a n^3 operation, so if we optimize enough, we will always beat it. Finally, it’s really less painful to code, because each line of the function serves a visible purpose.
Anyways, I’m sold.
(I’ll eventually publish the code in the summer)
	


	



		
		Posted on February 27, 2017
Burgers Equation
	


	
	
		For conservation laws in general, there’s the implicit solution of . It’s surprisingly useful for quick and dirty “exact” solutions to a lot of problems. In particular inviscid Burgers’ equation with initial values becomes trivial to code up.
For example, in Python it would be literally one line:
return optimize.fsolve(lambda u: u - self.u0(x - u*t), 1)
	


	



		
		Posted on February 25, 2017
Spectacular Books
	


	
	
		Mathematicians suck at writing. This is part of the reasons why they went into math in the first place, because they suck at writing. Sucking at writing doesn’t mean that mathematicians don’t write books though; in fact there are tons of math books written by mathematicians.
The problem is most of them suck.
Half of them are for geniuses written by geniuses.
The other half are for geniuses written by borderline geniuses.
By far my two favorite books are William’s Probability with Martingales and Trefethen’s NLA book. Both British authors. Both written in a highly colloquial style.
It’s really too bad academics have this ego-stroking urge to write to the highest denominator rather than to, say grad students or undergrad.
God damn.
	


	



		
		Posted on February 13, 2017
Schur Complement and Minimal Energy Extension
	


	
	
		(Note: this post is mainly for me to consolidate my thoughts)
In the framework of domain decomposition, consider creating the Schur complement which orthogonalizes interior nodes and the edges/vertex nodes. It turns out the norm of these functions which are orthogonal to the interior functions are minimal energy (i.e. L2 norm) extensions.
This can be seen in both a Hilbert space way or an optimization way. For the optimization way, write out the product for the mass matrix, and note that we can take a derivative to minimize one of the factors… now the Schur complement pops up naturally!
	


	



		
		Posted on February 5, 2017
Referencing Copies
	


	
	
		A lot of times in numerical methods, I need to have a temporary variable as a time stepping tool or as an “incrementing” device without altering the original variable. A code snippet like
copy_u = u
for i in range(len(copy_u)):
    copy_u[i] = f(u)
But I gotta be more careful. There’s a deep difference between making a copy of a variable, and just creating a reference to a variable. Any change to copy_u might change u itself! Use np.copy or the copy module in Python!
	


	


	
		Posts pagination
		Previous page
Page 1
Page 2
Page 3
Page 4
…
Page 9
Next page