Introductory course on random matrices from October 10 to November 23, 2017 at IST Austria. The course instructor is László Erdős and the teaching assistant is Dominik Schröder.
Random matrices were first introduced in statistics in the 1920’s, but they were made famous by Eugene Wigner’s revolutionary vision. He predicted that spectral lines of heavy nuclei can be modelled by the eigenvalues of random symmetric matrices with independent entries (Wigner matrices). In particular, he conjectured that the statistics of energy gaps is given by a universal distribution that is independent of the detailed physical parameters. While the proof of this conjecture for realistic physical models is still beyond reach, it has recently been shown that the gap statistics of Wigner matrices is independent of the distribution of the matrix elements. Students will be introduced to the fascinating world of random matrices and presented with some of the basic tools for their mathematical analysis in this course.
Students with orientation in mathematics, theoretical physics, statistics and computer science. No physics background is necessary. Calculus, linear algebra and some basic familiarity with probability theory is expected.
The final grade will be obtained as a combination of the student’s performance on the example sheets and an oral exam.
3 ECTS
Related notes from the recent PCMI summer school on random matrices.
The course lasts from October 10 – November 23, 2017.
Oct 12 | Thu | 11.20am–12.35pm |
Lecture | Mondi 3 |
Oct 17 | Tue | 10.15am-11.30am |
Lecture | Mondi 3 |
Oct 17 | Tue | 11.45am-12.35pm |
Recitation | Mondi 3 |
Oct 18 | Wed | 11.30am-12.45pm |
Lecture | Mondi 1 |
Oct 24 | Tue | 10.15am-11.30am |
Lecture | Mondi 3 |
Oct 24 | Tue | 11.45am-12.35pm |
Recitation | Mondi 3 |
Oct 25 | Wed | 11.30am-12.45pm |
Lecture | Mondi 1 |
Nov 2 | Thu | 11.20am–12.35pm |
Lecture | Mondi 3 |
Nov 7 | Tue | 10.15am-11.30am |
Lecture | Mondi 3 |
Nov 7 | Tue | 11.45am-12.35pm |
Recitation | Mondi 3 |
Nov 9 | Thu | 11.20am–12.35pm |
Lecture | Mondi 3 |
Nov 14 | Tue | 10.15am-11.30am |
Lecture | Mondi 3 |
Nov 14 | Tue | 11.45am-12.35pm |
Recitation | Mondi 3 |
Nov 16 | Thu | 11.20am–12.35pm |
Lecture | Mondi 3 |
Nov 21 | Tue | 10.15am-11.30am |
Lecture | Mondi 3 |
Nov 21 | Tue | 11.45am-12.35pm |
Recitation | Mondi 3 |
Nov 23 | Thu | 11.20am–12.35pm |
Lecture | Mondi 3 |
Basic facts from probability theory. Law of large numbers (LLN) and the central limit theorem (CLT), viewed as universality statements. In the LLN the limit is deterministic, while in CLT the limit is a random variable, namely the Gaussian (normal) one. No matter which distribution the initial random variables had, their appropriately normalized sums always converge to the same distribution — in other words the limiting Gaussian distribution is universal.
Wigner random matrices. Real symmetric and complex hermitian. GUE and GOE. Wishart matrices and their relation to Wigner-type matrices. Scaling so that eigenvalues remain bounded. Statement on the concentration of the largest eigenvalue. Introducing the semicircle law as a law of large numbers for the empirical density of the eigenvalues.
Linear statistics of eigenvalues (with a smooth function as observable) leads to CLT but with an unusual scaling — indicating very strong correlation among eigenvalues.
Statement of the gap universality, Wigner surmise. The limit behavior of the gap is a new universal distribution; in this sense this is the analogue of the CLT.
Reading. PCMI lecture notes up to middle of Section 1.2.3.
Main questions in random matrix theory:
Definition of \(k\)-point correlation functions. Relation of the gap distribution to the local correlation functions on scale of the eigenvalue spacing (inclusion-exclusion formula)
Rescaled (local) correlation functions. Determinant structure. Sine kernel for complex Hermitian Wigner matrices. Statement of the main universality result in the bulk spectrum (for energies away from the edges of the semicircle law).
Reading. PCMI lecture notes up to the end of Section 1.2.3.
Main motivations for random matrices:
Quantum Mechanics in nutshell:
Random Schrödinger operator describes a single electron in an ionic (metallic) lattice. \(S = \mathbb Z^d\) or a subset of that. \(H\) is the sum of the discrete (lattice) Laplace operator and a random potential.
Anderson phase transition: depending on the strength of the disorder, the system is either in delocalized (conductor) or localized (insulator) phase. Localized phase is characterized by
In the delocalized phase, we have delocalized eigenfunctions (“almost” \(\ell^2\)-normalizable solutions to the eigenvalue equation), quantum transport, absolutely continuous spectrum and random matrix eigenvalue statistics, in particular level repulsion.
Reading. PCMI lecture notes Sections 5.1 — 5.3
Phase diagram for the Anderson model (= random Schrödinger operator on the \(\mathbb Z^d\) lattice) in \(d\ge 3\) dimensions. Localized regime can be proven, delocalized regime is conjectured to exist but no mathematical result.
In \(d=1\) dimension the Anderson model is always localized (transfer matrix method helps). In \(d=2\) nothing is known, even there is no clear agreement in the physics whether it behaves more like \(d=1\) (localization) or more like \(d=3\) (delocalization); majority believes in localization.
Delocalized regime, at least for small disorder, sounds easier to prove because it looks like a perturbative problem (zero disorder corresponds to the pure Laplacian which is perfectly understood). Resolvent perturbation formulas were discussed; major problem: lack of convergence.
We gave some explanation why the localization regime is easier to handle mathematically: off-diagonal resolvent matrix elements decay exponentially. This fact provides an effective decoupling and makes localized resolvents almost independent.
Random band matrices: naturally interpolate between \(d=1\) dimensional random Schrödinger operators (bandwidth \(W=O(1)\)) and mean field Wigner matrices (bandwidth \(W = N\), where \(N\) is the linear size of the system). Phase transition is expected at \(W = \sqrt{N}\); this is a major open question. There are similar conjectures in higher dimensional band matrices, but we did not discuss them.
Finally, we discussed a mysterious connection between the Dyson sine kernel statistics and the location of the zeros of of the zeta function on the critical line. There is only one mathematical result in this direction, Montgomery proved that the two point function of the (appropriately rescaled) zeros follows the sine kernel behavior, but only for test functions with Fourier support in \([-1,1]\). No progress has been made in the last 40 years to relax this condition.
Reading. PCMI lecture notes Section 5.3 and the entertaining article “Tea Time in Princeton” by Paul Bourgade about Montgomery’s theorem.
There are two natural ways to put a measure on the space of (hermitian) matrices, hence defining two major classes of random matrix ensembles:
Only Gaussian matrices belong to both families.
For invariant ensembles, the joint probability density function of the eigenvalues can be computed explicitly and it consists of the Vandermonde determinant (to the first or second power, \(\beta=1,2\), depending on the symmetry class). We sketched of the proof by change of variables.
Invariant ensembles can also be represented as Gibbs measure of N points on the real line with a one-body potential \(V\) and a logarithmic two-body interaction. This interpretation allows for choosing any \(\beta>0\), yielding the beta-ensembles, even though there is no matrix or eigenvalues behind them. There are analogous universality statements for beta-ensembles, which assert that the local statistics depend only on the parameter beta and are independent of the potential \(V\).
Reading. PCMI lecture notes Section 1.1.2
Precise statement of the Wigner semicircle law (for i.i.d. case) in the form of weak convergence in probability. In general, there are two methods to prove the semicircle law:
The resolvent method in general is more powerful, it works well inside as well as neat the edge of the spectrum. The moment method is powerful only at the extreme edges.
Proof of the Wigner semicircle law by moment method: Compute \[\frac{1}{N} \mathbb E \text{Tr}\, H^k=\frac{1}{N}\mathbb E\sum_{i_1,\dots,i_k} h_{i_1i_2}h_{i_2i_3}\dots h_{i_{k-1}i_k}h_{i_ki_1}\] in terms of the number of backtracking paths (only those path give a relevant contribution where every edge is travelled exactly twice and the skeleton of the graph is a tree). We reduced the problem to counting such path — it is an \(N\) independent problem.
We completed the proof of the Wigner semicircle law by moment method. Last time we showed that to evaluate \(\mathbb E \text{Tr}\, H^{2k}\) is sufficient to count the number of backtracking path of total length \(2k\). This number has many other combinatorial interpretations. It is the same as the number of rooted, oriented trees on \(k+1\) vertices by a simple one to one correspondance. It is also the same as the number of Dyck paths of length \(2k\), where a Dyck path is a random walk on the nonnegative integers starting and ending at \(0\). Finally, we counted the Dyck paths by deriving the recursion \[C_k = C_{k-1} C_0 + C_{k-2} C_1 + … + C_0 C_{k-1}\] with \(C_0=1\) for their number \(C_k\). This recursion can be solved by considering the generating function \[f(x) = \sum_{k=0}^\infty C_k x^k\] and observe that \[xf^2(x) = f(x) - 1.\] Thus f(x) can be explicitly computed by the solution formula for the quadratic equation and Taylor expanding around \(x=0\). After some calculation with the fractional binomial coefficients, we obtain that \(C_k = 1/(k+1) {2k \choose k}\), i.e. the Catalan numbers.
Since the Catalan numbers are the moments of the semicircle law (calculus exercise), and these moments do not grow too fast, they identify the measure.
This proved that the expectation of the empirical eigenvalue density converges to the semicircle in the sense of moments. Using compact support of the measures (for the empirical density we know it from the homework problem since the norm of \(H\) is bounded), by Weierstrass theorem we can extend the convergence for any bounded continuous functions.
Finally, the expectation can be removed, by computing the variance of \(N^{-1} \text{Tr}\, H^k\), again by the graphical representation (now we have two cycles and studied which edge-coincidences give rise to nonzero contribution). We showed that the variance vanishes in the large \(N\) limit and then a Chebyshev inequality converts it into a high probability bound.
Reading. PCMI lecture notes Section 2.3
We found yet another combinatorial description of Catalan numbers. \(C_k\) is the number of non-crossing pair partitions of the set \(\{1,\dots,2k\}\). Indeed, denote the number in question by \(N_k\). Then there exists some \(j\) such that \(1\) is paired with \(2j\) since due to the absence of crossings there has to be an even number of other integers between \(1\) and its partner. The number of non-crossing pairings of the integers \(\{2,\dots,2j-1\}\) and \(\{2j+1,\dots,2k\}\) are given by \(N_{j-1}\) and \(N_{k-j}\) respectively and it follows that \[N_{k}=\sum_{j=1}^k N_{j-1}N_{k-j}, \qquad N_1=1\] and thus \(N_k=C_k\) since they satisfy the same recursion.
We defined a commonly used notion of stochastic domination^{2} \(X\prec Y\) and stated the following large deviation estimates for families of random variables \(X_i,Y_i\) of zero mean \(\mathbf E X_i=\mathbf E Y_i=0\) and unit variance \(\mathbf E \lvert X_i\rvert^2=\mathbf E \lvert Y_i\rvert^2=1\) and deterministic coefficients \(b_i\), \(a_{ij}\), \begin{align}\nonumber\left\lvert\sum_{i} b_i X_i\right\rvert&\prec \left(\sum_i\lvert b_i\rvert^2\right)^{1/2},\\\label{LDE} \left\lvert\sum_{i,j} a_{ij} X_i Y_j\right\rvert&\prec \left(\sum_{i,j}\lvert a_{ij}\rvert^2\right)^{1/2},\\ \left\lvert\sum_{i\not=j} a_{ij} X_i X_j\right\rvert&\prec \left(\sum_{i\not=j}\lvert a_{ij}\rvert^2\right)^{1/2}\nonumber. \end{align} We proved \eqref{LDE} only for uniformly subgaussian families of random variables but not that uniformly finite moments of all orders are also sufficient for \eqref{LDE} to hold.
Reading. PCMI lecture notes Section 3.1.1.
We presented a cumulant approach to proving local laws for correlated random matrices. Specifically, we gave a heuristic argument that the resolvent \(G\) should be well apprixmated by the unique solution \(M=M(z)\) to the matrix Dyson equation (MDE) \[0=1+zM+\mathcal S[M]M, \quad \Im M>0,\qquad \mathcal S[R]:= \sum_{\alpha,\beta}\text{Cov}(h_\alpha,h_\beta) \Delta^\alpha R\Delta^\beta.\] We furthermore proved that the error matrix \[D=1+zG+\mathcal S[G]G=HG+\mathcal S[G]G\] satisfies \[\mathbf E\lvert\langle x,Dy \rangle\rvert^2 \lesssim \left(\frac{\lVert x\rVert \lVert y\rVert}{\sqrt{N\eta}}\right)^2,\qquad \mathbf E\lvert\langle BD \rangle\rvert^2 \lesssim \left(\frac{\lVert B\rVert}{N\eta}\right)^2\] in the case of Gaussian entries \(h_\alpha\).
Reading. PCMI lecture notes Sections 4.1–4.3
We summarized the three step strategy to prove local spectral universality of Wigner matrices. We discussed the second step: fast convergence to equilibrium of the Dyson Brownian Motion. Relation between SDE and PDE: introduction of the generator. Laplacian is the generator of the standard Brownian motion.
Basics of large dimensional analysis: Gibbs measure, entropy, Dirichlet form, generator. The total mass of a probability measure is preserved under the dynamics. Relation between various concepts of closeness to equilibrium. Entropy inequality (total variation norm is bounded by the entropy). Logarithmic Sobolev inequality. Spectral gap inequality. Bakry-Emery theory: (i) the Gibbs measure with a convex Hamiltonian satisfies LSI, (ii) entropy and Dirichlet form decays exponentially fast.
The problem sheets can either be handed in during the lecture or put in the letter box of Dominik Schröder in LBW, 3rd floor.
published | due | ||
---|---|---|---|
Problem sheet I | Solutions | Oct 18 | Oct 25 |
Problem sheet II | Solutions | Oct 25 | Nov 7 |
Problem sheet III | Solutions | Nov 9 | Nov 21 |
This should be compared to the standard Lévy continuity theorem from classical probability theory. ↩
See, for example, Definition 2.1 ↩