16c. Spherical coordinates
In order to solve now the hydrogen atom, the idea will be that of reformulating the Schrödinger equation in spherical coordinates. And for this purpose, we will need:
The Laplace operator in spherical coordinates is
There are several proofs here, a short, elementary one being as follows:
(1) Let us first see how [math]\Delta[/math] behaves under a change of coordinates [math]\{x_i\}\to\{y_i\}[/math], in arbitrary [math]N[/math] dimensions. Our starting point is the chain rule for derivatives:
By using this rule, then Leibnitz for products, then again this rule, we obtain:
(2) Now by summing over [math]i[/math], we obtain the following formula, with [math]A[/math] being the derivative of [math]x\to y[/math], that is to say, the matrix of partial derivatives [math]dy_i/dx_j[/math]:
(3) So, this will be the formula that we will need. Observe that this formula can be further compacted as follows, with all the notations being self-explanatory:
(4) Getting now to spherical coordinates, [math](x,y,z)\to(r,s,t)[/math], the derivative of the inverse, obtained by differentiating [math]x,y,z[/math] with respect to [math]r,s,t[/math], is given by:
The product [math](A^{-1})^tA^{-1}[/math] of the transpose of this matrix with itself is then:
But everything simplifies here, and we have the following remarkable formula, which by the way is something very useful, worth to be memorized:
Now by inverting, we obtain the following formula, in relation with the above:
(5) Let us compute now the Laplacian of [math]r,s,t[/math]. We first have the following formula, that we will use many times in what follows, and is worth to be memorized:
Of course the same computation works for [math]y,z[/math] too, and we therefore have:
(6) By using the above formulae, twice, we can compute the Laplacian of [math]r[/math]:
(7) In what regards now [math]s[/math], the computation here goes as follows:
(8) Finally, in what regards [math]t[/math], the computation here goes as follows:
(9) We can now plug the data from (4) and (6,7,8) in the general formula that we found in (2) above, and we obtain in this way:
Thus, we are led to the formula in the statement.
Still with me, I hope, and do not worry, one day you will have such computations for breakfast. We can now reformulate the Schrödinger equation in spherical coordinates, and separate the variables, which leads to a radial and angular equation, as follows:
The time-independent Schrödinger equation in spherical coordinates separates, for solutions of type [math]\phi=\rho(r)\alpha(s,t)[/math], into two equations, as follows,
By using the formula in Theorem 16.14, the time-independent Schrödinger equation reformulates in spherical coordinates as follows:
Let us look now for separable solutions for this latter equation, consisting of a radial part and an angular part, as in the statement, namely:
By plugging this function into our equation, we obtain:
In order to solve this equation, we will do two manipulations. First, by multiplying everything by [math]2mr^2/(h^2\rho\alpha)[/math], this equation takes the following more convenient form:
Now observe that by moving the radial terms to the left, and the angular terms to the right, this latter equation can be written as follows:
Since this latter equation is now separated between radial and angular variables, both sides must be equal to a certain constant [math]-K[/math], as follows:
But this leads to the conclusion in the statement.
Let us first study the angular equation. We first have the following result:
The angular equation that we found before, namely
This is something elementary, the idea being as follows:
(1) Let us first recall that [math]r\in[0,\infty)[/math] is the radius, [math]s\in[0,\pi][/math] is the polar angle, and [math]t\in[0,2\pi][/math] is the azimuthal angle. Be said in passing, there are several conventions and notations here, and the above ones, that we use here, come from the general ones in [math]N[/math] dimensions, because further coordinates can be easily added, in the obvious way.
(2) Getting back now to our question, by plugging [math]\alpha=\sigma(s)\theta(t)[/math] into the angular equation, we obtain:
By dividing everything by [math]\sigma\theta[/math], this equation can be written as follows:
Since the variables are separated, we must have, for a certain constant [math]m[/math]:
Thus, we are led to the conclusion in the statement.
Regarding the azimuthal equation, things here are quickly settled, as follows:
The solutions of the azimuthal equation, namely
The first assertion is clear, because we have a second order equation, and two obvious solutions for it, [math]e^{\pm imt}[/math], and then their linear combinations, and that's all. Regarding the last assertion, the point here is that by using [math]\theta(t)=\theta(t+2\pi)[/math], which is a natural physical assumption on the wave function, we are led to [math]m\in\mathbb Z[/math], as stated.
We are now about to solve the angular equation, with only the polar equation remaining to be studied. However, in practice, this polar equation is 10 times more difficult that everything what we did so far, so be patient. We first have:
The polar equation that we found before, namely
Let us first do a number of manipulations on our equation, before making the change of variables. By multiplying by [math]\sigma[/math], our equation becomes:
By differentiating at left, this equation becomes:
Finally, by dividing everything by [math]\sin^2s[/math], our equation becomes:
Now let us set [math]\sigma(s)=f(\cos s)[/math]. With this change of variables, we have:
By plugging this data, our radial equation becomes:
Now with [math]x=\cos s[/math], which is our new variable, this equation reads:
But this is the Legendre equation, as stated.
Here comes now the difficult point. We have the following non-trivial result:
The solutions of the Legendre equation, namely
The first part is something quite complicated, involving the hypergeometric functions [math]\!\!{\ }_2F_1[/math], that you don't want to hear about, believe me. As for the second part, analysis and physics, this is something not trivial either. See Griffiths [1].
In order to construct the solutions, we will need:
The orthonormal basis of [math]L^2[-1,1][/math] obtained by starting with the Weierstrass basis [math]\{x^l\}[/math], and doing Gram-Schmidt, is the family of polynomials [math]\{P_l\}[/math], with each [math]P_l[/math] being of degree [math]l[/math], and with positive leading coefficient, subject to:
As a first observation, we are not lost somewhere in abstract math, because of the occurrence of the Legendre equation. As for the proof, this goes as follows:
(1) The first assertion is clear, because the Gram-Schmidt procedure applied to the Weierstrass basis [math]\{x^l\}[/math] can only lead to a certain family of polynomials [math]\{P_l\}[/math], with each [math]P_l[/math] being of degree [math]l[/math], and also unique, if we assume that it has positive leading coefficient, with this [math]\pm[/math] choice being needed, as usual, at each step of Gram-Schmidt.
(2) In order to have now an idea about these beasts, here are the first few of them, which can be obtained say via a straightforward application of Gram-Schmidt:
(3) Now thinking about what Gram-Schmidt does, this is certainly something by recurrence. And examining the recurrence leads to the Legendre equation, as stated.
(4) As for the Rodrigues formula, by uniqueness no need to try to understand where this formula comes from, and we have two choices here, either by verifying that [math]\{P_l\}[/math] is orthonormal, or by verifying the Legendre equation. And both methods work.
Going ahead now, we can solve in fact the Legendre equation at any [math]m[/math], as follows:
The general Legendre equation, with parameters [math]m\in\mathbb N[/math] and [math]K=l(l+1)[/math] with [math]l\in\mathbb N[/math], namely
The first assertion is something elementary, coming by differentiating [math]m[/math] times the Legendre equation, which leads to the general Legendre equation. As for the second assertion, this follows from the Rodrigues formula for Legendre polynomials.
And this is the end of our study. Eventually. By putting together all the above results, we are led to the following conclusion:
The separated solutions [math]\alpha=\sigma(s)\theta(t)[/math] of the angular equation,
This follows indeed from all the above, and with the comment that everything is taken up to linear combinations. We will normalize the wave function later.
General references
Banica, Teo (2024). "Calculus and applications". arXiv:2401.00911 [math.CO].