Recall that in the previous lecture, we wanted to find solutions to the following congruence
Using Newton’s method, if we have a solution \(x_1\) such that
and if
then the Newton’s update
is a solution modulo \(p^{2n}\)
In this lecture, we want to know what happens if \(f'(x_1) \equiv 0 \pmod{p^n}\)? For example, consider
We can see here that \(f'(x) = 2x\) and that \(f'(x) \equiv 0 \pmod{2}\) for any \(x\). So it looks like we won’t be able to use Newton Method or Hensel’s Lemma to solve this. However, we could if we are a little more careful. Suppose we have a solution \(x\) such that
and suppose
Then, this implies that \(f'(x)\) is divisible by \(p\). Now, let \(d\) be the highest power such that \(f'(x)\) is divisible by \(p^d\) and not divisible by \(p^{d+1}\). Now recall that the new update / improved solution is \(x - \frac{f(x)}{f'(x)}\). Then using Taylor’s Theorem:
The first two terms will cancel out because we’ll get \(f(x) - f(x)\) when we multiply the terms. Now, consider the next term in the series:
From last lecture, the first term is an integer. The second term is divisible by \(\frac{p^{2n}}{p^{2d}}\). Why? We know that \(f(x)\) is divisible by \(p^n\) by assumption and we know that \(f'(x)\) is divisible by \(p^d\). Therefore,
where \(C\) is some constant. So this term is divisible by \(p^{2(n-d)}\). Now, we want this improved solution or update \(x_2 = x - \frac{f(x)}{f'(x)}\) to be a solution to
This implies that we want \(p^{2(n-d)}\) to be greater than \(n+1\). So \(2n - 2d \geq n + 1\). In other words
To summarize what we did
Observe here that when \(d = 0\), then we get Hansel’s Lemma
Example
Let \(f(x) = x^2 - a\). Then \(f'(x) = 2x\). Suppose we have a solution \(x_1\) such that
Then \(x\) must be odd. But if \(x\) is odd, then \(f'(x_1) = 2x\) is divisible by \(2^1\) but not by \(2^2\). Then \(d = 1\). So the condition becomes
So now if \(n = 3\), then we can lift the solution \(x^2 \equiv a \pmod{2^3}\) to for example \(x^2 \equiv a \pmod{2^4},\pmod{2^5},\cdots\)
So now when can we solve \(x^2 \equiv a \pmod{2^3}\)? We can check all the cases. We know that \(x\) is odd. So we need to check the cases
Squaring both sides gets us
This means that we can only solve \(x^2 \equiv a \pmod{2^3}\) when
This illustrates how we reduced the problem from modulo a large power to condition modulo a small prime power. We started with wanting to solve \(x^2 \equiv a \pmod{2^n}\) where \(n\) is large and then we saw that \(n\) must be at least \(3\) to be able to lift the solution to higher powers. So if we can prove that it works for \(n=3\), then it means we can lift the solution to any large \(n\). That’s when we saw that \(a\) must be congruent to \(1\) modulo \(8\).
p-adic Numbers
Recall the example we did in the previous lecture when we solved
We first solved this modulo \(3\). We got
This implied that \(x = 3k+1\). Then we plugged this \(x\) into the next congruence modulo \(3^2\)
So \(x = 3 \cdot 1 + 1\). If we repeat the process for the next few powers, we will see that
This sum is called a 3-adic number. This sum diverges in \(\mathbb{R}\) as these numbers get larger and larger. but in \(\mathbb{Z}_3\), it actually converges.
As \(n\) goes to infinity, this sum obviously diverges in \(\mathbf{R}\) but in the 3-adic world, a number is small if it’s divisible by a large power of \(3\). So \(3^1\) is small, \(3^{50}\) is a lot smaller and so on. Therefore,
We can add, subtract and multiply p-adic numbers. We can also divide given that the element has an inverse. Moreover, anything we can do for the real numbers, there is an analog of it in the \(p\)-adics….
Some Intuition of p-adic Numbers
We’ve encountered infinite series before like
To find the sum, we can use the geometric series \(\frac{a}{1-r}\) where \(a\) is the first term and \(r\) is the rate so
We’ve also encountered the following series
This series clearly doesn’t converge and if we try to apply the geometric series formula we will get
This isn’t true and in fact, \(r\) is required to satisfy \(|r| < 1\) when we typically use this formula. However, in the p-adic world, this sum is really \(-1\). In the \(p\)-adic world, higher powers of \(p\) means smaller and smaller numbers. Why? Take the following example
\(10^0\) | \(1\) | |||||
\(10^1\) | \(1\) | \(0\) | ||||
\(10^2\) | \(1\) | \(0\) | \(0\) | |||
\(10^3\) | \(1\) | \(0\) | \(0\) | \(0\) | ||
\(10^4\) | \(1\) | \(0\) | \(0\) | \(0\) | \(0\) | |
\(10^5\) | \(1\) | \(0\) | \(0\) | \(0\) | \(0\) | \(0\) |
As powers of \(10\) increase, the \(1\) is shifted to the left. We align the digits in the \(p\)-adic system such that the right most digit is the least significant one and the left most digits are the most significant digits. In this sense, \(10^5\) is a lot smaller than \(10^1\).