Metric spaces

Metric spaces#

The first part of the course is about metric spaces. Metric spaces are sets equiped with a metric, which is a definition of distance. Metrics generalise the usual notion of distance that the absolute value has on the reals, to more general spaces, and allow us to generalise the notions of convergent sequences and continuity. However, we will later see that a function can be continuous under many different metrics. It will turn out that the important underlying structure that determines continuity is the set of open sets corresponding to the metric, and that many different metrics define the same open sets. This will give rise to the notion of a topology, which abstracts away the idea of a metric altogether and instead defines open sets directly.

Metric spaces#

We begin with the definition of metric spaces. A metric space is a set that is equiped with a notion of distance between elements, the metric.

Definition 76 (Metric space)

A metric space is a pair \((X, d_X)\) of a set \(X,\) called the space, and a function \(d_X: X \times X \to \mathbb{R},\) called the metric, which for all \(x, y, z \in X\) satisfies

\(d(x, y) \geq 0,\)
\(d(x, y) = 0\) if and only if \(x = y,\)
\(d(x, y) = d(y, x),\)
\(d(x, z) \leq d(x, y) + d(y, z).\)

Two common examples of metrics, which we will use later on, are the familiar Euclidean metric and the discrete metric.

Example 13 (Examples of metric spaces)

Euclidean metric: Let \(X = \mathbb{R}^N.\) The Euclidean metric \(d_X\) on \(X\) is defined as

\[ d_X(a, b) = \sqrt{\sum_{n = 1}^N (a_n - b_n)^2}, \text{ where } a, b \in X. \]

Discrete metric: Let \(X\) be any set. The discrete metric \(d_X\) on \(X\) is defined as

\[\begin{split} d_X(a, b) = \begin{cases} 0 & \text{ if } x = y \\ 1 & \text{ if } x \neq y \\ \end{cases}, \text{ where } a, b \in X. \end{split}\]

Definition 77 (Metric subspace)

Let a metric space \((X, d_X),\) and \(Y \subseteq X.\) We call \((Y, d_Y)\) a metric subspace of \(X\) where \(d_Y: Y \to \mathbb{R}\) is defined such that \(d_Y(a, b) = d_X(a, b)\) for all \(a, b \in Y.\)

With the definition of metric spaces in place, we are ready to define convergent sequences. This is a generalisation of convergence from the familiar definition in the context of the real numbers to more general metric spaces.

Definition 78 (Convergent sequence)

Let \((x_n)\) be a sequence in a metric space \((X, d_X).\) We say that \((x_n)\) converges to \(x \in X,\) written \(x_n \to x\) if for every \(\epsilon > 0,\) there exists \(N \in \mathbb{N}\) such that \(d_X(x_n, x) < \epsilon\) for all \(n > N.\)

Similar to analogous results in analysis, we can show that in a metric space, limits are unique.

Lemma 7 (Limits in metric spaces are unique)

Suppose \((X, d_X)\) is a metric space and \((x_n)\) is a sequence in \(X\) such that \(x_n \to x\) and \(x_n \to x'\) for some \(x, x' \in X.\) Then \(x = x'.\)

With convergent sequences in place, we can define continuous functions. This definition appears slightly different than the \(\epsilon-\delta\) defnition in analysis, but we will shortly see the two are equivalent.

Definition 79 (Continuous function)

Let \((X, d_X)\) and \((Y, d_Y)\) be metric space. A function \(f: X \to Y\) is continuous if \(f(x_n) \to f(x)\) in \(Y\) whenever \(x_n \to x\) in \(X.\)

Norms#

We now introduce the idea of a norm, which is a definition of the length of a point in a vector space.

Definition 80 (Norm)

Let \(V\) be a vector space. A norm is a function \(||\cdot||: V \to \mathbb{R}\) which satisfies the following properties.

\(||v|| \geq 0\) for all \(v \in V,\)
\(||v|| = 0\) if and only if \(v = 0,\)
\(||\lambda v|| = |\lambda|||v||\) for all \(\lambda \in \mathbb{R}\) and \(v \in V,\)
\(||v + w|| \leq ||v|| + ||w||\) for all \(v, w \in V.\)

A norm can be used to define a metric on a vector space.

Lemma 8 (Norms induce metrics)

Let \(V\) be a vector space with a norm \(||\cdot||.\) The function \(d_V: V \times V \to \mathbb{R}\) defined as \(d_V(v, w) = ||v - w||\) is a metric on \(V.\)

Example 14 (Examples of norms)

The following are examples of norms on \(C[0, 1],\) the vector space of continuous functions with domain \([0, 1].\)

\[\begin{split}\begin{align} ||\cdot||_1 &= \int_0^1 |\cdot(x)| dx, \\ ||\cdot||_2 &= \int_0^1 |\cdot(x)|^2 dx, \\ ||\cdot||_\infty &= \max_{x \in [0, 1]} |\cdot(x)| dx. \end{align}\end{split}\]

Proof: Examples above are norms

Most of the properties of norms in Definition 80 follow from the definition of the examples but property 2, the identity of indiscernibles for norms, is a little more involved. For this, we need an intermediate result that we prove here.

Lemma 9 (Non-constant positive continuous function has positive integral)

Let \(f \in C[0, 1]\) satisfy \(f(x) \geq 0\) for all \(x \in [0, 1].\) If \(f\) is not constantly \(0,\) then \(\int_0^1 f(x)dx > 0.\)

Now we can show that each of the examples is indeed a norm. In the following, assume that \(f, g \in C[0, 1]\)

Example 1: First, \(||f||_1 \geq 0\) so the first property is satisfied. Second, by Lemma 9 we have that \(||f||_1 = 0\) only if \(f = 0,\) so the second property is satisfied. Third, note that \(||\lambda f||_1 = |\lambda|||f||_1.\) Fourth, we have that

\[\begin{split}\begin{align} ||f + g||_1 &= \int_0^1 |f(x) + g(x)| dx \\ &\leq \int_0^1 (|f(x)| + |g(x)|) dx \\ &= \int_0^1 |f(x)|dx + \int_0^1|g(x)| dx \\ &= ||f||_1 + ||g||_1 dx \end{align}\end{split}\]

where going from the first to the second line holds by the triangle inequality of absolute values.

Example 2: The first three parts of the argument for example 1 hold also for example 2. For the fourth part, we have that

\[\begin{split}\begin{align} ||f + g||_2^2 &= \int_0^1 (f(x) + g(x))^2 dx \\ &= \int_0^1 f(x)^2 + 2f(x)g(x) + g(x)^2 dx \\ &= \int_0^1 f(x)^2 dx + 2\int_0^1 f(x)g(x)dx + \int_0^1 g(x)^2 dx \\ &\leq ||f||_2^2 + 2||f||_2||g||_2 + ||g||_2^2 dx \\ &= (||f||_2 + ||g||_2)^2 dx \\ \end{align}\end{split}\]

where going from the third to the fourth line we have used the Cauchy-Schwarz inequality (which we have not proved yet, but will be given later).

Example 3: Again, the first three parts of the argument for example 1 also hold for example 3. For the fourth part, we have

\[\begin{split}\begin{align} ||f + g||_\infty &= \max_{x \in [0, 1]} |f(x) + g(x)| \\ &\leq \max_{x \in [0, 1]} |f(x)| + |g(x)| \\ &\leq \max_{x \in [0, 1]} |f(x)| + \max_{x \in [0, 1]} |g(x)| \\ &\leq ||f||_\infty + ||g||_\infty, \end{align}\end{split}\]

where from the first to the second line we have used the triangle inequality of the absolute value, and from the second to the third we have used the fact that the maximum of a sum of two functions is no greater than the sum of their maxima.

This concludes the proof showing that all four examples are norms.

Inner products#

Now we turn to inner products. Inner products generalise the notion of angles between vectors to general vector spaces.

Definition 81 (Inner product)

Let \(V\) be a vector space. An inner product on \(V\) is a function \(\langle \cdot, \cdot \rangle: V \times V \to \mathbb{R}\) which satisfies:

\(\langle v, v \rangle \geq 0\) for all \(v \in V,\)
\(\langle v, v \rangle = 0\) if and only if \(v = 0,\)
\(\langle v, w \rangle = \langle w, v \rangle\) for all \(v, w \in V,\)
\(\langle v_1 + \lambda v_2, w \rangle = \langle v_1, w \rangle + \lambda \langle v_2, w \rangle \) for all \(v_1, v_2, w \in V\) and \(\lambda \in \mathbb{R}.\)

The properties of an inner product look very similar to those of a norm. In fact an inner product can be used to define a norm on a vector space. To show this, we must first however show an intermediate result for inner products.

Lemma 10 (Cauchy-Schwarz inequality)

If \(\langle \cdot, \cdot \rangle\) is an inner product on a vector space \(V,\) then for all \(v, w \in V,\) we have

\[\langle v, w \rangle^2 \leq \langle v, v \rangle \langle w, w \rangle.\]

We can now show that an inner product induces a norm.

Lemma 11 (Inner products induce norms)

Let \(V\) be a vector space with an inner product \(\langle \cdot, \cdot \rangle.\) The function \(||\cdot||: V \to \mathbb{R}\) defined as

\[||v|| = \sqrt{\langle v, v \rangle}\]

is a norm on \(V.\)

Open and closed sets#

We now turn to open and closed sets in metric spaces. These will turn out to be the key objects that determine continuity of functions in metric spaces. We first define open and closed balls.

Definition 82 (Open and closed balls)

Let \((X, d_X)\) be a metric space. For any \(x \in X\) and \(r > 0,\) we define the open ball to be the set

\[B_r(x) = \{ x' \in X : d_X(x, x') < r \},\]

and the closed ball to be the set

\[\overline{B}_r(x) = \{ x' \in X : d_X(x, x') \leq r \}.\]

With open and closed balls defined, we can now define open and closed sets.

Definition 83 (Open and closed subsets)

Let \((X, d_X)\) be a metric space. A subset \(U \subseteq X\) is open if for every \(x \in U,\) there exists \(r > 0\) such that \(B_r(x) \subseteq U.\) A subset \(C \subseteq X\) is closed if its complement \(X \setminus C\) is open.

We can now show that the terms open ball and closed ball are in fact justified, by showing that open balls are open subsets and closed balls are closed subsets.

Lemma 12 (Open (closed) balls are open (closed))

Let \((X, d_X)\) be a metric space. Then, for any \(x \in X\) and \(r > 0,\) the open ball \(B_r(x)\) is an open subset of \(X\) and the closed ball \(\overline{B}_r(x)\) is a closed subset of \(X.\)

Sometimes it’s handy to have the following shorthand when talking about open sets.

Definition 84 (Open neighbourhood)

Let \((X, d_X)\) be a metric space. If \(x \in X,\) an open neighbourhood of \(x\) is an open set \(U \subseteq X\) such that \(x \in U.\)

We can re-express convergence of sequences in terms of this shorthand.

Lemma 13 (Convergence implies sequence eventually in open neighbourhood)

Let \((X, d_X)\) be a metric space and \((x_n)\) be a sequence in \(X\) that converges to \(x \in X.\) Then, for every open neighbourhood \(U\) of \(x,\) there exists \(N \in \mathbb{N}\) such that \(x_n \in U\) for all \(n > N.\)

Now we define limit points of sets. Intuitively, a limit point of a set is a point that is the limit of some sequence in the set. Note that a limit point of a set need not itself be in the set.

Definition 85 (Limit point)

Let \((X, d_X)\) be a metric space and \(A \subseteq X.\) A point \(x \in X\) is a limit point of \(A\) if there exists a sequence \((x_n)\) in \(A\) such that \(x_n \to x.\)

Limit points allow an equivalent definition of closed sets, as stated in the following lemma.

Lemma 14 (Closed set \(\iff\) set contains all its limit points)

Let \((X, d_X)\) be a metric space and \(A \subseteq X.\) The set \(A\) is closed if and only if \(A\) contains all its limit points.

Now we turn to the important result of the first part of the course. This result shows that the thing that determines whether a function is continuous is not the metric itself, but rather the collection of sets that are open under the metric. In particular, even if two metrics are different, if they define the same open sets, then the same functions will be continuous under both of them.

Theorem 103 (Characterisation of continuity)

Let \((X, d_X)\) and \((Y, d_Y)\) be metric spaces and \(f: X \to Y\) be a function. Then, the following are equivalent:

\(f\) is continuous,
\(f(x_n) \to f(x)\) in \(Y\) whenever \(x_n \to x\) in \(X,\)
For every open set \(U \subseteq Y,\) the preimage \(f^{-1}(U)\) is open in \(X,\)
For every closed set \(C \subseteq Y,\) the preimage \(f^{-1}(C)\) is closed in \(X,\)
For every \(x \in X\) and \(\epsilon > 0,\) there exists \(\delta > 0\) such that \(f(B_\delta(x)) \subseteq B_\epsilon(f(x)).\)

We conclude with three properties of open sets that we will use to define toplogies in the next section.

Lemma 15 (Properties of open sets)

Let \((X, d_X)\) be a metric space. Then

The empty set \(\emptyset\) and \(X\) are open,
If \(\{U_i\}_{i \in I}\) is a collection of open sets, then \(\bigcup{i \in I} U_i\) is open,
If \(U_1, \ldots, U_N\) are open sets, then \(\bigcap{n = 1}^N U_n\) is open.

Metric spaces

Contents

Metric spaces#

Metric spaces#

Norms#

Inner products#

Open and closed sets#