Excercises

Excercises#

The following are my solutions to exercises from the book Sipser, Introduction to the Theory of Computation. Here I repeat some exercise statements for completeness. Any mistakes in the exercise statements or the solutions are my own.

Chapter 1#

Exercise 1.1

Consider the DFAs \(M_1\) and \(M_2\) (see book). For each DFA

What is the start state?
What is the set of accept states?
What sequence of steps does the machine go through on the input \(aabb\)?
Does the machine accept the input \(aabb\)?
Does the machine accept the empty string \(\epsilon\)?

Exercise 1.2

Give formal descriptions for the FSAs in Excercise 1.1.

Exercise 1.11

Prove that any NFA can be converted to an equivalent NFA with a singla accept state.

Exercise 1.20

Let \(\Sigma = \{a, b\}\). For each of the regular expressions below give two example strings which are in the language, and two which are not:

\(a^* b^*\)
\(a(ba)^*b\)
\(a^* \cup b^*\)
\((aaa)^*\)
\(\Sigma^*a\Sigma^*b\Sigma^*a\Sigma^*\)
\(aba \cup bab\)
\((\epsilon \cup a)b\)
\((a \cup ba \cup bb)\Sigma^*\)

Exercise 1.31

For any string \(w = w_1 w_2 \dots w_n\), the reverse of \(w\), written \(w^R\), is the string \(w\) in reverse order, \(w_n \dots w_2 w_1\). For any language \(A\), let \(A^R = \{w^E | w \in A\}\). Show that if \(A\) is regular, so is \(A^R\).

Exercise 1.32

Let

\[\begin{split} \Sigma_3 = \left\{ \begin{bmatrix} 0 \\ 0 \\ 0 \end{bmatrix}, \begin{bmatrix} 0 \\ 0 \\ 1 \end{bmatrix}, \begin{bmatrix} 0 \\ 1 \\ 0 \end{bmatrix}, \dots, \begin{bmatrix} 1 \\ 1 \\ 1 \end{bmatrix} \right\} \end{split}\]

\(\Sigma_3\) contains all size \(3\) columns of zeros and ones. A string of symbols in \(\Sigma_3\) gives three rows of zeros and ones. Consider each row to be a binary number and let

\[ B = \{w \in \Sigma_3^* | \text{ the bottom row of } w \text{ is the sum of the top two rows}\}. \]

Show that \(B\) is regular.

Exercise 1.33

Let

\[\begin{split} \Sigma_2 = \left\{ \begin{bmatrix} 0 \\ 0 \end{bmatrix}, \begin{bmatrix} 0 \\ 1 \end{bmatrix}, \begin{bmatrix} 1 \\ 0 \end{bmatrix}, \dots, \begin{bmatrix} 1 \\ 1 \end{bmatrix} \right\} \end{split}\]

\(\Sigma_2\) contains all size \(2\) columns of zeros and ones. A string of symbols in \(\Sigma_2\) gives three rows of zeros and ones. Consider each row to be a binary number and let

\[ C = \{w \in \Sigma_2^* | \text{ the bottom row of } w \text{ is three times the bottom row}\}. \]

Show that \(C\) is regular.

Exercise 1.34

Let \(\Sigma_2\) be the same as in the previous problem. A string of symbols in \(\Sigma_2\) gives three rows of zeros and ones. Consider each row to be a binary number and let

\[ D = \{w \in \Sigma_2^* | \text{ the bottom row of } w \text{ is a larger number than the bottom row}\}. \]

Show that \(D\) is regular.

Exercise 1.34

Let \(\Sigma_2\) be the same as in the previous problem. A string of symbols in \(\Sigma_2\) gives three rows of zeros and ones. Consider each row to be a binary number and let

\[ D = \{w \in \Sigma_2^* | \text{ the bottom row of } w \text{ is a larger number than the bottom row}\}. \]

Show that \(D\) is regular.

Exercise 1.41

For languages \(A\) and \(B\), let the perfect shuffle of \(A\) and \(B\) be the language

\[ \{w | w = a_1 b_1 \dots a_k b_k, \text{ where } a_1, \dots, a_k \in A \text{ and } b_1, \dots, b_k \in B, \text{ where } a_i, b_i \in \Sigma\}.\]

Show that the class of regular languages is closed under the perfect shuffle.

Exercise 1.43

Let \(A\) be any language. Define \(\texttt{DROPOUT}(A)\) to be the language containing all strings that can be obtain by removing one symbol from a string in \(A\). Thus

\[\texttt{DROPOUT}(A) = \{xz | xyz \in A \text{ where } x, z \in \Sigma^*, y \in \Sigma\}.\]

Show that the class of regular languages is closed under the \(\texttt{DROPOUT}\) operation.

Exercise 1.44

Let \(B\) and \(C\) be languages over \(\Sigma = \{0, 1\}\). Define

\[ B \overset{1}{\gets} C = \{w \in B | \text{ for some } y \in C, \text{ strings } w \text{ and } y \text{ contain equal numbers of } 1\text{s}\}\]

Show that the class of regular languages is closed under the \(\overset{1}{\gets}\) operation.

Exercise 1.45

Let \(A / B = \{w | wx \in A, x \in B\}\). If \(A\) is regular and \(B\) is any language, show that \(A / B\) is regular.

Exercise 1.46

Prove that the following languages are not regular.

\(\{0^n 1^m 0^n | m, n \geq 0\}\)
\(\{0^m 1^n | m \neq n\}\)
\(\{w | w \in \{0, 1\}^* \text{ is not a palindrome}\}\)
\(\{wtw | w, t \in \{0, 1\}^+\}\)

Solution

Part 1: Suppose \(A_1 = \{0^n 1^m 0^n | m, n \geq 0\}\) is regular. Then by the pumping lemma, it has a pumping length \(p\). Consider the string \(s = 0^p 1^p 0^p\). Since \(s \in A_1\), by the pumping lemma, it can be written as \(s = xyz\) with \(|xy| < p\) and \(|y| \geq q\), and \(xy^nz \in A_1\) for all \(n = 1, 2, \dots\). Since \(|xy| < p\), both \(x\) and \(y\) are made up entirely of zeros. So the string \(xy^nz\) contains an an unequal number of leading and trailing zeros, so it is not in \(A_1\). This is a contradiction, so \(A_1\) cannot be regular.

Part 2: Suppose \(A_2 = \{0^m 1^n | m \neq n\}\) is regular. Then, its complement \(A_2'\) is also regular. From here on, the solution is essentially the same as for part 1. By the pumping lemma, \(A_2'\) has a pumping length \(p\). Consider the string \(s = 0^p 1^p\). Since \(s \in A_2\), by the pumping lemma, it can be written as \(s = xyz\) with \(|xy| < p\) and \(|y| \geq q\), and \(xy^nz \in A_2\) for all \(n = 1, 2, \dots\). Since \(|xy| < p\), both \(x\) and \(y\) are made up entirely of zeros. So the string \(xy^nz\) is of the form \(0^k 1^m\) with \(k \neq m\), so it is not in \(A_2'\). Therefore \(A_2'\) is not regular, and neither is \(A_2\).

Part 3: Suppose \(A_3 = \{w | w \in \{0, 1\}^* \text{ is not a palindrome}\}\) is regular. Then, its complement \(A_3'\) is also regular. By the pumping lemma, \(A_3'\) has a pumping length \(p\). Consider the string \(s = 0^p 1^p 0^p\). Since \(s \in A_3\), by the pumping lemma, it can be written as \(s = xyz\) with \(|xy| < p\) and \(|y| \geq q\), and \(xy^nz \in A_2\) for all \(n = 1, 2, \dots\). Since \(|xy| < p\), both \(x\) and \(y\) are made up entirely of zeros. So the string \(xy^nz\) contains an an unequal number of leading and trailing zeros, so it is not a palindrome, and therefore cannot be in \(A_3'\). Therefore, \(A_3'\) is not regular and neither is \(A_3\).

Part 4: Suppose \(A_4 = \{wtw | w, t \in \{0, 1\}^+\}\) is regular. By the pumping lemma, \(A_4\) has a pumping length \(p\). Fix some \(t = 1\) and consider the string \(s = 0^p 1^p t 0^p 1^p\). Since \(s \in A_4\), by the pumping lemma, it can be written as \(s = xyz\) with \(|xy| < p\) and \(|y| \geq q\), and \(xy^nz \in A_2\) for all \(n = 1, 2, \dots\). Since \(|xy| < p\), both \(x\) and \(y\) are made up entirely of zeros. So if \(|y| = k\) the string \(s_n = xy^{n+1}z\) has the form \(s_n = 0^{p + nk} 1^p t 0^p 1^p\). However, \(s_n\) cannot be written in the form \(wt'w\) for any \(w, t' \in \{0, 1\}^+\), so \(s_n \not \in A_4\) for any \(n \geq 1\). This is a contradiction, so \(A_4\) cannot be regular.

Exercise 1.47

Let \(\Sigma = \{0, \#\}\) and let

\[ Y = \{w | w = x_1 \# x_2 \# \dots \# x_k \text{ for } k \geq 0, \text{ each } x_i \in 1^*, \text{ and } x_i \neq x_j \text{ for } i \neq j\}.\]

Prove that \(Y\) is not regular.

Exercise 1.48

Let \(\Sigma = \{0, 1\}\) and let

\[ D = \{w | w \text{ contains an equal number of occurences of the substrings } 01 \text{ and } 10\}.\]

Prove that \(D\) is regular.

Exercise 1.51

Let \(x\) and \(y\) be strings and \(L\) be any language. We say that \(x\) and \(y\) are distinguishable by \(L\) if some string \(z\) exists whereby exactly one of the strings \(xz\) and \(yz\) is a member of \(L\); otherwise, for every string \(z\), we have \(xz \in L\) whenever \(yz \in L\) and we say that \(x\) and \(y\) are indistinguishable by \(L\). If \(x\) and \(y\) are indistinguishable by \(L\) we write \(x \equiv_L y\). Show that \(\equiv_L\) is an equivalence relation.

Exercise 1.52 (Myhill-Nerode theorem)

Let \(L\) be a language and let \(X\) be a set of strings. We say that \(X\) is pairwise distinguishable by \(L\) if every two distinct strings in \(X\) are distinguishable by \(L\). Define the index of \(L\) to be the maximum number of elements in a set that is pairwise distinguishable by \(L\). The index of \(L\) may be infinite or finite.

Show that, if \(L\) is recognised by a DFA with \(k\) states, \(L\) has index at most \(k\).
Show that, if the index of \(L\) is a finite number \(k\), it is recognised by a DFA with \(k\) states.
Conclude that \(L\) is regular if and only if it has finite index. Moreover, its index is the size of the smallest DFA recognising it.

Solution

Part 1: Let \(L\) be recognised by a DFA, \(M\), with \(k\) states. Suppose that the index of \(L\) is greater than \(k\). Then, there exists a set \(S\) containing \(k + 1\) distinct strings which are all pairwise distinguishable by \(L\). Consider passing each of the strings in \(S\) as input to \(M\). At the end of reading each of the strings, \(M\) will be in some state. By the pigeonhole principle, since there are more than \(k\) strings and only \(k\) states, two strings, say \(s_1 \in S\) and \(s_1 \in S\) must end up in the same final state. Then, for any string \(z\), including the empty string, the strings \(s_1 z\) and \(s_2 z\) will end up in the same final states, so they are either both accepted or both rejected by \(M\), which means they are indistinguishable. This is a contradiction, so \(L\) cannot have an index greater than \(k\).

Part 2: Suppose that \(L\) has a finite index \(k\). Then, there exists a finite set of strings of size \(k\), say \(S = \{s_1, \dots, s_k\}\), which is pairwise distinguishable by \(L\). We know, from Exercise 1.51, that indistinguishability under \(L\) is an equivalence relation \(\equiv_L\). Now note that any string \(z\) is indistiguishable from at least one string in \(S\) (because if not, then the index of \(L\) would be larger than \(k\)) and at most one string in \(S\) (beacuse if \(z \equiv_L s_i, s_j \in S\), then \(s_i \equiv_L s_j\)), so any string \(z\) is indistinguishable from exactly one string in \(L\). Therefore the equivalence relation \(\equiv_L\) forms exactly \(k\) equivalence classes over the set of all finite strings, and \(s_1, \dots, s_k\) are representatives of these classes. Let \(\pi(x)\) denote the equivalence class of a string \(x\). Now, define a DFA \(M = (Q, \Sigma, q_0, \delta, F)\) as follows. Let \(\Sigma\) be the alphabet over which \(L\) is defined. Then, let \(Q = \{\pi(s_1), \dots, \pi(s_n)\}\) be the states, \(q_0 = \pi(\epsilon)\), \(\delta(q, a) = \pi(xa)\) where \(x\) is any string that satisfies \(\pi(x) = q\) be the transition function, and \(F = \{\pi(x) \in Q | x \in L\}\) be the set of initial states. We need to show that this \(M\) is well defined, that is we need to show that \(\delta(q, a) = \pi(xa)\) gives the same answer regardless of which representative \(x\) of \(q\) we pick. To show this, let \(x, y\) be strings such that \(\pi(x) = \pi(y) = q\). Then \(x\) and \(y\) are in the same equivalence class so they are indistinguishable, so \(xa\) and \(ya\) are also indistinguishable, so \(\pi(xa) = \pi(ya)\) as required and \(\delta\) is well defined. This DFA accepts exactly the strings in \(L\) and no strings outside \(L\), so it recognises \(L\).

Part 3: If \(L\) is regular, then it is recognised by a DFA with a finite number of states so, by part 1, the index of \(L\) is finite. Conversely, if the index of \(L\) if finite then, by part 2, it is recognised by a DFA, so it is regular. Therefore \(L\) is regular if and only if it has a finite index. In addition, the index of \(L\) is the size of the smallest DFA recognising it: part 2 implies that if the index of \(L\) is \(k,\) then it is recognised by a DFA with \(k\) states and part 1 implies that \(L\) cannot be recognised by a DFA with fewer than \(k\) states.

Exercise 1.59

Let \(M = (Q, \Sigma, \delta, q_0, F)\) be a DFA and let \(h\) be a state of \(M\) called its home. A synchronising sequence for \(M\) and \(h\) is a string \(s \in \Sigma^*\) where \(\delta(q, s) = h\) for every \(q \in Q\). (Here we have extended \(\delta\) to strings, so that \(\delta(q, s)\) equals the state where \(M\) ends up when \(M\) starts at state \(q\) and reads input \(s\).) Say that \(M\) is synchronisable if it has a synchronising sequence for some state \(h\). Prove that if \(M\) is a \(k\)-state synchronisable DFA, then it has a synchronising sequence of length at most \(k^3\). Can you improve on this bound?

Exercise 1.63

Let \(L\) be an infinite regular language. Prove that \(L\) can be split into two disjoint infinite regular languages.
Let \(B\) and \(D\) be two languages. Write \(B \Subset D\) if \(B \subset D\) and \(D\) contains infinitely many strings not contained in \(B\). Show that if \(B\) and \(D\) are two regular languages where \(B \Subset D\), then we can find a regular language \(C\) where \(B \Subset C \Subset D\).

Exercise 1.67

Let the rotational closure of language \(A\) be \(\text{RC}(A) = \{yx | xy \in A\}\).

Show that for any language \(A\), we have \(\text{RC}(A) = \text{RC}(\text{RC}(A))\).
Show that the class of regular languages is closed under rotational closure.

Solution

Part 1: Let \(A\) be a language. If \(x \in A\), then the image of \(\{x\}\) is the set of all strings which are cyclic permutations of the symbols in \(x\). Since the set of all cyclic permutations of a string includes the string itself, \(\text{RC}(A) \subseteq \text{RC}(\text{RC}(A)).\) Conversely, applying two cyclic permutations to a string gives another cyclic permutation of the same string, so \(\text{RC}(\text{RC}(A))\subseteq \text{RC}(A),\) concluding that \(\text{RC}(A) = \text{RC}(\text{RC}(A))\).

Part 2: Suppose \(A\) is a regular language, so there exists a DFA \(M = (Q, \Sigma, \delta, q_0, F)\) which recognises it. Suppose \(|Q| = N.\) Let \(q_0'\) be a new initial state, and \(Q_{n, 1}, Q_{n, 2}\) be copies of \(Q\) for \(n = 1, \dots, N.\) Let the \(m^{th}\) state in \(Q\) be denoted \(q_m,\) and let the corresponding copies in \(Q_{n, 1}\) and \(Q_{n, 2}\) be \(q_{n, m, 1}\) and \(q_{n, m, 2}\) respectively. Similarly, let the \(k^{th}\) final state in \(F\) be \(f_k\) and let the corresponding copies in \(Q_{n, 1}\) and \(Q_{n, 2}\) be \(f_{n, k, 1}\) and \(f_{n, k, 2}\) respectively. Finally, let \(\delta_{n, 1}, \delta_{n, 2}\) be copies of \(\delta\) for \(n = 1, \dots, N.\)

Now, \(\epsilon\) transitions from \(q_0'\) to each \(q_{n, n, 1}.\) Also, add \(\epsilon\) transitions from each \(f_{n, k, 1}\) to \(q_{n, 0, 2}.\) Let the states \(q_{n, n, 2}\) for \(n = 1, \dots, N\) be the final states, and let the transition function be the collection of the \(\epsilon\) transitions described above, together with the individual transition functions \(\delta_{n, 1}\) and \(\delta_{n, 2}\) for \(n = 1, \dots, N.\) Let us determine the language recognised by this NFA, by breaking down the transition into two stages.

First stage: In the first stage, the NFA can first make a transition to any \(q_{n, n, 1},\) without reading in a symbol. Then, in the first stage, the NFA can make any sequence of transitions in \(Q_{n, 1},\) according to \(\delta_{n, 1}\) until it reaches a state \(f_{n, k, 1}\) which is a copy of the final state \(f_k\) from \(F.\) At this point, the NFA has read a sequence of symbols \(x_1\dots x_p\) that is a suffix of a string in \(A,\) and it cannot have reached a terminal state.

Second stage: Then, in the second stage, the NFA can make a transition to \(q_{n, 0, 2},\) which is a copy of the initial state \(q_0\) from \(Q,\) without reading a symbol. Then, the NFA can make a sequence of transitions in \(Q_{n, 2}\) according to \(\delta_{n, 2}\) until it reaches a final state \(q_{n, n, 2},\) which is a copy of the state \(q_n\) from \(Q.\) During this second stage, the NFA reads a sequence of symbols \(y_1, \dots, y_q\) if and only if it is a prefix of a string in \(A,\) and also if after reading it, the original DFA \(M\) would end up in state \(q_n.\)

Conclusion: Threfore, the NFA reads a sequence of symbols \(x_1\dots x_p y_1 \dots y_q\) if and only if: (1) starting from state \(q_n,\) the DFA \(M\) would end up in a final state after reading \(x_1\dots x_p\) and also (2) starting from \(q_0,\) the DFA \(M\) would end up in state \(q_n\) after reading \(y_1 \dots y_q.\) Therefore, the NFA accepts a string \(s\) if and only if it can be written as \(s = xy\) where \(yx \in A,\) so it recognises \(\text{RC}(A).\)

Excercises

Contents

Excercises#

Chapter 1#