Mock Exam 3 — Medium

Cover

CSE2315 Automata, Languages and Computability. 10 open questions, 45 points total. Closed-book — one double-sided A4 handwritten cheatsheet allowed. No calculator. Assume P ≠ NP and NP ≠ EXP. Karp reductions denoted ≤_P. Irrelevant information costs points.

Comparable to the actual Endterm 24-25. Requires solid understanding; some questions have non-obvious angles.

Q1 — 5 pts — DFA Product Construction

Consider the following two DFAs over Σ = {a, b}.

DFA D₁: States {q₀, q₁}, start state q₀, accept states F₁ (to be determined).

δ(q₀, a) = q₁, δ(q₀, b) = q₀
δ(q₁, a) = q₀, δ(q₁, b) = q₁

L(D₁) = {w ∈ {a, b}* | w contains an even number of a’s}.

DFA D₂: States {p₀, p₁}, start state p₀, accept states F₂ = {p₁}.

δ(p₀, a) = p₀, δ(p₀, b) = p₁
δ(p₁, a) = p₁, δ(p₁, b) = p₀

L(D₂) = {w ∈ {a, b}* | w contains an odd number of b’s}.

(a) (1 pt) Give the set F₁ for D₁.

(b) (4 pts) Construct a DFA D such that L(D) = L(D₁) ⊕ L(D₂) (symmetric difference). Leave out unreachable states.

Solution

Part (a)

q₀ tracks "even number of a’s seen." The start state q₀ has seen zero a’s (even), so:

F₁ = {q₀}

Part (b) — Product construction

Symmetric difference: accept when exactly one of D₁, D₂ accepts (XOR).

Product states: (q_i, p_j). Start: (q₀, p₀).

Accept states F = {(q_i, p_j) | exactly one of q_i ∈ F₁, p_j ∈ F₂}.

(q₀, p₀): q₀ ∈ F₁ ✓, p₀ ∉ F₂ → exactly one → accept
(q₀, p₁): q₀ ∈ F₁ ✓, p₁ ∈ F₂ ✓ → both → reject
(q₁, p₀): q₁ ∉ F₁, p₀ ∉ F₂ → neither → reject
(q₁, p₁): q₁ ∉ F₁, p₁ ∈ F₂ ✓ → exactly one → accept

F = {(q₀, p₀), (q₁, p₁)}

Transition table:

State	a	b
(q₀, p₀) ☆	(q₁, p₀)	(q₀, p₁)
(q₁, p₀)	(q₀, p₀)	(q₁, p₁)
(q₀, p₁)	(q₁, p₁)	(q₀, p₀)
(q₁, p₁) ☆	(q₀, p₁)	(q₁, p₀)

All 4 states are reachable from the start state. L(D) = "even #a’s XOR odd #b’s."

Q2 — 3 pts — GNFA State Elimination

Consider the NFA N over Σ = {0, 1} with states {q₀, q₁, q₂}, start state q₀, accept state q₂.

δ(q₀, 0) = q₀, δ(q₀, 1) = q₁
δ(q₁, 0) = q₂, δ(q₁, 1) = q₁
δ(q₂, 0) = q₂, δ(q₂, 1) = q₀

Convert N to a GNFA G with L(G) = L(N) using Sipser’s procedure, then remove state q₁. Show the resulting GNFA (all transitions with their regex labels).

Solution

Step 1 — Build GNFA

Add new start state q_s with ε-transition to q₀, and new accept state q_a with ε-transition from q₂.

GNFA transitions (before elimination):

q_s → q₀: ε
q₀ → q₀: 0
q₀ → q₁: 1
q₁ → q₁: 1
q₁ → q₂: 0
q₂ → q₂: 0
q₂ → q₀: 1
q₂ → q_a: ε

Step 2 — Eliminate q₁

For each pair (q_i, q_j) where q_i has an edge to q₁ and q₁ has an edge to q_j:

Formula: R(q_i, q_j) = R_old(q_i, q_j) ∪ R(q_i, q₁) · R(q₁, q₁)* · R(q₁, q_j)

q₁’s self-loop: 1
Into q₁: q₀ → q₁ via "1"
Out of q₁: q₁ → q₂ via "0"

Only affected pair: (q₀, q₂)

Old q₀ → q₂: ∅ (no direct edge)

New q₀ → q₂: ∅ ∪ 1 · 1* · 0 = 1·1*·0 = 1⁺0

Resulting GNFA

States: {q_s, q₀, q₂, q_a}. Transitions:

From	To	Label
q_s	q₀	ε
q₀	q₀	0
q₀	q₂	1⁺0
q₂	q₂	0
q₂	q₀	1
q₂	q_a	ε

q₀ → q₂ gets label 1⁺0 (= 1·1*·0). All other transitions unchanged.

The path through q₁ means: read one or more 1’s, then a 0 to reach q₂.

Q3 — 2 pts — Regular Expression

Give a regular expression R over Σ = {0, 1} such that:

$$L(R) = \{w \in \{0,1\}^* \mid w \text{ does not contain } 110 \text{ as a substring}\}$$

Solution

Key insight

The substring "110" is forbidden. After seeing "11", the next character cannot be 0 — it must be another 1 (or the string ends). So once a run of two or more 1’s starts, only more 1’s can follow until the string ends.

Building the regex

The string consists of two phases:

Phase 1: Any sequence of "0" and "10" blocks — these can never create "110"
Phase 2: A trailing run of 1’s (possibly empty) — once we start a run of ≥2 ones, we can’t go back to 0

$$R = (0 \mid 10)^* \cdot 1^*$$

Verification

String	Contains "110"?	Matches R?	✓?
ε	No	Yes — empty (0\|10), empty 1	✓
110	Yes	No — cannot produce "110"	✓
111	No	Yes — empty first part, 1*=111	✓
0101	No	Yes — (0)(10)(1)*=0·10·1	✓
1100	Yes	No — "10" block can’t be followed by "0" then back to phase 1 with "11"	✓
10100	No	Yes — (10)(10)(0)	✓

The (0|10)* part ensures every 1 in phase 1 is immediately followed by 0 — so no two consecutive 1’s can occur before the final trailing block.

Q4 — 3 pts — Pumping Lemma

Let $L = \{w \in \{a,b\}^* \mid w = w^R\}$ (the language of palindromes).

Use the pumping lemma to prove that L is not regular. Use the word $w = a^p b a^p$.

Solution

Setup

Assume for contradiction that L is regular with pumping length p.

Choose $w = a^p b a^p$. Then $w \in L$ since $w^R = a^p b a^p = w$, and $|w| = 2p + 1 \ge p$.

Pumping

By the pumping lemma, $w = xyz$ where:

$|xy| \le p$
$|y| \ge 1$
$xy^i z \in L$ for all $i \ge 0$

Since $|xy| \le p$, both x and y lie entirely within the first block of a’s. So $y = a^k$ for some $k \ge 1$.

Pump down: i = 0

$$xy^0z = xz = a^{p-k} b a^p$$

Left side has $p - k$ a’s before b. Right side has $p$ a’s after b.

Since $k \ge 1$, we have $p - k < p$, so the string is not a palindrome.

$a^{p-k}ba^p \notin L$ since $p-k \ne p$. Contradiction with condition 3. ∴ L is not regular.

Q5 — 5 pts — CFG + PDA

Consider the CFG G = ({S}, {a, b, c}, R, S) with rules:

$$S \to aSa \mid bSb \mid c$$

(a) (2 pts) Is G ambiguous? Prove your answer.

(b) (3 pts) Give a PDA P with at most 6 states that recognizes L = $\{wcw^R \mid w \in \{a, b\}^*\}$.

Solution

Part (a) — Ambiguity

G is not ambiguous.

Every word in L(G) has the form $wcw^R$ where $w \in \{a,b\}^*$. The derivation is uniquely forced:

The outermost symbols of the sentential form determine which rule applies
If the string starts with a (and ends with a), we must use S → aSa
If the string starts with b (and ends with b), we must use S → bSb
If the remaining string is just "c", we must use S → c

G is unambiguous. Each $wcw^R$ has exactly one leftmost derivation — the first/last character at each step uniquely determines the production.

Part (b) — PDA

Strategy: push symbols until c is read, then pop matching symbols.

States: q_start, q_push, q_pop, q_acc.

State	Input	Stack top	Push	Next state
q_start	ε	ε	$	q_push
q_push	a	ε	a	q_push
q_push	b	ε	b	q_push
q_push	c	ε	ε	q_pop
q_pop	a	a	ε	q_pop
q_pop	b	b	ε	q_pop
q_pop	ε	$	ε	q_acc

4 states: q_start, q_push, q_pop, q_acc. Push phase reads w and pushes each symbol. On reading c, switch to pop phase. Pop phase matches w^R against the stack. Accept when stack marker $ is reached.

Q6 — 4 pts — Turing Machine

Consider TM M with input alphabet Σ = {a}, tape alphabet Γ = {a, ␣, X}, and transition function:

State	Read	Write	Move	Next
q₀	a	X	R	q₁
q₀	␣	␣	R	q_acc
q₁	a	a	R	q₁
q₁	␣	␣	L	q₂
q₂	a	␣	L	q₃
q₂	X	X	R	q_rej
q₃	a	a	L	q₃
q₃	X	X	R	q₀

(a) (2 pts) Find a word w with |w| = 4 on which M accepts. Show the full sequence of TM configurations.

(b) (2 pts) Is L(M) decidable? What language does M recognize?

Solution

Understanding M

M operates in rounds. Each round:

q₀: Mark the leftmost a with X, go right
q₁: Scan right past all a’s to find the right end
q₂: Erase the rightmost a (replace with ␣), go left
q₃: Scan left past all a’s back to the X marker, restart

Each round removes a pair: one from the left (marked X) and one from the right (erased). Accepts when all a’s are consumed. Rejects if in q₂ no a is left (only X), meaning odd number of a’s.

Part (a) — w = aaaa

#	Configuration	State
1	aaaa	q₀
2	Xaaa	q₁
3	Xaaa	q₁
4	Xaaa	q₁
5	Xaaa␣	q₁
6	Xaaa␣	q₂
7	Xaa␣␣	q₃
8	Xaa␣␣	q₃ (a→L)
9	Xaa␣␣	q₃
10	Xaa␣␣	q₀
11	XXa␣␣	q₁
12	XXa␣␣	q₁
13	XXa␣␣	q₂
14	XX␣␣␣	q₃
15	XX␣␣␣	q₀
16	accept	q_acc

w = aaaa. M accepts after removing pairs (a₁,a₄) and (a₂,a₃).

Part (b)

L(M) = $\{a^{2n} \mid n \ge 0\}$ = strings of a’s with even length (including ε).
Yes, L(M) is decidable. In fact, L(M) is regular — recognized by a 2-state DFA that counts a’s mod 2.

Q7 — 7 pts — Undecidability

Let $L = \{\langle M \rangle \mid M \text{ is a TM and } L(M) \text{ is infinite}\}$.

(a) (2 pts) Prove that L is Turing-recognizable.

(b) (4 pts) Prove that L is undecidable.

(c) (1 pt) Prove that L is not co-Turing-recognizable.

Solution

Part (a) — L is Turing-recognizable (2 pts)

Build a TM T that recognizes L using dovetailing:

T = “On input ⟨M⟩:

For n = 1, 2, 3, …
Run M on strings s₁, s₂, …, s_n for n steps each
Count the number of distinct strings accepted so far
If the count exceeds n, accept.”

More precisely: for each threshold k = 1, 2, …, dovetail until k distinct strings are found to be accepted. If L(M) is infinite, every threshold k is eventually met, so T accepts. If L(M) is finite, T never reaches a high enough threshold and loops (never accepts).

T accepts ⟨M⟩ iff L(M) is infinite. T may loop if L(M) is finite. ∴ L is TR.

Part (b) — L is undecidable (4 pts)

Reduce from A_TM = {⟨M, w⟩ | M accepts w}.

Assume for contradiction that decider R decides L. Build decider S for A_TM:

S = “On input ⟨M, w⟩:

Construct TM M′ = “On input x: ignore x. Run M on w. If M accepts w, accept.”
Run R on ⟨M′⟩.
If R accepts, accept. If R rejects, reject.”

Correctness:

If M accepts w: M′ accepts every input → L(M′) = Σ* (infinite) → ⟨M′⟩ ∈ L → R accepts → S accepts.
If M does not accept w: M′ accepts nothing → L(M′) = ∅ (finite) → ⟨M′⟩ ∉ L → R rejects → S rejects.

So S decides A_TM. Contradiction since A_TM is undecidable.

A_TM ≤_m L, so L is undecidable.

Part (c) — L is not co-TR (1 pt)

From (a), L is Turing-recognizable. If L were also co-Turing-recognizable, then L would be both TR and co-TR, which means L would be decidable (by running both recognizers in parallel). But (b) shows L is undecidable. Contradiction.

TR + undecidable → not co-TR.

Q8 — 3 pts — Fill-in Reduction

Consider the language:

$$L = \{\langle M \rangle \mid M \text{ accepts all strings of length } \le 5\}$$

We want a mapping reduction from A_TM to L. The computable function F is:

F = “On input ⟨M, w⟩:

Construct M′ = “On input x: (a) …”
Output ⟨M′⟩.”

(a) (1 pt) What should M′ do on input x? Complete the description.

(b) (2 pts) What does this reduction prove about L? What complexity class does L belong to (TR, co-TR, neither)?

Solution

Part (a)

M′ = “On input x:
1. If |x| > 5, reject.
2. Run M on w.
3. If M accepts, accept.”

Correctness of the reduction

M accepts w → M′ accepts all x with |x| ≤ 5 (step 2 always reaches step 3, which accepts) → ⟨M′⟩ ∈ L
M does not accept w (rejects or loops) → M′ rejects or loops on all x with |x| ≤ 5 → M′ does not accept any string of length ≤ 5 → ⟨M′⟩ ∉ L

Part (b)

A_TM ≤_m L, so L is undecidable.

For the recognizability class: L is co-Turing-recognizable.

To see this, we build a recognizer for the complement $\overline{L}$ = {⟨M⟩ | ∃ string x with |x| ≤ 5 that M does not accept}:

“On input ⟨M⟩: Run M on all strings of length 0, 1, …, 5 in parallel (dovetail). There are finitely many such strings. If any run halts and rejects, accept.”

This recognizes $\overline{L}$, so L is co-TR.

L is undecidable and co-TR. Since co-TR + undecidable → L is not Turing-recognizable.

Q9 — 7 pts — NP Scaffold

COLORFUL MATCHING

Instance: A bipartite graph G = (A ∪ B, E), a coloring c: E → {1, …, k}, and an integer m.

Question: Is there a matching M ⊆ E with |M| ≥ m such that no two edges in M have the same color?

(a) (2 pts) Prove that COLORFUL MATCHING ∈ NP.

(b) (1 pt) Give a yes-instance of Vertex Cover with |V| = 4, |E| = 4, K = 2.

(c) (2 pts) A student proposes the following reduction from Vertex Cover to COLORFUL MATCHING:

Given VC instance (G = (V, E), K): set A = V, B = E. Add edge (v, e) iff v is an endpoint of e. Color all edges color 1. Set m = K.

Apply this reduction to your VC instance from (b).

(d) (1 pt) Is the resulting COLORFUL MATCHING instance a yes-instance?

(e) (1 pt) Explain what is wrong with the reduction and suggest a fix.

Solution

Part (a) — COLORFUL MATCHING ∈ NP

Certificate: a set of edges M.

Verifier checks in polynomial time:

|M| ≥ m
M is a valid matching: no two edges in M share a vertex
All edges in M have distinct colors: no two edges in M have the same color

Certificate = matching M. Verification is O(|M|²) for pairwise checks. ∴ COLORFUL MATCHING ∈ NP.

Part (b) — VC yes-instance

V = {v₁, v₂, v₃, v₄}, E = {(v₁,v₂), (v₂,v₃), (v₃,v₄), (v₁,v₃)}, K = 2.

Vertex cover: {v₂, v₃}. Every edge has at least one endpoint in {v₂, v₃}. ✓

Part (c) — Apply the reduction

Bipartite graph: Left side A = {v₁, v₂, v₃, v₄}, right side B = {e₁, e₂, e₃, e₄}.

Edges in the bipartite graph (vertex v connected to edge e iff v is an endpoint):

v₁ — e₁ (v₁v₂), v₁ — e₄ (v₁v₃)
v₂ — e₁ (v₁v₂), v₂ — e₂ (v₂v₃)
v₃ — e₂ (v₂v₃), v₃ — e₃ (v₃v₄), v₃ — e₄ (v₁v₃)
v₄ — e₃ (v₃v₄)

All bipartite edges are colored 1. m = K = 2.

Part (d)

No. Since all edges have color 1, any matching of size ≥ 2 contains two edges of the same color. A colorful matching can have at most 1 edge, but m = 2 requires 2.

Part (e) — The flaw and fix

Flaw: All edges are assigned the same color (color 1). This means the "colorful" constraint forbids any matching of size ≥ 2, regardless of the VC instance. The reduction always produces a no-instance (for K ≥ 2), so it cannot be correct.

Fix: Assign distinct colors to the bipartite edges. For instance, give each original edge e_i its own color i. Then the edges (v, e_i) and (v′, e_i) both get color i. A colorful matching picks at most one edge per color (= per original edge), and at most one edge per left-vertex. A matching of size K then corresponds to K original edges, each covered by a distinct vertex — i.e., a vertex cover.

Alternatively: drop the color constraint entirely, reducing to standard bipartite matching. But this changes the target problem, so it’s a weaker fix.

Q10 — 6 pts — True / False

For each statement, determine whether it is true or false and provide a brief justification.

(a) (2 pts) If L₁ and L₂ are both Turing-recognizable, then L₁ ∩ L₂ is Turing-recognizable.

(b) (1 pt) Every regular language is in P.

(c) (2 pts) If A ≤_P B and B ∈ NP, then A ∈ NP.

(d) (1 pt) A_TM ≤_m HALT_TM.

Solution

(a) True

Given recognizers T₁ for L₁ and T₂ for L₂, build recognizer T for L₁ ∩ L₂:

T = “On input w: Run T₁ and T₂ on w in parallel (dovetail). If both accept, accept.”

If w ∈ L₁ ∩ L₂: both eventually accept → T accepts. If w ∉ L₁ ∩ L₂: at least one never accepts → T never accepts.

True. TR is closed under intersection.

(b) True

A DFA processes input in a single left-to-right pass, reading each symbol once. Running time is O(n) where n = |w|. Since O(n) ⊂ O(n^k) for k = 1, this is polynomial.

True. REG ⊂ P (DFA runs in O(n) time).

A ≤_P B means there exists a poly-time computable function f such that x ∈ A iff f(x) ∈ B.

Since B ∈ NP, there is a poly-time verifier V_B for B.

Build verifier V_A for A: “On input (x, c): compute f(x) in poly time. Run V_B(f(x), c). Accept iff V_B accepts.”

Total time: poly(|x|) for f + poly(|f(x)|) for V_B = polynomial.

True. NP is closed under polynomial-time reductions.

(d) True

Construct mapping f: on input ⟨M, w⟩, output ⟨M′, w⟩ where:

M′ = “On input x: simulate M on x. If M accepts, accept. If M rejects, loop forever.”

M accepts w → M′ halts on w (accepts) → ⟨M′, w⟩ ∈ HALT_TM ✓
M rejects w → M′ loops on w → ⟨M′, w⟩ ∉ HALT_TM ✓
M loops on w → M′ loops on w → ⟨M′, w⟩ ∉ HALT_TM ✓

True. A_TM ≤_m HALT_TM via the reduction that converts rejections into loops.

Score Yourself

Question	Topic	Points
Q1a	DFA Product — accept states	1
Q1b	DFA Product — symmetric difference	4
Q2	GNFA State Elimination	3
Q3	Regular Expression	2
Q4	Pumping Lemma	3
Q5a	CFG Ambiguity	2
Q5b	PDA Construction	3
Q6a	TM Trace	2
Q6b	TM Language	2
Q7a	TR proof	2
Q7b	Undecidability reduction	4
Q7c	Not co-TR	1
Q8a	Fill-in — M′ description	1
Q8b	Fill-in — conclusion	2
Q9a	NP membership	2
Q9b	VC instance	1
Q9c	Apply reduction	2
Q9d	Yes/No instance	1
Q9e	Fix the flaw	1
Q10a	TR ∩ closure	2
Q10b	REG ⊂ P	1
Q10c	NP closed under ≤_P	2
Q10d	A_TM ≤_m HALT_TM	1
Total		/45