Unit 5 - Notes

CSE408 8 min read

Unit 5: Backtracking and Approximation Algorithms

1. Introduction to Backtracking

Backtracking is a systematic method to iterate through all the possible configurations of a search space. It is a refinement of the brute force approach, which systematically searches for a solution to a problem among all available options.

State-Space Tree: The search space is represented as a tree. The root represents the initial state, and leaves represent the final states (solutions or dead ends).
Depth-First Search (DFS): Backtracking traverses the state-space tree using DFS.
Bounding Function / Pruning: As the tree is explored, the algorithm checks if the current node can lead to a valid solution. If it cannot, the algorithm "backtracks" (returns to the parent node) and explores other branches, pruning the dead-end subtree.

1.1 The n-Queens Problem

Problem Statement: Place $n$ non-attacking queens on an $n \times n$ chessboard. Two queens are attacking each other if they share the same row, column, or diagonal.

Backtracking Approach:

Place queens column by column (or row by row).
For column $i$ , try placing a queen in each row $j$ (from 1 to $n$ ).
Constraint Check: Check if placing a queen at $(j, i)$ is safe (no queen in the same row, column, or diagonals).
If safe, mark the position and recursively place the queen in the $(i+1)^{th}$ column.
If placing the queen at $(j, i)$ does not lead to a solution, unmark the position (backtrack) and try the next row.

Time Complexity: $O(n!)$ in the worst case, though pruning significantly reduces actual execution time.

1.2 Hamiltonian Circuit Problem

Problem Statement: Given an undirected graph $G = (V, E)$ , find a cycle that visits every vertex exactly once and returns to the starting vertex.

Backtracking Approach:

Start at an arbitrary vertex (e.g., vertex 0).
Maintain an array path[] to store the sequence of vertices.
At each step, try adding a new vertex to path[].
Constraint Check:
- The new vertex must have an edge connecting it to the previously added vertex.
- The new vertex must not already be in path[].
If valid, add the vertex and recurse.
If the path contains all $V$ vertices, check if there is an edge from the last vertex back to the first vertex. If yes, a Hamiltonian circuit is found.
If not, remove the vertex (backtrack) and try another adjacent vertex.

1.3 Subset-Sum Problem

Problem Statement: Given a set of non-negative integers $S = \{w_1, w_2, \dots, w_n\}$ and a target weight $W$ , find a subset of $S$ whose elements sum to exactly $W$ .

Backtracking Approach:

Sort the set $S$ in ascending order (optional but highly recommended for efficient pruning).
Traverse the state-space tree where each node represents the inclusion or exclusion of the current element $w_i$ .
Constraint Check / Pruning:
- If current sum + $w_i > W$ , prune the branch (since the array is sorted, subsequent elements will also exceed $W$ ).
- If current sum + sum of all remaining elements $< W$ , prune the branch (it's impossible to reach $W$ ).
If current sum $== W$ , a solution is found.

2. Introduction to Branch-and-Bound

Branch-and-Bound (B&B) is an algorithm design paradigm generally used for solving combinatorial optimization problems. Like backtracking, it uses a state-space tree, but with key differences:

Search Strategy: B&B typically uses Breadth-First Search (BFS) or Best-First Search (using a Priority Queue), rather than strictly DFS.
Optimization: It is used for optimization problems (finding minimum or maximum cost), whereas backtracking is usually for decision/enumeration problems.
Bounds: It calculates a bound (lower bound for minimization, upper bound for maximization) at each node to determine whether the node can lead to a better solution than the current best. If the bound is worse than the current best solution, the node is pruned.

2.1 Assignment Problem

Problem Statement: Assign $n$ jobs to $n$ workers such that the total cost of assignment is minimized. Given an $n \times n$ cost matrix where $C[i][j]$ is the cost of assigning worker $i$ to job $j$ .

Branch-and-Bound Approach:

State Space Tree: Level $i$ represents assigning a job to worker $i$ .
Lower Bound Calculation: The minimum possible cost at a node can be estimated by taking the cost of the current assignments plus the minimum element of each unassigned worker's row.
Exploration: Use Best-First Search. Generate all possible job assignments for the next worker. Calculate the lower bound for each.
Expand the node with the minimum lower bound.
If a leaf node is reached and its cost is less than the current optimal cost, update the optimal cost.

2.2 0/1 Knapsack Problem

Problem Statement: Given $n$ items with specific weights and values, and a knapsack of capacity $W$ , maximize the total value without exceeding the capacity. Fractional items are not allowed.

Branch-and-Bound Approach:

Sort items by value-to-weight ratio in descending order.
Upper Bound Calculation: To calculate the maximum possible profit for a node, take the current profit and add the profit of the remaining items. If an item doesn't completely fit, take a fraction of it (this is the fractional knapsack greedy approach, which provides a strict upper bound).
Exploration: Generate two children for each item (Include item, Exclude item).
If the upper bound of a node is less than the current maximum profit (max_profit), prune the node.
Update max_profit whenever a valid combination with a higher profit is found.

2.3 Traveling Salesman Problem (TSP)

Problem Statement: Given a set of cities and the distances between them, find the shortest possible route that visits every city exactly once and returns to the origin city.

Branch-and-Bound Approach (Cost Matrix Reduction):

Row & Column Reduction: Reduce the cost matrix so that every row and column has at least one zero. The sum of the subtracted minimums forms the initial lower bound.
Branching: From city $i$ , consider all unvisited cities $j$ .
Bounding: The cost of the child node $(i \to j)$ is:
Cost(Parent) + Cost[i][j] (from reduced matrix of parent) + Cost of reducing the new child matrix.
When transitioning from $i \to j$ , set row $i$ and column $j$ to infinity, and set $Cost[j][1]$ (return path) to infinity before reducing the matrix.
Use Best-First Search, always expanding the node with the lowest lower bound.

3. Approximation Algorithms

Many optimization problems (like TSP, Vertex Cover, Set Cover, Bin Packing) are NP-Hard, meaning there is no known polynomial-time algorithm to solve them exactly. Approximation Algorithms are used to find near-optimal solutions in polynomial time.

Approximation Ratio ( $\rho$ ):
An algorithm has an approximation ratio of $\rho(n)$ if for any input of size $n$ , the cost $C$ of the solution produced by the algorithm is within a factor of $\rho(n)$ of the optimal cost $C^*$ :

For minimization: $C / C^* \le \rho(n)$
For maximization: $C^* / C \le \rho(n)$

3.1 Vertex-Cover Problem

Problem Statement: Given an undirected graph $G = (V, E)$ , find a minimum-size subset of vertices $V' \subseteq V$ such that every edge in $E$ has at least one endpoint in $V'$ .

2-Approximation Algorithm:

Initialize $C = \emptyset$ .
Initialize $E' = E$ .
While is not empty:
- Pick an arbitrary edge $(u, v) \in E'$ .
- Add both $u$ and $v$ to $C$ .
- Remove from $E'$ all edges incident on either $u$ or $v$ .
Return $C$ .

Analysis: This guarantees a vertex cover that is at most twice the size of the optimal vertex cover (Approximation ratio $\rho = 2$ ). This is because no two edges picked in step 3 share an endpoint, meaning the optimal solution must pick at least one vertex from every edge we picked. We picked two, so $|C| \le 2 |C^*|$ .

3.2 Set-Covering Problem

Problem Statement: Given a universe of elements $U$ and a family of subsets $S = \{S_1, S_2, \dots, S_m\}$ of $U$ . Find the minimum number of subsets whose union equals $U$ .

Greedy Approximation Algorithm:

Initialize $C = \emptyset$ (the set of picked subsets).
Initialize $U' = U$ (uncovered elements).
While is not empty:
- Pick the subset $S_i \in S$ that maximizes $|S_i \cap U'|$ (the subset that covers the most uncovered elements).
- Add $S_i$ to $C$ .
- Remove the elements of $S_i$ from $U'$ .
Return $C$ .

Analysis: The approximation ratio is $O(\ln|U|)$ . Specifically, it is bounded by $H_n = 1 + 1/2 + 1/3 + \dots + 1/n$ , where $n$ is the size of the largest set.

3.3 Bin Packing Problems

Problem Statement: Given $n$ items with sizes $s_1, s_2, \dots, s_n$ where $0 < s_i \le 1$ , pack them into the minimum number of bins, each with a capacity of 1.

Approximation Algorithms:

Next Fit (NF):
- Keep one bin open.
- If the current item fits in the open bin, put it there.
- Otherwise, close the bin, open a new one, and put the item in.
- Approximation Ratio: $\le 2 \times OPT$ .
First Fit (FF):
- Keep all used bins open.
- When considering an item, place it in the first bin that has enough space.
- If it doesn't fit in any open bin, open a new bin.
- Approximation Ratio: $\le 1.7 \times OPT$ .
First Fit Decreasing (FFD):
- Sort the items in decreasing order of size.
- Apply the First Fit algorithm.
- Approximation Ratio: $\le 11/9 \times OPT + 1$ (Approx 1.22). Sorting heavily reduces the number of wasted gaps in bins.

Unit 4

Unit 6

Unit 5 - Notes

Table of Contents

Unit 5: Backtracking and Approximation Algorithms

1. Introduction to Backtracking

1.1 The n-Queens Problem

1.2 Hamiltonian Circuit Problem

1.3 Subset-Sum Problem

2. Introduction to Branch-and-Bound

2.1 Assignment Problem

2.2 0/1 Knapsack Problem

2.3 Traveling Salesman Problem (TSP)

3. Approximation Algorithms

3.1 Vertex-Cover Problem

3.2 Set-Covering Problem

3.3 Bin Packing Problems