Unit 2 - Practice Quiz

CSE275 60 Questions
0 Correct 0 Wrong 60 Left
0/60

1 What is the core inspiration behind evolutionary computation and algorithms like genetic algorithms?

Introduction to evolutionary computation Easy
A. Natural biological evolution
B. Chemical reactions
C. Classical physics
D. Quantum mechanics

2 Which of the following is a key characteristic of an evolutionary algorithm?

Introduction to evolutionary computation Easy
A. It guarantees finding the global optimal solution.
B. It uses a population of candidate solutions.
C. It relies on calculating gradients of a function.
D. It operates on a single solution at a time.

3 In a genetic algorithm, what is a 'chromosome'?

Genetic algorithms representation Easy
A. A representation of a single candidate solution to the problem.
B. The entire collection of all possible solutions.
C. The process of creating a new generation.
D. The function used to evaluate solutions.

4 For a problem where you need to select a subset of items, what is the most common way to represent a solution (chromosome)?

Genetic algorithms representation Easy
A. A single integer
B. A binary string (e.g., 10110)
C. A real-valued vector (e.g., [0.2, 3.1, -1.5])
D. A permutation (e.g., [3, 1, 2, 4])

5 What is the primary purpose of a fitness function in a genetic algorithm?

fitness function Easy
A. To introduce random changes into the population.
B. To select the initial population.
C. To create new solutions from existing ones.
D. To quantify how good a candidate solution is.

6 In a genetic algorithm designed to minimize a cost function, a solution with a lower fitness value is considered...

fitness function Easy
A. Better
B. Invalid
C. Ready for mutation
D. Worse

7 Which selection strategy involves choosing a few individuals at random and selecting the best one from that small group?

selection strategies Easy
A. Truncation Selection
B. Rank Selection
C. Tournament Selection
D. Roulette Wheel Selection

8 What is the main idea behind 'elitism' in a genetic algorithm?

selection strategies Easy
A. Applying mutation to every single individual.
B. Selecting only the worst solutions to be parents.
C. Copying the best solution(s) from the current generation directly to the next.
D. Using a very large population size.

9 What is the role of the 'crossover' operator in a genetic algorithm?

crossover and mutation operators Easy
A. To introduce small, random changes in a single solution.
B. To combine information from two parent solutions to create offspring.
C. To evaluate the quality of a solution.
D. To remove weak individuals from the population.

10 The 'mutation' operator is primarily responsible for which of the following?

crossover and mutation operators Easy
A. Combining the best traits of two strong parent solutions.
B. Ranking the solutions based on their fitness.
C. Exploring new areas of the search space and maintaining diversity.
D. Ensuring the best solution is always preserved.

11 When a genetic algorithm has 'converged', what does this typically mean?

Convergence behaviour Easy
A. The mutation rate has dropped to zero.
B. The population size has reached its maximum limit.
C. The solutions in the population have become very similar to each other.
D. The algorithm has found the certified global optimum.

12 What is the main problem with 'premature convergence'?

Premature convergence and diversity preservation Easy
A. The fitness function becomes too easy to calculate.
B. The algorithm runs for too many generations without stopping.
C. The population becomes too diverse to manage.
D. The algorithm gets stuck in a local optimum instead of finding the global optimum.

13 Which operator is most crucial for preventing premature convergence by maintaining genetic diversity?

Premature convergence and diversity preservation Easy
A. Elitism
B. Selection
C. Crossover
D. Mutation

14 What does a 'peak' or 'hill' on a fitness landscape represent?

Fitness landscape intuition and search difficulty Easy
A. The starting point of the search.
B. An optimal or near-optimal solution.
C. A region of very poor solutions.
D. An area that the algorithm cannot explore.

15 A problem with a 'rugged' fitness landscape containing many local optima is generally...

Fitness landscape intuition and search difficulty Easy
A. More difficult for a genetic algorithm to solve.
B. Easier for a genetic algorithm to solve.
C. Guaranteed to be solved quickly.
D. Unsolvable by any optimization method.

16 Which of the following is a well-known application of genetic algorithms in the field of neural networks?

Applications of genetic algorithms in machine learning Easy
A. Calculating the gradient during backpropagation.
B. Optimizing network weights and architecture (Neuroevolution).
C. Normalizing input data before training.
D. Serving a trained model over an API.

17 Why are GAs suitable for hyperparameter tuning in machine learning?

Applications of genetic algorithms in machine learning Easy
A. They are a type of supervised learning algorithm.
B. They only work for a small, predefined set of parameters.
C. They are always faster than grid search or random search.
D. They can effectively search large, complex spaces without needing gradient information.

18 When using a GA for feature selection, what does a single 'gene' in the chromosome typically represent?

Feature selection using genetic algorithms Easy
A. A machine learning model.
B. The model's accuracy.
C. A single feature.
D. The entire dataset.

19 What is a common objective for the fitness function in a GA used for feature selection?

Feature selection using genetic algorithms Easy
A. To minimize the time it takes to train the model.
B. To select all available features.
C. To select the features with the longest names.
D. To maximize model accuracy while minimizing the number of selected features.

20 If a chromosome is represented by the binary string 11111 and mutation flips a single bit, which of the following could be a possible result?

crossover and mutation operators Easy
A. 111
B. 11111 (no change)
C. 11011
D. 00000

21 Compared to traditional gradient-based optimization methods, what is a key advantage of Evolutionary Algorithms (EAs) when dealing with the optimization of a machine learning model?

Introduction to evolutionary computation Medium
A. EAs are guaranteed to find the global optimum in polynomial time.
B. EAs are more effective for problems with non-differentiable or discontinuous objective functions.
C. EAs always converge faster than methods like Stochastic Gradient Descent.
D. EAs require a convex search space to function correctly.

22 For the Traveling Salesperson Problem (TSP), which chromosome representation is most suitable for a standard Genetic Algorithm to ensure the creation of valid tours?

Genetic algorithms representation Medium
A. A tree structure where nodes are cities.
B. A real-valued vector representing the coordinates of the cities.
C. A binary string where each bit represents a connection between two cities.
D. A permutation of integers, where each integer represents a city and the order represents the tour.

23 A researcher is using a GA to find the optimal set of weights for a fixed-architecture neural network. The network has 50 weights in total, which can be any real number. What is the most appropriate chromosome representation?

Genetic algorithms representation Medium
A. A permutation of 50 integers.
B. A binary string of length 50.
C. A single integer representing the sum of weights.
D. A vector of 50 real-valued numbers.

24 When using a Genetic Algorithm to tune the hyperparameters of a regression model (e.g., a Support Vector Regressor), what would be a suitable fitness function to minimize?

fitness function Medium
A. The coefficient of determination () on the training set.
B. The number of support vectors.
C. The accuracy on the training set.
D. The Root Mean Squared Error (RMSE) evaluated on a validation set.

25 A GA is used for feature selection, aiming to maximize classification accuracy while minimizing the number of features. The proposed fitness function is . What does the weight parameter control?

fitness function Medium
A. The trade-off between model performance and model complexity.
B. The population size of the GA.
C. The selection pressure.
D. The rate of mutation.

26 In a constrained optimization problem solved by a GA, a penalty function is often added to the fitness calculation. How does a typical penalty function work?

fitness function Medium
A. It increases the fitness of solutions that satisfy the constraints.
B. It decreases the fitness of solutions that violate constraints, making them less likely to be selected.
C. It removes any solution that violates a constraint from the population immediately.
D. It modifies the crossover operator to avoid creating infeasible solutions.

27 How does Tournament Selection with a tournament size generally compare to Roulette Wheel Selection in terms of selection pressure?

selection strategies Medium
A. Tournament selection typically exerts lower selection pressure.
B. Both methods always have identical selection pressure.
C. Tournament selection typically exerts higher selection pressure.
D. Selection pressure is only determined by the mutation rate, not the selection strategy.

28 What is the primary purpose of using an elitism strategy in a Genetic Algorithm?

selection strategies Medium
A. To ensure the best solution(s) found so far are not lost in subsequent generations.
B. To guarantee that every individual in the population gets a chance to reproduce.
C. To increase the mutation rate for the best individuals.
D. To randomly re-initialize a portion of the population to increase diversity.

29 Consider a population where one individual has a fitness of 1000, and all others have a fitness of around 10. Which selection method is most susceptible to being dominated by this single "super" individual, potentially leading to premature convergence?

selection strategies Medium
A. Rank Selection
B. Tournament Selection (with )
C. Fitness Proportional Selection (Roulette Wheel)
D. Random Selection

30 What is the most likely outcome in a Genetic Algorithm if the crossover probability is set to 0.0 and the mutation probability is set to a small positive value?

crossover and mutation operators Medium
A. The algorithm will behave similarly to a set of parallel random walks or hill climbers.
B. The algorithm will perform a broad search by combining existing solutions.
C. The population will converge to the global optimum very quickly.
D. No evolution will occur, and the initial population will remain unchanged.

31 You have two parent chromosomes for a feature selection problem: P1 = [1, 1, 1, 0, 0, 0] and P2 = [0, 0, 0, 1, 1, 1]. If you apply a single-point crossover after the 3rd gene, what are the resulting offspring?

crossover and mutation operators Medium
A. O1 = [1, 1, 0, 1, 0, 1] and O2 = [0, 0, 1, 0, 1, 0] (Uniform Crossover)
B. O1 = [1, 1, 1, 1, 1, 1] and O2 = [0, 0, 0, 0, 0, 0] (This is an error in calculation, check again)
C. O1 = [0, 0, 0, 1, 1, 1] and O2 = [1, 1, 1, 0, 0, 0] (Parents are swapped)
D. O1 = [1, 1, 1, 1, 1, 1] and O2 = [0, 0, 0, 0, 0, 0]

32 Why are standard operators like one-point and two-point crossover unsuitable for permutation-based representations like those used in the Traveling Salesperson Problem?

crossover and mutation operators Medium
A. They do not allow for any exploration of the search space.
B. They often produce invalid offspring where cities are repeated or omitted.
C. They are computationally too expensive for permutations.
D. They can only be applied to binary strings.

33 You are observing the convergence plot of a Genetic Algorithm (Average Fitness vs. Generation). The plot shows a rapid increase in fitness for the first 20 generations, followed by a long plateau where the average fitness barely changes. What is the most likely interpretation?

Convergence behaviour Medium
A. The mutation rate is too high, preventing the algorithm from settling on a solution.
B. The algorithm has likely converged, possibly prematurely, to a local optimum.
C. The population size is too large, slowing down progress.
D. The algorithm has successfully found the global optimum.

34 A key cause of premature convergence in a Genetic Algorithm is the loss of genetic diversity. Which combination of parameters is most likely to cause this?

Premature convergence and diversity preservation Medium
A. High selection pressure, low mutation rate, small population size.
B. Low selection pressure, high mutation rate, small population size.
C. High selection pressure, high mutation rate, large population size.
D. Low selection pressure, low mutation rate, large population size.

35 Which of the following techniques is specifically designed to counteract premature convergence by maintaining multiple subpopulations, each exploring a different area of the search space?

Premature convergence and diversity preservation Medium
A. Elitism
B. Fitness Scaling
C. Uniform Crossover
D. Island Model (or Coarse-Grained) GA

36 A fitness landscape for a problem is described as "deceptive." What does this imply for a Genetic Algorithm?

Fitness landscape intuition and search difficulty Medium
A. The GA is guided towards local optima that are far from the global optimum.
B. The fitness of a solution is completely random and has no correlation with its neighbors.
C. The fitness calculation is computationally very expensive.
D. The landscape is smooth and unimodal, making it easy for the GA to find the optimum.

37 How does the performance of a crossover operator relate to the fitness landscape, according to the Building Block Hypothesis?

Fitness landscape intuition and search difficulty Medium
A. Crossover's effectiveness is independent of the landscape's structure.
B. Crossover is most effective on rugged, random landscapes.
C. Crossover works best when good solutions can be constructed by combining short, low-order, high-fitness schemata (building blocks).
D. Crossover is only useful for landscapes with a single peak (unimodal).

38 A data scientist is using a GA to find the optimal hyperparameters for a Support Vector Machine (SVM), specifically the kernel type, the regularization parameter , and gamma . What role does the GA play in this context?

Applications of genetic algorithms in machine learning Medium
A. The GA is used to generate synthetic training data for the SVM.
B. The GA performs feature selection on the input data before it reaches the SVM.
C. The GA trains the SVM model by adjusting its support vectors directly.
D. The GA acts as a meta-optimizer, searching the space of possible hyperparameter configurations to find the one that yields the best model performance.

39 In the context of feature selection using a Genetic Algorithm, what does the "wrapper" approach entail?

Feature selection using genetic algorithms Medium
A. Embedding the feature selection process directly into the training algorithm of the model itself.
B. Training and evaluating a specific machine learning model for every feature subset (chromosome) to calculate its fitness.
C. Using a binary chromosome representation.
D. Using a statistical filter (like correlation) as the fitness function to evaluate feature subsets.

40 A GA for feature selection uses a chromosome [1, 0, 1, 0, 1] for a dataset with 5 features. The fitness is calculated by training a logistic regression model. What does this process evaluate?

Feature selection using genetic algorithms Medium
A. The performance of a model using all 5 features.
B. The performance of a model using only the 1st, 3rd, and 5th features.
C. The performance of a model using only the 2nd and 4th features.
D. The individual importance of each of the 5 features separately.

41 In a fitness landscape characterized as a 'needle-in-a-haystack' (a single, narrow, high peak in an otherwise flat landscape), which combination of genetic algorithm operators and parameters would be the most ineffective and why?

Fitness landscape intuition and search difficulty Hard
A. Low crossover rate, high mutation rate, and tournament selection.
B. High crossover rate, very low mutation rate, and elitist selection.
C. No crossover, high mutation rate (essentially a parallel random search).
D. Fitness sharing, uniform crossover, and a moderate mutation rate.

42 Consider a multi-modal optimization problem where a GA using Roulette Wheel selection is consistently converging to a suboptimal peak. Which of the following diversity preservation techniques fundamentally alters the selection probabilities by modifying the fitness landscape itself, rather than just the replacement strategy?

Premature convergence and diversity preservation Hard
A. Elitism
B. Increasing the mutation rate
C. Crowding
D. Fitness Sharing

43 Analyze the concept of 'selection pressure'. Which statement correctly contrasts Tournament Selection and Rank Selection in terms of their susceptibility to premature convergence due to a single 'super-individual' in the population?

selection strategies Hard
A. Tournament selection's pressure is independent of the fitness distribution and depends only on tournament size 'k', making it less susceptible to a super-individual than Rank Selection, where the top-ranked individual always has the highest selection probability.
B. Rank Selection is less susceptible because it only considers relative ranks, thus capping the maximum selection probability for a super-individual and preventing it from dominating the selection process as it would in proportional selection methods.
C. Tournament selection is more susceptible because a super-individual has a high chance of being selected for multiple tournaments, whereas Rank Selection gives a chance to lower-ranked individuals.
D. Both are equally susceptible, as a super-individual will win any tournament it enters and will always be ranked first, leading to identical selection pressures.

44 When using a GA for feature selection with a binary chromosome representing the feature subset, what is the primary implication of the 'Building Block Hypothesis' being violated due to high dimensionality and feature interactions?

Feature selection using genetic algorithms Hard
A. The mutation operator becomes the primary driver of search, rendering the GA no better than a random search.
B. Standard crossover operators (like one-point or two-point) are likely to be destructive, breaking apart co-adapted sets of features (schemata) that are located far from each other on the chromosome.
C. The binary representation is insufficient, and a real-valued representation for feature weights is required.
D. The fitness function (e.g., model accuracy) becomes too noisy to provide a reliable selection gradient.

45 Consider two parent chromosomes P1 = 11110000 and P2 = 00001111. Which crossover operator is guaranteed to produce offspring that are maximally different from both parents in Hamming distance?

crossover and mutation operators Hard
A. Two-Point Crossover with cut points at positions 2 and 6.
B. Uniform Crossover with a 0.5 probability for each bit.
C. Single-Point Crossover at the midpoint (position 4).
D. None of the above; crossover always produces offspring that share traits with parents.

46 The Schema Theorem provides a lower bound on the expected number of instances of a schema in the next generation, . If a schema has a defining length , an order , and an average fitness that is times the average population fitness , what is the most significant threat to its propagation, assuming a high-performance GA (i.e., )?

Convergence behaviour Hard
A. The selection method being too weak to recognize its above-average fitness.
B. Disruption by crossover, especially if is large relative to the chromosome length .
C. Stochastic noise and sampling errors in a small population.
D. Disruption by mutation, especially if is large.

47 You are designing a GA to solve the Traveling Salesperson Problem (TSP). Which representation and crossover operator pair is most suitable to ensure that all offspring are valid tours (i.e., permutations of cities)?

Genetic algorithms representation Hard
A. Representation: Adjacency (e.g., city i is followed by city j); Crossover: Uniform Crossover.
B. Representation: Path (e.g., [3, 1, 4, 2]); Crossover: Partially Mapped Crossover (PMX).
C. Representation: Binary string where each block of bits represents a city; Crossover: Two-Point Crossover.
D. Representation: Path (e.g., [3, 1, 4, 2]); Crossover: Single-Point Crossover.

48 When using a GA to evolve the architecture of a neural network (Neuroevolution), what is a primary advantage of using a direct encoding scheme (e.g., a chromosome explicitly defining every connection weight) compared to an indirect encoding scheme (e.g., a chromosome defining rules for network construction)?

Applications of genetic algorithms in machine learning Hard
A. Indirect encoding is unable to produce regular or symmetrical network structures.
B. Direct encoding results in much shorter chromosomes, making the search space smaller and easier for the GA to explore.
C. Direct encoding is inherently more scalable to very deep and large networks.
D. Direct encoding allows for finer-grained control and optimization of individual connections, potentially finding highly specialized, non-intuitive architectures.

49 In designing a fitness function for a GA that optimizes a machine learning model's hyperparameters, what is the most critical trade-off to manage when the function is defined as ?

fitness function Hard
A. Ensuring the fitness function is convex to guarantee convergence to a global optimum.
B. Choosing between classification accuracy and F1-score, as this choice has a larger impact than the training time penalty.
C. Normalizing the accuracy and training time to the same scale (e.g., [0, 1]) to prevent one term from dominating the other purely due to the magnitude of its units.
D. Balancing exploration vs. exploitation by carefully tuning the weights and to avoid prematurely favoring either very fast, simple models or very slow, complex models.

50 The 'No Free Lunch' (NFL) theorem for optimization states that, averaged over all possible problems, no single optimization algorithm is superior to any other. What is the most profound implication of the NFL theorem for the application of Genetic Algorithms?

Introduction to evolutionary computation Hard
A. A GA's effectiveness is entirely dependent on how well its operators (representation, crossover, mutation) are aligned with the structure of the specific problem's fitness landscape.
B. For a GA to be effective, it must incorporate problem-specific knowledge (hybridization) to escape the 'average' case described by the theorem.
C. Genetic Algorithms are theoretically no better than random search for any given problem.
D. The NFL theorem proves that GAs will always outperform gradient-based methods on non-differentiable problems.

51 In the context of multi-objective optimization using GAs (e.g., NSGA-II), how does the concept of 'crowding distance' fundamentally differ from 'fitness sharing' in its role and calculation?

Premature convergence and diversity preservation Hard
A. Crowding distance is calculated in the objective space and is used as a secondary sorting criterion to promote spread along the Pareto front, whereas fitness sharing is calculated in the genotype/phenotype space to create niches across the entire search space.
B. Crowding distance aims to penalize solutions in dense regions, while fitness sharing aims to reward solutions in sparse regions.
C. Fitness sharing is a pre-selection mechanism that modifies fitness values, while crowding distance is a post-selection mechanism used only to prune the archive of non-dominated solutions.
D. Both mechanisms serve the identical purpose of maintaining diversity, but crowding distance has a computational complexity of while fitness sharing is .

52 What is the primary theoretical justification for using Uniform Crossover over Single-Point or Two-Point Crossover in a problem where the linkage between beneficial genes is unknown or their chromosomal positions are not adjacent?

crossover and mutation operators Hard
A. Uniform crossover introduces more diversity than any other crossover operator, effectively acting as a blend of crossover and high-rate mutation.
B. Uniform crossover is computationally less expensive than multi-point crossover operators.
C. Uniform crossover is less positionally biased; it treats all genes equally regardless of their location, making it more robust when the ordering of genes on the chromosome does not correspond to logical linkage.
D. According to the Schema Theorem, uniform crossover has a higher probability of preserving long defining-length schemata.

53 A fitness landscape is described as 'deceptive' if the low-order building blocks (schemata) that guide the search actually lead away from the global optimum. Which GA modification would be most effective at overcoming a moderately deceptive problem?

Fitness landscape intuition and search difficulty Hard
A. Increasing the population size significantly to maintain diversity and allow higher-order schemata to form and compete.
B. Using a very high mutation rate to 'jump' out of the deceptive basins of attraction.
C. Implementing elitism to ensure the best-found (but potentially deceptive) solutions are preserved.
D. Switching from binary to Gray coding for parameter representation.

54 Comparing Stochastic Universal Sampling (SUS) to Roulette Wheel Selection, what is the key advantage of SUS that addresses a major sampling error issue in Roulette Wheel?

selection strategies Hard
A. SUS guarantees that the number of times an individual is selected is bounded by and , reducing the stochastic noise and ensuring fitter individuals are not missed by chance.
B. SUS introduces a higher selection pressure, leading to faster convergence.
C. SUS has a much lower computational complexity, making it more suitable for large populations.
D. SUS is the only proportional selection method that can work with negative fitness values.

55 When using a GA for feature selection, a 'wrapper' approach is employed where the fitness function involves training and evaluating a specific ML model. What is the primary cause of the 'overfitting' risk in this context?

Feature selection using genetic algorithms Hard
A. The GA itself overfits to the population, leading to premature convergence before the optimal feature set is found.
B. The GA might select a feature subset that performs exceptionally well on the specific validation set used for fitness evaluation, but generalizes poorly to unseen test data.
C. The number of generations is too high, causing the GA to find a feature set that is too large and complex.
D. The binary chromosome representation is too simple and causes the underlying ML model to overfit.

56 In analyzing the exploratory vs. exploitative behavior of a GA, how does the dynamic between crossover and mutation typically evolve over the course of a run?

Convergence behaviour Hard
A. Both operators maintain a constant level of exploration and exploitation throughout the entire run.
B. Initially, when the population is diverse, crossover is highly exploratory. As the population converges, its exploratory power diminishes, and mutation becomes the primary source of exploration.
C. Initially, mutation is the main exploratory force. As good schemata are found, crossover takes over to exploit them by creating new combinations.
D. Crossover is always an exploitation operator, while mutation is always an exploration operator.

57 For a problem with continuous variables, what is the main theoretical argument for using Gray coding instead of standard binary encoding in a GA?

Genetic algorithms representation Hard
A. Gray coding is a form of real-valued encoding that eliminates the need for bit-string representations entirely.
B. Gray coding allows for a more compact representation, reducing the chromosome length and the size of the search space.
C. Gray coding increases the selection pressure of the algorithm, leading to faster convergence.
D. Gray coding ensures that any two adjacent integer values have a Hamming distance of exactly 1, preventing large, disruptive jumps in the phenotype space from a single bit-flip mutation (the 'Hamming cliff').

58 A GA is used to optimize the weights of a fixed-architecture neural network, as an alternative to backpropagation. In which scenario would this approach have a significant theoretical advantage over gradient-based methods like SGD?

Applications of genetic algorithms in machine learning Hard
A. When training on extremely large datasets (Big Data), as GAs can process batches more efficiently than SGD.
B. For training very deep networks (e.g., >100 layers), as GAs do not suffer from the vanishing/exploding gradient problem.
C. When a globally optimal set of weights is required, as GAs are guaranteed to find the global optimum.
D. When the network's activation functions are non-differentiable or the objective function is discontinuous, making it impossible to compute a reliable gradient.

59 Epistasis, in the context of GAs, refers to the interaction between genes, where the contribution of one gene to fitness depends on the values of other genes. How does high epistasis affect the fitness landscape and the performance of a simple GA?

Fitness landscape intuition and search difficulty Hard
A. It primarily affects the mutation operator, causing most mutations to be lethal and slowing down convergence.
B. It creates a rugged and deceptive landscape with many local optima, violating the Building Block Hypothesis and making it difficult for crossover to combine good partial solutions.
C. It has no effect on the landscape but requires the use of specialized representations like permutation encoding.
D. It creates a smooth, convex landscape, making the problem easier for a GA to solve than for a gradient-based optimizer.

60 Self-adaptation is an advanced technique in GAs where parameters like the mutation rate are not fixed but are encoded into the chromosome itself and evolve alongside the solution. What is the primary mechanism by which a 'good' mutation rate is selected for and propagated in the population?

crossover and mutation operators Hard
A. The mutation rate encoded on the chromosome does not affect the chromosome itself, but is used to mutate the other parent during crossover.
B. Mutation rates are averaged during crossover, ensuring that the population's average rate converges to an optimal value.
C. The GA maintains a global, population-level mutation rate that is increased when diversity is low and decreased when diversity is high.
D. A lower mutation rate is beneficial for an individual that is already highly fit, as it protects its good genes from disruption. A higher mutation rate is beneficial for a low-fitness individual, as it increases the chance of a large, beneficial jump. Selection for the solution indirectly selects for the associated mutation rate.