Unit 5 - Practice Quiz

INT345 60 Questions
0 Correct 0 Wrong 60 Left
0/60

1 Why is color information highly valuable in computer vision applications?

Importance of color in computer vision Easy
A. It increases the processing speed of algorithms
B. It eliminates the need for edge detection
C. It reduces the memory required to store images
D. It provides a powerful descriptor that simplifies object identification and extraction

2 Which of the following tasks benefits the most from adding color information to a standard grayscale image?

Importance of color in computer vision Easy
A. Calculating the spatial frequency of an image
B. Differentiating a red apple from a green apple of the same brightness
C. Finding the geometric center of a shape
D. Detecting straight lines using the Hough transform

3 Which color model is primarily based on how the human eye perceives colors and is standard for displaying images on digital screens?

color models Easy
A. CMYK
B. HSV
C. YIQ
D. RGB

4 What do the letters in the HSV color model stand for?

color models Easy
A. Hue, Saturation, Value
B. High, Standard, Variable
C. Hue, Shade, Volume
D. Hue, Saturation, Variance

5 Why is it common to convert images from RGB to the HSV color space in computer vision?

conversion between color spaces Easy
A. To convert a color image into a 3D model
B. To increase the resolution of the image
C. To compress the image size
D. To separate color information from intensity or lighting information

6 When converting an RGB image to a grayscale image, what mathematical operation is typically performed on the pixels?

conversion between color spaces Easy
A. Taking a weighted average of the , , and channels
B. Multiplying all three channels together
C. Finding the maximum value among , , and
D. Subtracting the channel from the channel

7 What is the primary purpose of applying color augmentation techniques to an image dataset?

color augmentation techniques Easy
A. To reduce the color depth to 8-bit
B. To artificially increase dataset diversity and make models robust to lighting changes
C. To compress the dataset for storage
D. To convert 2D images into 3D point clouds

8 Which of the following is a common color augmentation technique?

color augmentation techniques Easy
A. Max pooling
B. Edge detection
C. Image rotation
D. Color jittering (randomly changing brightness, contrast, and saturation)

9 What does 'color constancy' refer to in computer vision?

color constancy Easy
A. The ability to perceive the true color of an object despite changes in the color of the illumination
B. The technique of using only primary colors in an image
C. The process of keeping the file size constant when adding color
D. The ability to convert a color image into grayscale

10 Which simple algorithm is often used as a baseline to achieve color constancy by assuming that the average color of a scene is neutral gray?

color constancy Easy
A. The Gray World assumption
B. The Hough Transform
C. The RGB to HSV conversion
D. The Canny Edge Detector

11 What does the pixel value represent in a range image?

introduction to range images Easy
A. The intensity of light reflected by an object
B. The color hue of the object
C. The distance or depth from the sensor to the object
D. The temperature of the object

12 What is another common name for a range image?

introduction to range images Easy
A. Grayscale map
B. Depth map
C. Thermal map
D. Texture map

13 Which of the following best describes the difference between a 2D intensity image and a 3D range image?

difference between 2D intensity images and 3D range images Easy
A. Intensity images depend on light reflectance; range images measure geometric distance.
B. Intensity images are 3D; range images are 2D.
C. Intensity images can only be captured actively; range images are captured passively.
D. Intensity images capture distance; range images capture color.

14 Which type of image is generally immune to shadows and variations in ambient lighting?

difference between 2D intensity images and 3D range images Easy
A. Grayscale images
B. 3D range images
C. 2D intensity images
D. RGB images

15 How do active range sensors operate?

active range sensors Easy
A. They passively measure the ambient sunlight reflecting off objects.
B. They estimate depth strictly using two passive cameras (stereo vision).
C. They calculate depth based purely on the color of objects.
D. They emit their own energy (like light or sound) and measure the reflection to calculate distance.

16 Which of the following is an example of an active range sensor?

active range sensors Easy
A. Passive Stereo Camera
B. LiDAR (Light Detection and Ranging)
C. Thermal imaging camera
D. Standard RGB Web camera

17 Why is preprocessing often necessary for raw range data?

preprocessing of range data Easy
A. To add artificial colors to the image
B. To convert the 3D data into 2D intensity data
C. To increase the physical range of the sensor
D. To remove missing data points and noise spikes common in depth sensors

18 Which filter is commonly used in preprocessing range images to remove outlier noise (salt-and-pepper noise) without blurring the sharp geometric edges?

preprocessing of range data Easy
A. Median filter
B. Fourier filter
C. High-pass filter
D. Mean filter

19 Which of the following fields heavily relies on range data for obstacle avoidance and mapping?

applications of range data Easy
A. Autonomous driving and robotics
B. Audio processing
C. Digital typography
D. Text document scanning

20 How are range images primarily used in modern smartphone facial recognition systems (like FaceID)?

applications of range data Easy
A. To check the skin tone of the user
B. To capture a 3D structural map of the face, preventing spoofing with 2D photos
C. To measure the room temperature
D. To improve the color saturation of selfies

21 Why is color information often considered more robust than grayscale for object tracking under partial occlusion?

Importance of color in computer vision Medium
A. Color provides a multi-dimensional feature space that helps distinguish objects from similar-intensity backgrounds.
B. Color components are entirely invariant to extreme changes in illumination.
C. Color images require less computational power to process during real-time tracking.
D. Grayscale images cannot be used to compute optical flow, making color essential for tracking.

22 In image segmentation, how does the inclusion of color channels typically affect the performance of clustering algorithms like K-means compared to using only intensity?

Importance of color in computer vision Medium
A. It forces the algorithm to rely solely on edge detection rather than regional properties.
B. It transforms the problem into a linear regression task, bypassing the need for clustering.
C. It increases the separability of classes by mapping pixels into a 3D feature space instead of a 1D space.
D. It reduces the separability of classes by introducing noise into the clustering process.

23 Which color model is most suitable for developing an application that requires intuitive manipulation of "tint" and "vividness" by a human user?

Color models Medium
A. RGB (Red, Green, Blue)
B. HSV (Hue, Saturation, Value)
C. YUV (Luminance, Chrominance)
D. CMYK (Cyan, Magenta, Yellow, Key)

24 In the YCbCr color model used in video compression, why is the 'Y' channel typically sampled at a higher resolution than 'Cb' and 'Cr'?

Color models Medium
A. Compression algorithms can only process the 'Cb' and 'Cr' channels at lower resolutions.
B. The human visual system is more sensitive to spatial variations in brightness than in color.
C. The 'Y' channel is used to synchronize the audio and video streams.
D. The 'Y' channel contains the color information, which requires more bits to store.

25 When converting from an RGB image to grayscale using the standard NTSC formula , why are the weights unequal?

Conversion between color spaces Medium
A. To account for the non-linear response of digital camera sensors.
B. Because the human eye has varying sensitivity to different wavelengths, being most sensitive to green.
C. To compensate for the fact that blue pixels use more electrical power in LCD displays.
D. Because the RGB color space is inherently flawed and requires statistical correction.

26 What happens if the non-linear gamma correction step is omitted during the conversion from linear RGB to sRGB space?

Conversion between color spaces Medium
A. The color hue will shift by 180 degrees, resulting in a negative image.
B. The image will become completely desaturated, appearing as grayscale.
C. The resulting image will appear unnaturally dark on standard display monitors.
D. The image resolution will be mathematically halved.

27 When applying color jittering as an augmentation technique to train a robust neural network, which parameter should be perturbed to simulate varying lighting intensities without changing the object's intrinsic color?

Color augmentation techniques Medium
A. Saturation in the HSV space
B. Hue in the HSV space
C. Value/Brightness in the HSV space
D. The a* channel in the CIELAB space

28 What is the primary purpose of applying PCA-based color augmentation (like Fancy PCA) on RGB images during deep learning model training?

Color augmentation techniques Medium
A. To convert the RGB image into a binary mask for faster feature extraction.
B. To perfectly normalize the image so all pixel values equal exactly zero mean and unit variance.
C. To alter RGB intensities proportionally along their principal components, preserving natural illumination variations.
D. To compress the image size and speed up training epochs.

29 Which algorithm applies the assumption that the spatial average of surface reflectances in a scene is achromatic (gray) to achieve color constancy?

Color constancy Medium
A. Histogram Equalization
B. Gamut Mapping
C. Gray World Assumption
D. White Patch Retinex

30 An image taken under a strong yellow tungsten light looks unnaturally warm. How does the White Patch (Max-RGB) algorithm attempt to correct this illumination bias?

Color constancy Medium
A. By finding the maximum pixel value in each color channel and scaling all pixels so these maximums become pure white.
B. By applying a median filter to remove the yellow wavelengths from the image spectrum.
C. By calculating the average of all pixels and subtracting it from the yellow channel.
D. By converting the image to grayscale and colorizing it using a pre-trained neural network.

31 In a standard range image, what does the scalar value at pixel coordinate explicitly represent?

Introduction to range images Medium
A. The velocity of the object relative to the camera.
B. The amount of light reflected from the object surface to the sensor.
C. The color intensity of the object in the near-infrared spectrum.
D. The physical distance from the sensor to the surface of the scene at that spatial location.

32 Which of the following best describes the structural representation of a standard range image?

Introduction to range images Medium
A. A fully volumetric 3D voxel grid.
B. A continuous mathematical surface defined by a set of B-spline equations.
C. A 2D array where each element stores a depth value, often referred to as a 2.5D representation.
D. A 1D array of distance measurements sorted by magnitude.

33 Why are edge detection algorithms applied to range images fundamentally different in physical interpretation compared to 2D intensity images?

Difference between 2D intensity images and 3D range images Medium
A. Range images have fewer pixels, making edge detection mathematically simpler but less accurate.
B. Intensity images only have straight edges, while range images capture curved geometric edges.
C. Edge detection cannot be applied to range images because depth values are not continuous.
D. Edges in range images correspond to physical depth discontinuities, whereas in intensity images they may represent shadows or texture boundaries.

34 How does the presence of ambient illumination changes affect 2D intensity images compared to 3D range images?

Difference between 2D intensity images and 3D range images Medium
A. Range images are highly sensitive to illumination changes, while intensity images are invariant.
B. Ambient illumination changes only affect the spatial resolution of range images.
C. Intensity images are highly sensitive to illumination changes, while range images are generally invariant to ambient lighting.
D. Both are equally degraded by changes in ambient illumination.

35 How does a Time-of-Flight (ToF) camera estimate the depth of a scene?

Active range sensors Medium
A. By capturing two images simultaneously from different angles and computing stereo disparity.
B. By projecting a grid pattern and measuring the geometric distortion of the grid on the object.
C. By analyzing the blur radius of objects moving rapidly across the sensor's field of view.
D. By measuring the phase shift or time delay of an emitted light pulse reflecting off the scene and returning to the sensor.

36 In a structured light active range sensor, what is the primary purpose of projecting a known pattern (e.g., a grid or stripes) onto the scene?

Active range sensors Medium
A. To directly measure the speed of light reflecting off the object's surface.
B. To illuminate the scene brightly enough so the camera can capture color.
C. To establish pixel correspondences between the projector and the camera for triangulation.
D. To confuse ambient infrared sensors that might interfere with the measurement.

37 Range images obtained from active sensors often contain "flying pixels" (mixed pixels) at object boundaries. Which filter is most effective for removing these artifacts while preserving sharp depth edges?

Preprocessing of range data Medium
A. Gaussian filter
B. Mean filter
C. Median filter
D. High-pass filter

38 What is a common technique used to fill in missing depth values (holes) in range images caused by highly reflective surfaces or occlusions?

Preprocessing of range data Medium
A. Bilateral filtering or spatial interpolation using valid neighboring depth pixels.
B. Converting the depth map to the frequency domain and applying a low-pass filter.
C. Multiplying the depth map by the ambient lighting intensity.
D. Applying a global Fourier transform to reconstruct missing frequencies.

39 In autonomous navigation, how is range data primarily utilized for obstacle avoidance?

Applications of range data Medium
A. By analyzing the color signature of objects to determine if they are moving vehicles.
B. By reading text on traffic signs using optical character recognition (OCR).
C. By constructing a 3D occupancy grid or point cloud to identify the exact distance and geometry of obstacles in the vehicle's path.
D. By tracking the sun's position to estimate the vehicle's global coordinates.

40 How are range images utilized in industrial quality inspection for manufactured parts?

Applications of range data Medium
A. By evaluating the color consistency of the paint applied to the manufactured part.
B. By extracting 3D surface geometry from the scan and comparing it against a reference CAD model to detect dimensional defects.
C. By analyzing the 2D shadows cast by the part to estimate its weight.
D. By listening to the acoustic resonance of the part when hit with an infrared laser.

41 Metamerism poses a significant challenge in color-based object recognition. Two physically distinct surfaces are considered metamers under a specific illuminant if they exhibit which of the following properties?

Importance of color in computer vision Hard
A. They possess identical spectral reflectance curves but yield different RGB sensor responses.
B. They possess identical spectral reflectance curves and yield identical sensor responses under all possible illuminants.
C. They possess different spectral reflectance curves but yield identical sensor responses when integrated over the sensor's spectral sensitivities and the illuminant's power distribution.
D. They possess different spectral reflectance curves and yield different sensor responses, but map to the same chromaticity coordinates in the CIE xy diagram.

42 According to the Dichromatic Reflection Model, the total radiance from a dielectric inhomogeneous material is given by . If an algorithm successfully factors out , what is the primary consequence for computer vision tasks?

Importance of color in computer vision Hard
A. The algorithm completely neutralizes the color of the illuminant, achieving perfect color constancy.
B. The algorithm maps all out-of-gamut colors to the closest boundary on the CIE spectral locus.
C. The algorithm eliminates specular highlights, ensuring the remaining radiance is purely a function of the material's albedo and Lambertian shading.
D. The algorithm removes the Lambertian diffuse reflection, leaving only the material's structural geometry.

43 The sRGB color space applies a non-linear transfer function (gamma correction) to linear RGB values. From a mathematical and information-theoretic perspective, what is the primary purpose of applying this specific non-linearity prior to 8-bit quantization?

Color models Hard
A. To distribute quantization steps more uniformly according to the human visual system's logarithmic sensitivity to luminance, thereby minimizing banding in dark regions.
B. To map the theoretically infinite dynamic range of physical scene radiance to the bounded interval required for digital processing.
C. To compensate for the non-linear voltage-to-luminance response of modern OLED and LCD displays.
D. To orthogonalize the color channels, ensuring that modifications in the Red channel do not affect the perceived brightness of the Blue channel.

44 In the YCbCr color model, chroma subsampling (e.g., 4:2:0) reduces bandwidth. Which mathematical property of the RGB to YCbCr conversion makes this subsampling perceptually viable?

Color models Hard
A. The conversion applies a localized Fourier transform, allowing the and channels to inherently represent only low-frequency spatial patterns.
B. The transformation orthogonalizes the space such that the channel contains a weighted sum of RGB optimized for human luminance sensitivity, while and represent purely chromatic differences to which high-frequency human spatial sensitivity is low.
C. The transformation is highly non-linear, pushing high-frequency chromatic data into the channel.
D. The and channels are mathematically derived from the eigenvectors of the human retina's cone responses.

45 When converting from linear RGB to the HSI (Hue, Saturation, Intensity) color space, the Hue becomes mathematically undefined when . How does this singularity impact gradient-based optimization in deep learning models operating directly in HSI space?

Conversion between color spaces Hard
A. It has no impact, because automatic differentiation engines inherently handle divisions by substituting a gradient of $1.0$.
B. It induces infinite or undefined gradients (NaNs) when traversing the achromatic axis, leading to severe instability during backpropagation.
C. It guarantees that the gradient of the loss with respect to hue is zero, naturally freezing the hue parameters.
D. It requires the optimizer to use complex-valued arithmetic to traverse the hue circle smoothly.

46 The conversion from CIE XYZ to CIE Lab* incorporates a non-linear function (for ). What is the exact geometrical objective of this specific transformation?

Conversion between color spaces Hard
A. To compress the gamut of highly saturated colors to prevent out-of-bound errors during 8-bit quantization.
B. To achieve perceptual uniformity, ensuring that the Euclidean distance between two coordinates roughly corresponds to the perceived color difference by the human eye.
C. To map the white point of the illuminant exactly to the origin in Lab* space.
D. To decouple the chromaticity coordinates from luminance, allowing for perfectly linear alpha blending.

47 In Fancy PCA (used in AlexNet) for color augmentation, an RGB pixel is augmented by adding , where and are eigenvectors and eigenvalues of the RGB covariance matrix. What specific invariance is this technique injecting into the model?

Color augmentation techniques Hard
A. Invariance to spatial frequency shifts in the color channels.
B. Invariance to changes in the intensity and color of the scene illumination by shifting pixel values along the principal axes of the dataset's color distribution.
C. Invariance to local geometric distortions and chromatic aberration.
D. Invariance to extreme nonlinear gamma shifts caused by different camera sensors.

48 When applying color jittering (randomly varying Hue, Saturation, and Value) in HSV space to augment data, which mathematical precaution MUST be taken for the Hue channel that is not required for Saturation or Value?

Color augmentation techniques Hard
A. The Hue channel must be augmented using modulo arithmetic (e.g., modulo or $1.0$) because it represents an angular, periodic space.
B. The Hue channel must be transformed logarithmically to prevent perceptually massive color shifts.
C. The Hue channel must be clamped strictly to the range before converting back to RGB.
D. The Hue channel must be normalized to have zero mean and unit variance before jittering.

49 The Grey-World assumption is a popular algorithm for color constancy. Under which of the following real-world scenarios will the Grey-World algorithm fail most catastrophically, producing severely distorted colors?

Color constancy Hard
A. An image of a diverse outdoor landscape containing sky, grass, and dirt.
B. An image of a uniformly bright red brick wall taking up 90% of the camera's field of view.
C. An image with a wide dynamic range containing both extremely dark shadows and bright specular highlights.
D. An image of a highly textured, multi-colored carpet taken under a standard D65 illuminant.

50 The von Kries transform models illuminant changes using a diagonal matrix such that . For this diagonal approximation to be strictly true mathematically (without error), what underlying constraint must be satisfied regarding the camera sensors?

Color constancy Hard
A. The camera sensor spectral sensitivities must be perfectly non-overlapping (narrowband or Dirac delta functions).
B. The camera sensors must possess completely overlapping spectral sensitivities with differing amplitudes.
C. The camera sensors must have identical peak sensitivities at exactly $450$nm, $550$nm, and $650$nm.
D. The camera must have a linear dynamic range exceeding 14 bits per channel.

51 In a range image, pixel values represent distance from a reference plane or sensor. If a range image is generated using perspective projection, how does the spatial resolution (the physical area covered by a single pixel) change as the depth value increases?

Introduction to range images Hard
A. The physical area covered by a pixel decreases linearly with depth.
B. The physical area covered by a pixel increases logarithmically with depth.
C. The spatial resolution remains strictly constant regardless of depth.
D. The physical area covered by a pixel increases quadratically with depth.

52 When computing geometric properties, how does the evaluation of Gaussian Curvature on a 3D range image fundamentally differ from applying edge detection (e.g., Canny) on a 2D intensity image?

Difference between 2D intensity images and 3D range images Hard
A. There is no fundamental difference; both operators mathematically evaluate the second-order partial derivatives of the image grid.
B. Gaussian Curvature is an extrinsic property dependent on the viewpoint, whereas 2D edge detection is invariant to perspective transformations.
C. Gaussian Curvature requires a perfectly Lambertian surface to be computed accurately, whereas 2D edge detection works on any surface type.
D. Gaussian Curvature measures an intrinsic, viewpoint-invariant geometric property of the underlying surface, whereas 2D edges are heavily dependent on viewpoint, illumination, and albedo.

53 A key challenge in 2D computer vision is 'scale ambiguity', which is generally resolved in 3D range images. However, 3D range imaging suffers from a distinct physical limitation not typically affecting ideal 2D pinhole cameras. Which of the following is it?

Difference between 2D intensity images and 3D range images Hard
A. Range images are inherently limited by the speed of light, rendering them useless for moving objects.
B. Range images cannot resolve high-frequency texture information due to chromatic aberration.
C. Range images suffer from perspective foreshortening, making distant objects appear smaller.
D. Active range sensors inherently struggle with multi-path interference and specular reflections that distort geometry.

54 In an Amplitude Modulated Continuous Wave (AMCW) Time-of-Flight sensor, the distance is calculated using the phase shift of the modulated signal at frequency . The measurement is subject to phase wrapping. What is the expression for the unambiguous range interval ?

Active range sensors Hard
A.
B.
C.
D.

55 Structured light scanners project known patterns to establish corresponding points. In a highly cluttered scene with strong inter-reflections, a standard binary Gray code projection often fails. Why is phase-shifting (using sinusoidal patterns) often combined with Gray codes to mitigate this?

Active range sensors Hard
A. Phase-shifting relies on high-frequency temporal changes that inherently cancel out low-frequency inter-reflections.
B. Phase-shifting provides continuous sub-pixel accuracy that relies on relative intensity rather than absolute thresholding, though it requires Gray codes to unwrap the phase globally.
C. Phase-shifting uses distinct color channels to separate the direct signal from the inter-reflected signal.
D. Gray codes cause heavy quantization errors that physically scatter photons, whereas smooth sinusoids do not.

56 When smoothing a noisy range image (depth map), applying a standard linear Gaussian filter causes severe blurring across object boundaries (depth discontinuities). The Bilateral Filter solves this by modifying the convolution kernel. How does the Bilateral Filter achieve this mathematically?

Preprocessing of range data Hard
A. It weights adjacent pixels not only by their spatial Euclidean distance but also by the difference in their depth values, assigning lower weights to pixels across depth discontinuities.
B. It applies the Fourier transform to remove high-frequency components before performing spatial convolution.
C. It calculates the surface normals first and only convolves pixels whose normals are strictly parallel.
D. It replaces the weighted average with the median value of the local neighborhood, which naturally preserves edges.

57 In the Iterative Closest Point (ICP) algorithm used for range data registration, using a 'point-to-plane' error metric instead of a 'point-to-point' metric generally results in which of the following optimization behaviors?

Preprocessing of range data Hard
A. It turns the optimization into a strictly convex problem, preventing convergence to local minima.
B. It restricts transformations to purely translations, eliminating rotational degrees of freedom.
C. It allows the source points to slide smoothly along the target surfaces, leading to significantly faster convergence in flat regions.
D. It eliminates the need for an initial alignment guess, guaranteeing a global optimum.

58 'Flying pixels' are artifacts in ToF range images occurring at depth discontinuities, caused by the sensor integrating light from both foreground and background objects within a single pixel. Which morphological filter approach is most suited to systematically identify and remove these without eroding legitimate thin structures?

Preprocessing of range data Hard
A. Applying a Laplacian of Gaussian (LoG) filter and removing zero-crossings.
B. Checking local gradient magnitude; pixels with a depth gradient exceeding a dynamic threshold relative to the surrounding variance are flagged and removed.
C. Applying a standard 3x3 median filter iteratively until convergence.
D. Global intensity thresholding based on the active illumination amplitude.

59 To estimate the surface normal at a specific point in a 3D point cloud (derived from range data), one standard method involves applying Principal Component Analysis (PCA) to the local -nearest neighbors. What does the eigenvector corresponding to the smallest eigenvalue of the local covariance matrix represent?

Applications of range data Hard
A. The principal axis of the local surface texture.
B. The vector pointing directly toward the sensor origin.
C. The direction of maximum curvature on the surface.
D. The estimated surface normal vector at that point.

60 In 3D object recognition from range data, a 'Spin Image' is a local descriptor. How does the Spin Image achieve invariance to rigid 3D transformations (rotations and translations)?

Applications of range data Hard
A. By defining the descriptor strictly through the eigenvalues of the global covariance matrix, which are invariant to rotation.
B. By establishing a local cylindrical coordinate system centered at a specific point, aligned with that point's surface normal, and accumulating neighbor points into a 2D histogram of radial distance and elevation.
C. By projecting the 3D data onto the three principal Cartesian planes (XY, YZ, XZ) and extracting 2D SIFT features.
D. By computing a global 3D Fourier transform of the entire object and discarding the phase information.