1 $Why is color information highly valuable in computer vision applications?$

Importance of color in computer vision Easy

A.

It increases the processing speed of algorithms

B.

It eliminates the need for edge detection

C.

It provides a powerful descriptor that simplifies object identification and extraction

D.

It reduces the memory required to store images

2 $Which of the following tasks benefits the most from adding color information to a standard grayscale image?$

Importance of color in computer vision Easy

A.

Differentiating a red apple from a green apple of the same brightness

B.

Calculating the spatial frequency of an image

C.

Finding the geometric center of a shape

D.

Detecting straight lines using the Hough transform

3 $Which color model is primarily based on how the human eye perceives colors and is standard for displaying images on digital screens?$

color models Easy

A.

HSV

B.

YIQ

C.

CMYK

D.

RGB

4 $What do the letters in the HSV color model stand for?$

color models Easy

A.

Hue, Shade, Volume

B.

High, Standard, Variable

C.

Hue, Saturation, Value

D.

Hue, Saturation, Variance

5 $Why is it common to convert images from RGB to the HSV color space in computer vision?$

conversion between color spaces Easy

A.

To increase the resolution of the image

B.

To separate color information from intensity or lighting information

C.

To compress the image size

D.

To convert a color image into a 3D model

6 $When converting an RGB image to a grayscale image, what mathematical operation is typically performed on the pixels?$

conversion between color spaces Easy

A.

Multiplying all three channels together

B.

Taking a weighted average of the,, and channels

C.

Finding the maximum value among,, and

D.

Subtracting the channel from the channel

7 $What is the primary purpose of applying color augmentation techniques to an image dataset?$

color augmentation techniques Easy

A.

To convert 2D images into 3D point clouds

B.

To compress the dataset for storage

C.

To reduce the color depth to 8-bit

D.

To artificially increase dataset diversity and make models robust to lighting changes

8 $Which of the following is a common color augmentation technique?$

color augmentation techniques Easy

A.

Edge detection

B.

Image rotation

C.

Max pooling

D.

Color jittering (randomly changing brightness, contrast, and saturation)

9 $What does 'color constancy' refer to in computer vision?$

color constancy Easy

A.

The process of keeping the file size constant when adding color

B.

The ability to perceive the true color of an object despite changes in the color of the illumination

C.

The technique of using only primary colors in an image

D.

The ability to convert a color image into grayscale

10 $Which simple algorithm is often used as a baseline to achieve color constancy by assuming that the average color of a scene is neutral gray?$

color constancy Easy

A.

The Gray World assumption

B.

The RGB to HSV conversion

C.

The Hough Transform

D.

The Canny Edge Detector

11 $What does the pixel value represent in a range image?$

introduction to range images Easy

A.

The temperature of the object

B.

The intensity of light reflected by an object

C.

The distance or depth from the sensor to the object

D.

The color hue of the object

12 $What is another common name for a range image?$

introduction to range images Easy

A.

Texture map

B.

Thermal map

C.

Depth map

D.

Grayscale map

13 $Which of the following best describes the difference between a 2D intensity image and a 3D range image?$

difference between 2D intensity images and 3D range images Easy

A.

Intensity images can only be captured actively; range images are captured passively.

B.

Intensity images are 3D; range images are 2D.

C.

Intensity images capture distance; range images capture color.

D.

Intensity images depend on light reflectance; range images measure geometric distance.

14 $Which type of image is generally immune to shadows and variations in ambient lighting?$

difference between 2D intensity images and 3D range images Easy

A.

RGB images

B.

2D intensity images

C.

3D range images

D.

Grayscale images

15 $How do active range sensors operate?$

active range sensors Easy

A.

They calculate depth based purely on the color of objects.

B.

They emit their own energy (like light or sound) and measure the reflection to calculate distance.

C.

They estimate depth strictly using two passive cameras (stereo vision).

D.

They passively measure the ambient sunlight reflecting off objects.

16 $Which of the following is an example of an active range sensor?$

active range sensors Easy

A.

LiDAR (Light Detection and Ranging)

B.

Standard RGB Web camera

C.

Passive Stereo Camera

D.

Thermal imaging camera

17 $Why is preprocessing often necessary for raw range data?$

preprocessing of range data Easy

A.

To add artificial colors to the image

B.

To convert the 3D data into 2D intensity data

C.

To remove missing data points and noise spikes common in depth sensors

D.

To increase the physical range of the sensor

18 $Which filter is commonly used in preprocessing range images to remove outlier noise (salt-and-pepper noise) without blurring the sharp geometric edges?$

preprocessing of range data Easy

A.

Median filter

B.

High-pass filter

C.

Mean filter

D.

Fourier filter

19 $Which of the following fields heavily relies on range data for obstacle avoidance and mapping?$

applications of range data Easy

A.

Text document scanning

B.

Autonomous driving and robotics

C.

Digital typography

D.

Audio processing

20 $How are range images primarily used in modern smartphone facial recognition systems (like FaceID)?$

applications of range data Easy

A.

To improve the color saturation of selfies

B.

To check the skin tone of the user

C.

To measure the room temperature

D.

To capture a 3D structural map of the face, preventing spoofing with 2D photos

21 $Why is color information often considered more robust than grayscale for object tracking under partial occlusion?$

Importance of color in computer vision Medium

A.

Color components are entirely invariant to extreme changes in illumination.

B.

Color images require less computational power to process during real-time tracking.

C.

Grayscale images cannot be used to compute optical flow, making color essential for tracking.

D.

Color provides a multi-dimensional feature space that helps distinguish objects from similar-intensity backgrounds.

22 $In image segmentation, how does the inclusion of color channels typically affect the performance of clustering algorithms like K-means compared to using only intensity?$

Importance of color in computer vision Medium

A.

It forces the algorithm to rely solely on edge detection rather than regional properties.

B.

It increases the separability of classes by mapping pixels into a 3D feature space instead of a 1D space.

C.

It reduces the separability of classes by introducing noise into the clustering process.

D.

It transforms the problem into a linear regression task, bypassing the need for clustering.

23 $Which color model is most suitable for developing an application that requires intuitive manipulation of "tint" and "vividness" by a human user?$

Color models Medium

A.

CMYK (Cyan, Magenta, Yellow, Key)

B.

YUV (Luminance, Chrominance)

C.

HSV (Hue, Saturation, Value)

D.

RGB (Red, Green, Blue)

24 $In the YCbCr color model used in video compression, why is the 'Y' channel typically sampled at a higher resolution than 'Cb' and 'Cr'?$

Color models Medium

A.

The human visual system is more sensitive to spatial variations in brightness than in color.

B.

The 'Y' channel is used to synchronize the audio and video streams.

C.

Compression algorithms can only process the 'Cb' and 'Cr' channels at lower resolutions.

D.

The 'Y' channel contains the color information, which requires more bits to store.

25 $When converting from an RGB image to grayscale using the standard NTSC formula, why are the weights unequal?$

Conversion between color spaces Medium

A.

To account for the non-linear response of digital camera sensors.

B.

Because the RGB color space is inherently flawed and requires statistical correction.

C.

Because the human eye has varying sensitivity to different wavelengths, being most sensitive to green.

D.

To compensate for the fact that blue pixels use more electrical power in LCD displays.

26 $What happens if the non-linear gamma correction step is omitted during the conversion from linear RGB to sRGB space?$

Conversion between color spaces Medium

A.

The image will become completely desaturated, appearing as grayscale.

B.

The resulting image will appear unnaturally dark on standard display monitors.

C.

The image resolution will be mathematically halved.

D.

The color hue will shift by 180 degrees, resulting in a negative image.

27 $When applying color jittering as an augmentation technique to train a robust neural network, which parameter should be perturbed to simulate varying lighting intensities without changing the object's intrinsic color?$

Color augmentation techniques Medium

A.

The a* channel in the CIELAB space

B.

Value/Brightness in the HSV space

C.

Saturation in the HSV space

D.

Hue in the HSV space

28 $What is the primary purpose of applying PCA-based color augmentation (like Fancy PCA) on RGB images during deep learning model training?$

Color augmentation techniques Medium

A.

To perfectly normalize the image so all pixel values equal exactly zero mean and unit variance.

B.

To compress the image size and speed up training epochs.

C.

To alter RGB intensities proportionally along their principal components, preserving natural illumination variations.

D.

To convert the RGB image into a binary mask for faster feature extraction.

29 $Which algorithm applies the assumption that the spatial average of surface reflectances in a scene is achromatic (gray) to achieve color constancy?$

Color constancy Medium

A.

Gray World Assumption

B.

White Patch Retinex

C.

Histogram Equalization

D.

Gamut Mapping

30 $An image taken under a strong yellow tungsten light looks unnaturally warm. How does the White Patch (Max-RGB) algorithm attempt to correct this illumination bias?$

Color constancy Medium

A.

By finding the maximum pixel value in each color channel and scaling all pixels so these maximums become pure white.

B.

By applying a median filter to remove the yellow wavelengths from the image spectrum.

C.

By converting the image to grayscale and colorizing it using a pre-trained neural network.

D.

By calculating the average of all pixels and subtracting it from the yellow channel.

31 $In a standard range image, what does the scalar value at pixel coordinate explicitly represent?$

Introduction to range images Medium

A.

The color intensity of the object in the near-infrared spectrum.

B.

The amount of light reflected from the object surface to the sensor.

C.

The velocity of the object relative to the camera.

D.

The physical distance from the sensor to the surface of the scene at that spatial location.

32 $Which of the following best describes the structural representation of a standard range image?$

Introduction to range images Medium

A.

A fully volumetric 3D voxel grid.

B.

A continuous mathematical surface defined by a set of B-spline equations.

C.

A 1D array of distance measurements sorted by magnitude.

D.

A 2D array where each element stores a depth value, often referred to as a 2.5D representation.

33 $Why are edge detection algorithms applied to range images fundamentally different in physical interpretation compared to 2D intensity images?$

Difference between 2D intensity images and 3D range images Medium

A.

Range images have fewer pixels, making edge detection mathematically simpler but less accurate.

B.

Intensity images only have straight edges, while range images capture curved geometric edges.

C.

Edges in range images correspond to physical depth discontinuities, whereas in intensity images they may represent shadows or texture boundaries.

D.

Edge detection cannot be applied to range images because depth values are not continuous.

34 $How does the presence of ambient illumination changes affect 2D intensity images compared to 3D range images?$

Difference between 2D intensity images and 3D range images Medium

A.

Range images are highly sensitive to illumination changes, while intensity images are invariant.

B.

Ambient illumination changes only affect the spatial resolution of range images.

C.

Intensity images are highly sensitive to illumination changes, while range images are generally invariant to ambient lighting.

D.

Both are equally degraded by changes in ambient illumination.

35 $How does a Time-of-Flight (ToF) camera estimate the depth of a scene?$

Active range sensors Medium

A.

By analyzing the blur radius of objects moving rapidly across the sensor's field of view.

B.

By measuring the phase shift or time delay of an emitted light pulse reflecting off the scene and returning to the sensor.

C.

By projecting a grid pattern and measuring the geometric distortion of the grid on the object.

D.

By capturing two images simultaneously from different angles and computing stereo disparity.

36 $In a structured light active range sensor, what is the primary purpose of projecting a known pattern (e.g., a grid or stripes) onto the scene?$

Active range sensors Medium

A.

To establish pixel correspondences between the projector and the camera for triangulation.

B.

To illuminate the scene brightly enough so the camera can capture color.

C.

To confuse ambient infrared sensors that might interfere with the measurement.

D.

To directly measure the speed of light reflecting off the object's surface.

37 $Range images obtained from active sensors often contain "flying pixels" (mixed pixels) at object boundaries. Which filter is most effective for removing these artifacts while preserving sharp depth edges?$

Preprocessing of range data Medium

A.

Mean filter

B.

High-pass filter

C.

Median filter

D.

Gaussian filter

38 $What is a common technique used to fill in missing depth values (holes) in range images caused by highly reflective surfaces or occlusions?$

Preprocessing of range data Medium

A.

Multiplying the depth map by the ambient lighting intensity.

B.

Bilateral filtering or spatial interpolation using valid neighboring depth pixels.

C.

Applying a global Fourier transform to reconstruct missing frequencies.

D.

Converting the depth map to the frequency domain and applying a low-pass filter.

39 $In autonomous navigation, how is range data primarily utilized for obstacle avoidance?$

Applications of range data Medium

A.

By tracking the sun's position to estimate the vehicle's global coordinates.

B.

By analyzing the color signature of objects to determine if they are moving vehicles.

C.

By reading text on traffic signs using optical character recognition (OCR).

D.

By constructing a 3D occupancy grid or point cloud to identify the exact distance and geometry of obstacles in the vehicle's path.

40 $How are range images utilized in industrial quality inspection for manufactured parts?$

Applications of range data Medium

A.

By listening to the acoustic resonance of the part when hit with an infrared laser.

B.

By extracting 3D surface geometry from the scan and comparing it against a reference CAD model to detect dimensional defects.

C.

By evaluating the color consistency of the paint applied to the manufactured part.

D.

By analyzing the 2D shadows cast by the part to estimate its weight.

41 $Metamerism poses a significant challenge in color-based object recognition. Two physically distinct surfaces are considered metamers under a specific illuminant if they exhibit which of the following properties?$

Importance of color in computer vision Hard

A.

They possess different spectral reflectance curves but yield identical sensor responses when integrated over the sensor's spectral sensitivities and the illuminant's power distribution.

B.

They possess different spectral reflectance curves and yield different sensor responses, but map to the same chromaticity coordinates in the CIE xy diagram.

C.

They possess identical spectral reflectance curves but yield different RGB sensor responses.

D.

They possess identical spectral reflectance curves and yield identical sensor responses under all possible illuminants.

42 $According to the Dichromatic Reflection Model, the total radiance from a dielectric inhomogeneous material is given by . If an algorithm successfully factors out, what is the primary consequence for computer vision tasks?$

Importance of color in computer vision Hard

A.

The algorithm completely neutralizes the color of the illuminant, achieving perfect color constancy.

B.

The algorithm removes the Lambertian diffuse reflection, leaving only the material's structural geometry.

C.

The algorithm eliminates specular highlights, ensuring the remaining radiance is purely a function of the material's albedo and Lambertian shading.

D.

The algorithm maps all out-of-gamut colors to the closest boundary on the CIE spectral locus.

43 $The sRGB color space applies a non-linear transfer function (gamma correction) to linear RGB values. From a mathematical and information-theoretic perspective, what is the primary purpose of applying this specific non-linearity prior to 8-bit quantization?$

Color models Hard

A.

To distribute quantization steps more uniformly according to the human visual system's logarithmic sensitivity to luminance, thereby minimizing banding in dark regions.

B.

To compensate for the non-linear voltage-to-luminance response of modern OLED and LCD displays.

C.

To orthogonalize the color channels, ensuring that modifications in the Red channel do not affect the perceived brightness of the Blue channel.

D.

To map the theoretically infinite dynamic range of physical scene radiance to the bounded interval required for digital processing.

44 $In the YCbCr color model, chroma subsampling (e.g., 4:2:0) reduces bandwidth. Which mathematical property of the RGB to YCbCr conversion makes this subsampling perceptually viable?$

Color models Hard

A.

The and channels are mathematically derived from the eigenvectors of the human retina's cone responses.

B.

The transformation is highly non-linear, pushing high-frequency chromatic data into the channel.

C.

The conversion applies a localized Fourier transform, allowing the and channels to inherently represent only low-frequency spatial patterns.

D.

The transformation orthogonalizes the space such that the channel contains a weighted sum of RGB optimized for human luminance sensitivity, while and represent purely chromatic differences to which high-frequency human spatial sensitivity is low.

45 $When converting from linear RGB to the HSI (Hue, Saturation, Intensity) color space, the Hue becomes mathematically undefined when . How does this singularity impact gradient-based optimization in deep learning models operating directly in HSI space?$

Conversion between color spaces Hard

A.

It induces infinite or undefined gradients (NaNs) when traversing the achromatic axis, leading to severe instability during backpropagation.

B.

It guarantees that the gradient of the loss with respect to hue is zero, naturally freezing the hue parameters.

C.

It has no impact, because automatic differentiation engines inherently handle divisions by substituting a gradient of $1.0$.

D.

It requires the optimizer to use complex-valued arithmetic to traverse the hue circle smoothly.

46 $The conversion from CIE XYZ to CIE L a b* incorporates a non-linear function (for). What is the exact geometrical objective of this specific transformation?$

Conversion between color spaces Hard

A.

To map the white point of the illuminant exactly to the origin in L a b* space.

B.

To decouple the chromaticity coordinates from luminance, allowing for perfectly linear alpha blending.

C.

To achieve perceptual uniformity, ensuring that the Euclidean distance between two coordinates roughly corresponds to the perceived color difference by the human eye.

D.

To compress the gamut of highly saturated colors to prevent out-of-bound errors during 8-bit quantization.

47 $In Fancy PCA (used in AlexNet) for color augmentation, an RGB pixel is augmented by adding, where and are eigenvectors and eigenvalues of the RGB covariance matrix. What specific invariance is this technique injecting into the model?$

Color augmentation techniques Hard

A.

Invariance to changes in the intensity and color of the scene illumination by shifting pixel values along the principal axes of the dataset's color distribution.

B.

Invariance to extreme nonlinear gamma shifts caused by different camera sensors.

C.

Invariance to local geometric distortions and chromatic aberration.

D.

Invariance to spatial frequency shifts in the color channels.

48 $When applying color jittering (randomly varying Hue, Saturation, and Value) in HSV space to augment data, which mathematical precaution MUST be taken for the Hue channel that is not required for Saturation or Value?$

Color augmentation techniques Hard

A.

The Hue channel must be augmented using modulo arithmetic (e.g., modulo or $1.0$) because it represents an angular, periodic space.

B.

The Hue channel must be transformed logarithmically to prevent perceptually massive color shifts.

C.

The Hue channel must be clamped strictly to the range before converting back to RGB.

D.

The Hue channel must be normalized to have zero mean and unit variance before jittering.

49 $The Grey-World assumption is a popular algorithm for color constancy. Under which of the following real-world scenarios will the Grey-World algorithm fail most catastrophically, producing severely distorted colors?$

Color constancy Hard

A.

An image with a wide dynamic range containing both extremely dark shadows and bright specular highlights.

B.

An image of a diverse outdoor landscape containing sky, grass, and dirt.

C.

An image of a uniformly bright red brick wall taking up 90% of the camera's field of view.

D.

An image of a highly textured, multi-colored carpet taken under a standard D65 illuminant.

50 $The von Kries transform models illuminant changes using a diagonal matrix such that . For this diagonal approximation to be strictly true mathematically (without error), what underlying constraint must be satisfied regarding the camera sensors?$

Color constancy Hard

A.

The camera must have a linear dynamic range exceeding 14 bits per channel.

B.

The camera sensor spectral sensitivities must be perfectly non-overlapping (narrowband or Dirac delta functions).

C.

The camera sensors must have identical peak sensitivities at exactly $450$nm, $550$nm, and $650$nm.

D.

The camera sensors must possess completely overlapping spectral sensitivities with differing amplitudes.

51 $In a range image, pixel values represent distance from a reference plane or sensor. If a range image is generated using perspective projection, how does the spatial resolution (the physical area covered by a single pixel) change as the depth value increases?$

Introduction to range images Hard

A.

The physical area covered by a pixel increases logarithmically with depth.

B.

The physical area covered by a pixel increases quadratically with depth.

C.

The spatial resolution remains strictly constant regardless of depth.

D.

The physical area covered by a pixel decreases linearly with depth.

52 $When computing geometric properties, how does the evaluation of Gaussian Curvature on a 3D range image fundamentally differ from applying edge detection (e.g., Canny) on a 2D intensity image?$

Difference between 2D intensity images and 3D range images Hard

A.

Gaussian Curvature is an extrinsic property dependent on the viewpoint, whereas 2D edge detection is invariant to perspective transformations.

B.

Gaussian Curvature requires a perfectly Lambertian surface to be computed accurately, whereas 2D edge detection works on any surface type.

C.

There is no fundamental difference; both operators mathematically evaluate the second-order partial derivatives of the image grid.

D.

Gaussian Curvature measures an intrinsic, viewpoint-invariant geometric property of the underlying surface, whereas 2D edges are heavily dependent on viewpoint, illumination, and albedo.

53 $A key challenge in 2D computer vision is 'scale ambiguity', which is generally resolved in 3D range images. However, 3D range imaging suffers from a distinct physical limitation not typically affecting ideal 2D pinhole cameras. Which of the following is it?$

Difference between 2D intensity images and 3D range images Hard

A.

Range images cannot resolve high-frequency texture information due to chromatic aberration.

B.

Range images are inherently limited by the speed of light, rendering them useless for moving objects.

C.

Active range sensors inherently struggle with multi-path interference and specular reflections that distort geometry.

D.

Range images suffer from perspective foreshortening, making distant objects appear smaller.

54 $In an Amplitude Modulated Continuous Wave (AMCW) Time-of-Flight sensor, the distance is calculated using the phase shift of the modulated signal at frequency . The measurement is subject to phase wrapping. What is the expression for the unambiguous range interval ?$

Active range sensors Hard

A.

B.

C.

D.

55 $Structured light scanners project known patterns to establish corresponding points. In a highly cluttered scene with strong inter-reflections, a standard binary Gray code projection often fails. Why is phase-shifting (using sinusoidal patterns) often combined with Gray codes to mitigate this?$

Active range sensors Hard

A.

Phase-shifting uses distinct color channels to separate the direct signal from the inter-reflected signal.

B.

Phase-shifting provides continuous sub-pixel accuracy that relies on relative intensity rather than absolute thresholding, though it requires Gray codes to unwrap the phase globally.

C.

Phase-shifting relies on high-frequency temporal changes that inherently cancel out low-frequency inter-reflections.

D.

Gray codes cause heavy quantization errors that physically scatter photons, whereas smooth sinusoids do not.

56 $When smoothing a noisy range image (depth map), applying a standard linear Gaussian filter causes severe blurring across object boundaries (depth discontinuities). The Bilateral Filter solves this by modifying the convolution kernel. How does the Bilateral Filter achieve this mathematically?$

Preprocessing of range data Hard

A.

It applies the Fourier transform to remove high-frequency components before performing spatial convolution.

B.

It replaces the weighted average with the median value of the local neighborhood, which naturally preserves edges.

C.

It calculates the surface normals first and only convolves pixels whose normals are strictly parallel.

D.

It weights adjacent pixels not only by their spatial Euclidean distance but also by the difference in their depth values, assigning lower weights to pixels across depth discontinuities.

57 $In the Iterative Closest Point (ICP) algorithm used for range data registration, using a 'point-to-plane' error metric instead of a 'point-to-point' metric generally results in which of the following optimization behaviors?$

Preprocessing of range data Hard

A.

It turns the optimization into a strictly convex problem, preventing convergence to local minima.

B.

It eliminates the need for an initial alignment guess, guaranteeing a global optimum.

C.

It restricts transformations to purely translations, eliminating rotational degrees of freedom.

D.

It allows the source points to slide smoothly along the target surfaces, leading to significantly faster convergence in flat regions.

58 $'Flying pixels' are artifacts in ToF range images occurring at depth discontinuities, caused by the sensor integrating light from both foreground and background objects within a single pixel. Which morphological filter approach is most suited to systematically identify and remove these without eroding legitimate thin structures?$

Preprocessing of range data Hard

A.

Applying a Laplacian of Gaussian (LoG) filter and removing zero-crossings.

B.

Applying a standard 3x3 median filter iteratively until convergence.

C.

Global intensity thresholding based on the active illumination amplitude.

D.

Checking local gradient magnitude; pixels with a depth gradient exceeding a dynamic threshold relative to the surrounding variance are flagged and removed.

59 $To estimate the surface normal at a specific point in a 3D point cloud (derived from range data), one standard method involves applying Principal Component Analysis (PCA) to the local -nearest neighbors. What does the eigenvector corresponding to the smallest eigenvalue of the local covariance matrix represent?$

Applications of range data Hard

A.

The vector pointing directly toward the sensor origin.

B.

The principal axis of the local surface texture.

C.

The estimated surface normal vector at that point.

D.

The direction of maximum curvature on the surface.

60 $In 3D object recognition from range data, a 'Spin Image' is a local descriptor. How does the Spin Image achieve invariance to rigid 3D transformations (rotations and translations)?$

Applications of range data Hard

A.

By establishing a local cylindrical coordinate system centered at a specific point, aligned with that point's surface normal, and accumulating neighbor points into a 2D histogram of radial distance and elevation.

B.

By projecting the 3D data onto the three principal Cartesian planes (XY, YZ, XZ) and extracting 2D SIFT features.

C.

By defining the descriptor strictly through the eigenvalues of the global covariance matrix, which are invariant to rotation.

D.

By computing a global 3D Fourier transform of the entire object and discarding the phase information.

Unit 5 - Practice Quiz