1 $Which programming language environment is strictly required to be installed before running Apache Cassandra?$

Installation of Apache Cassandra Easy

A.

Ruby

B.

Python

C.

Java

D.

C++

2 $What type of architecture does Apache Cassandra use to ensure there is no single point of failure?$

Cassandra Architecture – peer-to-peer architecture Easy

A.

Master-Slave architecture

B.

Peer-to-peer (Masterless) architecture

C.

Hub-and-Spoke architecture

D.

Client-Server architecture

3 $Which protocol does Cassandra use for nodes to communicate and share state information about themselves and other nodes?$

Gossip protocol Easy

A.

FTP

B.

HTTP/2

C.

SMTP

D.

Gossip protocol

4 $In Cassandra, what does the 'Replication Factor' determine?$

Replication and consistency levels Easy

A.

The number of users allowed to read data

B.

The speed at which data is written

C.

The total number of copies of data across the cluster

D.

The number of disks per node

5 $Which consistency level requires only a single replica node to acknowledge a read or write operation to be considered successful?$

Replication and consistency levels Easy

A.

ANY

B.

ONE

C.

ALL

D.

QUORUM

6 $During a write operation, where does Cassandra store the data in memory before flushing it to disk?$

Read and write paths Easy

A.

Memtable

B.

Cache

C.

Commit Log

D.

SSTable

7 $What is the append-only file on disk used by Cassandra to ensure data durability and aid in crash recovery?$

Read and write paths Easy

A.

SSTable

B.

Commit log

C.

Memtable

D.

Bloom filter

8 $Which Cassandra object acts as a container for tables and is analogous to a database in a Relational Database Management System (RDBMS)?$

Cassandra Data Model – keyspace, table, primary key, partition key, clustering columns, wide rows Easy

A.

Keyspace

B.

Partition

C.

Column Family

D.

Cluster

9 $Which component of the primary key is responsible for determining the specific node where a row's data is stored?$

Cassandra Data Model – keyspace, table, primary key, partition key, clustering columns, wide rows Easy

A.

Clustering column

B.

Super column

C.

Secondary index

D.

Partition key

10 $What is the primary function of clustering columns in Cassandra's data model?$

Cassandra Data Model – keyspace, table, primary key, partition key, clustering columns, wide rows Easy

A.

To encrypt the data

B.

To determine the node location

C.

To sort data within a specific partition

D.

To create a secondary index

11 $What term is used to describe a partition in Cassandra that contains a very large number of rows/columns?$

Cassandra Data Model – keyspace, table, primary key, partition key, clustering columns, wide rows Easy

A.

Tall tables

B.

Wide rows

C.

Skinny rows

D.

Fat partitions

12 $Which query language is used by developers to interact with Apache Cassandra?$

CQL (Cassandra Query Language) – data types, creating keyspaces and tables, insert, update, delete, select, filtering, indexing Easy

A.

CQL (Cassandra Query Language)

B.

HQL (Hive Query Language)

C.

SQL (Structured Query Language)

D.

Pig Latin

13 $Which CQL command is used to add new records into a Cassandra table?$

CQL (Cassandra Query Language) – data types, creating keyspaces and tables, insert, update, delete, select, filtering, indexing Easy

A.

UPLOAD

B.

INSERT

C.

PUT

D.

ADD

14 $In CQL, what keyword must often be appended to a SELECT statement to query on a column that is not part of the primary key (assuming no secondary index)?$

CQL (Cassandra Query Language) – data types, creating keyspaces and tables, insert, update, delete, select, filtering, indexing Easy

A.

FORCE FILTER

B.

BYPASS INDEX

C.

ENABLE SCAN

D.

ALLOW FILTERING

15 $Which CQL command is used to remove an entire keyspace and all of its contents?$

CQL (Cassandra Query Language) – data types, creating keyspaces and tables, insert, update, delete, select, filtering, indexing Easy

A.

TRUNCATE KEYSPACE

B.

DROP KEYSPACE

C.

DELETE KEYSPACE

D.

REMOVE KEYSPACE

16 $What is the purpose of creating an index in Cassandra?$

CQL (Cassandra Query Language) – data types, creating keyspaces and tables, insert, update, delete, select, filtering, indexing Easy

A.

To trigger the compaction process

B.

To increase the replication factor

C.

To allow efficient querying on a non-primary key column

D.

To sort the partition key

17 $Which Cassandra administration process merges multiple SSTables into a single new SSTable to reclaim disk space?$

Cassandra Administration - compaction Easy

A.

Compaction

B.

Replication

C.

Gossip

D.

Flushing

18 $Because Cassandra uses a masterless architecture, what can be said about the capabilities of every node in the cluster?$

Cassandra Architecture – peer-to-peer architecture Easy

A.

Only one node can handle writes.

B.

Nodes must request permission from a master node to serve data.

C.

Read operations are faster than write operations on all nodes.

D.

Every node can accept read and write requests equally.

19 $Which of the following is a valid native data type in CQL used for storing whole numbers?$

CQL (Cassandra Query Language) – data types, creating keyspaces and tables, insert, update, delete, select, filtering, indexing Easy

A.

VARCHAR

B.

INT

C.

STRING

D.

DECIMAL

20 $Which CQL command modifies an existing table structure, such as adding a new column?$

CQL (Cassandra Query Language) – data types, creating keyspaces and tables, insert, update, delete, select, filtering, indexing Easy

A.

MODIFY TABLE

B.

CHANGE TABLE

C.

ALTER TABLE

D.

UPDATE TABLE

21 $In a 5-node Cassandra cluster, node C experiences a sudden hardware failure. How does the Cassandra architecture ensure continued availability of data originally managed by node C?$

Cassandra Architecture – peer-to-peer architecture Medium

A.

A leader election is triggered among the remaining nodes to elect a replacement for node C.

B.

A master node detects the failure and reassigns node C's tokens to another node.

C.

Other nodes containing replicas of node C's data continue to serve read and write requests based on the replication factor.

D.

The cluster immediately locks all read and write operations until node C is manually restored.

22 $How does a Cassandra node dynamically discover the current state, load, and availability of all other nodes in a large cluster?$

Gossip protocol Medium

A.

By broadcasting UDP multicast messages to every node in the datacenter simultaneously.

B.

By querying the central Apache ZooKeeper service for metadata updates.

C.

By performing continuous DNS lookups on a centralized registry.

D.

By exchanging state information continuously with up to three other nodes every second using the Gossip protocol.

23 $A keyspace is configured with a Replication Factor (RF) of 3. If a client executes a write operation with a Consistency Level of QUORUM, how many replica nodes must acknowledge the write for it to be successful?$

Replication and consistency levels Medium

A.

3

B.

2

C.

1

D.

4

24 $In a cluster with a Replication Factor of 3, which combination of Read and Write Consistency Levels provides 'Strong Consistency' (ensuring a read always reflects the latest write)?$

Replication and consistency levels Medium

A.

Write ONE, Read ONE

B.

Write ANY, Read QUORUM

C.

Write QUORUM, Read QUORUM

D.

Write LOCAL_ONE, Read ONE

25 $When a client sends a write request to a Cassandra node, what is the exact sequence of internal components involved before acknowledging success to the client?$

Read and write paths Medium

A.

It writes to the Commit Log on disk, then writes to the Memtable in memory.

B.

It writes to the Memtable, then flushes directly to the SSTable.

C.

It writes to the SSTable on disk, then updates the Commit Log.

D.

It writes to the Memtable in memory, then asynchronously appends to the Commit Log.

26 $During a read operation, if a Cassandra node discovers that data for the same row exists in the Memtable and multiple SSTables with different values, how does it resolve the conflict?$

Read and write paths Medium

A.

It raises a version conflict exception to the client.

B.

It merges the data and uses the cell with the most recent timestamp (Last-Write-Wins).

C.

It assumes the Memtable always contains the correct data because it is in memory.

D.

It prompts the Gossip protocol to vote on the correct value.

27 $You are designing a globally distributed application that requires Cassandra to replicate data across two different datacenters (e.g., US-East and EU-West). Which replication strategy should you use when creating the keyspace?$

Cassandra Data Model – keyspace, table, primary key, partition key, clustering columns, wide rows Medium

A.

SimpleStrategy

B.

NetworkTopologyStrategy

C.

GlobalReplicationStrategy

D.

DatacenterStrategy

28 $What specifically defines a 'wide row' in Cassandra's data model architecture?$

Cassandra Data Model – keyspace, table, primary key, partition key, clustering columns, wide rows Medium

A.

A physical row on disk that exceeds 2 GB of storage space.

B.

A single database partition that contains multiple logical rows organized by clustering columns.

C.

A table that has more than 100 column definitions in its CQL schema.

D.

A row that spans across multiple nodes in a cluster to balance the load.

29 $Given a table created with PRIMARY KEY ((sensor_id, date), time), how is the data distributed across nodes and sorted on disk?$

Cassandra Data Model – keyspace, table, primary key, partition key, clustering columns, wide rows Medium

A.

Distributed sequentially by all three fields combined.

B.

Distributed by a composite of sensor_id and date, sorted by time .

C.

Distributed by sensor_id, sorted by date and time .

D.

Distributed by time, sorted by sensor_id and date .

30 $You execute an INSERT statement in CQL, but a row with the exact same primary key already exists in the table. What is the outcome?$

CQL (Cassandra Query Language) – insert, update, delete, select Medium

A.

Cassandra overwrites the existing row with the new values without checking if it existed.

B.

Cassandra throws a DuplicateKeyException .

C.

Cassandra silently ignores the query and retains the old data.

D.

Cassandra creates a secondary partition for the duplicate row.

31 $If you need to store an unordered collection of unique email addresses for a user within a single column, which CQL data type is the most appropriate?$

CQL (Cassandra Query Language) – data types Medium

A.

list<text>

B.

map<text, text>

C.

tuple<text>

D.

set<text>

32 $How does Cassandra internally execute a CQL UPDATE statement?$

CQL (Cassandra Query Language) – update Medium

A.

It reads the current row, modifies the specified columns, and writes the complete row back to disk.

B.

It writes the updated column values with a new timestamp as a new entry, without reading the existing data.

C.

It searches the SSTables for the existing row and modifies it in-place.

D.

It locks the partition across all replica nodes to ensure serializability, then updates the data.

33 $When a user executes a DELETE query in CQL, how does Cassandra process this request to ensure replicas accurately reflect the deletion?$

CQL (Cassandra Query Language) – delete Medium

A.

It flags the node's JVM to perform garbage collection on the specific row memory.

B.

It replaces the deleted column values with null pointers.

C.

It immediately erases the data from the Memtable and SSTables.

D.

It inserts a marker called a 'Tombstone' with a timestamp indicating the data is deleted.

34 $A developer runs a SELECT query with a WHERE clause on a column that is neither a partition key nor indexed. They append ALLOW FILTERING to make it execute. What is the performance impact?$

CQL (Cassandra Query Language) – filtering Medium

A.

Cassandra may scan all partitions on all nodes to find the requested data, causing severe performance degradation.

B.

Cassandra temporarily builds an in-memory index for the duration of the query.

C.

The query runs locally on a single node without contacting replicas, sacrificing consistency for speed.

D.

The query executes efficiently by utilizing the partition key cache.

35 $In Cassandra, creating a Secondary Index is generally considered an anti-pattern and highly inefficient in which of the following scenarios?$

CQL (Cassandra Query Language) – indexing Medium

A.

When created on a column with extremely high cardinality, such as a user's unique email or UUID.

B.

When querying an indexed column alongside a specific partition key.

C.

When created on a column storing standard static categories like 'department'.

D.

When created on a clustering column to filter within a partition.

36 $During the installation and configuration of an Apache Cassandra node, which file is primarily modified to define the cluster_name, listen_address, and seeds list?$

Installation of Apache Cassandra Medium

A.

cassandra.yaml

B.

cassandra-env.sh

C.

jvm.options

D.

logback.xml

37 $What is the primary function of the compaction process in Cassandra administration?$

Cassandra Administration - compaction Medium

A.

To automatically distribute data partitions evenly across newly added nodes.

B.

To force Memtables to flush their in-memory contents to disk immediately.

C.

To merge multiple SSTables into a single new SSTable, purging tombstones and keeping only the most recent data.

D.

To compress the network payload during node-to-node Gossip communication.

38 $Which compaction strategy is optimized for time-series data workloads where older data is rarely queried or updated, allowing entire SSTables to be dropped when their TTL expires?$

Cassandra Administration - compaction Medium

A.

NetworkTopologyCompactionStrategy (NTCS)

B.

LeveledCompactionStrategy (LCS)

C.

TimeWindowCompactionStrategy (TWCS)

D.

SizeTieredCompactionStrategy (STCS)

39 $Assume a table employees is defined with PRIMARY KEY (department, employee_id) . Which of the following SELECT queries will execute efficiently without requiring ALLOW FILTERING ?$

CQL (Cassandra Query Language) – select Medium

A.

SELECT * FROM employees WHERE department > 'IT';

B.

SELECT * FROM employees WHERE employee_id = 1005;

C.

SELECT * FROM employees WHERE department = 'Sales' ORDER BY department DESC;

D.

SELECT * FROM employees WHERE department = 'Sales' AND employee_id = 1005;

40 $You want to design a table to store temperature readings from sensors. The primary key is ((sensor_id), reading_time) . By default, Cassandra sorts clustering columns in ascending order. How can you ensure the newest readings are retrieved first without specifying ORDER BY in your queries?$

Cassandra Data Model – keyspace, table, primary key, partition key, clustering columns, wide rows Medium

A.

Create a secondary index on reading_time with a descending flag.

B.

Specify WITH CLUSTERING ORDER BY (reading_time DESC) at the end of the CREATE TABLE statement.

C.

It is impossible; sorting must always be handled by the client application.

D.

Declare reading_time DESC inside the PRIMARY KEY definition.

41 $In a 10-node Cassandra cluster using virtual nodes (vnodes) with num_tokens: 256, a single node catastrophically fails and its data must be rebuilt. Which of the following best describes the network traffic pattern during the streaming recovery process?$

Cassandra Architecture – peer-to-peer architecture Hard

A.

The replacement node will stream data simultaneously from all other 9 nodes in the cluster, distributing the load.

B.

Streaming is strictly constrained to the nodes residing in the same rack as the replacement node to avoid cross-rack traffic.

C.

A single replica node containing the primary token range will stream all necessary data to the replacement node.

D.

The cluster elects a temporary master node to coordinate the streaming of exactly 256 token ranges to the new node.

42 $Cassandra utilizes the Phi Accrual Failure Detector as part of its Gossip protocol. How does this algorithm differ from a traditional heartbeat mechanism in determining node failures?$

Gossip protocol Hard

A.

It outputs a continuous probability value representing the likelihood that a node has failed, adapting dynamically to network latency and jitter.

B.

It mandates that a node is marked offline only if at least a QUORUM of nodes agree on the failure via Paxos.

C.

It uses a fixed timeout threshold, but leverages UDP instead of TCP for faster heartbeat detection.

D.

It increments a discrete 'generation number' every time a node misses a heartbeat, marking it dead when the number reaches a configured limit.

43 $A cluster uses NetworkTopologyStrategy with two datacenters: DC1 with a Replication Factor (RF) of 3, and DC2 with an RF of 2. An application issues a write with Consistency Level EACH_QUORUM . If one node in DC1 goes offline, what happens to this write request?$

Replication and consistency levels Hard

A.

It succeeds, because the LOCAL_QUORUM in DC1 is 2, and the 2 remaining nodes can acknowledge, while DC2's LOCAL_QUORUM is 2.

B.

It succeeds only if Hinted Handoff is enabled to temporarily store the missed write for the offline node.

C.

It fails, because DC1 cannot reach its LOCAL_QUORUM of 3 nodes.

D.

It fails, because EACH_QUORUM requires all nodes in all datacenters to acknowledge the write.

44 $During a write operation, power is lost to a Cassandra node immediately after an insert is written to the CommitLog but before the Memtable is flushed to an SSTable . What happens to this data when the node is restarted?$

Read and write paths Hard

A.

Cassandra automatically requests the missing data from the coordinator node using Hinted Handoff.

B.

The data is recovered by reading the Bloom filter and generating a synthetic Memtable entry.

C.

The data is permanently lost and must be recovered via an anti-entropy repair from replica nodes.

D.

During startup, Cassandra replays the CommitLog to reconstruct the Memtable, ensuring no data loss.

45 $Consider the read path in Cassandra. If an application queries a specific partition key, and the Bloom filter for a given SSTable returns a 'positive' result, but the data is actually absent from that SSTable (a false positive), what sequence of events occurs next?$

Read and write paths Hard

A.

Cassandra updates the Bloom filter to correct the false positive and immediately initiates a read repair.

B.

Cassandra bypasses the Key Cache, reads the Partition Summary, locates the data in the Partition Index, reads the data block, and throws a TombstoneOverwhelmingException .

C.

Cassandra checks the Key Cache (if enabled), reads the Partition Summary, scans the Partition Index, seeks to the data block on disk, discovers the data is missing, and returns no data from this SSTable.

D.

Cassandra throws a ReadTimeoutException because the false positive causes an infinite loop in the Compression Offset map.

46 $Given the table schema: CREATE TABLE sensor_data (sensor_id uuid, year int, month int, ts timestamp, value double, PRIMARY KEY ((sensor_id, year, month), ts)) . Which of the following queries will FAIL to execute without ALLOW FILTERING ?$

Cassandra Data Model – primary key, partition key, clustering columns Hard

A.

SELECT * FROM sensor_data WHERE sensor_id = 123 AND year = 2023 AND month = 10 ORDER BY ts DESC;

B.

SELECT * FROM sensor_data WHERE sensor_id = 123 AND year = 2023 AND month = 10;

C.

SELECT * FROM sensor_data WHERE sensor_id = 123 AND year = 2023;

D.

SELECT * FROM sensor_data WHERE sensor_id = 123 AND year = 2023 AND month = 10 AND ts > '2023-10-01';

47 $A developer heavily deletes individual clustering rows within a very large partition (a 'wide row'). Sometime later, read queries on this partition begin failing with TombstoneOverwhelmingException . What is the architectural reason for this failure?$

Cassandra Data Model – wide rows Hard

A.

Tombstones act as secondary indexes, and exceeding the threshold causes index corruption in wide rows.

B.

To satisfy the read, Cassandra must scan and keep in memory all tombstones up to tombstone_failure_threshold to filter out deleted data, which protects the node from memory exhaustion.

C.

Cassandra keeps all tombstones in the Memtable indefinitely, causing an OutOfMemoryError.

D.

The Bloom filter size exceeds the JVM heap limit because it must index every deleted cell.

48 $A table contains a column defined as frozen<map<text, text>> . How does the frozen keyword modify the behavior of this collection compared to a standard (non-frozen) collection?$

CQL (Cassandra Query Language) – data types Hard

A.

It allows the map to be updated concurrently by multiple clients using lightweight transactions (LWT).

B.

It serializes the entire collection into a single, immutable binary value, meaning elements cannot be added or updated individually; the whole map must be overwritten.

C.

It prevents the collection from generating tombstones when the row is deleted, bypassing garbage collection pauses.

D.

It forces the collection to be stored in the Partition Cache rather than on disk, improving read performance for small maps.

49 $When creating a keyspace, why is NetworkTopologyStrategy generally preferred over SimpleStrategy even for a single-datacenter deployment at the beginning of a project?$

CQL (Cassandra Query Language) – creating keyspaces and tables Hard

A.

NetworkTopologyStrategy automatically provisions vnodes, whereas SimpleStrategy restricts the cluster to single-token architecture.

B.

SimpleStrategy uses a fixed Gossip interval which causes network flooding, whereas NetworkTopologyStrategy dynamically adjusts Gossip frequency.

C.

SimpleStrategy distributes replicas on the same physical rack, risking simultaneous failure, while NetworkTopologyStrategy inherently forces rack-awareness.

D.

SimpleStrategy places replicas contiguously on the token ring ignoring topology. If a second datacenter is ever added, migrating from SimpleStrategy to NetworkTopologyStrategy requires extensive downtime and manual intervention.

50 $A developer executes the following CQL statement: UPDATE users SET email = 'new@test.com' WHERE user_id = 999 IF email = 'old@test.com'; Which of the following describes the internal protocol used to execute this statement?$

CQL (Cassandra Query Language) – insert, update, delete Hard

A.

Hinted Handoff, utilizing a background queue to ensure the update is applied eventually.

B.

Anti-entropy repair, triggering an immediate Merkle tree comparison to ensure the previous email value is consistent.

C.

Paxos consensus protocol, requiring four round-trips (Prepare/Promise, Read/Results, Propose/Accept, Commit/Ack) to ensure linearizability.

D.

Two-Phase Commit (2PC), locking the row across all replicas before applying the update.

51 $Which of the following scenarios is the most inappropriate use case for a native Cassandra Secondary Index, leading to severe performance degradation (a 'scatter-gather' problem)?$

CQL (Cassandra Query Language) – select, filtering, indexing Hard

A.

Indexing a clustering column to filter within a specific partition key.

B.

Indexing a frozen map to query exact matches of the entire map payload.

C.

Indexing a column with very low cardinality (e.g., a boolean is_active flag) across a small cluster.

D.

Indexing a highly unique column (e.g., user_email) in a cluster with hundreds of nodes without providing the partition key in the query.

52 $A team configures TimeWindowCompactionStrategy (TWCS) for a time-series table holding application logs. They occasionally receive logs that are timestamped 30 days in the past (late-arriving data). How will this late-arriving data impact the compaction strategy?$

Cassandra Administration - compaction Hard

A.

The data will be rejected by Cassandra because TWCS enforces a strict monotonic write pattern.

B.

TWCS will automatically update the timestamps of the late-arriving logs to the current time to maintain strict chronological SSTables.

C.

It will force the creation of new SSTables in older time windows, which may never be compacted with the original SSTables for that period, reducing read performance and breaking TTL whole-file drop efficiency.

D.

TWCS will dynamically expand the current time window by 30 days to encompass the late-arriving data, triggering a massive major compaction.

53 $When tuning the JVM for an Apache Cassandra 4.x node with 128GB of RAM, what is the primary architectural rationale for using G1GC (Garbage-First Garbage Collector) rather than CMS (Concurrent Mark Sweep)?$

Installation of Apache Cassandra Hard

A.

G1GC dynamically resizes the Memtable to match heap usage, whereas CMS uses static off-heap allocation.

B.

G1GC intercepts Gossip protocol messages directly in kernel space, reducing CPU overhead during garbage collection.

C.

CMS inherently limits the heap size to 8GB, whereas G1GC supports heaps up to 1TB.

D.

G1GC is designed to handle larger heap sizes (e.g., 31GB) by compacting memory in regions, virtually eliminating the long Stop-The-World (STW) fragmentation pauses that plague CMS on large heaps.

54 $A cluster uses a Replication Factor of 3. Node A goes down for 4 hours. The max_hint_window_in_ms is set to 3 hours. When Node A comes back online, what is the state of the writes that occurred during its downtime, and what action must be taken?$

Read and write paths Hard

A.

Node A has missed 4 hours of writes. The coordinator automatically replays all hints from the 4-hour window.

B.

Node A receives the first 3 hours of hints via Hinted Handoff, but misses the 4th hour. An anti-entropy repair (e.g., nodetool repair) must be run to synchronize the missing data.

C.

Node A uses the Gossip protocol to pull the missing 4 hours of CommitLogs directly from Node B and Node C.

D.

Node A receives no hints because Hinted Handoff is disabled the moment a node exceeds the max window. The data is permanently lost.

55 $A team models IoT data with PRIMARY KEY (device_id, timestamp) . Over several years, certain devices produce millions of readings, resulting in massive unbounded partitions. Which data modeling technique best resolves this 'wide row' anti-pattern while maintaining efficient reads?$

Cassandra Data Model – keyspace, table, primary key, partition key, clustering columns, wide rows Hard

A.

Using ALLOW FILTERING on all read queries so the coordinator can manage the memory payload dynamically.

B.

Implementing 'bucketing' by introducing a time-based artificial column (e.g., month_year) into the partition key, making it PRIMARY KEY ((device_id, month_year), timestamp) .

C.

Adding a high-cardinality secondary index on the timestamp column.

D.

Changing the primary key to PRIMARY KEY (timestamp, device_id) to distribute data evenly across all days.

56 $For a workload characterized by overwhelming write volume (e.g., heavy inserts, rare updates/deletes) and relatively few reads, which compaction strategy minimizes write amplification and CPU overhead?$

Cassandra Administration - compaction Hard

A.

SizeTieredCompactionStrategy (STCS)

B.

LeveledCompactionStrategy (LCS)

C.

DateTieredCompactionStrategy (DTCS)

D.

TimeWindowCompactionStrategy (TWCS) with an infinite window size

57 $Which of the following describes the difference in how Cassandra handles COUNTER columns compared to standard integer columns during an update?$

CQL (Cassandra Query Language) – data types Hard

A.

COUNTER values are synchronized across datacenters using Paxos, making them highly consistent.

B.

COUNTER updates require a read-before-write to fetch the current value, meaning they incur a higher latency penalty than standard idempotent writes.

C.

COUNTER columns cannot be used in tables alongside non-counter columns (other than primary keys).

D.

COUNTER columns bypass the Memtable and write directly to the SSTable to prevent concurrent modification exceptions.

58 $In a Cassandra cluster utilizing a token ring topology, what specific mechanism ensures that a hot spot does not form if one physical node has significantly more disk space and CPU capacity than the others?$

Cassandra Architecture – peer-to-peer architecture Hard

A.

Assigning a higher number of vnodes (tokens) to the more powerful node in the cassandra.yaml configuration.

B.

Configuring a dedicated load-balancer proxy in front of the Gossip protocol.

C.

Using LeveledCompactionStrategy specifically on the stronger node to dynamically allocate more data.

D.

Applying the WeightingSnitch to route more reads to the node with lower CPU utilization.

59 $If a developer executes an UPDATE statement in CQL on a row where the primary key does not currently exist in the database, what is the result?$

CQL (Cassandra Query Language) – insert, update, delete Hard

A.

Cassandra generates a tombstone for the non-existent row and increments the mutation clock.

B.

Cassandra waits for the row to be created via Hinted Handoff, blocking the query.

C.

Cassandra throws an InvalidQueryException indicating the row was not found.

D.

Cassandra applies the update, effectively performing an 'upsert', resulting in a new row being created.

60 $An application requires strict linearizability (strong consistency) for critical read and write operations across a multi-DC cluster. Which combination of Consistency Levels should be used to achieve this without relying on Paxos (Lightweight Transactions)?$

Replication and consistency levels Hard

A.

Write at EACH_QUORUM, Read at LOCAL_QUORUM

B.

Write at QUORUM, Read at LOCAL_QUORUM

C.

Write at LOCAL_QUORUM, Read at LOCAL_QUORUM

D.

Write at QUORUM, Read at QUORUM

Unit 6 - Practice Quiz