Modern Scalable Databases

Description

Test your knowledge of modern scalable databases, including sharding, NoSQL systems like MongoDB and Cassandra, and the trade-offs between normalized and denormalized data.
Eladio Rocha
Quiz by Eladio Rocha, updated about 1 month ago
Eladio Rocha
Created by Eladio Rocha about 1 month ago
1
0

Resource summary

Question 1

Question
What is the primary purpose of a router in a scalable database design?
Answer
  • To perform joins between shards.
  • To route client requests to the appropriate shard.
  • To store backup configurations for shards.
  • To hash keys for partitioning data.

Question 2

Question
What is a "shard" in the context of databases?
Answer
  • A method of encrypting sensitive data.
  • A backup server in a replica set.
  • A horizontal partition of a database.
  • A tool for monitoring database traffic.

Question 3

Question
What is a major advantage of sharding in database systems?
Answer
  • It eliminates the need for backups.
  • It increases both scalability and redundancy.
  • It simplifies complex SQL queries.
  • It reduces the need for a router.

Question 4

Question
What is a replica set in MongoDB?
Answer
  • A set of configuration servers.
  • A collection of shards distributed across regions.
  • A primary server and its associated secondary servers for redundancy.
  • A group of shards that handle write-heavy traffic.

Question 5

Question
How does MongoDB ensure high availability when a primary server fails?
Answer
  • Traffic is rerouted to the config server.
  • Secondary servers automatically elect a new primary.
  • The router assigns new shards dynamically.
  • The primary server is rebooted automatically.

Question 6

Question
What does "eventual consistency" mean in the context of Cassandra?
Answer
  • Data is always immediately consistent across all nodes.
  • Data might take time to propagate across nodes, leading to temporary inconsistencies.
  • Data must be manually synchronized across nodes.
  • Consistency is not guaranteed due to system failures.

Question 7

Question
What is the "celebrity problem" in sharded databases?
Answer
  • High traffic on specific shards causes uneven load distribution.
  • Celebrities’ data is stored differently for security reasons.
  • All shards are overloaded with high read and write requests.
  • Certain shards fail to replicate data efficiently.

Question 8

Question
What is re-sharding, and why is it challenging?
Answer
  • Redistributing data when adding or removing shards, which requires fault-tolerant methods.
  • Creating backups for new shards added to the system.
  • Replicating data across all shards during traffic spikes.
  • Distributing write-heavy traffic across existing shards.

Question 9

Question
What is a key advantage of using a normalized database schema?
Answer
  • It minimizes storage space and simplifies updates.
  • It allows faster reads with fewer database hits.
  • It duplicates data for high performance.
  • It eliminates the need for joins between tables.

Question 10

Question
Why might denormalized data be preferred in a NoSQL database?
Answer
  • It reduces redundancy and improves update efficiency..
  • It allows for single-query retrieval of all necessary data.
  • It enforces strict schema rules.
  • It is easier to scale horizontally.

Question 11

Question
What should you prioritize when designing a horizontally scalable database?
Answer
  • Using as many joins as possible for efficiency.
  • Designing around simple key-value lookups to avoid joins.
  • Avoiding hash functions for data distribution.
  • Relying heavily on a single primary server for consistency.

Question 12

Question
Which database systems are examples of NoSQL?
Answer
  • PostgreSQL and MySQL.
  • MongoDB, Cassandra, and DynamoDB.
  • Oracle and SQLite.
  • Redis and HBase and SQLite

Question 13

Question
In what scenarios might you choose to use normalized data over denormalized data in a NoSQL database?
Answer
  • When updates are frequent and space efficiency is important.
  • When scaling horizontally is a top priority.
  • When high traffic requires fewer database hits.
  • When you need to perform complex joins efficiently.
Show full summary Hide full summary

Similar

CAP Theorem and Database Trade-offs
Eladio Rocha
MongoDB, ¿son las bases de datos no relacionales el futuro?
lucho ibarguen
Unit 1. Databases and electronic libraries for English Studies
mgr UNED
UNIT 1. Databases & Electronic libraries for English studies
Roberto Rojas
Seguridad en bases de datos
KATERINE MARCELES
liderazgo
ROuz De La ROza
conflicto y negacion
ROuz De La ROza
Databases and electronic libraries
Begoña Tarrio
ACID Compliance and CAP Theorem
Eladio Rocha
Introducción a MongoDB
Jose Julian Ariza V.