07 — DynamoDB

What is DynamoDB?

Fully managed NoSQL database — single-digit millisecond performance at any scale. Serverless, no provisioning.

Core Concepts

Concept	Description
Table	Collection of items
Item	A row (JSON document, max 400 KB)
Attribute	A field in an item
Partition Key (PK)	Primary key — determines data distribution
Sort Key (SK)	Optional — enables range queries within a partition
GSI	Global Secondary Index — query on non-key attributes
LSI	Local Secondary Index — alternate sort key (same partition key)

Key Design

Table: Users
  PK: userId

Table: Orders
  PK: userId    SK: orderId
  → Get all orders for a user (Query on PK)
  → Get specific order (PK + SK)

Table: SingleTable
  PK: "USER#123"         SK: "PROFILE"          → User profile
  PK: "USER#123"         SK: "ORDER#2024-001"   → User's order
  PK: "PRODUCT#abc"      SK: "METADATA"         → Product info

Operations

import &#123; DynamoDBClient &#125; from '@aws-sdk/client-dynamodb';
import &#123; DynamoDBDocumentClient, PutCommand, GetCommand, QueryCommand, DeleteCommand &#125; from '@aws-sdk/lib-dynamodb';

const client = DynamoDBDocumentClient.from(new DynamoDBClient(&#123;&#125;));

// Put item
await client.send(new PutCommand(&#123;
  TableName: 'Orders',
  Item: &#123;
    userId: 'user-123',
    orderId: 'order-001',
    total: 49.99,
    status: 'pending',
    createdAt: new Date().toISOString(),
  &#125;,
&#125;));

// Get item (by exact PK + SK)
const &#123; Item &#125; = await client.send(new GetCommand(&#123;
  TableName: 'Orders',
  Key: &#123; userId: 'user-123', orderId: 'order-001' &#125;,
&#125;));

// Query (all orders for a user)
const &#123; Items &#125; = await client.send(new QueryCommand(&#123;
  TableName: 'Orders',
  KeyConditionExpression: 'userId = :uid AND begins_with(orderId, :prefix)',
  ExpressionAttributeValues: &#123;
    ':uid': 'user-123',
    ':prefix': 'order-',
  &#125;,
&#125;));

// Delete item
await client.send(new DeleteCommand(&#123;
  TableName: 'Orders',
  Key: &#123; userId: 'user-123', orderId: 'order-001' &#125;,
&#125;));

Capacity Modes

Mode	Description	Best For
On-Demand	Pay per request, auto-scales	Unpredictable workloads
Provisioned	Set RCU/WCU, use auto-scaling	Predictable workloads (cheaper)

RCU (Read Capacity Unit):
  1 RCU = 1 strongly consistent read/sec (up to 4 KB)
  1 RCU = 2 eventually consistent reads/sec

WCU (Write Capacity Unit):
  1 WCU = 1 write/sec (up to 1 KB)

Global Secondary Index (GSI)

Query on attributes other than the primary key.

Table: Orders (PK: userId, SK: orderId)

GSI: StatusIndex (PK: status, SK: createdAt)
  → Query all "pending" orders sorted by date
  → Query all "shipped" orders from last week

const &#123; Items &#125; = await client.send(new QueryCommand(&#123;
  TableName: 'Orders',
  IndexName: 'StatusIndex',
  KeyConditionExpression: '#s = :status',
  ExpressionAttributeNames: &#123; '#s': 'status' &#125;,
  ExpressionAttributeValues: &#123; ':status': 'pending' &#125;,
&#125;));

Single-Table Design

Store multiple entity types in one table — reduces joins and costs.

PK              SK                  Data
────────────    ──────────────      ─────────────────
USER#123        PROFILE             &#123; name, email &#125;
USER#123        ORDER#001           &#123; total, status &#125;
USER#123        ORDER#002           &#123; total, status &#125;
PRODUCT#abc     METADATA            &#123; name, price &#125;
PRODUCT#abc     REVIEW#r1           &#123; rating, text &#125;

GSI1PK          GSI1SK
────────────    ──────────────
ORDER#001       USER#123            (lookup order → user)
pending         2024-01-15          (query by status)

DynamoDB Streams

Capture item-level changes (insert, update, delete) → trigger Lambda.

Table change → DynamoDB Stream → Lambda
                                  → Update search index
                                  → Send notification
                                  → Replicate to another table

DAX (DynamoDB Accelerator)

In-memory cache in front of DynamoDB — microsecond reads.

App → DAX (cache) → DynamoDB
      Hit: ~microseconds
      Miss: ~milliseconds (reads from DynamoDB, caches result)

Key Takeaways

DynamoDB = serverless NoSQL — millisecond performance at any scale
Design keys for access patterns — PK for partitioning, SK for range queries
GSI for querying on non-key attributes (most tables need at least one)
Single-table design reduces cost and complexity for related entities
On-Demand for variable traffic; Provisioned for predictable (cheaper)
Streams + Lambda for event-driven reactions to data changes
Use DAX for microsecond read-heavy caching

06 — RDS & Databases 08 — Lambda & Serverless