nonobie — Real dev skills, explained like a friend would

What you'll learn

By the end of this chapter you will:

Explain the five reasons not to store files in PostgreSQL
Design a bucket layout with one bucket per environment and prefix-based access policies
Enable SSE-KMS encryption and force TLS-only access via bucket policy
Use sharp to generate image variants via an job
Configure lifecycle rules for temp, exports, and long-term receipt storage

The why

It is technically possible to store a file as a BYTEA column in PostgreSQL. Don't. There's an entire category of infrastructure — object storage — designed for one job: store bytes, retrieve bytes, charge by GB. Use the right tool.

Why files don't belong in your database

Problem	What goes wrong
Backup bloat	A 50 GB DB suddenly becomes 500 GB; nightly backups fall over.
Slow queries	`SELECT *` accidentally returns megabytes per row.
Memory pressure	PostgreSQL's TOAST mechanism pages large blobs in/out — wasted I/O.
Caching impossible	You cannot put a in front of a database.
Scaling blocked	Read replicas lag because they replicate huge blobs.

The DB stores only the metadata pointer. The object storage stores the bytes.

plaintext

PostgreSQL row:
  id:          a3f1...
  owner_id:    restaurant-123
  s3_key:      menu/restaurant-123/a3f1.jpg
  size_bytes:  482311
  sha256:      a92c...
  uploaded_at: 2026-05-02T10:23:01Z
 
S3 object:
  Bucket: quickbite-assets-prod
  Key:    menu/restaurant-123/a3f1.jpg
  Body:   <binary bytes>

Why files don't belong on the app server disk either

App servers are ephemeral — every deploy spins up new containers, the old disk is gone. App servers are horizontally scaled — Box A wrote the file, the next request lands on Box B which can't see it. Local disk has no replication, no encryption-at-, no lifecycle rules.

Tip

12-factor (Chapter 16): treat your processes as stateless. The disk under your app is scratch space, not a file system.

Bucket layout — get this right on day one

Buckets are a flat namespace. Renaming a bucket means copying every object and updating every reference. Decide the layout once.

One bucket per environment

plaintext

quickbite-assets-dev
quickbite-assets-staging
quickbite-assets-prod

Never share a bucket between dev and prod — a bug in dev that lists all objects will find production data.

Prefixes per logical area

plaintext

quickbite-assets-prod/
├── menu/{restaurant_id}/{uuid}.{ext}       ← menu item photos
├── profiles/{user_id}/{uuid}.{ext}         ← profile pictures
├── receipts/{year}/{month}/{order_id}.pdf  ← order receipts
├── exports/{user_id}/{job_id}.csv          ← data exports
└── temp/{user_id}/{uuid}                   ← lifecycle rule deletes after 24h

S3 has no real folders — / in the key is convention. But that convention enables prefix-based IAM policies, lifecycle rules, and listing.

Object key naming rules

Danger

❌ burger.jpg — collision-prone, leaks the content type in the key
❌ Mario Pizza Margherita 2026.jpg — spaces, PII, original filename
✅ menu/rst-123/a3f12c8e-uuid.jpg — owner-scoped, opaque, UUID-based

S3 keys show up in CloudTrail logs and access logs. Never put PII (names, emails) in them.

Encryption — turn it on, always

At rest

Enable bucket-level default encryption:

Mode	When to use
SSE-S3 (AES-256, AWS-managed key)	Default. Free. Good enough for most data.
SSE-KMS (AWS KMS key)	Compliance-driven (PCI, HIPAA, SOC2). Gives you key rotation and per-key audit.
SSE-C (customer-supplied key)	You hold the key, AWS forgets it. Almost never the right answer.

For a food delivery app with PCI-adjacent data (card tokens, order receipts): SSE-KMS with a dedicated CMK per environment.

In transit

Force TLS with a bucket policy:

json

{
  "Sid": "DenyInsecureTransport",
  "Effect": "Deny",
  "Principal": "*",
  "Action": "s3:*",
  "Resource": ["arn:aws:s3:::quickbite-assets-prod/*"],
  "Condition

Access control — private by default

Every bucket should be created with Block Public Access turned ON at the account level.

Watch out

Most public S3 leaks did NOT happen because someone "hacked AWS". They happened because someone clicked "make public" to debug something and forgot. All four Block Public Access toggles: ON.

Uploading from the backend

import { S3Client, PutObjectCommand } from '@aws-sdk/client-s3';
 
@Injectable()
export class S3Service {
  private readonly s3 = new S3Client({ region: this.config.get('AWS_REGION')

Notice: bucket name is in config (never hard-coded), ContentType is set explicitly, encryption is set per-object as defence-in-depth.

CDN in front for public-ish assets

For product images, marketing PDFs, public branding — put CloudFront (or Cloudflare) in front of S3:

plaintext

Browser → CloudFront edge cache (cached 24h) → S3 origin (only on miss)

Benefits: latency drops from ~200 ms to ~20 ms, S3 GET costs disappear on hits, you can revoke URLs without touching S3.

For private user content (order receipts, profile photos), use CloudFront + signed URLs (Chapter 20).

Image variants — never serve the original

Restaurant owners upload high-res food photos. Don’t serve raw originals to users — generate variants:

plaintext

menu/rst-123/{uuid}.jpg               ← 4 MB, original
menu/rst-123/{uuid}_thumb_200.jpg     ← 8 KB, menu list view
menu/rst-123/{uuid}_thumb_800.jpg     ← 60 KB, item detail view

On upload, fire an async job (Chapter 15) that uses sharp to resize and writes variants back to S3:

// resize.worker.ts
import sharp from 'sharp';
 
await Promise.all([
  sharp(buffer).resize(200, 200, { fit: 'cover' }).jpeg({ quality: 80

Lifecycle policies — let S3 do the housekeeping

plaintext

quickbite-assets-prod/temp/*          → DELETE after 1 day
quickbite-assets-prod/exports/*       → DELETE after 30 days
quickbite-assets-prod/receipts/*      → TRANSITION to Glacier after 90 days
                                      → DELETE after 7 years (compliance)
quickbite-assets-prod/menu/*          → no lifecycle rule (keep forever)

These rules run for free, every night, in the background. Storage costs grow forever unless you configure them.

Versioning — your safety net against "oops"

Enable bucket versioning. Each PUT to the same key keeps the old version too. If a bug overwrites an object, you can restore it.

Watch out

Versioning has a cost — every old version still occupies storage. Combine with a lifecycle rule that deletes non-current versions after 30 days.

Audit logging — who touched what

Turn on S3 Server Access Logging or CloudTrail Data Events for the bucket. Both record every GetObject, PutObject, DeleteObject with the IP, IAM principal, and timestamp.

Without these logs, after a security incident you cannot answer "did the attacker download every customer's passport, or just one?" That distinction changes whether you're notifying 1 person or 100,000.

New-bucket checklist

Block Public Access: ON (all four toggles)
Default encryption: SSE-KMS with a dedicated CMK
Bucket policy: deny non-TLS access
Versioning: ON, with a lifecycle rule for old versions
Access logging or CloudTrail data events: ON
Lifecycle rules for temp/* and exports/*
IAM role for the app: scoped to specific prefixes only
Bucket name in config, never hard-coded

One thing to remember

The DB stores metadata (owner, size, s3_key, upload time). S3 stores bytes. Never put PII in the S3 key — UUIDs only. Set Block Public Access on day one, because it's much easier than explaining to customers why their passport was publicly accessible.

Chapter 19 — File & Image Storage (S3)

The why

Why files don't belong in your database

Why files don't belong on the app server disk either

Bucket layout — get this right on day one

One bucket per environment

Prefixes per logical area

Object key naming rules

Encryption — turn it on, always

At rest

In transit

Access control — private by default

Uploading from the backend

CDN in front for public-ish assets

Image variants — never serve the original

Lifecycle policies — let S3 do the housekeeping

Versioning — your safety net against "oops"

Audit logging — who touched what

New-bucket checklist

One thing to remember