nonobie — Real dev skills, explained like a friend would

What you'll learn

By the end of this chapter you will:

Explain the three pillars of observability and what each one answers
Write structured JSON logs with the correct log level for each situation
Implement correlation IDs so every log line for a request can be found instantly
Know what metrics to ship from day 1 and why
Never use console.log in production code

💡The on-call war story

Without observability: You check logs that say Error: something went wrong and a trace pointing to Sequelize internals. You have no idea which request failed, who triggered it, or what the database saw.

With observability: You check the dashboard, see p99 latency spiked at 2:47am, find the trace for the slowest request, see it spent 28 seconds waiting for a lock on the orders table, grep the logs by trace ID, find the exact query. Fixed in 12 minutes.

Observability is not a nice-to-have. It's the difference between 12 minutes and 8 hours.

Pillar	Question answered	Tool in this project
Logs	What happened to this specific request?	Winston
Metrics	How is the system behaving overall?	Jaeger / Prometheus
Traces	Where did this slow request spend its time?	Jaeger / OpenTracing

// ❌ You can't query this, can't filter it, can't aggregate it
console.log('Order created: ' + orderId + ' for user ' + userId);
console.error(e);

// ✅ Every log is a queryable JSON object
this.logger.log('order.created', {
  order_id: orderId,
  user_id: userId,
  restaurant_id: restaurantId,
  trace_id: cid,
});
 
this.

Level	When to use	Who gets paged?
`error`	Something broke — a request failed unexpectedly	Yes
`warn`	Something suspicious — high retry count, slow query	No
`info`	Normal events you want an audit trail of	No
`debug`	Detailed diagnostic info — OFF in production	No

plaintext

2026-05-01T10:23:01Z  cid=7f3a-abc  svc=api     msg=POST /orders received
2026-05-01T10:23:01Z  cid=7f3a-abc  svc=api      msg=placing order  restaurant_id=rst-xyz
2026-05-01T10:23:01Z  cid=7f3a-abc  svc=orders   msg=validating items  count=3
2026-05-01T10:23:02Z  cid=7f3a-abc  svc=orders   msg=restaurant closed  opens_at=18:00 → RestaurantClosedException
2026-05-01T10:23:02Z  cid=7f3a-abc  svc=api     msg=returning 422

const cid = req.headers['x-request-id'] ?? randomUUID();
res.setHeader('x-request-id', cid);  // send back to client
asyncLocalStorage.run({ cid }, () =>

const { cid } = asyncLocalStorage.getStore() ?? {};
this.logger.log({ ...event, cid });

axios.post(url, body, {
  headers: { 'x-request-id': cid }
});

plaintext

Request total: 1.2 seconds
  ├── JwtGuard: 2ms
  ├── ValidationPipe: 1ms
  ├── OrdersService.create: 1197ms
  │     ├── restaurantsService.isOpen: 5ms
  │     ├── orderItems.validate: 12ms
  │     ├── paymentsService.charge: 1150ms   ← HERE IS THE PROBLEM
  │     │     └── payment gateway API call: 1100ms  (external timeout)
  │     └── Order.create: 28ms
  └── Response serialization: 1ms

Metric	Why
HTTP request count by route × status	Are error rates rising?
HTTP p99 latency by route	Are routes getting slower?
DB query duration	Is the database the bottleneck?
DB connection pool usage	Are we running out of connections?
Outbound HTTP duration per vendor	Is a vendor degrading?
depth	Are jobs piling up?
Business: orders created/min	Is the business healthy?

// ❌ Banned everywhere in this codebase
console.log('debugging...');
console.error(e);
 
// ✅ Use NestJS Logger
private readonly logger = new Logger(OrdersService.name);

One thing to remember

Structured logs + correlation IDs = you can answer any "what happened?" question in under 2 minutes. Without them, you're debugging blind at 3am. Every log message should have a trace_id.

Chapter 12 — Logging & Observability

The why

The three pillars

Logs — structured, not strings

Bad logging (never do this)

Good logging (structured JSON)

Log levels — use the right one

Never log these

Correlation IDs — trace a request through everything

How correlation IDs work in this project

Traces — find where time is spent

What to log on every endpoint

Metrics to ship from day 1

The `console.log` rule

One thing to remember

Chapter 12 — Logging & Observability

The why

The three pillars

Logs — structured, not strings

Bad logging (never do this)

Good logging (structured JSON)

Log levels — use the right one

Never log these

Correlation IDs — trace a request through everything

How correlation IDs work in this project

Traces — find where time is spent

What to log on every endpoint

Metrics to ship from day 1

The console.log rule

One thing to remember

The `console.log` rule