Quiz: Chapter 13 (24/7 Production SRE)
Questions
What is the first priority in the first minutes of an incident?
Which statement is correct?
- A) Decide mitigation first, collect evidence later.
- B) Collect evidence first, then choose mitigation.
- C) Wait for AI confidence to reach 100%.
Name the minimum evidence set before high-risk action.
Why are blameless postmortems important?
Who owns final production decisions when AI is used?
What makes an action item “good” in postmortem output?
If issue recurs weekly, where should it be tracked besides incident timeline?
Correct handling of uncertain diagnosis:
- A) immediate risky change
- B) reduce blast radius and gather more evidence
- C) close incident as transient
What should be true before incident closure?
Complete the principle:
- A) AI may auto-fix production if confidence is high
- B) AI assists; humans remain accountable for actions
- C) AI replaces on-call
Answer Key (Short)
- Acknowledge, assign IC, and classify severity.
- B
- Metrics + traces + correlated logs.
- They improve systems and learning without blame culture.
- Human on-call/incident leadership.
- Clear owner, due date, and verification method.
- Recurring problem/hardening backlog.
- B
- Recovery verified and follow-up ownership assigned.
- B