Tommy, I’m seconding Matt’s request. There was not a proper post mortem the last time this happened (link) and now we’ve had a repeat at the same point of failure with still no more visibility into why this happened and what is being done to prevent future failures.
That status page is only as useful as its accuracy to the actual outages and neither of these incidences were reflected there. What’s your process for updating that page and can we expect this incident to be reflected in the log? The issue we run into is when there are issues that are not reflected in that page when we check it and then it requires a post here while we wait and manage expectations with our users.