Deeper Real Time Office 365 Performance Data from Office365Mon.Com

We’ve just released some new and improved reports that I think many of you will find valuable and interesting at Office365Mon.Com.  A couple of these reports give you performance information about your own Office 365 tenant that’s even deeper and more insightful than we’ve ever had before.  In addition to that, we’ve simplified some of our outage reports, and also added additional historical outage reporting options that let you drill down even further into the stability of the different services in your Office 365 subscription.

For real time performance data on your Office 365 tenant, we started monitoring your tenant health scores and request durations in June, 2018:  https://samlman.wordpress.com/2018/06/14/office365mon-launches-new-features-for-storing-office-365-logs-and-monitoring-health-scores-and-request-times/.  For a quick refresher, the health score is something we get for SharePoint Online and OneDrive for Business that is between 0 and 10 and represents the overall health of your tenant.  When your score is 0 things are as healthy as possible; the more the score increases, the less healthy your tenant is.  Request duration is the amount of time that it takes to process synthetic transactions that we send to your tenant while we monitor it.  As request durations increase, users begin to see it as “Office 365 is slow today” or “our network is slow”.

We’ve provided you a real-time view of the scores and durations since June 2018, and while there is a lot of value in that, the problem is that there wasn’t a good way for you to look at those numbers and understand if what you were seeing was really bad, or if those numbers are actually typical of what you have been getting from your tenant.  The new reports we have today plug that gap and give you some great and easy to understand insight so you know exactly how you’re doing.  To get started, here’s a screenshot of the new request duration report:

DurationOverlay

The first thing that probably catches your eye is the wavy blue and yellow data in the background.  That data is actually historical data for what the request duration has been during the exact time frame that you’re currently looking at.  So if you look at this report at 8AM and then again at 5PM, that wavy data will be different.  The idea is that you can see not only what your current performance is like, but what it has historically been like at this same exact time of day.  Not only that, but as we gather more historical data, then we fine-tune what you see here even further – down to the same exact day of week with the historical data.  So that means, for example, if you look at the current performance at 2PM on a Tuesday, then eventually the wavy data you see in the background will also be for 2PM on a Tuesday.  This is incredibly useful to be able to do as much of an “apples to apples” comparison as possible when you are looking for performance issues in your tenant, as well as to understand if the data you are currently seeing is “typical” for what you normally get in your tenant.  That’s exactly what you see in the chart above – so overall at the point in time we took this report screenshot, our tenant overall was actually performing a little better than past Wednesdays at this same time of day.

As with all of our graphical reports at Office365Mon.Com, you can also drill into this further by just clicking on items in the chart’s legend.  So for example, if you just want to look at what’s going on with SharePoint Online only, here’s what the chart looks like:

SpoDurationOverlay

Again, it’s very interesting in how you see some similar changes in performance to what we’ve previously seen on Wednesdays during this time frame, but overall still looking a little better than usual.

We also do the same thing with health scores, so for comparison, here’s the new health score report with this same historical data underlay:

HealthOverlay

Again, as you can see from this, it’s consistent with the request duration data in that overall the health scores are right about where they normally are at this time of day on a Wednesday.

 

Easier to Understand Outage Information

In addition to the deeper real time data, we’ve also simplified and improved the usability and level of detail around outages and the reasons they happen.  First, we’ve changed the existing reports we’ve had from the beginning on outages and outage reasons so that now they only show information on outages that have occurred in the last 60 days.  Previously, they contained data on every outage we ever monitored for you.

Next, we added a couple of new reports  – Outage History and Outage Reason History.  The Outage History report lets you see all of the outages that ever occurred, but it breaks it down by resource, so you can view all of the times you had a SharePoint outage, or an Exchange outage, etc.  As you can see from the screenshot here, it’s a much simpler way to view this data – in this case for OneDrive for Business:

OneDriveOutageHistory

For Outage Reason History, the data actually gets a lot more interesting and insightful.  First of all, we give you a breakdown of the different reasons for outages broken down by calendar quarter.  This allows you to see trends in where the supportability issues have been for your tenant.  Here’s an example:

OutageHistoryAll

Just at a quick glance you can see that originally back when we first started monitoring our Office 365 tenant, a lot of outages were Internal Server Errors and Unauthorized (usually meaning there was an Azure AD availability issue).  Again though, just like with the Outage History report, you can drill down into specific Office 365 services to see what the outage history is like for each one.  Here’s an example of that:

OutageHistorySharePoint

In this case we’re looking at the outage history for SharePoint Online.  Again, you can also see here how earlier the outages were primarily Unauthorized (meaning an Azure AD issue), but now we see more Service Unavailable errors.  Overall though, what’s also interesting is that you see we have far fewer outages with SharePoint Online than we did when we first started monitoring.  This is a great historical perspective to have and understand, as you see changes to the service and changes in the reliability in your tenant over time.

 

Some More Data, Some Better Data

I think these new reports will continue to help you Stay in the Know and Stay in Control of your Office 365 tenant.  You get deeper and more meaningful data than ever before, and we continue to build and expand the service to try and help keep you on top of your tenant at all times.  As always, if you have feedback on these or any other feature, I strongly encourage you to contact us at support@office365mon.com.  I read every customer recommendation and piece of feedback we receive.  If you haven’t started monitoring Office 365 yet, then please stop by our site at https://www.office365mon.com and click the big Start Now link on the home page for a free 90-day trial so you can see exactly how you can put monitoring to work for you.

From Sunny Phoenix,

Steve

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s