<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>A critical junction in support issues: Root Cause VS. Relief </title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx</link><description>In the lifetime of a case, particularly one of high impact, a point is reached where a decision is necessary regarding the direction of the case. This decision impacts how the situation is approached going forward. The decision that must be made is: “Do</description><dc:language>en-US</dc:language><generator>CommunityServer 2.1 SP1 (Build: 61025.2)</generator><item><title>re: A critical junction in support issues: Root Cause VS. Relief </title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#363517</link><pubDate>Mon, 31 Jan 2005 05:37:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:363517</guid><dc:creator>Christopher Summers</dc:creator><description>The real question is how do you satisfy a customer that wants both RCA and Relief?  More often then not we dont have the luxury of providing a RCA because of SLA's but we are still required to provide one even though we dont know what it really was.</description></item><item><title>A critical junction in support issues: Root Cause VS. Relief </title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#363702</link><pubDate>Mon, 31 Jan 2005 14:59:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:363702</guid><dc:creator>Welcome to Flaphead.com @ Home</dc:creator><description /></item><item><title>re: A critical junction in support issues: Root Cause VS. Relief </title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#364321</link><pubDate>Tue, 01 Feb 2005 08:51:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:364321</guid><dc:creator>Steve</dc:creator><description>But at what point is it a loss for Microsoft? If you do not fully invesitgate the RCA and just provide Relief, Microsoft has just lost out on the chance to futher their ability to root out the causes of software issues. Example right now I have been working with MS on a case for over 43 hours (phone hours) MS has a relief answer, but I am determined to get the RCA because I want to further our database of troubleshooting and to save us the call in the future. After all if I experience the issue once chances are good that it will come back in the future and I need to save my company the $245 support call. On the other hand I have had issues where I just want relief and no RCA. So it is a 50/50 issue I guess.</description></item><item><title>re: A critical junction in support issues: Root Cause VS. Relief </title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#364893</link><pubDate>Wed, 02 Feb 2005 00:24:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:364893</guid><dc:creator>Jeremy</dc:creator><description>Steve, you are absolutely right. Microsoft does loose out consistently in performing RCA on issues. It has to do with the decisions that are made by the customer during the incident regarding pursing RCA or relief. We have to honor what the customer wants to do and we can only be diligent about communicating this when we believe the junction has been reached, where further steps would jepordize the ability to provide RCA. &lt;br&gt;&lt;br&gt;Unfortunatly with the computers, you typically loose evidence of the problem in the steps of providing relief... catch 22...</description></item><item><title>re: A critical junction in support issues: Root Cause VS. Relief </title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#365672</link><pubDate>Wed, 02 Feb 2005 22:25:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:365672</guid><dc:creator>Bill Wilson</dc:creator><description>This problem is not confined to software. Sometimes, to create a safe state, you have to take actions that you know will destroy evidence. This is frequently the case in high-hazard industrial settings. Other times, the pressure to simply get things cleaned up and back into production can be very intense.&lt;br&gt;&lt;br&gt;In my case (power production and heavy construction), there's generally somebody on-site at all times that can get in, collect a bunch of evidence, and get out quickly. We have procedures for incident response that cover this. Your case is different -- the problem is not at your site, you can't get in to collect evidence quickly, and the customer wants both RCA and relief, right now.&lt;br&gt;&lt;br&gt;That's a tough situation. I think you're probably doing the right thing... make it very clear to the customer that immediate relief will seriously hinder any RCA effort, and without RCA, the problem could come back. Then you have to leave the choice to them.</description></item><item><title>re: A critical junction in support issues: Root Cause VS. Relief </title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#366678</link><pubDate>Fri, 04 Feb 2005 02:45:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:366678</guid><dc:creator>Chris Scharff</dc:creator><description>For me it depends in large part on two factors. &lt;br&gt;&lt;br&gt;1. The level of business impact the issue is causing. If it is a critical system, root cause (while important) generally has to take a backseat to relief. If the problem has happened before, the balance shifts somewhat I suppose.&lt;br&gt;&lt;br&gt;2. The confidence level of the engineer (and of me in the engineer's ability) that in losing RCA data the steps will actualy provide relief. &lt;br&gt;&lt;br&gt;</description></item><item><title>re: A critical junction in support issues: Root Cause VS. Relief </title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#370028</link><pubDate>Thu, 10 Feb 2005 00:15:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:370028</guid><dc:creator>Chris</dc:creator><description>It seems to be a catch 22 for either side. Businesses cannot afford the downtime and don't care to investigate the RCS. On the other hand, IT typically has to be able to identify the root cause to properly fix the problem.&lt;br&gt;&lt;br&gt;Eventlogs and performance logs can typically lead you to the precise cause of failure. But in somecases, it may take an additional call to MS. It's important to relay everything that has happened to the support engineer in order to properly get a solid RCA.&lt;br&gt;&lt;br&gt;All your really doing when contacting MS is brining in an additional resource for assistance. All they know is what you tell them. It goes for any outsourced consultant.&lt;br&gt;&lt;br&gt;Althought, uptime is the most important thing, identifying the RCA has to be addressed by all times to fully support and update any SLA.&lt;br&gt;&lt;br&gt; </description></item><item><title>re: A critical junction in support issues: Root Cause VS. Relief </title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#370817</link><pubDate>Fri, 11 Feb 2005 06:46:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:370817</guid><dc:creator>David P</dc:creator><description>&lt;br&gt;This is a question that rarely comes up for your typical firefighter: relief (put out the fire!) goes before root cause analysis (it was a pan of grease on the stove).  Much easier prioritization decision!&lt;br&gt;&lt;br&gt;It's a balance, just like you have a balance when deciding whether to take all your servers down right now to apply the latest series of security patches.  Is the risk of a self-inflicted denial of service higher than the risk of lost sales while I reboot?  If so, you might want to consider patching over the weekend.  Is the risk of an exploit and associated losses higher than the risk of possible lost sales?  Then bring'em down, cowboy.&lt;br&gt;&lt;br&gt;The balance in this case is based on, amongst other things:&lt;br&gt;&lt;br&gt;A- how much the analysis time is costing in lost productivity or business opportunity while the problem is ongoing, vs. how quickly I could apply relief and reduce that cost&lt;br&gt;B- how much longer it will take to perform further analysis, before I can determine root cause &lt;br&gt;C- how soon the problem will reappear if I just apply relief.  If I reboot tonight and the problem goes away for two years, I'll be much more willing to select &amp;quot;relief&amp;quot; than a scenario where I'll be called back in an hour.&lt;br&gt;D- the level of confidence I have in my answers to the above questions.  Also, the level of confidence I have that the Relief I will be applying will actually work.&lt;br&gt;E- the value of the data lost when I apply relief.  If logs are critical to resolving the root cause, and I will lose them completely by applying relief, I'm less willing to do so.  If I can apply relief and still work on RCA, that's more palatable.&lt;br&gt;F- SLAs.  If my company gets financial penalties on downtime per incident, my incentive to search for root cause may be diminished.  If I get penalties on cumulative downtime, I may want to resolve the problem for good, in which case I want root cause.&lt;br&gt;G- closely related to the above: how loudly is the client shouting in my ear?&lt;br&gt;H- how long I've been working on the problem with no forward movement.  I'm more willing to provide relief and throw in the towel on RCA if I've been trying to fix the problem for 48 hours and I don't feel I've made progress.  If I feel the solution is &amp;quot;just around the corner&amp;quot; I'm more willing to continue analysis.&lt;br&gt;I- how much sleep I've had in the past 48 hours, and whether the coffee in the breakroom is any good.&lt;br&gt;&lt;br&gt;The last one is not completely in jest.  Root Cause analysis often requires a sharp, focused, alert! mind that is in tune with the environment and can detect minute anomalies or variations from the norm.  Rebooting takes one binary brain cell and an index finger.&lt;br&gt;&lt;br&gt;All of the factors above are balanced in the equation:&lt;br&gt;&lt;br&gt; X =  lim (( A / I^2) * (B - G/D) + C)&lt;br&gt;        H -&amp;gt; F&lt;br&gt;&lt;br&gt;Plus a constant, of course.  As X trends to 1, I'll be more willing to just go to Starbucks rather than drink any more of that overwarmed pot sludge.  &lt;br&gt;&lt;br&gt;-dp.</description></item><item><title> JeremyK s MSFT WebLog A critical junction in support issues Root | Insomnia Cure</title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#9719265</link><pubDate>Wed, 10 Jun 2009 03:23:21 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9719265</guid><dc:creator> JeremyK s MSFT WebLog A critical junction in support issues Root | Insomnia Cure</dc:creator><description>&lt;p&gt;PingBack from &lt;a rel="nofollow" target="_new" href="http://insomniacuresite.info/story.php?id=9510"&gt;http://insomniacuresite.info/story.php?id=9510&lt;/a&gt;&lt;/p&gt;
</description></item><item><title> JeremyK s MSFT WebLog A critical junction in support issues Root | fire pit</title><link>http://blogs.msdn.com/jeremyk/archive/2005/01/29/363143.aspx#9780232</link><pubDate>Fri, 19 Jun 2009 09:16:24 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9780232</guid><dc:creator> JeremyK s MSFT WebLog A critical junction in support issues Root | fire pit</dc:creator><description>&lt;p&gt;PingBack from &lt;a rel="nofollow" target="_new" href="http://firepitidea.info/story.php?id=1348"&gt;http://firepitidea.info/story.php?id=1348&lt;/a&gt;&lt;/p&gt;
</description></item></channel></rss>