Welcome to MSDN Blogs Sign in | Join | Help

NLB (Network Load Balancing) and SharePoint... Troubleshooting and Configuration tips.

Joys of NLB. 

Customer was recently asking me about issues they were having with NLB and Kerberos.

He came across a recent KB 325608 where authentication delegation through Kerberos doesn't work straight forward in load balanced environments.

There are some recommendations for how to configure NLB with Kerberos in a load balanced environment.

A very good article is referenced in the KB... "Kerberos authentication for load balanced web sites" white paper.

For additional information about network load balancing, visit the following Microsoft Web site:
 
Couple of free tips...
 
Make your sessions sticky 
No matter what you use for load balancing try to keep the sessions sticky.  In NLB they call it affinity.  Although with load balancing you may think you want to have 50% of traffic go between the servers and load balance each request, in reality, it's not bad to have a 52% 48% load balance where the sessions are sticky and the users maintain the same server throughout their session.  The experience will be more consistent.  For example...  "Hey I'm getting a 500 error or server error,"  "Works for me" and me.  Oh, well I can repro it on this machine, but not this client.  Looks like you're having a problem with one of the servers in the load balanced cluster.  Had you had both with all requests load balanced, it would be.  "I see intermittent outages" when I refresh the page comes up.  It's really *wierd.*  The other thing is let's say it's an expired certificate.  You can imagine the experience with that, or let's say you decided to use basic over SSL.  You can imagine your users having to login twice!  Also, imagine trying to track down a session across two servers trying to figure out on which node, which action happened from an IIS log perspective.  You'll likely have to track it down anyway, but once you get there, you don't want to have to keep jumping back and forth.
 
Make sure the servers can talk 
Another tip for ya.  Let's say you put both your WFE's (web front ends) in an NLB cluster but you can't ping between them.  To our DC guys in IT they were always fine with it.  You don't need an extra network or cross over cable or HUB they use to say.  NOT SO! in SharePoint, especially in 2003.  If the WFE's can't talk to each other the Admin service would get upset and either pages would load very slow the first time or search requests would be super slow the first time.  You may see this an think it's an IIS or SharePoint .NET compilation or server caching issue.  Not so.  If you know the page has already been assembled or is hot and you get a slow response the first time, but not the second, you may want to check for communication issues between your WFEs and even communication between your WFEs and Search servers.  In IT we use to use cross over cables for the two node clusters, it was a cheap solution.  We'd use perfmon to determine that it was really working.  If it was even 40-60, we'd have to call it good some times.  You can simply use the web service, concurrent connections, although it is a funny and controversial counter, it will give you an idea if there is so many current sessions on one node or the other.
 
Avoid Collisions for communication
In 3 node NLB clusters we would either use a HUB or put pressure on the network folks to put the second NIC on a backend LAN with the SQL environment.  That was COOL.  We saw a lot of our network perf issues work themselves out by simply having front end NICs which took end user traffic with the second NIC being used for communication to the SQL server, and having server to server communication go over that NIC.  For SQL having a second NIC for backups was something we used in another company.  They had an entire "backup" VLAN.  It worked well for them.  Setting up static routes required documentation and some tribal knowledge when troubleshooting to be passed on, so from that perspective I preferred to use hubs and cross over cables.
 
When determining your load balancing strategy figure out how much you're willing to pay, what your security requirements are, what your availability requirements are.  NLB is cheap, but it does lack some of the intelligence you might think it has.  It does *not* know when the web service is stopped for example.  It doesn't know when SharePoint is down.  It's not very smart.  I've told the NLB folks this, but for the price of a NIC, and with some of your own intelligence it is a little bit better than round robin DNS.
 
If you decide to get serious with load balancing and start writing scripts around it to integrate with your MOM environment and/or web sites and services and you use ISA, you may want to look at the ISA 2006 load balancing stuff.  It's got some intelligence in it to determine that the web service isn't responding properly.
 
At one point I was trying to build some intelligence in a Web Sites and Services script that would force NLB to stop services, recycle IIS, and come back up after checking itself.  I never got past the Visio diagram stage, but did share it with the NLB Windows guys.  I'm attaching my Visio "SharePoint Uptime" logic from a past life.  In this world where you are the "person of the year," maybe you can build this and share it for the rest of us.
 
As far as good "How to Resources" for configuring or planning NLB.  Here's some references:
TechNet: Planning availability in WSS (more of a planning than a how to)
Mindsharp appears to have a "how to" whitepaper in their premium section... requires login.
The documentation on NLB is in development.  There are references to it being included in the WSS deployment guide.
 
Network Load Balancing: Frequently Asked Questions for Windows 2000 and Windows
Server 2003
http://www.microsoft.com/technet/prodtechnol/windowsserver2003/technologies/clustering/nlbfaq.mspx

NLB Troubleshooting Overview for Windows Server 2003
 
Good KBs on the Topic.
 
278431 - Using teaming adapters with network load balancing may cause network problems
http://support.microsoft.com/default.aspx?scid=kb;EN-US;278431
 
323437 - How To Configure Network Load Balancing Parameters in Windows Server 2003
http://support.microsoft.com/default.aspx?scid=kb;EN-US;323437
 
323431 - How To Set Up TCP/IP for Network Load Balancing in Windows Server 2003
http://support.microsoft.com/default.aspx?scid=kb;EN-US;323431
 
232190 - Description of Network Load Balancing features
http://support.microsoft.com/default.aspx?scid=kb;EN-US;232190
 
828258 - Network Load Balancing Manager does not work with imaged servers
 
890159 - The "NLB troubleshooting overview for Windows Server 2003" article is available
http://support.microsoft.com/default.aspx?scid=kb;EN-US;890159
Published Friday, January 05, 2007 10:33 PM by joelo
Attachment(s): SharePoint Uptime Logic.vsd

Comment Notification

If you would like to receive an email when updates are made to this post, please register here

Subscribe to this post's comments using RSS

Comments

Monday, January 08, 2007 2:43 PM by Jesse R

# re: NLB (Network Load Balancing) and SharePoint... Troubleshooting and Configuration tips.

Thanks for the tips. I will let you know another status update once I can figure this out. I wen't back to NTLM for the time being until I can sort this out. It has been a pain in my side!

Tuesday, January 16, 2007 8:52 AM by The Boiler Room - Mark Kruger, SharePoint MVP

# Great SharePoint Blog Articles by Joel Oleson

Why WSS 3.0 x64 and MOSS 2007 x64... What's the deal? NLB (Network Load Balancing) and SharePoint......

Wednesday, January 24, 2007 5:16 PM by Christophe Fiessinger's Blog

# Configuring load balanced PWA site

Install and configure a WSS Farm on ServerA Add a second server (ServerB) to the WSS Farm created above

Thursday, February 01, 2007 8:02 AM by The Mit's Blog

# Installation d'un MOSS 2007 : pas à pas

C'est un peu la question à la mode Si tout le monde sait aisement installé un WSS ou un MOSS sur un serveur

Monday, April 02, 2007 3:52 PM by EMS

# re: NLB (Network Load Balancing) and SharePoint... Troubleshooting and Configuration tips.

Does the issue with Kerberos and loadbalancing apply to using Windows 2003 NLB service only or a physical device (such as an F5 load balancer).

Monday, April 02, 2007 4:45 PM by Joel Oleson's SharePoint Land

# Replication and High Availability

The term replication comes up quite frequently in large deployments. It means a number of things to a

Thursday, April 05, 2007 6:31 AM by Spence

# re: NLB (Network Load Balancing) and SharePoint... Troubleshooting and Configuration tips.

The IIS guys are working on a addin for IIS which will add the intelligence features you talk about wrt nlb - think of it as AppCenter lite

Tuesday, August 07, 2007 1:13 AM by Peter Seale's weblog

# SharePoint Farms and My Site

Tuesday, August 07, 2007 1:15 AM by Peter Seale's weblog

# SharePoint Farms and My Site

Tuesday, August 07, 2007 1:17 AM by Peter Seale's weblog

# SharePoint Farms and My Site

Thursday, September 27, 2007 6:52 PM by blissful abstraction

# Error - This page has been modified since you opened it. You must open the page again.

Error - This page has been modified since you opened it. You must open the page again.

Thursday, February 28, 2008 2:04 PM by Brett's SharePoint Blog

# ECM Supplemental Material, Week 2

Thursday, February 28, 2008 2:24 PM by Noticias externas

# ECM Supplemental Material, Week 2

IRM/RMS Implementation General Deployment Resources: http://technet2.microsoft.com/windowsserver/en/technologies

Thursday, January 15, 2009 11:00 AM by vladkol on SharePoint

# Некоторые особенности развёртывания Office SharePoint Server 2007 в NLB, под Windows Server 2008, с HTTPS-доступом

Здесь мне хотелось бы рассказать об особенностях развертывания SharePoint под Windows Server 2008 И/ИЛИ...

Leave a Comment

(required) 
required 
(required) 

  
Enter Code Here: Required
 
Page view tracker