Michael Kaplan has hundreds (maybe over a thousand) of posts on obscure details of unicode/internationalization/sort orders/etc. Skim through his blog, it's pretty impressive. He recently had a post on Japanese Word Breaking: http://blogs.msdn.com/michkap/archive/2006/12/04/1203808.aspx