John R. Durant's WebLog

Blog of "The" Office Developer

WordProcessingML and Tables, CALS

We recently had a customer inquire about a WordProcessingML -> CALS conversion. Well, I had never heard of CALS, and neither my immediate colleagues. So, it was digging time. As a side note: when customer questions make it to the product team, we can become almost maniacal about finding an answer. I know that MS employees regularly give up "off the clock" hours (if there is such a thing) in order to solve customer problems.

Lucky for me, CALS was pretty easy to figure out. It is an SGML representation of tables. But, it has limitations, and it is a bit arcane. So, there is the Oasis representation, a sort of CALS-lite with some extra features. Converting to CALS from WordProcessingML may or may not be easy; I have not tried. But, I am guessing there is no "simple" XSLT to do the job. If there were, I would have already found one out there at least partially done. It's a skunk-works revenue opportunity waiting for a taker!

There is a cool tool I found that converts CALS tables to Excel spreadsheets. It's sensibly called X2XL. If anyone has experience with it, please let me know.

I also found this helpful article: Converting Tables to XML. The background info is useful. Finally, you may be interested in this article: The CALS Table Model History by Harvey Bingham. Good times.

OK, now on to the...

 Rock Thought for the Day: I'm sorry that I'm not on a positive note for the last two entries, but I just cannot be enthusiastic about everything that is legitmately rock and roll (although Bright Eyes is not rock and roll, I still needed to dampen the enthusiasm for his "music"). Today's equally deserving victim: My Chemical Romance. They have an album, Three Cheers for Sweet Revenge. In short, they just scream too much. Now, these aren't "cookie monster vocals", mind you. By the way, this constant shrieking is what makes some emo bands so distateful. It certainly doesn't work for My Chemical Romance's genre either. The punk ancestry can be detected in the DNA of this album, but it is corrupted by too noisy, less present and less gritty vocals. I love the rumble of a Harley Davidson. It's rough and real. But, put through a trumpet horn and given polished effects? In a word: No. My Chemical Romance is a botched experiment.

Rock On

Published Tuesday, August 09, 2005 12:06 PM by johnrdurant

Comments

 

David Giusto said:

John - First thanks for mentioning X2XL, it is a project that sprung from a customer need. We even integrated it with Word to convert to CALS tables to embedded Excel objects. Almost what you are looking for but not quite.

Since the release of the Office 2003 beta we have not been able to find anyone at MS who knew what a CALS table was or why it is important to high-end publishing and composition systems, so don't sweat it. In fact the CALS/Office combination is quite a niche but a niche that we are quite versed in. There are a few CALS to HTML via XSLT examples out there. (google "cals to html")

We also had an inqiury just recently from a VSTO 2005 Beta user asking about CALS and Word tables - I'm betting it's the same outfit that found their way to you. It's a small world in high-end publishing.

Speaking of Word and XML our latest project integrates Word with an XML content management system. Version Control, Document Component Sharing & Reuse, Fragment Editing, etc. See the Document Authoring section here: http://www.dmsi-world.com/adm.htm

August 17, 2005 9:22 PM
 

johnrdurant said:

Great stuff, David! I am happy to mention products that help our customers, and yours certainly does! (I accept company t-shirts I can wear around my little, too: size=M) :)

CALS is a technology that was new to me when the customer asked about it. I always wonder what else is out there that I know nothing about.

I'll check out your XML content management hook-up today.
September 2, 2005 10:52 AM
Anonymous comments are disabled

© 2009 Microsoft Corporation. All rights reserved. Terms of Use  |  Trademarks  |  Privacy Statement
Microsoft
Page view tracker