• Sign In
 
  • MSDN Blogs
  • Microsoft Blog Images
  • More ...
Search
  • Advanced search options...
Tags
  • .NET
  • Altova
  • blogging
  • code samples
  • Codeplex
  • Custom XML
  • DII
  • DIS29500
  • ECMA-376
  • IBM
  • Java
  • Monarch
  • ODF
  • Office 2007
  • OpenXMLDeveloper.org
  • PHP
  • Redmond
  • SharePoint
  • System.IO.Packaging
  • TechEd
  • UOF
  • VSTO
  • Windows
  • WordprocessingML
  • workshops
Archives
Archives
  • January 2012 (1)
  • October 2011 (1)
  • July 2011 (2)
  • April 2011 (1)
  • March 2011 (3)
  • December 2010 (1)
  • August 2010 (1)
  • June 2010 (1)
  • May 2010 (1)
  • April 2010 (3)
  • March 2010 (1)
  • November 2009 (4)
  • October 2009 (1)
  • September 2009 (2)
  • July 2009 (2)
  • June 2009 (4)
  • May 2009 (5)
  • April 2009 (4)
  • March 2009 (4)
  • February 2009 (2)
  • January 2009 (4)
  • December 2008 (4)
  • November 2008 (3)
  • October 2008 (4)
  • September 2008 (3)
  • August 2008 (2)
  • July 2008 (5)
  • June 2008 (7)
  • May 2008 (5)
  • April 2008 (8)
  • March 2008 (14)
  • February 2008 (15)
  • January 2008 (13)
  • December 2007 (12)
  • November 2007 (5)
  • October 2007 (9)
  • September 2007 (6)
  • August 2007 (10)
  • July 2007 (9)
  • June 2007 (8)
  • May 2007 (12)
  • April 2007 (14)
  • March 2007 (12)
  • February 2007 (10)
  • January 2007 (17)
  • December 2006 (14)
  • November 2006 (10)
  • October 2006 (11)
  • September 2006 (12)
  • August 2006 (12)
  • July 2006 (12)
  • June 2006 (23)
  • May 2006 (14)
Common Tasks
  • Blog Home
  • Email Blog Author
  • About
  • RSS for comments
  • RSS for posts

Open XML links for 01-11-2007

Doug Mahugh - Office Interoperability
MSDN Blogs > Doug Mahugh > Open XML links for 01-11-2007

Open XML links for 01-11-2007

Doug Mahugh
11 Jan 2008 8:12 PM
  • Comments 1

I've been pretty busy since the holidays, and consequently haven't got around to posting links to all of the great Open XML information that I've come across lately. So without further delay, here are some of my favorite Open XML blog posts and projects I've run into since the first of the year ...

Open source Open XML projects. I met Jason Harrop at the Office 2.0 conference in San Francisco last summer, and he showed me a very interesting Word add-in he was working on. Since then, Jason has started collaborating with a colleague to start up a series of open-source Open XML projects. I'll soon have more to say about some of the interesting work Jason and Jo are doing, but for now I wanted to point out a cool trick they mentioned on their blog that many developers aren't aware of: the ActiveDocument.WordOpenXML feature of the Word object model. On a related note, there are two other methods that developers may want to use for similar purposes: ActiveDocument.ExportFragment and Range.ExportFragment. Those work like SaveCopyAs in Excel, and can save to any supported format.

What's up, DOCX? Guy Creese's blog has some information about a free report available from the Burton Group entitled "What’s Up, .DOC? ODF, OOXML, and the Revolutionary Implications of XML in Productivity Applications." As Guy explains, "we'll probably ruffle some vendor feathers on this one, but we've tried hard to look into this objectively and in some detail (the report is 37 pages long)."

DOCX to HTML in ASP.NET. Maarten Balliauw, the driving force behind the PHPExcel API, is also a talented C# developer, and he has posted a useful article about how to preview DOCX files in HTML. His example takes advantage of LINQ to XML technology in the creation of an ASP.NET HttpHandler that transforms WordprocessingML into HTML. It's a simple example, but a great starting point for those who want to post DOCX files on web sites. Which is a very common need these days; here's another approach that starts from the DocX2Html.xsl that ships with SharePoint.

Taming the complexity of style inheritance. WordprocessingML's approach to style inheritance can be a complex topic, because a given string of text may have many formatting properties that apply: direct formatting, run properties, paragraph properties, list styles, table properties, etc. James Newton King has posted some thoughts on how to manage that complexity, by taking advantage of the fact that Open XML consistently uses property elements (rPr, pPr, etc.) for storing styling and formatting information.

Don't want macros? Remove them. Vineela Kavoori of Sonata Software has posted an article on the OpenXMLDeveloper site entitled Removing macro from WordProcessingML document using Java that demonstrates how to remove macros from a DOCM file and turn it into a DOCX. The sample uses no special libraries or tools, just the standard zip functionality in the util package.

Wouter's pretty developer tab. If you're doing Open XML development work with Word 2007 and you've not installed Wouter Van Vugt's Word add-ins, you're probably working too hard. Install the Databinding toolkit and the Word Source Viewer, and your Developer tab will be as useful — and look as good — as Wouter's.

The design goals of XML. The Open XML standards process has resulted in some interesting debate about the design of XML schemas. Rick Jelliffe's post on Design Goals of XML helps put that debate in its proper historical perspective.

  • 1 Comments
HTML, WordprocessingML, Java
Comments
  • Wouter
    12 Jan 2008 8:31 AM

    Nice list of references Doug. Just to complete it, the source viewer is at version 2:

    http://blogs.code-counsel.net/Wouter/Lists/Posts/Post.aspx?ID=28

    Now I am off to read the Burton report.

Page 1 of 1 (1 items)
  • © 2012 Microsoft Corporation.
  • Terms of Use
  • Trademarks
  • Privacy Statement
  • Report Abuse
  • 5.6.131.143