For those of us on the Office Interoperability team, as well as our colleagues throughout Office, today is a big day. We’ve released SP2 (Service Pack 2 for Office 2007), which includes a bunch of updated features. Gray Knowlton has a roundup of what’s new in SP2, but I think the feature of most interest to readers here is probably the built-support for ODF 1.1.
I first mentioned our plans for ODF support in a blog post last year, and I’ve also blogged in the past about the guiding principles that we followed in our ODF implementation. Our decision to support ODF is just one aspect of Office's broad commitment to choice and interoperability, as covered by Tom Robertson today on the Microsoft on the Issues blog.
For today’s post, I thought I’d put together a hands-on example of a typical user experience when working with ODF and Office 2007 SP2. I’m going to focus on a typical document creation and editing scenario in Word. Specifically, I’ll go through these steps:
The starting point. As a first step, I’ll create a document we can use as a starting point to try out some things. So I select File/New in Word, add some text, insert a few of the things we all use regularly in documents (a title, headings of various levels, a numbered list, and a table), and do some simple formatting. Here's how it looks:
The next step is to save this as an ODT document. That’s pretty simple – – just click the Office Button, move your mouse to ‘Save As”, and then select “OpenDocument Text” from the menu. Before I go any further, it’s worth noting a couple of things about this step:
Now I’ll open this document in OpenOffice version 3.0.1. In a future post I’ll look at differences between various existing ODF implementations, but for today’s post I’m just going to stick to OpenOffice 3.0.1 and Office 2007 SP2.
When I open my ODT document in OpenOffice Writer, here’s what it looks like:
As you can see, the document looks essentially the same in both applications. The page break is the only obvious difference – it occurs at a different point in the document due to differences between the default line-spacing values used in Word and OpenOffice. Other than that detail, the document looks the same in both applications, with the same fonts, formatting, headings and content.
The line-spacing variation is something you can see in other ODT documents and other ODF implementations as well. For example, if you open the latest draft of the ODF 1.2 specification (OpenDocument-v1.2-cd01-rev06.odt) in IBM Lotus Symphony 1.2.0, it is 931 pages long, but if you open the same document in OpenOffice Writer 3.0.1, it’s 875 pages long. These types of variations demonstrate a fundamental difference between a fixed-layout format (such as PDF or XPS) and a flow-oriented layout like ODF or Open XML. Flow-oriented formats work well for dynamic editing activities, whereas fixed-layout formats rigidly pin down the layout of a document so that it will be rendered exactly the same on different devices. For these reasons, most people prefer to use a flow-oriented format during document authoring and editing, and a fixed-layout format for published documents that are no longer being edited.
Getting Fancier. Now let’s move on to some fancier formatting and see how that works. I’m going to open this document in Word and make a variety of changes:
As a result of these changes, my document now looks like this in Word:
And if I save that version as an ODT file and open it in OpenOffice, I see this:
You’ll notice that many things are identical in both Word and OpenOffice, and a few things look a little different in each application. Here are some things that are the same in both applications:
And here are some things that appear differently in the two applications:
If you’d like to test these sample documents yourself, they’re in a ZIP file attached to this blog post (below).
Getting more information. This demonstration was just a simple example, for those who are curious about how the new built-in ODF support works in Office. You can find more detailed information about SP2’s support for ODF 1.1, including which features are supported by Word, Excel and PowerPoint, at these links:
Going forward, I’ll be doing some blog posts that get down into more of the technical details, to help explain some of the engineering decisions that we made in our implementation. For example, tracked changes functionality is of interest to many users, so I’m working on a post to cover why we decided to not implement tracked changes in ODF.
What else would you like to understand about our implementation of ODF? Share your questions and thoughts in the comment thread, or email me (dmahugh at microsoft dot com) if you have suggestions for topics you’d like to see covered here. I’m very proud of the work my colleagues on the Word, Excel and PowerPoint teams have done to add ODF support, and I’m looking forward to discussing the details now that SP2 has been released.
PingBack from http://microsoft-sharepoint.simplynetdev.com/working-with-odf-in-word-2007-sp2/
From Microsoft: Today Microsoft is releasing Service Pack 2 for the 2007 Microsoft Office system. This
Wow, great news. Now we can talk about this as a released implementation of ODF. Congratulations.
Great news. I'm quite interested in how form fields are handled (haven't checked that one out, but it's tax season so all I can think of right now is filling out those forms ;-)
Keep up the good work.
Congratulations to this great step towards interoperability.
Office 2007 SP2 includes major performance enhancements for Office applications and servers, most notably
Se poate descărca de pe Microsoft Update . Cele mai importante goodies din acest SP, după părerea mea
The Office 2007 SP2 is available now for download: http://www.microsoft.com/downloads/details.aspx?familyid=B444BF18-79EA-46C6-8A81-9DB49B4AB6E5&displaylang=en
I'm assuming that the line spacing can be addressed directly: If the page or paragraph style has an explicit line height and leading, will that result in consistent text flow across page breaks, etc.?
Can you test using the test documents that are available from Oasis? This would show up any holes in the produced XML.
Maybe then we'll see it as a complete implementation.
Also, what can Office do that ODF cannot store?
Thanks.
I did my own quick test. I have a 525 page book I am nearly finished writing in Office 2007. It has a lot of complexity, so I thought it would be a good test.
With SP2 installed, I saved in ODF. There were the line spacing issues you mentioned. What I also found was that indents were incorrect (e.g., for a bulleted list, using the built-in List Bullet style). I would have thought that the indent would have gone over OK, but apparently not.
I did a second test. Using the latest OpenOffice, I directly opened the .docx file. The results were notably worse than using Office 2007 SP2. Footnotes did not appear numbered, but instead showed as field codes. The worst was the title page. It had random sentences from throughout the text superimposed over the picture on the dust jacket.
My conclusion is that it looks like Office SP2 .docx to .odt is the best route, rather than .docx directly into OpenOffice.
Dennis/Bart/Peter -- thanks.
Dave -- yes, if you use only styles that explicitly specify the line spacing, indents, margins, etc you can get a much more consistent appearance and reduce the difference in vertical spacing that you see in my examples.
Darren -- We did test our implementation with the test suite available from the http://testsuite.opendocumentfellowship.com/ to make sure that we can correctly read and write all of those. Are those the documents you are referring to? If not, can you provide a link?
Ian -- interesting test.
Thank you for this great feature! I hope Office will stay compatible with future versions of ODF too!
Congrats on the filter - for Word. However, since SP2 implements only ODF 1.1 (since ODF 1.2 is still only an advanced draft format), how are formulas stored in spreadsheets? I hear there's also a problem with tables in slideshows (which is strange, since obviously Word can do ODF tables; why can't Powerpoint?)
I also wonder about page styles: I'd like to see how a document that alternates page formats, filler blank pages and such work in both. How are master documents handled?
@Ian: .docx is a proprietary XML-based format that has a single implementation. OOo developers are having trouble developing an import filter for the following reasons:
- actual file format doesn't always conform to the published specification (encryption had to be reverse engineered, for example).
- there are several redundant features: tables in Word, Excel and Powerpoint are different objects that share 95% of their properties (tables in OOo/ODF are the same, as no difference is made between one document and another) which all require a different import filter method to create a single object: a table
- some features don't match with OOo's internal structure (geometrical shapes and text: OOo has 2 renderers, a simple one and Writer. The simple renderer is used for these shapes, but OOXML requires a richer one)
- Office 2007/2008 is the only Office generation using .docx; Office 14 should use OXML, and OOo developers think that their time would be better spent developing an import filter that can manage most XML formats at once (better for support, reduces code redundancy)
Upcoming version 3.1 will solve several problems here, and there are already further improvements planned/started for 3.2.
Doug, I really wish you could issue a response to the slashdot article. it brings up very interesting points worth answering.