The OpenXML4J project website is now up and running: http://www.openxml4j.org/

This is an open source project to create a Java library for consuming and generating files in the Open XML formats. Here's a diagram of their planned architecture:

As they describe up on the site, the current scenarios they are targeting are:

Styling content

If you must enforce organizational standards for document formatting in your company.

This scenario takes any Open XML document as input, one stylesheet to apply, and makes a restylish document compliant with your organizational formatting.

Content Inspection

There are lots of examples for this kind of scenario, let's introduce the most common use :

Confidential information removal

Remove comments, annotations, document properties, personal information, presentation notes, tracked changes, ... from outbound documents.

Macros removal

For security purposes, remove macros, inappropriate language and content from inbound documents.

Consuming documents

Users create reports as spreadsheet documents, or invoices as wordprocessing documents, which are loaded into a back-end system on a server (LOB/CRM/...)

Note : in this schema, we assume that OpenOffice.org support the Open XML format.

Document assembly

In this scenario, you build a document on the server. For example, you want to make a presentation for your next meeting, you choose the slides that will compose your deck in a web interface and then ask the server to generate your presentation. Or maybe your want the sales report of last month, ask for the server to find the data from various sources, compute statistics and make your spreadsheet for you.

 

This should be a very cool project.

-Brian