Welcome to MSDN Blogs Sign in | Join | Help

The Microsoft Office Word Team's Blog

All things Microsoft Office Word, from the Word team.
The Document Inspector

Intro

I just wanted to introduce myself quickly as I am a new poster. My name is Reed Shaffner and I am the newly appointed world-wide product manager for Microsoft Word. In addition to working on Word, I also do a lot of work around accessibility and sustained engineering across the Office Client.

The Document Inspector

Comments, personal information, hidden text, and more have always been a major pain point for users, particularly when they send information outside of their corporation. So in the 2007 release we went about addressing the problem by proving users a way to quickly to remove this sort of information from a document. At a high level, the document inspector provides users with a single place to go in order to ensure their document doesn't contain hidden content that should not be distributed.

Using the Document Inspector

The document inspector is located in the prepare menu which can be can found by clicking on the Office Button (Alt+F, E, I). When launching the inspector, if you haven't saved your file before making the most recent changes, you will be prompted to do so. It's not required that you save; however, the reason we provide this dialogue is because once you run the inspector, and choose to remove certain components of a document, they are GONE.

Once the inspector has launched you are provided with different types of information and metadata the inspector can look for and remove. Once you click inspect, you will be provided with a dialogue that states what was found and you can decided whether or not to remove that information.

So, when would I use it?

So as the product manager for Word, I spend a ton of time producing content. This often has to go through numerous review cycles with legal, our PM's, and others. After numerous rounds of markups and review, what started as a regular old document is often littered with comments, revisions, and markups galore. In addition, because I created the document, my personal information is available to any curious reader. Once I am ready to publish my content I would run the inspector. Because I don't want readers seeing all the comments or reading my personal information, I would search for, and remove it. I probably also don't want people seeing hidden text so I would get rid of that too. I would want to hold onto things like headers and watermarks because I use them in my documents.

With this new functionality I can be reassured that when I am sending documents outside of my company, I'm not sharing information I didn't want to. So that covers a brief introduction to the document inspector but we will be posting again soon on the nitty-gritty details and extensibility.

Reed Shaffner

Product Manager

Microsoft Word

Posted: Thursday, March 15, 2007 11:32 PM by wrdblog

Comments

Francis said:

Does Word 2007 still reveal hidden text and comments when doing an e-mail merge to plain text?

I had some very embarrassing incidents because of this in Word 2003--it would be nice if, at the very least, Word had warned me that hidden text would appear, or prompted me to run the document inspector.

# March 16, 2007 9:13 AM

lyricspowder said:

Please search for lyrics at LyricsMountain.com :

[url=http://www.lyricsmountain.com]Lyrics[/url]   =)

Thanks,

Sam

http://www.lyricsmountain.com

# March 18, 2007 1:48 AM

Ahmed said:

I will test this program for converting Docx to Doc

# March 18, 2007 5:14 PM

m_anter said:

In word 2007, can I set a theme as the default theme for new documents?

thanks

# March 19, 2007 11:47 AM

Craig said:

Might be a little off topic but this is driving me nuts.  I have a multi-level list with Chapter 1 Heading 1, Heading 2, etc.  With this method I apply the Heading 1 style to chapter titles, Heading 2 to section titles, etc.  I want to insert the current chapter title and current section title into the header of the document document so that they are updated for each page.  I tried using cross-references but then I have to manually change the references whenever the chapter or section changes.  Is there a way to do this?

# March 21, 2007 4:40 AM

Stefan KZVB said:

@Craig: Have you already tried to use the field named "StyleRef" to reference to the content of the Heading styles?

# March 21, 2007 9:43 AM

Stefan KZVB said:

Sorry for being off-topic, but I just experienced a severe bug with the docx-file format (or the converter):

- I saved a document which has ONLY the Arial font in it with Word 2007 as docx

- parts of the docx where formatted in "Times New Roman" when opening it with Word 2000 (that's the bug!)

- When the docx is reopened with Word 2007 the font is Arial

- When it's saved with Word 2007 as doc and opened with Word 2000 it's also ok.

- When saving and reopening a newly created similiar docx with Word 2000 it's also ok.

Tell me if you need an example document.

# March 21, 2007 10:45 AM

wrdblog said:

Hi Stefan - It would be great if you could post the document somewhere so I could look into this. Please remove any personal information or any information you do not want to be made public from the document.

-Jonathan

# March 21, 2007 2:21 PM

Craig said:

Stefan,

I thought I had already tried that to no avail.  I took another look at it and you are indeed correct.  Not sure what I was smokin' the first time I looked at it.  Thanks for the quick response.

# March 21, 2007 8:03 PM

Stefan KZVB said:

@Jonathan: I removed hidden information with the Document inspector (now we're on-topic again :) ) and uploaded two example docx-files there:

http://pschmid.net/office2007/forums/viewtopic.php?p=657#657

When there are news or a KB-Article about this issue, I'd be happy to read them - i.e. in the forum I posted the attachments. Thank you.

# March 22, 2007 5:28 AM

wrdblog said:

Hi Stefan - Are the actual files available at: http://pschmid.net/office2007/forums/viewtopic.php?p=657#657

-Jonathan

# March 22, 2007 1:29 PM

Stefan KZVB said:

Hi Jonathan,

yes, 2 small example files are attached to the end of the linked post (right below the picture). You can download both files from there by clicking the Download-Buttons on the right side even without being logged in to the forum.

I've also examined example2.docx a little bit further now and figured out that "Times New Roman" seems to be defined in the styles.xml as standard document font.

After I saved the file with Word 2007 as a doc (!), Word 2000 opened the doc correctly with Arial. Now I saved it with Word 2000 as a docx. After this it reopened correctly with Arial also in Word 2000. But in the Word 2000-saved docx's styles.xml there is not defined Arial as standard document font as I would have expected - instead it sets it to some constant. Sorry, I'm at home now with no Word 2000 available so I cannot tell you which constant it is.

- Stefan

# March 22, 2007 3:19 PM

divo said:

Hi,

I tried the document inspector on a simple Word test file. I just wondered that the internal revision IDs are left in the document:

<w:p w:rsidR="001123B4" w:rsidRDefault="005F1F92">

Thus it is still possible to track which paraphraphs have been edited since the first save, in how many sessions the document has been edited and which paragraphs were modified in the same session.

I did not expected to see any of this information after running the inspector.

Couldn't the document inspector also remove those IDs?

# March 28, 2007 10:25 AM

The Microsoft Office Word Team's Blog said:

In my last post we talked about the document inspector (DI) functionality available out of the box in

# April 5, 2007 2:25 PM

The Microsoft Office Word Team's Blog said:

The document review scenario has been significantly improved in Word 2007. It's pretty straightforward

# August 6, 2007 8:51 PM

Noticias externas said:

The document review scenario has been significantly improved in Word 2007. It&#39;s pretty straightforward

# August 6, 2007 10:42 PM

The Microsoft Office Word Team's Blog said:

Personally identifiable information (PII) is a hot-button issue these days. If you've ever had your identity

# June 13, 2008 4:02 PM
New Comments to this post are disabled
Page view tracker