HealthVault Data Types

For Review: Medical Image Study

We've had several requests for a new data type to store medical images and we are now at a point in our research and design that we'd like to hear from our larger HealthVault community.  We've found that DICOM is the generally accepted format for storing medical images today so we've designed this data type to include structural and metadata similaities while limited the exposed data to what we believe is the most consumer friendly.

If you use DICOM today, those same images can be added to HealthVault in their native format using this data type.  All the metadata stored in a DICOM file remains intact for all systems to reuse with this approach.  By adding a level of structured XML that exposes some of this data, it allows for new and existing HealthVault applications to use the data without needing to fully parse the DICOM file or files.

There are a few scenarios we've targeted in the design of this data type:

  1. Systems generating medical images can now store those images in a patient's HealthVault record instead of creating a DVD, CD or other portable media. Today, when a patient requests a copy of their medical images from a CT, MRI, mamogram, etc, those images are burned to portable media which gets added to the shoebox of all the other studies the patient has requested.  The intent of this data type is to help replace that shoebox full of discs.
  2. The shoebox full of discs serves a purpose of being an electronic history of various studies that can be taken for a second opinion or further analysis.  For systems that understand medical images, they should also be able to understand medical images stored in HealthVault. 
  3. The consumer should be able to accurately transfer their existing medical images to their HealthVault record.  This new data type should be capable of storing all existing medical images that can be uploaded by a consumer.

We fully expect consuming applications to crack open the actual image files to get all the details of the study, series, but we hope that the defined schema will help make the decision on what to open a bit easier. 

The image data stored using this data type will be large in both quantity and size and our current XML storage approach doesn't optimize for binary content like images. To help mitigate this, HealthVault provides seperate binary storage we call blob storage (blobs).  Each instance of data in HealthVault can also include a blob of additional data and with the introduction of this data type, the HealthVault platform will support multiple named blobs per data instance. You'll find in this data type that we use blobs in a few places.  Specifically, blobs are used to store the set of medical images, a directory listing all blob names for the original images, a collection of key images, and a preview image for a study, series, and images. 

We have a few specific questions that we'd like your help in answering: 

  • The first is around the use of c-FIND.  In our research, we found that there are additional required elements for c-FIND that are perhaps less meaningful for consumers. If you use c-FIND today and/or have scenarios where it applies to HealthVault data, please let us know. We expect consuming applications to crack open the actual image files to get all the details of the study, series, but we hope that the defined schema will help make the decision on what to open a bit easier.
  • The second question is around XDS-i support.  We would like to hear from you if you currently use XDS-i and how you would plan to use this element in your HealthVault connected scenarios.
  • The last is whether the list of elements we have defined is the right set.  We've picked the ones we think are meaningful for consumers and that represent the data being stored.

Take a look and let us know what you think by using the Data Types forum

 

Medical Image Study type

A collection of medical images.  DICOM data is stored in the named blob portion of the data type, and it is recommended that normal DICOM conventions are used for naming. An application may store XDS-i manifest information in XML format in the xds-i element.

Element

Type

Occurrence

Description

when

date-time

1

The date and time the study was created. DICOM tags (0008,0020) and (0008,0030)

patient-name

string

0..1

The name of the patient as contained in the medical image. DICOM tag (0010,0010)

description

string

0..1

A description of the study. DICOM tag (0008,1030)

series

medical-image-study-series

1..N

A collection of series.

reason

codable-value

0..1

The reason for the study.  DICOM tag (0032,1030)

preview-blob-name

string

0..1

The name of the blob holding a smaller version of the image suitable for web viewing or email. For example jpeg.

key-images

medical-image-study-series-image

0..N

The important images in the study.

study-instance-uid

string

0..1

The study instance UID.  DICOM tag (0020,000D)

Medical Image Study Series

The details describing a specific series of images.

Element

Type

Occurrence

Description

acquisition-datetime

date-time

1

The date and time that the image was acquired. DICOM tag: (0008,0022), (0008,0032)

description

string

0..1

A description of the series. DICOM tag (0008,103E)

images

medical-image-study-series-image

1..N

The collection of images.

institution-name

organization

0..1

The name of the institution where the images were acquired. DICOM tag (0008,0080)

referring-physician

person

0..1

The physician who ordered the study. DICOM tag (0008,0090)

modality

codable-value

0..1

The method (or modality) in which the images were acquired. DICOM tag (0008,0060)

body-part

codable-value

0..1

The body part that was imaged. DICOM tag (0018,0015)

preview-blob-name

string

0..1

The name of the blob holding a smaller version of the image suitable for web viewing or email. For example jpeg.

series-instance-uid

string

0..1

The series instance UID.  DICOM tag (0020, 000E)

 

Medical Image Study Series Image

Information about a single image in a series.

Element

Type

Occurrence

Description

image-blob-name

string

1

The name of the blob holding the original data for this image.  DICOM tag (0008, 0018)

image-preview-blob-name

string

0..1

The name of the blob holding a smaller version of the image suitable for web viewing or email. For example jpeg.

Published Thursday, July 09, 2009 11:43 AM by timsell
Filed under:

Comments

 

Tomasz K. Helenowski, M.D. said:

It is RIDICULOUS to come up with another image format rather than support DICOM. What in the world is "Consumer Friendly." This is nonsense. WE DO NOT NEED ANOTHER PROPRIETARY FORMAT. It will only set back progress made in this field.

July 14, 2009 11:05 AM
 

ericgu said:

Tomasz,

Sorry if this wasn't clear in the writeup.

The data stored in the blob section is the raw DICOM data - we're not proposign a new format for the data. Because the data can be fairly large and there can be a lot of it, it will take some time for an application to fetch it. We therefore duplicated some of the DICOM metadata into the data type itself, so that applications can look at that metadata without having to fetch the DICOM data itself.

July 14, 2009 11:13 AM
 

John Gentle said:

Can you please provide a visual model of how data might flow from PACS to Healthvault to PACS using DICOM? Thanks?

July 14, 2009 12:25 PM
 

Philippe PELLERIN PhD,MD said:

Keep the DICOM format it has the advantage to be universal

Would you propose a new format to take the jpeg place format for photographies ?

July 14, 2009 1:11 PM
 

Matt Hayes CIIP said:

This looks like a good beginnning for handling the images.  But I would caution that any uptake/downloading/importing applications you design for the system just parse the DICOM header to fill in the blanks of you schema as it archives the images.  Making sure the DB you're using can keep the elmenents tracked is the basic essentials to any image management system.   It is possible to have such structured data only visible on the application layer to the consumer.  However, if a consuming system does request an image, it should be able to perform the same operation on the DICOM object for itself. Send the image, not the data type. This new data type should only be for functional operations w/in health vault and maintain the fidelity of the original objects.

DICOM overall is very robust in handling imaging sets. Don't reinvent this structure wheel that we're continuing to make improvements on.

July 14, 2009 4:14 PM
 

Greg Ofiesh said:

If I understand the issue at hand here, your new data type is orthogonal to the issue of bandwidth and query performance. A lean and mean DICOM IOD can be defined for your uses.

Is Microsoft working on .NET assemblies to make development for HL7/DICOM much more developer friendly? If not, I would love to see this happen.

July 14, 2009 8:40 PM
 

Mathieu Malaterre said:

That looks like what Oracle people are doing with there DICOM object:

http://www.oracle.com/technology/obe/11gr1_db/datamgmt/dicom/dicom.htm

I would avoid copying DICOM fields other than UIDs fields as copying field imply you are actually able to read & interpret DICOM data properly, which can be tricky for DA type (think of old ACR NEMA 10byte DA field vs DICOM V3 8byte DA type).

Patient Name is also a pretty bad idea as it is a Type 1C in most SOP (read: it can be empty !). Worse in typical clinical trial environment it is being anonymized thus those system would not be able to cope with identical SOP Instance UID with different Patient Name.

I would only use UID to build a graph of relation (think PS object), I think Patient Name can only be used in some restricted environment type.

July 15, 2009 6:00 AM
 

Tomasz K. Helenowski, M.D. said:

There is a lot of current software for visualizing DICOM images which is very useful. I use several daily in my practice. They usually work if the DICOM images are stored in a subdirectory as individual DICOM files and fail if not. All this software would have to re-written to use a CD/DVD created in your system which creates a new type of blob with the image directory to be parsed before these images can be viewed. This is a step backwards in terms of compatibility. DO NOT TRY TO SAVE SPACE WITH A BLOB. DISK SPACE IS CHEAP THESE DAYS. KEEP A SEPARATE DIRECTORY WITH ALL THE IMAGES IN ORIGINAL DICOM FORMAT.

July 15, 2009 9:06 AM
 

Jacques Fauquex said:

DICOM WG27 is working right now on an extension to WADO, in order it to work as a web service. Other complementary web services will be defined for query and notification.

DICOM WG27, too, is defining a subset of atributes for easy reference to the binary information of the images.

Please contact Emmanuel Cordonnier, chair of this workgroup, in order to eventually reach a common subset used by WADO and by healthvault.

July 15, 2009 9:17 AM
 

ericgu said:

Tomasz,

Our intention is not to create a format that is used outside of HealthVault. If an application were to implement an export of the data to disk, we expect that they would take the original dicom files (now stored as blobs) and write them out to disk in that format. At that point, all existing viewers should work as they do now, since they would be working on the original data.

The image directory in healthvault is there as an optimization so that applications that want to access the data directly in HealthVault have a way to get certain information (such as thumbnails) without having to download the entire dicom file from the blob store. The directory is stored in a blob because of technical limitations that HealthVault has on the size of the study - with large studies the image directory could exceed this limit.

We're working on a description of the scenarios that we hope will make this clearer.

July 15, 2009 12:34 PM
 

DICOM dood said:

Are you planning on providing an integrated viewer that can parse and properly represent the original DICOM SOP classes? If so, will it support IHE CPI and GSPS and how will you deal with visualizing large data sets? If not, what is the good of storing the DICOM data in the first place if consumers cannot view it?

July 15, 2009 1:14 PM
 

Greg Ofiesh said:

Storage is cheap, healthvault is envisioning an exceptionally large data, but this whole topic is orthogonal to the issue at hand. Data types are an engineering design decision, and generally have nothing to do with storage requirements.

DICOM dood raised the most important question - why store the images? Of course for viewing (by doctors). So why not just implement a scalable PACS with extensive access control management? you can then front end it with your "patient friendly" web pages, while maintain industry standard PACS access mechanisms.

July 15, 2009 5:05 PM
 

Peter Tate said:

In what format will you use for patient name?  

Will you support non-ASCII character sets?

If you support non-ASCII character sets, will HealthVault convert the name to UCS-16 or will the application be responsible for decoding it?

If the application is responsible will you also include the Specific Character Set tag (0008,0005) or will applications have to read the DICOM file to determine the patient name? (which will also affect other string parameters)

What about  East Asian character sets that also have ideographic and phonetic representations?

Are you only planning on storing images?  What about Structured Reports, GSPS?

Finally, I would recommend looking at what is stored in a DICOMDIR for the list of tags to replicate in your  study.  This will be what many viewers already support, and what users expect to find when selecting studies to open.

July 17, 2009 10:48 AM
 

Chris Diacov said:

I would agree with Peter - DICOMDIR is probably a good place to start. Maybe ingest (or even create from scratch?) the DICOMDIR structure and find a way to create easy mapping between the blob name and the actual image file name? This could open the door for many viewers out there to be adapted to work with HealthVault with somewhat modest changes.

July 22, 2009 4:01 PM
New Comments to this post are disabled

© 2009 Microsoft Corporation. All rights reserved. Terms of Use  |  Trademarks  |  Privacy Statement
Microsoft
Page view tracker