Microsoft Research Connections Blog

The Microsoft Research Connections blog shares stories of collaborations with computer scientists at academic and scientific institutions to advance technical innovations in computing, as well as related events, scholarships, and fellowships.

DataUp rises to new curatorial heights

DataUp rises to new curatorial heights

  • Comments 0

DataUp, an award-winning program for data curation, is now better than ever, thanks to a significant upgrade released today. This new, more robust version of DataUp includes substantial usability improvements for scientific users needing data management support and, in addition, has entirely new functionality to enable repository administrators to add and manage their repositories from within the DataUp application.

DataUp: simplify data sharing

In the new version of DataUp, repository administrators can set up associative metadata via the UI or by uploading an XML file. This gives them the flexibility to define what metadata is required—even on a discipline-by-discipline basis—and to constrain the file-level metadata that will be captured from the user upon data deposit. In addition, administrators can activate the Data Quality Check, a new data validation feature that enables the DataUp tool to verify whether a user’s uploaded file meets certain requirements for the repository.

The DataUp upgrade will be officially unveiled at the 2014 International Data Curation Conference (IDCC2014) in San Francisco, which runs from February 24 to 27. The code is available as open source (Apache 2.0) on Bitbucket as of today, February 24. We encourage you to download the code and share it with other data curators—and to let us know what you think. And if you’ll be attending IDCC2014, we would be delighted if you would participate in the Microsoft Research workshop on Data Management in the Cloud, which will cover various topics such as how to use DataUp to manage your data in the cloud; also, be sure to stop by the poster session at the IDCC event.

This release marks the culmination of a project that started in conjunction with the Gordon and Betty Moore Foundation and which has included substantial information gathering and user support work done by the California Digital Library (CDL). CDL has also written a blog post on the new release.

Presently, DataUp supports two different types of repositories, though more can be added via repository adapters: (1) a personal or organizational Microsoft OneDrive repository or (2) a repository that adheres to the ONEShare standard developed by the California Digital Library. 

One thing I can truly say is that a project like this takes a village. This release has been a long time coming and I am very thankful to my partners at the Moore Foundation and CDL; and my colleagues in the Education and Scholarly Communication and Earth, Energy and Environment teams that put in the time and effort to bring this release to fruition.

Kristin Tolle, Director for Environmental Development Infrastructure, Microsoft Research Connections

Learn more

Leave a Comment
  • Please add 7 and 4 and type the answer here:
  • Post