Download Research Tools
From November 9 to 12, 2011, Portland, Oregon, the City of Roses, becomes the City of Hoppers, as technology-minded women from the across the United States flock to the Grace Hopper Celebration (GHC) of Women in Computing, an annual conference that brings the research and career interests of women in computing to the forefront. Named for the legendary computer scientist, U.S. Navy Rear Admiral Grace Hopper, past GHCs have drawn 1,500 or more participants and dozens of corporate sponsors. This year, a record number of attendees (more than 2,000) are expected.
As in the past, leading researchers will present their current work, and special sessions will focus on the role of women in computer science, information technology, research, and engineering—as well as trends in these fields. And as always, a large contingent of corporate recruiters will be on hand—including many from Microsoft—looking to snag the top talent that GHC attracts and to help researchers and technical professionals expand their computer science knowledge and networks.
It’s exciting to see the lineup of amazing speakers from academic institutions, governments, nonprofits, and industry—including more than a dozen from Microsoft. All in all, more than 100 Microsoft researchers and technical employees will be attending, and the company is involved in more than 16 plenaries and sessions (see the line-up of Microsoft speakers). We also will be actively involved in the career night, the poster session, and the Sponsor Night Party. Fact is, Microsoft is a Platinum Sponsor of the Grace Hopper Celebration, for the fifth year in a row. We are proud to support the GHC and the contributions of the Anita Borg Institute and the Association for Computing Machinery, which are critical in attracting and retaining the women who will create the new technologies and drive new innovations for our global future. Be sure to come visit our booth (Exhibit Hall B 417), learn about natural user interfaces, and try out Kinect for Xbox at our Kinect Lounge in Hall C next to CyberCenter.
Now, let me plug my hometown for just a minute. As the United States’ top green city, Portland derives half its power from renewable sources; a quarter of the workforce commutes by bike, carpool, or public transportation; and it has more than 35 buildings certified by the U.S. Green Building Council. Microsoft shares Portland’s focus on harnessing green technology and was recently named one of the Top Green IT organizations by ComputerWorld. In line with our efforts to reduce carbon emissions by 30 percent per unit of revenue by 2012, Microsoft will be going collateral free at this year’s GHC, so we encourage all attendees to visit our Grace Hopper event site to find the information that would typically have been available as booth handouts.
That said, we still want every Hopper to stop by the Microsoft booth to pick out a photosynthetic “research partner” from our Project Epiphyte nursery. You and your air plant will collaboratively recycle carbon dioxide and oxygen as you symbiotically photosynthesize and respire, and you will join the Project Epiphyte community of dedicated plant-human partners. What’s more, you might even beautify your workspace. The epiphyte is more than just a highly-evolved organism that has transcended the limitations of its soil-bound ancestors. It symbolizes our desire to nurture a lasting relationship with GHC attendees and is a metaphor for the collaborative process of research, where knowledge is built on previous efforts and leads to entirely new fields of study. The first 1,500 attendees who visit our booth will receive an epiphyte and our renowned Microsoft Grace Hopper chocolate.
Stop by the Microsoft booth to participate in Project Epiphyte and learn what these items are all about.
Also, visit our recruiting booth (Exhibit Hall A566). In addition to full-time positions, we offer a number of internships, scholarships, and fellowships. We think Microsoft is a great place for technological women (and men) to realize their ambitions, and we aren’t alone. Just last month, Great Place to Work, a global research, consulting, and training firm, named Microsoft the world’s best global company at which to work. As I have been telling all my friends for the last 10 years that I work at the best company in the world, now they don’t have to only take my word for it! So while you stop to smell the roses in the City of Roses, set aside some time to sniff out the possibilities of becoming a “Softie.”
“What If” is this year’s theme of the GHC, and it aligns nicely with our theme across Microsoft this year: “Be What’s Next.” Everyone at the conference can “Be What’s Next” by answering and investigating all the possible “What Ifs.” And if that didn’t make sense, I’ll be glad to rephrase it in person at the GHC. See you in Portland.
—Rane Johnson-Stempson, Director of Women in Research, Science, and Engineering, Microsoft Research Connections
The challenge of DNA sequencing is central to all genomics research, and while the technology has existed since the 1970s, today’s massively-parallel sequencing instruments are capable of producing gigabytes of raw genomic data quickly and increasingly cheaply. Reconstruction of a DNA sequence from this data (for example, through de novo assembly) is a compute-intensive task, and experimentation has shown that data quantity is no substitute for quality when it comes to the accurate reconstruction of a DNA sequence. Unfortunately, not all sequencing technologies produce reliable and accurate results, and experimental data will always contain varying rates of error. Therefore, a preliminary quality control (QC) step is regularly employed to detect and counteract such sequencing errors.
The QC of sequencing results may range from simple manual filtering procedures to comprehensive automated solutions. To contribute to this area of QC tools development, we present Sequence Quality Control Studio (SeQCoS), a Microsoft .NET software suite that is designed to perform an array of QC evaluations and post-QC manipulation of sequencing data. SeQCoS generates a series of standard plots that illustrate the quality of the input data. These plots (saved in JPEG file format) provide information on commonly observed measurements, such as GC content (the proportion of guanine and cytosine nucleotide bases in a DNA sequence), and distribution of quality scores at position-specific and sequence-specific levels. In order to filter out poorly performing sequences, SeQCoS also conducts basic trimming and discarding functions to manipulate sequence files.
At Microsoft Research, the Microsoft Biology Initiative team is collaborating with academic research groups in the sequencing of various organisms. To ensure that the sequenced sample is not contaminated by other strains or sequencing vectors, SeQCoS optionally integrates NCBI BLAST for PCs running the Windows operating system to search against a BLAST-formatted database. We provide a pre-formatted database of NCBI UniVec, a repository of vector sequences, adapters, linkers and PCR (polymerase chain reaction) primers that are used in DNA sequencing; however, researchers can use a different database if they prefer.
About the Tools
SeQCoS was written in C#, using the .NET Bio (formerly the Microsoft Biology Foundation [MBF]) bioinformatics toolkit and Sho, a data analysis and visualization application. It is freely available as open-source code under the Apache 2.0 license. Further details and software downloads are available from Sequence Quality Control Studio.
.NET Bio is a library of common bioinformatics functions (file parsers, algorithms, and web service connectors) that simplify the creation of bioinformatics applications on the .NET platform and is an open-source project that is freely available for academic and commercial use under the Apache 2.0 license. While this project was initiated by Microsoft Research, it is owned by the Outercurve Foundation, a non-profit organization, and is governed by a growing community of users and contributors.
—Kevin Ha, Microsoft Research Intern
Question: What precocious five-year old is writing parallel code to make the most efficient use of multi-core processors?
Answer: The Barcelona Supercomputing Center (BSC)–Microsoft Research Centre in Barcelona, Spain, also known as BSCMSRC by those who enjoy trying to pronounce acronyms that contain no vowels.
From left to right: Andrew Blake, managing director, Microsoft Research Cambridge; Fabrizio Gagliardi, director, Microsoft Research Connections EMEA; Maria Ribera, dean of Barcelona School of Informatics; Rick Rashid, senior vice president of Microsoft Research; Antoni Giró, president, Rector of Technical University of Catalonia - Universitat Politècnica de Catalunya; and Mateo Valero, director, Barcelona Supercomputing Center
Okay, so it was a trick question. But the Centre, which celebrates its fifth anniversary on November 2, 2011, truly is a precocious operation, producing code that makes it easy for programmers to develop parallel-processing software. This is vital because everything—from smart phones and tablets, to PCs and supercomputers—is sprouting extra cores so users can do more. A joint venture of BSC and Microsoft Research, the BSCMSRC brings together the expertise of hardware and software researchers from BSC and software mavens from Microsoft Research.
One technology that the BSCMSRC researchers have been looking at is transactional memory (TM). TM makes it easier to write parallel programs that frequently share data, a process that otherwise requires complex and unwieldy programs. The Centre has developed sophisticated TM applications to date, QuakeTM and Atomic Quake. These applications, which are based on the open-source Quake game server, will be useful in evaluating TM-equipped chips. As part of the €4 million VELOX project funded by the European Commission, BSCMSRC has coordinated the development of a fully integrated TM system that includes hardware simulators, language runtime systems, and compiler support alongside the new TM applications.
BSCMSRC researchers have also developed a dataflow programming model called StarsS, in which data that is produced and consumed in applications automatically “flows” at program runtime. This frees the programmer from explicitly architecting data movements in his or her application and makes it much easier to develop software. BSCMSRC researchers are integrating the StarsS programming model with the Barrelfish research OS, a new message-passing, open-source operating system being developed by Microsoft Research and ETH Zurich.
“BSC’s expertise in computer architecture has been a great fit with our expertise in programming language implementation,” notes Tim Harris, senior researcher at Microsoft Research Cambridge. “This cross-disciplinary approach has led to proposals for new, general-purpose hardware features to accelerate the language runtime systems that underpin modern languages such as Haskell and C#.”
In marking the BSCMSRC’s fifth anniversary, BSC Director Mateo Valero commented “I am proud of the impact of the work done by a very young team at the Centre in our five years of existence. With the multidisciplinary competences of our research personnel, the Centre is in a unique position to influence both hardware and software design. I am also very happy to see Microsoft Research being a major actor in our little Silicon Port at Barcelona in the Mediterranean.”
Fabrizio Gagliardi, Microsoft Research Connections director for Europe, the Middle East, and Africa—and Mateo’s counterpart in this adventure—adds, “Our collaboration with Mateo and his team of computer architects goes a long time back and was the foundation for this joint endeavor. I am very pleased and proud for the results of this collaboration and the resonance and the impact that this is having worldwide.”
—Kenji Takeda, Solutions Architect and Technical Manager, Microsoft Research Connections EMEA