Gunnar Kudrjavets

Iterative Identification of Fault-Prone Binaries Using In-Process Metrics

Interestingly enough yet another research paper on what I've worked on has been accepted to a software engineering conference. Lucas Layman, Nachi Nagappan, and I will be at ESEM 2008 presenting on 10/10/2008 the results of our work.

 Here's the summary:

Code churn, the amount of code change taking place within a software unit over time, has been correlated with fault-proneness in software systems. We investigate the use of code churn and static metrics collected at regular time intervals during the development cycle to predict faults in an iterative, in-process manner. We collected 159 churn and structure metrics from six, four-month snapshots of a 1 million LOC Microsoft product. The number of software faults fixed during each period is recorded per binary module. Using stepwise logistic regression, we create a prediction model to identify fault-prone binaries using three parameters: code churn (the number of new and changed blocks); class Fan In and class Fan Out (normalized by lines of code). The iteratively-built model is 80.0% accurate at predicting fault-prone and non-fault-prone binaries. These fault-prediction models have the advantage of allowing the engineers to observe how their fault-prediction profile evolves over time.

See you in Kaiserslautern, Germany. 

Published Monday, September 15, 2008 9:19 AM by gunnarku

Comments

No Comments
Anonymous comments are disabled

This Blog

Syndication

Tags

No tags have been created or used yet.

News

These postings are provided "AS IS" with no warranties, and confer no rights. Additionally, views expressed herein are my own and not those of my employer, Microsoft.

© 2009 Microsoft Corporation. All rights reserved. Terms of Use  |  Trademarks  |  Privacy Statement
Microsoft
Page view tracker