这篇文章挺好玩,本来是讲软件测试的adequacy criteria的:
M. Hutchins, H. Foster, T. Goradia, and T. Ostrand, "Experiments of the effectiveness of dataflow- and controlflow-based test adequacy criteria," in Proceedings of the 16th international conference on Software engineering, Sorrento, Italy, 1994, pp. 191-200.
被引用了500多次,但是大部分引用它的论文都是因为用到其提出的数据集,大名鼎鼎的西门子数据集(Siemens Test Suite),例如下面这几篇文章:
R. Santelices and M. J. Harrold, "Exploiting program dependencies for scalable multiple-path symbolic execution," in Proceedings of the 19th international symposium on Software testing and analysis, Trento, Italy, 2010, pp. 195-206.
H. Cheng, D. Lo, Y. Zhou, X. Wang, and X. Yan, "Identifying bug signatures using discriminative graph mining," in Proceedings of the eighteenth international symposium on Software testing and analysis, Chicago, IL, USA, 2009, pp. 141-152.
偶尔发现了一篇文章是引用其test adequacy criteria:
S. Elbaum, S. Kanduri and A. Andrews, "Trace anomalies as precursors of field failures: an empirical study," Empirical Softw. Engg., vol. 12, pp. 447-469, 2007.
原作者肯定想想不到,自己构造的数据集比方法影响力要广得多。
---------------------------------------------------------
今天是2012年5月6日,再补充一下,这个数据集可以在两个地方下载:
1. SIR网站:http://sir.unl.edu/php/index.php (需要注册)
2. aristotle Lab的网站:http://pleuma.cc.gatech.edu/aristotle/Tools/subjects/ (不需要注册)
补充这点内容,方便国内研究人员下载。(我可是对这些数据魂牵梦绕过好久)