Measuring the Heterogeneity of Cross-company Dataset
文献类型:会议论文
作者 | Chen J(陈嘉) ; Yang Y(杨叶) ; Zhang W(张文) ; Gregory Gay |
出版日期 | 2010-06 |
会议名称 | Profes 2010 |
会议日期 | 2010-6-22 |
会议地点 | 爱尔兰,Limerick大学 |
关键词 | Heterogeneous datasets software effort estimation parameter comparison estimation model calibration |
英文摘要 | As a standard practice, general effort estimate models are calibrated from large cross-company datasets. However, many of the records within such datasets are taken from companies that have calibrated the model to match their own local practices. Locally calibrated models are a double-edged sword; they often improve estimate accuracy for that particular organization, but they also encourage the growth of local biases. Such biases remain present when projects from that firm are used in a new cross-company dataset. Over time, such biases compound, and the reliability and accuracy of a general model derived from the data will be affected by the increased level of heterogeneity. In this paper, we propose a statistical measure of the exact level of heterogeneity of a cross-company dataset. In experimental tests, we measure the heterogeneity of two COCOMO-based datasets and demonstrate that one is more homogeneous than the other. Such a measure has potentially important implications for both model maintainers and model users. Furthermore, a heterogeneity measure can be used to inform users of the appropriate data handling techniques. |
学科主题 | 软件工程 |
语种 | 英语 |
源URL | [http://ir.iscas.ac.cn/handle/311060/14786] ![]() |
专题 | 软件研究所_互联网软件技术实验室 _会议论文 |
推荐引用方式 GB/T 7714 | Chen J,Yang Y,Zhang W,et al. Measuring the Heterogeneity of Cross-company Dataset[C]. 见:Profes 2010. 爱尔兰,Limerick大学. 2010-6-22. |
入库方式: OAI收割
来源:软件研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。