Howison, James, and Kevin Crowston. “The Perils and Pitfalls of Mining SourceForge”. Workshop on Mining Software Repositories, 26th International Conference on Software Engineering, 2004.
Abstract
SourceForge provides abundant accessible data from Open Source Software development projects, making it an attractive data source for software engineering research. However it is not without theoretical peril and practical pitfalls. In this paper, we outline practical lessons gained from our spidering, parsing and analysis of SourceForge data.
Year of Publication
2004
Conference Name
Workshop on Mining Software Repositories, 26th International Conference on Software Engineering
Conference Location
Edinburgh, Scotland