• Collecting Data from Continuous Practices: an Infrastructure to Support Team Development
Authors
Abstract
Through software analytics, raw data with low valueoriginates information that is valuable and able to provide
insights, enabling the support of claims that would otherwise
not be possible to verify. The software development ecosystem
has plenty of sources that can help understanding the quality of
processes and products but, to reach that goal, it is necessary to
collect and store the data. This paper describes an infrastructure
to allow the collection, storage and analysis of data from software
repositories. The scope of the research is an industrial case
study, which encompasses several specificities: tools and work
methodology. The current solution is able to collect information
from the continuous delivery & deployment pipeline, which
includes data sources such as the source code repository (SVN),
the static analysis tool (SonarQube), the continuous integration
server (from Jenkins jobs) and the continuous testing tool (an
in-house tool called Cerberus). Future work also includes the
implementation of components that will allow the collection of
unstructured data from the bug-tracking system and incident
management tool. As stated in the literature, correlating the
history of issues and incidents will allow the team to address,
or at least identify, areas of improvement.