This page summarises an exercise initiated by Jenny Linnerud from Statistics Norway to "text mine" documentation of GSBPM in order to identify empirically the "information objects" that the documentation of GSBPM recognises as relevant to (eg inputs to and/or outputs from) the phases and sub-processes of the statistical business process which GSBPM describes.
The primary purpose of the work was to identify "Information Object" Candidates (IOCs) to be considered for possible inclusion in the GSIM (Generic Statistical Information Model) Common Reference Model. This work represented a pragmatic, business focused method of identifying possible terms and concepts for inclusion in the GSIM Common Reference Model. It balanced more abstract and theoretical approaches to identifying and describing information objects.
The results of this approach may also be useful, however, to other groups and individuals interested in understanding and analysing what the current documentation of GSBPM says about data and metadata flows during the statistical business process.
The results of the analysis are reported in the attached spreadsheet.
The first tab shows results in terms IOCs categorised by the information object groups in GSIM Common Reference Model V0.1. The second tab shows IOCs categorised by the phase of the GSBPM in which they are referenced.
The methods used to text mine the documentation of GSBPM and arrive at the IOCs shown in the spreadsheet attached above are summarised in the attached document in the attached document.