A
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Acquisition Activity |
Business |
The set of executed processes and the actual resources required as inputs and produced as outputs to acquire data about a given Population for a particular reference period. It includes the process and resources required to acquire data in a Statistical Program consisting of gathering data via one or more Data Channels in order to create or feed one or more Data Resources . |
This object holds Statistical Activity information that relates specifically to data collection or acquisition. It inherits the relationships and attributes from the Statistical Activity type. |
|
|
Acquisition Design |
Business |
The specification of the resources required and processes used and description of relevant methodological information for a set of activities to collect data about a given Population . |
This object holds Statistical Program Design information that relates specifically to data collection or acquisition. It inherits the relationships and attributes from the Statistical Program Design type. Related to Acquisition Design is Acquisition Activity , which holds the detailed information about the conduct of the Acquisition Activity for a single reference period, The Acquisition Design describes the methodology and design elements that are intended to apply across all Acquisition Activities until such time as a decision is made to alter the design. |
|
|
Administrative Details |
Base |
A placeholder for extensions to the GSIM model. |
GSIM does not seek to replicate or embed constructs from the administration of objects held in metadata registries, but includes this placeholder to allow for future extensions. |
|
|
Analysis Population |
Concepts |
A Population used for the analysis, processing, or dissemination of statistical data. |
Population determined by parameters of an analysis |
object class, analytical population |
|
Analysis Unit |
Concepts |
A Unit that is defined for the analysis, processing, or dissemination of statistical data. |
Object corresponding to an Analysis Population |
analytical unit, unit of analysis |
|
Assessment |
Business |
An activity to analyze quality or effectiveness and consider available options. |
The Assessment is a generic class that regroups different types of more specific assessments. An example of Assessment is a SWOT assessment that identifies the Strengths, Weaknesses, Opportunities and Threats of a specified proposal. Another example is a Gap Analysis that formalizes the difference between the current situation and the state to reach due to certain requirements. An Assessment can use various objects as inputs, whether they are the main objects that the Assessment is about or auxiliary information objects that help the accomplishment of the assessment. |
|
|
Attribute Component |
Structures |
The role given to a Represented Variable in the context of a Data Structure . The role is to hold the pertinent information in addition to the identifiers and measures for a particular unit in a Data Set.
|
For example the publication status of an observation (e.g. provisional, final, revised), or information specific to the use of an Identifier in the context of a Data Set. |
|
B
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Business Case |
Business |
A proposal for a body of work that will deliver outputs designed to achieve outcomes. A Business Case will provide the reasoning for initiating a new Statistical Program Design for a Statistical Program, as well as the details of the change proposed. |
A Business Case is produced as a result of a detailed consideration of a Change Definition . It sets out a plan for how the change described by the Change Definition can be achieved. A Business Case usually comprises various evaluations, for example a SWOT assessment, or Gap Analyses for the different solutions that are considered for satisfying the Statistical Need . The Business Case will also specify the stakeholders that are impacted by the Statistical Need or by the different solutions that are required to implement it. |
|
|
Business Function |
Production |
Something an enterprise does, or needs to do, in order to achieve its objectives. |
A Business Function delivers added value from a business point of view. It is delivered by bringing together people, processes and technology (resources), for a specific business purpose.
Business Functions answer in a generic sense "What business purpose does this Process Step Design serve?" Through identifying the Business Function associated with each Process Step Design it becomes easier in for someone in future with an equivalent business need to identify Process Step Designs that they might reuse (in whole or in part). A Business Function may be defined directly with descriptive text and/or through reference to an existing catalogue of Business Functions . The phases and sub processes defined within GSBPM can be used as an internationally agreed basis for cataloguing high level Business Functions . A catalogue might also include Business Functions defined at a lower level than "sub process". For example, "Identify and address outliers" might be catalogued as a lower level Business Function with the "Review, validate and edit" function (5.3) defined within GSBPM. |
|
|
Business Service |
Production |
A defined interface for accessing business capabilities (an ability that an organization possesses, typically expressed in general and high level terms and requiring a combination of organization, people, processes and technology to achieve). |
A Business Service may provide one means of accessing a particular Business Function . Requesting a particular service through the defined interface may result in a business process (workflow) being executed.
The explicitly defined interface of a Business Service can be seen as representing a "service contract". If particular inputs are provided then the service will deliver particular outputs in compliance within specific parameters (for example, within a particular period of time).
In the case of GSIM, a Business Service typically implements a particular Process Method to perform a particular Business Function .
Note: The interface of a Business Service is not necessarily IT based. For example, a typical postal service will have a number of service interfaces: - Public letter box for posting letters - Counter at post office for interacting with postal workers |
|
C
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Category |
Concepts |
A Concept whose role is to extensionally define and measure a characteristic. |
Categories for the Concept of sex include: Male, Female
Note: An extensional definition is a description of a Concept by enumerating all of its sub ordinate Concepts under one criterion or sub division.
For example - the Noble Gases (in the periodic table) is extensionally defined by the set of elements including Helium, Neon, Argon, Krypton, Xenon, Radon. (ISO 1087-1) |
class |
|
Category Item |
Concepts |
An element of a Category Set. |
A type of Node |
|
|
Category Set |
Concepts |
A list of Categories |
A kind of Node Set for which the Categories have no assigned Designations .
For example: Male Female |
|
|
Change Definition |
Business |
A structured, well-defined specification for a proposed change. |
A related object - the Statistical Need - is a change expression as it has been received by an organization. A Statistical Need is a raw expression of a proposed change, and is not necessarily well-defined. A Change Definition is created when a Statistical Need is analyzed by an organization, and expresses the raw need in well-defined, structured terms.
A Change Definition does not assess the feasibility of the change or propose solutions to deliver the change - this role is satisfied by the Business Case object. The precise structure or organization of a Change Definition can be further specified by rules or standards local to a given organization.
Once a Statistical Need has been received, the first step is to do the conceptual work to establish what it is we are trying to measure. The final output of this conceptual work is the Change Definition.
The next step is to assess how we are going to make the measurements - to design a solution and put forward a proposal for a body of work that will deliver on the requirements of the original Statistical Need . The Change Definition is an input to this Process Step and the final Business Case is an output. Depending on the needs of individual agencies a Change Definition may be created before or after a Business Case has been created, or even created to a basic extent before the Business Case development and further developed after a Business Case has been approved and a decision made to proceed with the change. |
|
|
Channel Activity Specification |
Business |
The description of the Data Channel made at run time. |
This object is a specialization of a Data Channel and is used to describe the behaviour of a Data Channel at execution time. |
|
|
Channel Design Specification |
Business |
The description of the Data Channel made at design time. |
This object is a specialization of a Data Channel , and is used to make the design of the characteristics of a Data Channel before using it. |
|
|
Classification |
Concepts |
A set of related Classification Schemes . The Classification relates Classification Schemes which differ as versions or variants of each other. |
For example, NAICS (North American Industrial Classification System) is a Classification , but NAICS 2002 and NAICS 2007 are Classification Schemes , as they are different versions of NAICS. |
|
|
Classification Family |
Concepts |
A set of Classifications that are related from a certain point of view. |
The Classification Family includes Classifications devoted to describing the same subject matter, such as industries. |
|
|
Classification item |
Concepts |
A Category at a certain Level within a Classification Scheme . |
|
|
|
Classification Scheme |
Concepts |
A structured list of mutually exclusive Categories . Such a structured list may be linear or hierarchically structured. |
Classification Scheme has two subtypes - Classification Version and Classification Variant. In a hierarchical Classification Scheme , Categories organized into Levels determined by the hierarchy. The Categories in each Level are mutually exclusive and exhaustive. |
|
|
Classification Variant |
Concepts |
A Classification Variant is based on a Classification Version . In a variant, the Categories of the Classification Version are split, aggregated or regrouped to provide additions or alternatives to the standard order and structure of the base version. |
|
|
|
Classification Version |
Concepts |
A Classification Version is a list of mutually exclusive Categories representing the version-specific values of the classification variable. |
A Classification Version has a certain normative status and is valid for a given period of time. |
|
|
Code |
Concepts |
A Designation for a Category |
Codes are unique within their Code List. Example: M (Male) F (Female) |
|
|
Code Item |
Concepts |
An element of a Code List . |
A type of Node |
|
|
Code List |
Concepts |
A list of Categories where each Category has a predefined Code assigned to it. |
A kind of Node Set for which the Category contained in each Node has a Code assigned as a Designation .
For example: 1 - Male 2 - Female |
|
|
Code Value |
Concepts |
An alpha-numeric string used to represent a Code . |
This is a kind of Sign used for Codes . |
|
|
Collection Description |
Business |
The set of information that provides a textual description of the processes and methods used to undertake an Acquisition Activity . It provides a set of contextual and reference metadata about the acquisition process. |
|
|
|
Concept |
Concepts |
Unit of thought differentiated by characteristics |
ISO 1087-1 defines Concept as a "unit of knowledge created by a unique combination of characteristics". First, the term knowledge is poorly defined, and the word thought seems to capture the idea more cleanly. Second, different systems may try to capture the same thought but depend on different characteristics (i.e., attributes). For instance, typical demographic surveys care about age, sex, income, ethnicity, and education of persons. However, persons in a justice survey are either criminals or victims. |
|
|
Concept System |
Concepts |
Set of Concepts structured by the relations among them. |
Here are 2 examples 1) Concept of Sex: Male, Female, Other 2) ISIC (the list is too long to write down) |
|
|
Conceptual Domain |
Concepts |
Set of Categories , irrespective of any relations among them |
Here are 3 examples - 1) Sex categories (enumerated CD): male, female, other 2) Non-negative whole number (described CD) 3) Endowment categories (enumerated CD) $0-$99,999; $100,000-$999,999; $1,000,000 and above |
|
|
Contact Details |
Base |
A collection of modes and strings by which an Organization Item can be contacted. |
Contact modes can include (but are not limited to) telephone, e-mail or fax. In these cases, the relevant strings would be the telephone number, e-mail address and fax number. |
|
|
Context Key |
Base |
Gives semantic or structural meaning to the value of a Contextual String. |
Context Key has two sub classes - Type and Language. For example: Type = Short Name, or Language = French |
|
|
Contextual String |
Base |
A textual value, which is given context by one or more Context Keys. |
A Contextual String can be given context by one or more Context Key. For example: Type = Short Name, or Language = French |
|
|
Control Transition |
Business |
Governs how to determine the next Instrument Control based on factors such as the current location in the Instrument , the response to the previous questions etc. |
|
|
|
Correspondence Table |
Concepts |
A tool for the linking of Classifications . A Correspondence Table systematically explains where, and to what extent, the Categories in may be found in different Classification Schemes of the same Classification or in Classification Schemes of different Classifications . |
Given 2 Category Sets 1) Marital Status A: Married, Single 2) Marital Status B: Married, Single, Widowed, Divorced A Correspondence Table harmonizing the 2 Cate gory Sets will contain Maps that link Categories from each set: Married (A) -> Married (B) Single (A) <- Single (B), Widowed (B), Divorced (B) where the arrow points to the Category which is more generic. |
|
D
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Data Channel |
Business |
A means of exchanging data. |
A Data Channel is an abstract object that describes the means for communicating with Data Resource(s) . The Data Channel identifies the Instrument Implementation , Mode , and Data Resource that are to be used in a process. In some cases the Data Channel that is used by the Data Provider to send its responses could be different that the one used by the statistical office or organization to request information; the statistical office may put electronic formats that can be downloaded by the Data Provider and once answered returned by traditional mail. Two specialized objects are used to implement this abstract object: Channel Design Specification used at design time and Channel Activity Specification used at run time. |
|
|
Data Consumer |
Base |
An organization that uses data or metadata as input for further processing. |
|
|
|
Data Flow |
Structures |
The Data Flow represents both the availability of data over time and the availability of sub sets of the possible data that could be made available according to a Data Structure . |
There may be many data sets structured according to a Data Structure , perhaps made available at a pre-defined frequency (for example, monthly).
There can be many Data Flows that share the same Data Structure : for instance data for National Accounts may be compartmentalized into a number of Data Flows for organizational purposes or for data discovery purposes (there can be different Data Flows for different sub sets of National Accounts where each sub set is structured by the same Data Structure ). |
|
|
Data Location |
Structures |
Identifies where a Data Set can be retrieved from. |
This could be a Data Set structured in a known format and retrievable via a URL, or the URL of a service that can be queried to return such a Data Set . It could also be the location of a publication. |
|
|
Data Point |
Structures |
A placeholder in a Data Set for an item of factual information obtained by measurement or created by a production process |
Example for Unit Data: (1212123, 43) could be the age in years on the 1st of January 2012 of a person ( Unit ) with the social security number 1212123. The social security number is an identifying variable for the person whereas the age, in this example, is a variable measured on the 1st of January 2012. |
|
|
Data Provider |
Base |
An organization, association, group or person who delivers information for a S tatistical Activity . |
A Data Provider is an organization, association, group or person that possesses statistical information (that it has collected, produced, bought or otherwise acquired) and that is willing to supply those data and metadata to a statistical organization. |
data supplier |
|
Data Resource |
Structures |
An organized collection of stored information made of one or more Data Sets which may be sourced from multiple Acquisition or Statistical Activities . |
Data Resources are collections of structured or unstructured information that are used by a statistical activity to produce information. This information object is a specialization of an Information Resource . |
data source |
|
Data Set |
Structures |
An organized collection of data. |
Examples of Data Sets could be observation registers, time series, longitudinal data, survey data, rectangular data sets, event-history data, tables, data tables, cubes, registers, hypercubes, and matrixes. A broader term for Data Set could be data. A narrower term for Data Set could be data element, data record, cell, field |
database, data file, file, table |
|
Data Structure |
Structures |
Defines the structure of an organized collection of data ( Data Set ). |
The structure is described using Data Structure Components that can be either Attribute Components, Identifier Components or Measure Components . Examples for unit data include social security number, country of residence, age, citizenship, country of birth, where the social security number and the country of residence are both identifying components ( Unit Identifier Component ) and the others are measured variables obtained directly or indirectly from the person ( Unit ) and are Unit Measure Components . |
|
|
Data Structure Component |
Structures |
The identification of the Represented Variable used in the context of a Data Structure . |
A Data Structure Component can be an Attribute Component, Measure Component or an Identifier Component .
Example of Attribute Component : The publication status of an observation such as provisional, revised.
Example of Measure Component : age and height of a person in a Unit Data Set or number of citizens and number of households in a country in a Data Set for multiple countries ( Dimensional Data Set ).
Example of Identifier Component : The personal identification number of a Swedish citizen for unit data or the name of a country in the European Union for dimensional data. |
|
|
Data Type |
Concepts |
The computational model for some data, characterized by axioms and operations, and containing a set of distinct values. |
Here are 3 examples (with type families taken from ISO/IEC 11404) 1) State (nominal data): unordered, no arithmetic 2) Integer (interval data): Ordered, subtraction, bounded below 3) Enumerated (ordinal data): ordered, no arithmetic |
|
|
Datum |
Concepts |
Association of a Unit with an element of a Value Domain . |
A Datum is the actual instance of data that was collected. It is the value with populates a cell in a table. Here are 2 examples - 1. <M, male> (for unit Dan Gillman with respect to sex of US persons) 2. <3, $1,000,000 and above> (for unit John Hopkins with respect to endowments for US universities) |
|
|
Described Conceptual Domain |
Concepts |
A Conceptual Domain , with each Concept defined by a Rule . |
For example: All real numbers between 0 and 1 (where 'number' is a Concept, and 0 and 1 are possible designations.) |
non-enumerated conceptual domain |
|
Described Value Domain |
Concepts |
A Value Domain , with each Designation defined by a Rule . |
For example: All real decimal numbers between 0 and 1 (Where 'decimal number' is a Designation, such as the numeric string 0.5 for the number one half) |
non-enumerated value domain |
|
Design Context |
Business |
Methodological metadata that provide the basis for the specification of the information objects required as input to and output from the Process Step Design including Process Method and Rules . |
|
|
|
Designation |
Concepts |
The name given to an object so it can be identified. |
The association of a Concept with a Sign which denotes it. |
term, code, appellation |
|
Dimensional Attribute Component |
Structures |
A Represented Variable that is required to supply information in addition to the identification and measures of a Dimensional Data Set . |
Example: The publication status of an observation such as provisional, revised. |
|
|
Dimensional Data Point |
Structures |
A placeholder or cell in a Dimensional Data Set determined by the crossing of (all) the values for the Identifier Components to contain the value ( Datum ) for an Instance Variable (defined by a Measure Component ) with respect to a given Unit . |
A Dimensional Data Point is uniquely identified by the combination of exactly one value for each of the dimensions ( Dimensional Identifier Component ) and one measure ( Dimensional Measure Component ). There may be multiple values for the same Dimensional Data Point that is for the same combination of Dimension values and the same measure. The different values represent different versions of the data in the Data Point . Values are only distinguished on the basis of quality, date/time of measurement or calculation, status, etc. This is handled through the mechanisms provided by the Datum information object. |
cell |
|
Dimensional Data Set |
Structures |
A collection of aggregated data that conforms to a known structure. |
|
hyper cube, macro data, n-cube, aggregated data, multi-dimensional data, dimensional data |
|
Dimensional Data Structure |
Structures |
Defines the structure of a collection of aggregated data by Represented Variables (in their respective roles as Dimensional Measure Components, Dimensional Attribute Component or Dimensional Identifier Components ) and their Value Domains . |
This is similar to the SDMX Data Structure Definition: Set of structural metadata associated to a Data Set , which includes information about how Concepts are associated with the measures, dimensions, and attributes of a data cube, along with information about the representation of data and related descriptive metadata. |
file description, data set description |
|
Dimensional Identifier Component |
Structures |
A Represented Variable that is required to identify or classify each observation value in a Dimensional Data Set . |
Example: The name of a country in the European Union, the type of dwelling, the gender of a person, age-category of person |
dimension |
|
Dimensional Measure Component |
Structures |
A Represented Variable that has been given a role in a collection of aggregated data to hold the summary values (means, mode, total, index, etc.) for a specific sub-population. |
Examples: average age or total income in a sub-population |
measure |
|
Dissemination Activity |
Business |
The set of executed processes and the actual resources required as inputs and produced as outputs in the dissemination of data for a given P opulation for a particular reference period, or of metadata. It describes the process and resources required in the dissemination of data and metadata in a Statistical Program . |
This object holds Statistical Activity information that relates specifically to data and metadata dissemination. It inherits the relationships and attributes from the Statistical Activity type. A special type of Dissemination Activity is Publication Activity . |
|
|
Dissemination Design |
Business |
The specification of the resources required and processes used and description of relevant methodological information for a set of activities to disseminate data about a given Population, or metadata. |
This object holds Statistical Program Design information that relates specifically to dissemination. It inherits the relationships and attributes from the Statistical Program Design type. |
|
|
Dissemination Service |
Structures |
The mechanism for delivering, and possibly creating, structured content dynamically in response to a consumer request and in accordance with defined parameters as provided by that consumer. |
A Dissemination Service will deliver a Representation created by a process that it invokes. The inputs into the Dissemination Service determine and feed the process that is to be invoked.
A Dissemination Service retrieves the information to be structured and delivered through an Information Resource . As part of the service execution, the consumer may be given a chance to browse or search through the collection of information available from the Information Resource exposed by the Dissemination Service . Based on the results, the consumer can than refine the Output Specification as (further) input to the Dissemination Service to complete the process of creating and delivering the information required in the form of a Representation to the consumer.
Example:
1. SDMX SOAP Data Web Services: The query XML message provides the Service with data selection and the specification of the preferred format (e.g. Generic format or Structured format, time series or cross-sectional). Based on this input the Service will retrieve a Data Set from the Data Resource and invoke a process that will format the data as an SDMX data message. 2. A manual service such as a response to a telephone request where the person answering the call based on the caller's request would mail a PDF (which might either be a Product or dynamically created from another source). |
|
E
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Enumerated Conceptual Domain |
Concepts |
A Conceptual Domain expressed as a list of Categories . |
Example: The Sex categories of 'Male' and 'Female'. |
|
|
Enumerated Value Domain |
Concepts |
A Value Domain expressed as a list of Designations . |
Example - Sex Codes <m, male>; <f, female>; <o, other> |
|
|
Environment Change |
Business |
A requirement for change (type of Statistical Need ) that originates from a change in the operating environment of the statistical activity. |
An Environment Change reflects variations in the context of execution of the S tatistical Activity that create a need for a modification in the way that this activity is conducted. Environment Changes can be of different origins and also take different forms. They can result from a precise event (budget cut, new legislation enforced) or from a progressive process (technical or methodological progress, application or tool obsolescence). Other examples of Environment Changes include the availability of a new Data Resource , the opportunity for new collaboration between agencies, etc.
Environment Change objects may be structured in very diverse ways, but an object will usually group text material describing the type of change that has occurred and created the need for change. This allows the statistical organization to document precisely the (possibly multiple) changes in environment that have led to the Statistical Need . |
|
|
Evaluation Assessment |
Business |
A type of Assessment that evaluates the process outputs of a statistical activity based on a formalized methodological framework. |
The evaluation can be done in regard to various characteristics of the output, for example its quality, the efficiency of the production process, its conformance to a set of requirements, etc. The result of an Evaluation Assessment can lead to the creation of a Statistical Need : in this case, the Statistical Need will reference the Evaluation Assessment for traceability and documentary purposes. |
|
F
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Frame Population |
Concepts |
A Population represented by records in a frame, which is the observable part of a Target Population and provides a reasonable approximation to it. |
Example: most recent population census frame |
object class |
G
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Gap Analysis |
Business |
An expression of the difference (the 'gap') between the current state and a desired future state. |
A Gap Analysis is a type of Assessment that compares the actual state of the activity with a potential state that would correspond to the implementation of a change. An organization will list the factors that define its current state and what is needed to reach its target state. This will for example document a Business Case and help to take the decision to implement the change or not. |
need assessment |
I
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Identifiable Artefact |
Base |
An abstract class that comprises the basic attributes and associations needed for identification, naming and other documentation. |
|
|
|
Identifier Component |
Structures |
The role given to a Represented Variable in the context of a Data Structure . The role is to identify the unit in an organized collection of data. |
An Identifier Component is a sub-type of Data Structure Component . The personal identification number of a Swedish citizen for unit data or the name of a country in the European Union for dimensional data. |
|
|
Individual |
Base |
A person who acts, or is designated to act towards a specific purpose. |
|
|
|
Information Request |
Business |
An outline of a need for new data or metadata required for a particular purpose. |
An Information Request is a special case of Statistical Need that comes in a more organized way, for example by specifying on which S ubject Field the information is required, or what type of C oncept is to be measured, or even the type of U nits that are under consideration. The Information Request can for example be expressed internally, or by another statistical organization or authority. |
|
|
Information Resource |
Structures |
An abstract notion that is any organized collection of information. |
The only concrete sub class is Data Resource . The Information Resource allows the model to be extended to other types of resource. |
|
|
Instance Interviewer Instruction |
Business |
The use of an Interviewer Instruction in a particular Instrument . |
|
|
|
Instance Question |
Business |
The use of a Question in a particular Instrument . |
|
|
|
Instance Question Block |
Business |
The use of a Question Block in a particular Instrument . |
|
|
|
Instance Statement |
Business |
The use of a Statement in a particular Instrument . |
|
|
|
Instance Variable |
Concepts |
The use of a Represented Variable within a Data Set . It may include information about the source of the data. |
The Instance Variable is used to describe actual instances of data that have been collected. Here are 3 examples: 1) Gender: Dan Gillman has gender <m, male>, Arofan Gregory has gender<m, male>, etc.
2) Number of employees: Microsoft has 90,000 employees; IBM has 433,000 employees, etc. 3) Endowment: Johns Hopkins has endowment of <3, $1,000,000 and above>, Yale has endowment of <3, $1,000,000 and above>, etc. |
|
|
Instrument |
Business |
A tool conceived to record the information that will be obtained from the Observation Units. |
The Instrument describes the tool used to collect data. It could be a traditional survey, a set of requirements for a software collection program, a clinical procedure, etc.
Instrument is described from the perspective of the statistical organization collecting the data. It includes the special type of Instrument used for the explicit purpose of gathering data through a questionnaire (Survey Instrument). The behavior and characteristics of a concrete Instrument is determined by an Instrument Implementation . Several implementations can be based in the same Instrument giving the possibility of using multiple channels and to apply different collection techniques ( Modes ) to gather data.
An example of this is when a printed format to collect information for a survey is substituted by a software program; in both cases the Instrument will collect the data from the Unit but the behavior of the Instrument will be different accordingly with its implementation. |
|
|
Instrument Control |
Business |
A record of the flow of an Instrument and its use of Questions , Interviewer Instructions and Statements . |
|
|
|
Instrument Implementation |
Business |
A concrete and usable tool for gathering information based on the rendering of the description made by an Instrument . |
This represents an implementation of an Instrument . It describes the way in which an Instrument has been translated from a design to a concrete tool. It could represent a printed form, a software program made following a specific technological paradigm (web service, web scraping robot, etc.), the software used by a specialized device to collect data, etc. When it describes a Survey Instrument , it can contain descriptions of how each construct (e.g. Questions , Value Domains , validation Rules contained in the Instrument ) is implemented. |
|
|
Interviewer Instruction |
Business |
Directions given to an interviewer to aid the completion of the Instrument |
Example: “Show prompt card before reading question” |
|
L
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Language |
Base |
The linguistic code used. This takes into account geographic variations, e.g. Canadian French or Australian English. |
|
|
|
Level |
Concepts |
Set of Concepts which are mutually exclusive and exhaustive |
For example, section, division, group and class in ISIC Rev. 4. A Level often is associated with a Concept , which defines it. |
|
|
Logical Record |
Structures |
Describes a type of Unit Data Record for one Unit within a Unit Data Set . |
A Logical Record describes the record using variables of which one or more can uniquely identify the record ( Identifier Component ). It represents characteristics of a real or artificially constructed Unit , which could be represented by a Concept . The relationships between Logical Records are given by Record Relationships .
Examples: household, person or dwelling record. |
|
M
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Maintenance Agency |
Base |
The organization or expert body that maintains an artefact. |
|
|
|
Map |
Concepts |
An expression of the relation between a Category in a source Classification Scheme and a corresponding Category in the target Classification Scheme . |
Given 2 Category Sets 1) Marital Status A · Married · Single
2) Marital Status B · Married · Single · Widowed · Divorced
The 2 Married Categories may be compared as follows Married (A) -> Married (B) where the arrow points to the Category which is more generic. |
|
|
Measure Component |
Structures |
The role given to a Represented Variable in the context of a Data Structure . The role is to hold the observed/derived values for a particular Unit in an organized collection of data. |
A Measure Component is a sub-type of Data Structure Component. For example age and height of a person in a Unit Data Set or number of citizens and number of households in a country in a Data Set for multiple countries ( Dimensional Data Set ). |
|
|
Mode |
Business |
A set of characteristics that describe the technique (the "how") used for the data acquisition through a given Data Channel based on a specific Instrument Implementation. |
While the Data Channel describes the means used for data acquisition, the Instrument describes the "what" (i.e. the content, for example, in terms of questions in a questionnaire or a list of agreed time series codes in a data exchange template) and an Instrument Implementation describes the tool used to apply the Instrument ; the Mode describes "how" the Data Channel is going to be used. The Mode is relevant for all types of Data Channels , Instrument Implementations and Instruments and can change over time. The list of Modes will potentially grow in the future and vary from organization to organization. |
|
|
Multiple Question Item |
Business |
A construct that has all of the properties of a Question but additionally links to sub questions. |
A Multiple Question Item is a specific type of Question . |
|
N
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Node |
Concepts |
A combination of a Category and related attributes. |
A Node is created as a Category , Code or Classification Item for the purpose of defining the situation in which the Category is being used. |
|
|
Node Set |
Concepts |
A set of Nodes |
Node Set is a kind of Concept System . Here are 2 examples:
1) Sex Categories · Male · Female · Other
2) Sex Codes · <m, male> · <f, female> · <o, other> |
|
|
Non Structured Data Set |
Structures |
A Data Set whose structure is not described in a Data Structure. |
|
|
O
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Observation Unit |
Concepts |
A Unit for which information can actually be obtained during data collection. |
The sub-set of the Population of interest for which information can actually be obtained. For example, if the Population is the persons living in Ontario, the Observation Units might be persons currently residing in Ontario neither in an institution nor in a remote northern location nor temporarily out of the province. |
collection unit, unit of observation, unit of collection |
|
Organization Item |
Base |
An abstract class which has two sub classes: Organization Unit and Individual. |
|
|
|
Organization Item Role |
Base |
The function or activities of an Organization Item , in statistical processes such as collection, processing and dissemination. |
|
organization role |
|
Organization Scheme |
Base |
A maintained collection of Organization Items . |
|
|
|
Organization Unit |
Base |
A unique framework of authority within which a person or persons act, or are designated to act, towards some purpose. |
|
organization |
|
Output Specification |
Structures |
Contains the specifications for the dynamic creation and delivery of a Representation by a Dissemination Service . |
An Output Specification is a specialization of Parameter Input . It is in fact a request for the dynamic creation and delivery of a Representation . It contains references to the information (e.g. a Data Set , a Data Structure , a Code List, a publication plan) desired with specifications concerning selections, (technical) form and/or method of delivery.
The references to the information come from the collection of information sources provided by the Information Resource that is exposed by the Dissemination Service . The consumer may select any (combination) of those information sources by including the references in the Output Specification . Note that the Output Specification may be "soft" or "broad" in that it may identify groups of internal information objects rather than individual ones. For instance, all Data Sets within a certain (sub) category or theme. This may lead to multiple Representations being delivered.
As part of the Output Specification , the consumer may be given the option to select one of a number of possible formats for the Representation (e.g. SDMX, CSV, JSON or PDF) or to select one of a number of possible methods for delivery (web service response, email, FTP, mail delivery, etc.) The Dissemination Service may be used to request future deliveries of Representations for information that is not yet available. This results in a subscription, where the specification of the Representations to be delivered in future is given in the Output Specification. |
|
P
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Parameter Input |
Production |
Inputs used to specify which configuration should be used for a specific Process Step which has been designed to be configurable. |
Parameter Inputs may be provided where Rules and/or Business Service interfaces associated with a particular Process Step have been designed to be configurable based on inputs passed in to the Process Step. |
|
|
Population |
Concepts |
The total membership of a defined class of people, objects or events |
Population has a number of subtypes. Here are 3 examples – 1. US adult persons 2. US computer companies 3. Universities in the US |
|
|
Process |
Production |
A nominated set of Process Step Designs , and associated Process Controls (flow), which have been highlighted for possible reuse. |
In a particular statistical business process, some Process Steps may be unique to that business process while others may be applicable to other business processes. A Process can be seen as a reusable template. It is a means to accelerate design processes and to achieve sharing and reuse of design patterns which have approved effective. Reuse of process patterns can also lead to reuse of relevant Business Services and business Rules . By deciding to reuse a Process , a designer is actually reusing the "pattern" of Process Step Designs and Process Controls associated with that Process . They will receive a new instance of the Proce ss Step Designs and Process Controls . If they then tailor their "instance" of the Process Step Designs and Process Controls to better meet their needs they will not change the definition of the reusable Process . |
|
|
Process Control |
Production |
A decision point which determines the flow between Process Steps . |
The typical use of Process Control is to determine what happens next after a Process Step Design is executed. The possible paths, and the decision criteria, associated with a Process Control are specified as part of designing a production process. There is typically a very close relationship between the design of Process Steps and the design of Process Controls .
It is possible to define a Process Control where the next Process Step that will be executed is a fixed value rather than a "choice" between two or more possibilities. Where such a design would be appropriate, this feature allows, for example, initiation of a Process Step representing the GSBPM Process Phase (5) to always lead to initiation of GSBPM sub-process Integrate Data (5.1) as the next step.
This allows a process designer to divide a business process into logical steps (for example, where each step performs a specific Business Function ) even if these Process Steps will always follow each other in the same order. In all cases, the Process Control defines and manages the flow between Process Steps , even where the flow is "trivial". Process Step Design is left to focus entirely on the design of the Process Step itself, not sequencing between steps. |
|
|
Process Input |
Production |
Any instance of an information object which is supplied to a process step at the time its execution is initiated. |
Process Input has three subtypes: Process Support Input , Parameter Input and Transformable Input , to be able to identify the range of roles that the Process Inputs perform in the course of a Process Step . A Process Input may be provided to a Process Step to: - "add value" to that input by producing an output which represents a "transformed" version of the input. - control (for example, as a parameter) or influence the behavior of the Process Step . - be used by the Process Step as either an input or a guide.
Note: The same instance of an information object may perform different roles in regard to different Process Steps . |
|
|
Process Input Specification |
Production |
A record of the types of inputs required for a Process Step Design |
The Process Input Specification enumerates the Process Inputs required at the time a Process Step Design is executed. For example, if five different Process Inputs are required at the time, the Process Input Specification will describe each of the five inputs. For each required Process Input the Process Input Specification will record:
1. the type of Process Input ( Parameter Input, Process Support Input or Transformable Input ); and 2. the type of information object (based on GSIM) which will be used as the Process Input (Example types might be a Dimensional Data Set or a Classification ).
The Process Input to be provided at the time of Process Step execution will then be a specific instance of the type of information object specified by the Process Input Specification . For example, if a Process Input Specification requires a Dimensional Data Set then the corresponding Process Input provided at the time of Process Step execution will be a particular Dimensional Data Set . |
|
|
Process Method |
Production |
A specification of the technique which will be used to perform the unit of work. |
The technique specified by a Process Method is independent from any choice of technologies and/or other tools which will be used to apply that technique in a particular instance. The definition of the technique may, however, intrinsically require the application of specific Rules (for example, mathematical or logical formulas).
A Process Method describes a particular method for performing a Business Function. Similarly to the way in which Business Function documents the high level purpose of a process step ("what business purpose does this process step serve?"), Process Method documents the high level methodological "how" associated with the Process Step . Where a Process Step Design applies a method which is not specifically statistical in nature, however, this can still be recorded as the Process Method . |
|
|
Process Metric |
Production |
A Process Output whose purpose is to measure and report some aspect of how the Process Step performed during execution. |
A Process Metric is a sub-type of Process Output which records information about the execution of a Process Step . For example, how long it took to complete execution of the Process Step and what percentage of records in the Transformable Input was updated by the Process Step to produce the Transformed Output.
One purpose for a Process Metric may be to provide a quality measure related to the Transformed Output . For example, a Process Step with the Business Function of imputing missing values is likely to result, as its Transformed Output , in a Data Set where values that were missing previously have been imputed. Statistical quality measures, captured as Process Metrics for that Process Step may include a measure of how many records were imputed, and a measure of how much difference, statistically, the imputed values make to the dataset overall. Another purpose for a Process Metric may be to measure an aspect of the Process Step which is not directly related to the Transformed Output it produced. For example, a Process Metric may record the time taken to complete the Process Step or other forms of resource utilization (for example, human and/or IT).
Often these two kinds of Process Metrics will be used in combination when seeking to, for example, monitor and tune a statistical business process so its statistical outputs achieve the highest level of quality possible based on the time, staff and/or IT resources that are available. |
|
|
Process Output |
Production |
Any instance of an information object which is produced by a Process Step as a result of its execution. |
Process Outputs are subtyped.
- Transformed Output is the result which provides the "reason for existence" of the Process Step . If that output were no longer required then there would be no need for the Process Step in its current form. Typically a Transformed Output is either a Process Input to a subsequent Process Step or it represents the final product from a statistical business process.
- A Process Metric records information about the execution of a Process Step . For example, how long it took to complete execution of the Process Step and what percentage of records in the Transformable Input was updated by the Process Step to produce the Transformed Output . |
|
|
Process Output Specification |
Production |
Identifies the types of Process Outputs the associated Process Step Design will produce when it is executed. |
The Process Output Specification enumerates the Process Outputs that will be generated at the time the associated Process Step Design is executed. For example, if five different Process Outputs will be generated at the time of Process Step execution the Process Output Specification will describe each of the five outputs. For each Process Output the Process Output Specification will record:
1. the type of Process Output ( Process Metric or Transformed Output ) 2. the type of GSIM information object which will be generated as the Process Output .
An example type might be a Dimensional Data Set . The Process Output generated at the time of Process Step execution will then be a specific instance of the type of information object specified by the Process Output Specification . For example, if a Process Output Specification refers to generation of a Dimensional Data Set then the corresponding Process Output generated at the time of Process Step execution will be a particular Dimensional Data Set . For each Process Step execution a different Dimensional Data Set will be generated. |
|
|
Process Step |
Production |
One in a series of tasks which comprise a statistical business process |
A Process Step implements the Process Step Design specified in order to produce the outputs for which the process step was designed. |
|
|
Process Step Design |
Production |
Defines how a Process Step will be performed. This includes specifying the Process Inputs to that work and the Process Outputs that will be produced. |
A Process Step can be as big or small as the designer of a particular business process chooses. From a design perspective, one Process Step can contain "sub-steps", each of which is conceptualized as a (smaller) Process Step in its own right. Each of those "sub-steps" may contain "sub-steps" within them and so on indefinitely. It is a decision for the process designer to what extent to subdivide steps. At some level it will be appropriate to consider a Process Step to be a discrete task (unit of work) without warranting further subdivision. At that level the Process Step is designed to process particular Process Inputs, using a particular Business Service , to produce particular Process Output s. The flow between a Process Step and any sub steps is managed via Process Control . |
|
|
Process Step Execution Record |
Production |
A record of the execution of a Process Step. The record includes the actual Process Inputs to, and Process Outputs from, each Process Step. as well as the evaluation of each Process Control (which, in turn, determines the specific sequence of Process Steps performed during execution). |
Each Process is an instance of executing a repeatable Process Step Design. At the time of Process Step Execution specific instances of input objects (for example, specific Data Sets, specific Variables) will be supplied.
Each instance of Process Step may produce unique results even though the Process Step Design remains constant. One reason is that specific instances of inputs are provided for each Process Step.
Even when the inputs remain the same, metrics such as the elapsed time to complete execution of process step may vary from execution to execution. For this reason, each Process Step Execution Record details of inputs and outputs for that instance of implementing the Process Step Design. It also records the outcome of Process Control evaluation at the end of the process step.
In this way it is possible to trace the flow of execution of a business process through all the process steps which were involved. |
|
|
Process Support Input |
Production |
A form of Process Input that influences the work performed by the Process Step , and therefore influences its outcome. |
Process Support Input is a sub-type of Process Input . Typical Process Support Inputs include metadata resources such as Classifications or structural information used in the processing of data.
Examples of Process Support Inputs could include - A Code List which will be used to check whether the Codes recorded in one dimension of a dataset are valid - An auxiliary Data Set which will influence imputation for, or editing of, a primary Data Set which has been submitted to the Process Step as the Transformable Input .
In these examples, which Code List to use, or which auxiliary Data Set to use, may be specified via a Parameter Input . The details of the Code List or the auxiliary Data Set are Process Support Inputs . |
|
|
Product |
Structures |
Static package of objects that can be disseminated as a whole. |
A Product is a static presentation of artefacts created by fixed processes. The artefacts may be representations of data, visualizations, explanation, interpretation etc. Example: Publications, press releases, articles, list of classifications, etc. |
publication |
|
Production Activity |
Business |
The set of executed processes and the actual resources required as inputs and produced as outputs in the production of data for a given Population for a particular reference period. It describes the process and resources required in the production of data in a Statistical Program . |
These objects hold Statistical Activity information that relates specifically to data production. It inherits the relationships and attributes from the Statistical Activity type. |
|
|
Production Design |
Business |
The specification of the resources required and processes used and description of relevant methodological information for a set of activities to process data about a given Population . |
This object holds Statistical Program Design information that relates specifically to production - the act of taking data that have been collected and transforming them. It inherits the relationships and attributes from the Statistical Program Design type. |
|
|
Provision Agreement |
Structures |
A service-level agreement, a legal mandate, the terms of a mutual agreement, a memorandum of understanding, or any other terms/conditions which affect the provision of data. |
The Provision Agreement does not need to have any formal consent of the Data Provider . For instance data collection via web scraping may identify the Data Provider but requires no formal agreement. A web service that provides data to anyone that queries it also may not need any formal agreement (save that perhaps of implicit agreement under the terms of the web service). Nevertheless, in both these cases the data may be structured according to a Data Structure which is associated to the Data Flow .
A Provision Agreement represents the union of a specific Data Provider and a specific Data Flow for which the Data Provider supplies data. The location of the Data Sets that are available for this Provision Agreement are associated in the Data Location . |
|
|
Publication Activity |
Structures |
The mechanism for creating structured, static content in response to an internal trigger. |
A Publication Activity is a specific type of Dissemination Activity . A Publication Activity is triggered by an internal need to create a new Product . This is most commonly based on knowledge about a general need of potential consumers or the objective to actively provide information to consumers. Examples are the writing, editing and approval of a press release, web article or publication.
A Publication Activity may make use of Dissemination Services to get the necessary input. A Publication Activity may interpret or transform (e.g. visualize) statistical data, but cannot do any statistical processing.
A Publication Activity produces a Product and makes this available to Dissemination Services (possibly through an Information Resource ) for the actual dissemination. |
|
Q
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Question |
Business |
Describes the text used to interrogate a respondent, the Concept that is measured and the allowed responses. |
One specific type of Question is the Multiple Question Item. |
|
|
Question Block |
Business |
A set of Questions, Interviewer Instructions and Statements which are used together. |
A statistical organization will often have a number of Question Blocks which they reuse in a number of Instruments . Examples of Question Blocks include:
· Household Question Block · Income Question Block · Employment Question Block |
question module |
|
Question Group |
Business |
A set of Questions which are gathered or stored together for the purpose of discovery. |
Questions in Question Groups are similar in some way (for example, all the Questions relate to obesity).
Questions Groups are often found in databases that can be searched to find Questions which meet specific criteria |
question pool, question bank |
R
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Record Relationship |
Structures |
Describes relationships between Logical Records within a Unit Data Structure . It must have both a source Logical Record and a target Logical Record in order to define the relationship. |
All relationships are defined in pairs. Hence multiple relationships may be needed to clarify all Record Relationships within a Unit Data Set e.g. household and person, household and dwelling etc.
Example: Relationship between person and household Logical Records within a Unit Data Set . |
|
|
Representation |
Structures |
A "custom-built" artefact that has a consumable (human or machine) format. It is the output of a Dissemination Service . It is what is ultimately delivered to the consumer. |
A Representation brings together various maintainable artefacts and their related artefacts. It is essentially the application of rules to an artefact (and possibly its related artefacts) which transform the object into a format fit for consumption. This consumption may be something that is understandable to a person or a machine.
Representation can be in different forms; e.g. tables, graphs, structured data files. Examples: - A table of data. Based on a Data Set , the related Data Structure is used to label the column and row headings for the table. The Data Set is used to populate the cells in the table. Reference metadata is used to populate footnotes and cell notes on the table. Confidentiality rules are applied to the Data Set to suppress any disclosive cells. - A data file based on a standard (e.g. SDMX). - A PDF document describing a Classification . - Any structural metadata object expressed in a standard format (e.g. DDI 3.1 XML). - A list of Products or services (e.g. a product catalogue or a web services description language (WSDL) file). - A web page containing Classifications , descriptions of Variables , etc. |
presentation, publication, delivery, product |
|
Represented Variable |
Concepts |
The association of a Variable with a Value Domain which represents it. The Represented Variable is used as part of a Statistical Activity . |
Here are 3 examples – 1. Sex variable which will be collected using <m, male>, <f, female>, <o, other>
2. Number of Employees variable which will be collected using an Integer or Count of Individuals.
3. Endowment of Universities variable which will be collected using <1, $0-$99,999>, <2, $100,000-$999,999>, <3, $1,000,000 and above> |
|
|
Rule |
Production |
A specific mathematical or logical expression which can accept inputs and be evaluated based on those inputs. |
There are many forms of Rules and their purpose, character and expression can vary greatly. Evaluation Rules consist of computing an output which will result in a particular course of action. The logical Rules implemented by a Process Step and their implementations in executable form. A single Rule (at the conceptual level) may be expressed in different ways when using different notations and/or different software at the implementation level.
Rules can be "nested". In other words, a Rule can accept the outputs/evaluations from one or more other Rules as its inputs. This approach can be useful to achieve reuse of Rules .
A Rule can be used to generate new data (for example, determine values for a derived Variable) based on existing data. Rules can also be designed to apply "if then else" logic or "case" logic. Parameter Inputs can be included in the definition of a Rule and values provided for those parameters at the time the Rule is evaluated. |
|
S
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Sign |
Concepts |
Something that suggests the presence or existence of a fact, condition, or quality. |
It is a perceivable object. This object is used to denote a Concept as a Designation . |
|
|
Statement |
Business |
A report of facts in an Instrument |
Statements are often included to provide further explanation to respondents. Example:
“The following questions are about your health” |
|
|
Statistical Activity |
Business |
The set of executed processes and the actual resources required as inputs and produced as outputs to investigate the characteristics of a given Population for a particular reference period. It may describe process and resources required to acquire ( Acquisition Activity ), produce ( Production Activity ), and disseminate ( Dissemination Activity ) data in a Statistical Program . |
A Statistical Activity includes the run-time information used to actually execute a set of processes. Activities occur in the context of each Statistical Program Cycle and execute a particular Statistical Program Design . |
|
|
Statistical Need |
Business |
A requirement, request or other notification that will be considered by an organization. A Statistical Need does not have necessarily have structure or format - it is a 'raw' need as received by the organization. A Statistical Need may be of a variety of types including Environmental Change or Information Request . |
The Statistical Need is a proposed or imposed change as it has been received by an organization. A Statistical Need is a raw expression of a proposed change, and is not necessarily well-defined. A related object - Change Definition - is created when a Statistical Need is analyzed by an organization. Change Definition expresses the raw need in well-defined, structured terms. Once a Statistical Need has been received, the first step is to do the conceptual work to establish what it is we are trying to measure. The final output of this conceptual work is the Change Definition .
In some cases, the Statistical Need can result from the Assessment of the quality, efficiency, etc. of an existing process. |
|
|
Statistical Program |
Business |
A set of activities to investigate characteristics of a given Population . It describes the purpose and context of a set of Statistical Activities . |
The Statistical Program is one of a family of objects that provide the environmental context in which a set of statistical activities is conducted. Statistical Program is the top level object that describes the purpose and objectives of a set of activities. Statistical Program will usually correspond to an ongoing activity such as a survey or output series. Some examples of Statistical Program are: - Labour Force Survey - Multipurpose Household Survey - National Accounts - Demography - Overseas Arrivals and Departures
Related to the Statistical Program object there are Statistical Program Design and Statistical Program Cycle objects that hold the detailed information about the design and conduct of the Statistical Activity .
In the case of the traditional approach, an organization has received a Statistical Need and produced a Change Definition and an approved Business Case . The Business Case will specify either a change to the design or methodology of an existing Statistical Program , which will result in a new Statistical Program Design ; or a change to one or more existing Statistical Programs (for example, to add an additional objective to the Statistical Program ); or result in a new Statistical Program being created. |
|
|
Statistical Program Cycle |
Business |
A set of activities to investigate characteristics of a given Population for a particular reference period. |
A Statistical Program Cycle documents the execution of an iteration of a Statistical Program according to the associated Statistical Program Design for a certain reference period. It identifies the activities that are undertaken as a part of the cycle and the specific resources required and processes used and description of relevant methodological information used in this cycle defined by the Statistical Program Design. |
|
|
Statistical Program Design |
Business |
The specification of the resources required and processes used and description of relevant methodological information about the set of activities investigating characteristics of a given Population . Includes the Statistical Activities that are required to acquire ( Acquisition Activity ), produce ( Production Activity ), and disseminate ( Dissemination Activity ) data in a Statistical Program . |
The Statistical Program Design is one of a family of objects that provide the operational context in which a set of statistical activities is conducted.
A simple example is where a Statistical Program relates to a single survey, for example, the Labour Force Survey. The Statistical Program will have a series of Statistical Program Design objects that describe the methodology and design used throughout the life of the survey. When a methodological change is made to the survey, a new Statistical Program Design is created to record the details of the new design. |
|
|
Subject Field |
Concepts |
One or more Concept Systems used for the grouping of Concepts and Categories for the production of statistics. |
A Subject Field is a field of special knowledge under which a set of Concepts and their Designations is used. For example, labour market, environmental expenditure, tourism, etc. |
subject area, theme |
|
Survey Instrument |
Business |
A specialized kind of Instrument used for the explicit purpose of gathering statistical data. |
Survey Instrument is a tool used to gather information from a Data Resource . It can be applied in several ways using different formats and modes, for example, as paper forms in face-to-face interviews, as online self-administered interviews, as computer-assisted questionnaires in telephone interviews, as electronic templates downloaded from the web and returned via email. The Survey Instrument provides a generic description of the data collection form independent of the format and mode. |
|
|
Survey Population |
Concepts |
A Population for which information can be obtained in a survey. |
A Population which can realistically be studied (example: people currently residing in the province of Ontario not in an institution nor in a remote northern location nor temporarily out of the province). The Survey Population is therefore often a subset of the Target Population |
object class |
T
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Target Population |
Concepts |
A Population for which a Statistical Activity is designed to make estimates. |
Population for which estimates are desired in a Statistical Activity , though practical considerations may dictate that some units are excluded. If so, the resulting sub-set of units for which information can be obtained is the Survey Population . |
object class |
|
Transformable Input |
Production |
A type of Process Input whose content goes into a Process Step and is changed in some way by the execution of that Process Step . Some or all of the content will be represented in the Transformed Output. |
Transformable Input is a sub-type of Process Input . Producers of official statistics often conceptualize data (and sometimes metadata) flowing through the statistical business process, having statistical value added by each Process Step and being transformed along the way.
The concept of Transformable Input allows this notional flow of information through the production process to be traced, without confusing these inputs with other inputs - such as Parameter Inputs and Process Support Inputs that are controlling or influencing a particular Process Step but do not "flow through the business process" in the same sense. Typical Transformable Inputs are Data Sets and structural metadata (if changed by a process and needed to describe another output or as an object in their own right). |
|
|
Transformed Output |
Production |
A Process Output (a result) which provides the "reason for existence" for the Process Step . |
A Transformed Output is a sub-type of Process Output . Typically a Transformed Output is either a Process Input to a subsequent Process Step or it represents the final product from a statistical business process.
In many cases a Transformed Output may be readily identified as an updated ("value added") version of one or more Transformable Inputs supplied to the Process Step execution.
Note: If the output were no longer required then there would be no need for the Process Step in its current form. |
|
|
Type |
Base |
Identifies a narrower meaning for the value in the Contextual String. |
|
|
U
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Unit |
Concepts |
The object of interest in Statistical Activities and corresponds to at least one Population . |
Here are 3 examples - 1. Individual US person (i.e., Arofan Gregory, Dan Gillman, Barack Obama, etc.) 2. Individual US computer companies (i.e., Microsoft, Apple, IBM, etc.) 3. Individual US universities (i.e., Johns Hopkins, University of Maryland, Yale, etc.) |
|
|
Unit Attribute Component |
Structures |
A Represented Variable that is required to supply information in addition to the identification and measures in a Unit Data Set . |
Example: The publication status of an observation such as provisional, revised. |
|
|
Unit Data Point |
Structures |
A placeholder in a Unit Data Record to contain the value ( Datum ) for an Instance Variable with respect to a given Unit . |
For example (1212123, 43) could be the age in years on the 1st of January 2012 of a person (Unit) with the social security number 1212123. The social security number is an identifying variable for the person whereas the age, in this example, is a variable measured on the 1st of January 2012. The value can be obtained directly from the Unit or indirectly via a process of some kind. |
|
|
Unit Data Record |
Structures |
Contains the specific values (as a collection of Unit Data Points ) related to a given Unit as defined in a Logical Record . |
For example (1212123, 48, American, United Kingdom) specifies the age (48) in years on the 1st of January 2012 in years, the current citizenship (American), and the country of birth (United Kingdom) for a person with social security number 1212123.
The Unit Data Record is a collection of Unit Data Points that provide either a complete or restricted view of the state of a Unit as observed over a specific period or at a specific point in time. |
|
|
Unit Data Set |
Structures |
A collection of data that conforms to a known structure and describes aspects of one or more Units . |
Example: A synthetic unit record file is a collection of artificially constructed Unit Data Records , combined in a file to create a Unit Data Set . |
micro data, unit data, synthetic unit record file |
|
Unit Data Structure |
Structures |
Describes the structure of a Unit Data Set . |
For example (social security number, country of residence, age, citizenship, country of birth) where the social security number and the country of residence are the identifying components ( Unit Identifier Component ) and the others are measured variables obtained directly or indirectly from the person ( Unit ) and are Unit Measure Components of the Logical Record . |
file description, dataset description |
|
Unit Identifier Component |
Structures |
The role that has been given to a Represented Variable , in a Unit Data Structure , to identify the Unit . |
For example the person identification number in Norway. |
|
|
Unit Measure Component |
Structures |
The role that has been given to a specific Represented Variable to hold the observed or derived values related to a Unit as identified by the Unit Identifier Components , in an organized collection of data. |
For example age and height of a person in a Unit Data Set |
|
|
Unit of Measure |
Concepts |
Units by which some quantity is measured. |
Here are 3 examples - 1. Kilograms; 2. Count; 3. Dollars |
|
V
|
Object |
Group |
Definition |
Explanatory Text |
Synonyms |
|
Value Domain |
Concepts |
A set of allowed values (determinants). A Value Domain is a Concept System where all Concepts are designated, but in which there are no relations. |
Here are 3 examples - 1) Sex codes (enumerated Value Domain) m, male; f, female; o, other 2) Non-negative whole decimal number (described Value Domain) , count of people; 3) Endowment categories (enumerated Value Domain) , dollars: 1, $0-$99,999; 2, $100,000-$999,999; 3, $1,000,000 and above |
|
|
Variable |
Concepts |
The use of a Concept as a characteristic of a Population that is intended to be measured as part of a Statistical Activity . |
Here are 3 examples - 1. Sex 2. Number of employees 3. Endowment |
|