Data CaptureThe process by which collected data are put in a machine-readable form. Elementary edit checks are often performed in sub-modules of the software that does data capture. Source: UNECE Data Editing Group
Data CheckingActivity through which the correctness conditions of the data are verified. It also includes the specification of the type of the error or condition not met, and the qualification of the data and its division into the "error free" and "erroneous data". Data checking may be aimed at detecting error-free data or at detecting erroneous data. Source: UNECE Data Editing Group See also: Data Review
Data CollectionThe process of gathering data. Data maybe observed, measured, or collected by means of questioning, as in a survey or census response. Source: UNECE Data Editing Group
Data Correction (correction of errors in data)Activity of checking data which was declared (is possibly) erroneous. Source: UNECE Data Editing Group Data EditingThe activity aimed at detecting and correcting errors (logical inconsistencies) in data. Source: UNECE Data Editing Group Data FieldSee: Data Item Data ImputationSubstitution of estimated values for missing or inconsistent data items (fields). The substituted values are intended to create a data record that does not fail edits. Source: UNECE Data Editing Group Data Item (data field)The specific sub-components of a data record. For instance, in a population census, specific data items might be last name, first name, sex, and age. Source: UNECE Data Editing Group *Data QualityA measure (or measures) that indicate the quality of the data in a database. For example, if most records pass edits and the set of edit-failing records do not seriously affect certain aggregate and other measures, then the data may be said to be of (relatively) high quality. *Note: This definition may not be fully satisfactory. Source: UNECE Data Editing Group Data RedundancyWhen the value of data items (fields) can be partially or completely deduced from the values of other data items (fields). Source: UNECE Data Editing Group Data Review (data checking)Activity through which the correctness conditions of the data are verified. It also includes the specification of the type of the error or condition not met, and the qualification of the data and its division into the "error-free" and "erroneous" data. Data checking may be aimed at detecting error-free data or at detecting erroneous data. Data review consists of both error detection and data analysis, and can be carried out in manual or automated mode. Data review/error detection may occur at many levels: a) within a questionnaire
b) across questionnaires / editing of logical units
Source: UNECE Data Editing Group See also: Data Checking Data ValidationAn activity aimed at verifying whether the value of a data item comes from the given (finite or infinite) set of acceptable values. For instance, a geographic code (field), say for a Canadian Province, may be checked against a table of acceptable values for the field. Source: UNECE Data Editing Group Data Validation According to a ListVerifying whether the data value is in the list of acceptable values of this data item. Source: UNECE Data Editing Group Deck ImputationImputation method where a donor questionnaire is used to supply the missing value.
Source: UNECE Data Editing Group See also: Automated Imputations, Hot-Deck, Cold-Deck Deductive ImputationAn imputation rule defined by a logical reasoning, as opposed to a statistical rule. Source: UNECE Data Editing Group Detection of Errors in Data (error detection)An activity aimed at detecting erroneous data. Usually predefined correctness criteria are used. Source: UNECE Data Editing Group Deterministic Checking RuleA checking rule which determines whether data items are incorrect with a probability of 1. Source: UNECE Data Editing Group Deterministic EditAn edit, which if violated, points to an error in the data with a probability of one. Contrast with stochastic edit. Example: Age 5 and Status = mother. Source: UNECE Data Editing Group See also: Stochastic Edit Deterministic ImputationThe situation, given specific values of other fields, when only one value of a field will cause the record to satisfy all of the edits. For instance, it might occur when the items that are supposed to add to a total do not add to the total. If only one item in the sum is imputed, then its value is uniquely determined by the values of the other items. This may be the first situation that is considered in the automated editing and imputation of survey data. Example: The missing sum at the bottom of a column of numbers Source: UNECE Data Editing Group See also: Automated Imputations Donor (imputation)In hot-deck edit/imputation, a donor is chosen from the set of edit-passing records based on its similarity to the fields in the record being donated to (being imputed within). Values of fields (variables) in the donor are used to replace the corresponding contradictory or missing values in the edit-failing record that is receiving information. This type of replacement may or may not assure that the imputed record satisfies edits. Source: UNECE Data Editing Group See also: Hot-Deck, Hot-Deck Imputation |