NATIONAL STATISTICS OFFICE OF GEORGIA (GEOSTAT)

 

 

cid:image004.jpg@01CC4C4E.C4965B90

 

Topic (iii)

IT innovations in the management of statistical information systems

Joint UNECE/Eurostat/OECD/ESCAP Meeting on the Management of

Statistical Information Systems (MSIS 2013)

(Paris, France, 23-25 April 2013)

 

Introduction

IT innovations and Data flow in the management of statistical information systems in National Statistics Office of Georgia (Geostat) mainly consists of  Data collection, validation, data cleaning, final data production and dissemination steps.

 

IT development strategy was created and approved By Geostat in 2010 According to IT development strategy there was elaborated the Unified Statistical Database, which covers all of present needs of Geostat and is easily expandable for future needs. Database contains several parts:

 

1.       Main part – Unified Statistical Database, where all the data is collected, prepared and used for  provision of the statistical data for the internal and external users.

2.       Online questionnaires database – is used to collect primary survey data from Geostat respondents.
Survey Questionnaire Data is filled out by respondents online or by internal interviewers (personal statistician) and after passing all procedures is available within the main part of database.

3.       External government sources database – is used to collect primary data from several government institutions. Data is filled out by appropriate establishment and is available after passing all necessary procedures.

4.       Data dissemination system- Online Presentation database which is used to disseminate statistical data and contains parts for data dissemination to PC-Axis, Nada, cell phone dissemination system and Geostat business portal.
 

Also Geostat is now developing software responsible to collect, process and disseminate statistical data.

At this moment IT innovations apply on data collection, validation, user management and dissemination steps.

 

 

Data Collection

 

Data Collection is first and main part of Unified Statistical Database, and is directly responsible to correct primary data collection.

The first way to receive data is to use online survey system .

In this case Geostat has prepared online statistical data collection tool, which contains temporary online database for survey result storage, survey publishing module; primary data control module and synchronization procedures. Through this engine tool, it   is possible to prepare online questionnaires, their later publishing, , addition of external users to surveys and collection of statistical survey data from external users easily. There is also opportunity to enter huge amount of data by Geostats internal data collectors from paper questionnaires.

Every survey passes through primary data control and logical control module.

It’s possible to attach supervisors, personal statisticians and coordinators to any survey in purpose to minimize the risk of filling the data with mistakes. It is possible to return survey questionnaire to respondent in case of incorrect data fill out, or Geostat staff can correct data by phone interview. After sending data by external producer and checking by supervisors it is ready to synchronize with Unified Statistical Database.

 

In the case of new data collection Geostat has prepared online statistical data collection system, which is able to:

       Prepare web based online questionnaires,

       Publish them easily,

       Add external users to surveys,

       Collect statistical survey data from respondents/ external users.

 

After the synchronization step, the system makes it available for review, export data in various extension formats like csv, excel, text and others, apply primary data control for data validation.

 

The second way to receive primary data is to receive the data from External Government sources. In this case system has an ability to collect the data from the sources like Public registry, Revenue service, City hall and others automatically and insert these data into unified database after validation, primary preparation and passing all necessary data processing procedures. 

 

Primary Data validation- All the data, collected via this system is passed through the automated primary data validation and logical control module.

Primary data validation module is flexible system, which allows checking of the data filled by respondents and has the ability to return the invalid or incorrectly filled data to the primary data producer or filler of questionnaire in the case of data correction if automatic correction is not applicable. After the correction of mistakes user can resend the correctly filled data to the system.

 

Data Preparation and cleaning- During this process is created internal data management software. Using this software it’s possible to preview and edit the data, prepare predefined and user created report on the data from particular survey, make final data cleaning, export prepared reports and apply the final data for dissemination.

Also there is the possibility to export primary data for statistical analysis to external data processing systems like access, excel, SPSS or others.

 

User management and internal reporting system- There are external and internal users of the system. External users - respondents are responsible to fill up the questionnaires.

Internal users are divided on several user categories with different rights. Some of users are able to collect primary data, supervisors and coordinators can check data and return incorrectly filled questionnaires to senders, and the others can apply data cleaning procedures, prepare final data and approve the data for dissemination.

 

For management use the new statistical system contains many types of predefined reports about statistical data filling process, employee work and user control. System has the ability to create any type of reports to check and control internal staff efficiency.

 

 

Statistical Data Dissemination

 

Statistical data provided by Geostat have different formats and data itself is provided by different ways:

 

1.       Geostat is publishing data using predefined excel spreadsheets and charts on Geostat official website .
 

2.       Geostat prepared and is supporting Data Dissemination System for Android Cell phones . Users can download and install this system from Geostat official website.

This system allows to user to view most updated official statistics provided by Geostat online. System Makes it available to review statistical data into past years together with new up to date information
 

At this moment statistics is available in following fields:

       External trade and Direct Foreign Investments,

       Price Statistics,

       Demography,

       Agriculture Statistics,

       Business Statistics,

       National Accounts.

 

The users also can read recent Geostat news using this system.

 

Key Features:

       Select Data by Category and Subcategory

       Select Data by Period

       View Statistical data

       View Geostat News

 

3.       Geostat is now preparing localized Georgian/English version of NADA - microdata dissemination system .

NADA is a web-based cataloging system that serves for end users to browse, search, compare, apply for access, and download relevant survey information.

NADA provides a powerful instrument that facilitates the process of releasing study metadata and microdata to the user community.
 

NADA allows users for:

       Increased quality and diversity of research,

       Improved reliability and relevance of data,

       Reduced duplication of data collection activities,

       Improved visibility of the institution as their data becomes more frequently used and is more readily accessible,

       Increased donor and public trust towards  the institution,

       Improved publishing and dissemination efficiency of the Geostat.


NADA allows end permits to search and review metadata information together with survey data and will be used to disseminate micro-data catalogs after primary statistical data anonymous

 

Key Features:

       Search By Research

       Search By Citation

       Search By category using predefined filter

       View Micro-data

       View Metadata

       View Aggregations

       Simple Grouping

 

4.       Statistical database dissemination via PC-AXIS (Developed by Sweden Statistics) is implemented.

PC-AXIS is complex data dissemination system that allows publishing statistical data series divided by categories and subcategories in online system. There is available to change data visualization, charts or data table views and export data in several formats.
There is available to:

 

       Select indicators to view from study,

       Select periods to view,

       Change data visualization style,

       Change charts or data table views,

       Export data in several formats.

 

PC-AXIS/PX-WEB Key Features:

       Make Selection of Categories or Periods

       View Selected Data in Tables

       View Selected Data in Charts

       Change Type of Chart or table

       Flexible Export System

 

5.       Geostat is also developing a Business Portal

Business Portal makes it possible to create statistical information reports in different ways.  Business Portal allows users to search any published statistical data using keywords or via predefined category/research list, group selected data by different criteria, apply user selections and receive on screen charts or tables of statistical data.

Business Portal allows to users:

       search any published survey results using keywords,

       search via predefined category/research list,

       group selected data by different criteria

       apply user selections and receive on screen charts or tables of statistical data.


There is opportunity to download statistical data in different formats like MS Excel, PDF, or some others.

Key Features:

       Search Data by Keywords

       Select Data by category, Research or Period

       Grouping of Statistical Data by Different ways

       View Statistical Data in Charts or tables

       Export Table or Chart

 

Much work has been done in organizing IT both inside the office and for dissemination. The work is often organized as projects with interdivisional working groups. This ensures the participation of subject matter expert as well as IT expert, which is vital for developing solutions that will work from both technical and professional side.

 

Geostat has a good legal basis for dissemination and has implemented best practice concerning equal access to all users and an advance release calendar, which is being followed. Further development of IT innovations of new data sources is planed and highly welcomed.

IT innovations have made a significant impact on data collection, data processing, dissemination, visualization, user and internal management processes in Geostat. IT developed products represent the part of new technologies, new methods, new practices and new management approaches in field of IT innovations in the management of statistical information systems.