Session 28

Back to Schedule

Title of session: Data collection, metadata

Chair: Thomas Burg

Room: S4A Mariacki

Time: 08:30 - 10:00

Date: 29 June

Session 28 - papers & presentations

Presenting AuthorAbstract
Marie Hélène Kérouanton
Title: <<< The French statistical metadata repository, RMéS: managing metadata throughout the whole statistical process >>>
Insee has had a long experience in metadata. Twenty years ago, Insee developed a software application for a system of structured documentation called DDS (Data structured documentation). Nowadays, DDS is quite outdated. Furthermore, the patterns used to structure metadata have not been stringent enough and don’t conform to any standard. And finally, all the documentary elements are actually collected at each phase of the process, via different files. They are entered at the end of the different steps into different instances of the DDS, generating redundancies and inconsistencies. Therefore, a new statistical metadata repository, called RMéS, is being set up. RMéS relies on two repositories. The one dedicated to the questionnaires, variables and their codification is called a Colectica Repository. Information is stored in an international format, the DDI format (Data Documentation Initiative). The other one hosts all the other metadata, described in more appropriate models. These models are compliant with W3C standards and with the Single Integrated Metadata Structure as required by Eurostat for the quality reports. In this repository we store not only concepts and classifications but also the description of all our statistical sources and their quality report in a RDF format. These two repositories are linked together. At the same time, we are developing management interfaces and services to enable other applications to use the metadata. We also created an application to design questionnaires, called Pogues. With this project, we aim to gather and share in a single application all the metadata in accordance with international standards. What is more innovative is the idea of using metadata throughout the whole process, from the analysis of the needs to the statistical results and the assessment, in order to develop metadata driven processes.
Rasmus Anker Kristiansen
Title: <<< Development of a statistical product portfolio tool >>>
A regularly asked question is; how many statistical products does Statistics Denmark produce? To answer this question, it is necessary to ask, what is a statistical product? One answer could be that a statistical product is a qualitative measurement of some phenomenon. A less philosophical answer could be that a statistical product, from Statistics Denmark that is, consists of a name, one or more tables in the StatBank, a quality report with documentation, a press release, a page on the website, a person responsible and an administrative placement in the organization. Statistics Denmark is currently creating a complete product portfolio tool, which provides this overview of all our statistical products. Statistics Denmark has in recent years been working systematically with coherent metadata, including adoption of the Single Integrated Metadata Structure (SIMS). The idea is write once, use everywhere, but this raises the question; use where? The portfolio tool links all information about a given statistical product, in terms of naming, allocation in the subject structure, across dissemination platforms and administrative placement within the organization. The portfolio tool will retrieve real time information from existing administrative systems such as the metadata repository (Colectica), from different dissemination-, HR- and financial management systems and from the StatBank Application Programming Interface (API). There are multiple benefits and possibilities of such a tool. We get a better overview of our inventory, so it is clearer to ourselves and our users, what we actually produce. We can raise credibility when what we say we do in the annual work plan, aligns with what we actually do. We ensure coherent and consistent naming of the individual statistics and we can improve our management accounting procedures with a more precise activity-based costing (ABC) in the production of our statistical products.
Georgiana Ivan
Title: <<< Labour market characteristics of European citizens living in another European country than their own (emigration perspective) >>>
Migration flows within the European labour market tend to go in specific geographic directions; some EU Member States can mostly be defined as sending countries while the others mainly as receiving ones. While the statistical characteristics of immigrant populations and their performance on the labour market is more readily available based on the European Labour Force Survey, the labour market portrait of European citizens residing in a country other than their own by country of origin or citizenship is less known. Eurostat’s new innovative series of products (datasets and explanatory articles) on the characteristics of European citizens residing in another European country by country of origin or citizenship fills this gap. The work is of interest at a methodological level, as the construction of these indicators implies the aggregation of sub-populations coming from different national samples of the European Labour Force Survey. The article discusses the main results (comparing the population living abroad with the one in the country of origin or citizenship but also the one in the destination countries). It also describes methodological matters: the advantages of using this method, the calculation of confidence intervals and data validation using other data sources.

Back to Schedule

Font Resize