• Share this page to Facebook
  • Share this page to Twitter
  • Share this page to Google+
Linking methodology used by Statistics New Zealand in the Integrated Data Infrastructure project

Integrated Data Infrastructure (IDI) allows for statistical outputs and research on the transitions and outcomes of people through education, the labour market, benefits, justice, health and safety, migration, and business data.

The IDI is primarily based on administrative data and also contains a number of surveys undertaken by Statistics NZ and other agencies. Some of the datasets can be merged directly on common unique identifiers, and this is straightforward to do. However, other datasets do not have common unique identifiers and these can be linked by creating links using demographic information. These datasets are linked using record linkage techniques, and uses the software IBM QualityStage v8.5.

This report outlines the probabilistic record linkage used within the Integrated Data Infrastructure system and demonstrates with specific examples.

Read or download the report and tables from 'Available files' above. If you have problems viewing the files, see opening files and PDFs.

ISBN 978-0-478-42903-9 (online)
Published 4 July 2014

  • Share this page to Facebook
  • Share this page to Twitter
  • Share this page to Google+
Top
  • Share this page to Facebook
  • Share this page to Twitter
  • Share this page to Google+