Statistics NZ’s Integrated Data Infrastructure (IDI) is a linked longitudinal dataset that covers an extended range of pathways and transitions information. The IDI allows for policy evaluation and research analysis, and the production of statistical outputs on the transitions and outcomes of people. It currently includes economic, education, justice, health and safety, migration, tenancy and business data.
The IDI is used by a number of researchers from across government to answer a wide range of important research, policy, and evaluation questions.
Approved research access
All research proposals are assessed using our microdata access protocols.
Integrated Data Infrastructure extension: Privacy impact assessment has been prepared in consultation with the Office of the Privacy Commissioner to ensure that we continue to follow best practice and are open and transparent about how we are protecting information.
We are required by law to protect the information we collect. These requirements are outlined in the Statistics Act 1975 and the Privacy Act 1993. Data available in the IDI is anonymised – personal identifying information such as names, addresses, and exact dates of birth have been removed and all unique identifiers are encrypted. All research is checked before it's released to ensure no information about individual people, households, or businesses is published or disseminated.
Statistics NZ operates within a ‘five safes’ framework to ensure that access to microdata is only provided if all of the following conditions can be met:
- safe people – researchers can be trusted to use data appropriately and follow procedures
- safe projects – the project has a statistical purpose and is in the public interest
- safe settings – security arrangements prevent unauthorised access to the data
- safe data – the data itself inherently limits the risk of disclosure
- safe output – the statistical results produced do not contain any disclosive results.
Information on how to apply for access is available through the Statistics NZ Data Lab.
History of the IDI
In December 2011, Statistics NZ successfully completed a prototype of the IDI, including economic, education, migration, and business data. The IDI prototype was replaced and enhanced by the IDI in December 2012.
The IDI was extended in 2013 to include justice sector and health and safety data.
Inland Revenue – person and business tax data, Student Loans and Allowances data
Ministry of Social Development – benefit data, Student Loans and Allowances data
Ministry of Education – primary and secondary school achievement and intervention data, tertiary education data
Ministry of Health – National Health Index (NHI), Primary Health Organisation (PHO) and other health-related data
Ministry of Justice – charges data
Department of Corrections – sentencing data
Accident Compensation Corporation – injury data
Ministry of Business, Innovation and Employment – tenancy bond data, migration and movements data
New Zealand Customs Service – departure and arrival cards data
Statistics NZ – Household Labour Force Survey data
Statistics NZ – New Zealand Income Survey data
Statistics NZ – Survey of Family Income and Employment data
Statistics NZ – Longitudinal Immigration Survey of New Zealand data
Statistics NZ – Longitudinal Business Database data.
Each dataset in the IDI has a corresponding data dictionary that provides information about the data collected. View Data dictionaries for datasets in the IDI for a list of available data dictionaries.
From June 2014 tax data will be available in the IDI three months after it is supplied by Inland Revenue. This embargo time has been reduced to enable researchers to access timely, relevant data for research and policy advice.
Page updated 12 May 2015