Skip to content

Getting our data out there

  • How can government release more data, faster?
  • Should I be promoting a more open data culture across my workplace?
  • How does privacy impact open data?
  • What needs to happen at my workplace to get our data out there?

Rochelle Stewart-Allen and Mark Kirkpatrick introduce Ellen.

These are some of the questions that government employees posted up on a whiteboard at the start of the ‘Open Data in a Day, for Government’ workshop last week.

The workshop was organised by Stats NZ and DIA and facilitated by Open Data Institute trainer Ellen Broad.

What is open data?

Ellen started by defining open data as “data that anyone can access, use or share”.

It is:

  • accessible
  • machine readable

and has an open licence.

Open government data:

  • typically doesn’t have any private information about individuals (although there are exceptions – such as MP expenses)
  • can be freely used, reused and distributed by other agencies and the public
  • has an open licence for re-use (under Creative Commons via NZGOAL).

Limitations

Datasets with personal information, third-party IP and commercially or culturally sensitive information often cannot be open.

However, sometimes datasets can be modified. In New Zealand, where personal information is “information about an identifiable individual”, data can sometimes be ‘de-identified’ or ‘anonymised’ to ensure individual privacy is maintained.

This process has to be undertaken with care. Making it impossible to identify individual people requires more than just removing identifying information such as names.

  • Even if names are removed, could someone identify an individual from fields such as job, gender or postcode?
  • Could the data be combined with other data to identify individuals?

Anonymising data safely can be hard while retaining its usability.

For datasets that contain personal information, Ellen recommends that agencies use the resources of the Office of the Privacy Commissioner to set up a Privacy Impact Assessment.

Ellen noted that some sectors may be bound by more than one set of privacy laws (eg, health, taxation, statistics).

Open data licences

Workshop participants.

Ellen believes that licensing open data sources is important. It’s what makes open data ‘open’.

A licence provides clarity: it sets out what users and re-users can and can’t do with the dataset.

The default standard for open government data licences in New Zealand is a Creative Commons (v.4) attribution licence (CC BY) – these are set out in NZGOAL.

Ellen pointed out that only CC-BY and CC BY-SA are open licences within the Creative Commons suite of licences.

“An open licence allows both commercial and non-commercial use, and allows you to alter the dataset – so you can mashup etc,” she said.

‘Share alike’ means that if you use an open data source, then you make whatever you publish or distribute open, under the same licence conditions.

Communication

Ellen talking about benefits of open data.

Some licences require users of the dataset to register. Ellen says that while this can be implemented as a barrier, encouraging some feedback loop with publishers can be good (for example, to notify an error or update).

This building of relationships is important. Communication works both ways – and one good data management practice is to provide a (real, with a person behind it!) email address and other contact details so users can ask further questions.

Resources

There are some useful tools you can use when publishing open data.

  • The data.govt.nz Toolkit has checklists, guides and videos for open data publishers.
  • The ODI Pathway is a self-assessment tool that can help you understand how well your organisation manages open data.
  • ODI Certificates are badges that provide guidance on data quality.

Want to know more?


Photo credits

  • Photos 1 and 2: Mike Riversdale (CC BY 4.0)
  • Photo 3: Anne Nelson (CC BY 4.0)
Back


Top