Of all the topics I might have expected David Cameron to speak in favour of, standardised data formats was not top of the list. So I’m grateful to Nick Booth for pointing to Cameron’s speech last Friday to the Conservative Councillors’ Association.
At the moment, local government bodies must provide the public with information about the services they provide, what goes on in council meetings and how councillors have voted on specific issues. But the information isn’t published in a standardised way. It’s impossible for the public, charities or private companies to effectively collate this data, compare and contrast your performance and hold you to account. That’s why the Government relies on expensive and bureaucratic schemes to try and hold local government to account.
We will turn that approach on its head. We will require local authorities to publish this information – about the services they provide, council meetings and how councillors vote – online and in a standardised format. That way, it can be collected and used by the public and third party groups.
A very timely call, especially in the light of discussions last week about Harry’s consultations site (some of which, I’m told, he’s updating manually?!). I particularly like the way Cameron ties this into the tangible benefits for councillors themselves, removing the need for so many performance measurement exercises.
I’ve had some dealings in this field, particularly during my time with National Statistics. And I’m afraid it’s going to be much, much more difficult than Cameron makes it sound. Too many legacy systems, a chaotic approach to statistical geography, and (frankly) too much opposition from statisticians. Very few statisticians appreciate why they do what they do; they just do it. The work takes on an almost monastic purity. They don’t trust mere mortals – media included – to represent it properly. I was one of a management team hired to drive a culture change in that regard. Our success was limited.
Cameron’s speech focuses on TheyWorkForYou as a role model for data reprocessing; but I’m not sure the comparison holds up too well. A database of numbers would be much more difficult and more sensitive than the absolutes of Hansard: who said what (subject to correction, of course), and who voted how. Realistically we need to get to greater standardisation of process first: starting with defining consistent geographic reasons allowing sensible comparisons over time.
And besides, we haven’t done a great job of producing standardised data in RSS format, one of the most simple and straightforward data standards out there.
Responses
From what you quote, Cameron is talking about services, meetings, and votes (and looking at his fuller speech, council expenditure, locations of parks, etc.), not a “database of numbers”. You say comparing to TheyWorkForYou isn’t a good example, and yet meetings and votes are exactly the sort of things it covers?
As for RSS being a simple and straightforward data standard: http://diveintomark.org/archives/2004/02/04/incompatible-rss or http://www.feedvalidator.org/docs/warning/CharacterData.html ๐
That’s an amazing subject for a speech by a party leader to make. Was he at BarCamp in disguise?
Shane that was exactly what I though. It is remarkable that a party leader should contemplate data standards as a subject for a speech – times they are a….
Your point about statisticians can also apply to archivists. A colleague of mine was picking up a gong for best use of archive in a TV documentary and got chatting to a film archivists. “We can’t afford to preserve or transfer our archive” – they moaned. “Well I’d like to make a programme using your material – and in the process we could transfer it onto new stock.” “Oh no..” was the reply “…it’s not for that sort of use. These films are for academics”
aaaaaaaaaaaaaaargh