Still no Proper Icon Dictionary

We see more and more icons and symbols that are meant to be language-independent and obvious, and they just aren’t.

Back at the end of the last century, I decided that everyone liked using icons and symbols, but people had no idea what they meant. Just what does An upright triangle with a cross over it mean on a clothes tag?

So I registered icondictionary.{com,org} and set to work.

And failed 😀

But it was a good idea.

It was going to be crowd-sourced from around the world. A Spaniard travelling to Norway would be able to find out what Norwegian-specific road signs (such as the brilliant “Merging by the zipper method“) means, in their own language; be it Castilian, Catalan or something else I don’t know about.

And perhaps, surprisingly, it would be of great help to people with visual impairment.

Because the user would be able to get descriptions too (the modern alt is rarely enough, as you need proper description in detail) – it must be so frustrating to be reading a book and told that “his nose looked like some road sign”, when you have never seen one clearly, or ever had it described to you.

In fact, what does a poison icon look like on a bottle? Sort of important to know and detect.

So I planned to be able to capture images and look them up. This was before smart phones, so cameras were a problem, but they were coming along, and once the data was there, it would all work. I already had a C-Pen that could scan lines of text, so it couldn’t be long. And then anyone could scan a bottle and find out if that obscure symbol that someone thought was obvious was in fact saying the contents would kill them.

There was even a bit of business proposition here. The site would have decent quality images on display, but also behind, wherever possible there would be SVG versions that could be purchased, if a designer wanted high quality. With payment going back to the crowd-person who created it. Oh, and I had moderators/editors taking responsibility for areas, such as flags or laundry symbols, and also languages. With all the database permissions that entailed.

I had a student (Peter Dibdin) build a java app that enabled me to hand craft SVG documents to their highest quality or even a perfect description, and keep them in collections. It would then allow export in jpg at different resolutions, for different purposes. It still works nicely, by the way.

Given that SVG was only submitted to W3C in 1998, and you needed an Adobe plugin to view in a browser, you may get a sense of how ambitious this all was!

Even language stuff was new. I wanted to do all the stuff to distinguish pt-br and pt-pt etc., but even RFC 1766, trying to standardise it all, had only came out in 1995.

Clearly all this was hugely ambitious. Although the biggest problem was of course that I didn’t really have the skills 😀. And when I tried moving from a database to an RDF store, as the Semantic Web developed, since what I really wanted to do was at a semantic level, that was clearly going to be the final nail in the coffin!

Also, Google had recently come along, and was developing now, so search seemed to be much easier, and surely these huge corporations could do it – it was only a matter of time.

Then, of course, Wikipedia came along, and it looked like that would make it all redundant – it was only a matter of time.

But NO, it hasn’t happened yet.

Google Lens tells me that the Norwegian zipper sign is “A Norwegian Road Sign” – woohoo! Yeah, I sort of knew that because it is on the side of the Norwegian road I am driving down.

LLMs & ChatGPT? – it's only a matter of time!

We see more and more icons and symbols that are meant to be language-independent and obvious, and they just aren’t. I still can’t point my camera at one and find out what it means. Even though my car shows me what speed limit sign I last passed!

I didn’t finally (almost) give up until a couple of years ago. It would all be so much easier now. But I have other things I am doing that are being successful, and starting over in my 70s is probably not the best thing. But I only let the domains go in 2022, though 😀. Possibly mainly because I can always find another one, in this modern world of not needing com or org 😉.

Flattr this!

Amazon Prime Media Protection

Right. I want to watch something on my Amazon Prime account, on my TV, which is connected to my ‘puter by USB-C to HDMI, or whatever it is.
But no, with my TV plugged in at all, it won’t even let me watch on the computer monitors.
I presume all this is so that I can’t easily snaffle a digital version and put it on bitTorrent, or whatevs, or maybe just because it is the third screen, as one of my monitors is also USB-C to HDMI.
So what is a chap to do?
I know, I’ll download a snaffled digital version from bitTorrent, and then I can watch it on any of my screens.
And since I have a legal account for watching it on Amazon, they would be hard pressed to do me for anything – just for the few moments when I am downloading when someone may get a bit of upload from me.
So the only things their awkward bloody protection acheives is
a) puts this customer to trouble and pisses him off;
b) gets me to share their content, albeit for a few seconds.

Flattr this!

Random Streams Numbers on the Freeview Multiplexes

Specifically, Channel 4 assigns the data to different stream numbers: one programme might have the mpeg2video on Stream #0:5, and the dvb_subtitle on Stream #0:8, and the very next programme on Stream #0:1 and Stream #0:4 respectively.

OK, so most people won’t understand, and even fewer care. 🙂

I have a dongle (EyeTV, very nice) that records broadcast Freeview onto my old Mac (quite old, late 2006, but it ain’t broke, so doesn’t need fixing). And I end up with the raw broadcast mpeg file on the machine. Now I want to be able to move it elsewhere, and also compress it somewhat (MPEG-2 is not great compression).

I can simply tell EyeTV to export the video, and it will, although it takes a while to process (remember, 1.66 GHz Core Duo 🙂 ). Or I could even just move the mpeg file and use VLC to view it.

But I like to have the subtitles.

Now, perhaps surprisingly, the subtitles on broadcast TV are not text – they are images! So the only thing to do is overlay the images on the movie.

Of course, when it comes to this sort of thing, FFMPEG is your friend
-filter_complex "[0:5][0:8]overlay", I finally worked out. But you will see that I need to know which streams to overlay. And the bastards keep changing them. And I don’t have a utility that will identify them (and FFMPEG doesn’t, as far as I can tell.)

This used to be a problem for BBC & ITV too, but now I have Get iPlayer Automator, which downloads those nicely for me.

Flattr this!

Apple’s New Safari Tabs

Doesn’t Apple read its own HIG? (No – ed.)

I keep misreading which tab I am in. Every other app., including Apple’s Finder and Terminal, uses a brighter hue for the current tab, and darker hues for the background tabs. The same with Firefox and Chrome.

And the same convention applies to dialogue boxes, and anything else that has a foreground and background.

But not the newest version of Safari, oh no.

The front tab is grey, while the other tab (or tabs) is white. And I keep getting it wrong, even after a few days.

Basically, fuckwits.

Flattr this!

The Digital Catapult Really Screwing up Trust about Personal Data

Executive Summary

In publishing a report about how organisations need to build trust with the public about the storage and use of their Personal Data, the Digital Catapult has done almost all the classic things to undermine that trust.
And they even missed a great opportunity – the research suggests that if an organisation does screw up, and then comes clean and corrects things, it ends up with a higher level of trust than if they hadn’t screwed up in the first place.
But they didn’t.

Now read on if you wish…

Wendy tweeted about a new report:

Wendy Hall ‏@DameWendyDBE 1d1 day ago
#PDTReview has been launched today by @DigiCatapult! Get the report here: http://bit.ly/SignUpPDTReview

I though it might be interesting, so I followed to find that it was called “TRUST IN PERSONAL DATA: A UK REVIEW”. When I finally got the report, it turned out it says things like “We must build consumer trust or risk future failure” and “Building trust in the use of personal data is the responsibility of every digital stakeholder in the UK”.
However, I get ahead of myself: in order to get it, I was required to fill in a short form, giving my names, email, organisation and the organisation type (as mandatory fields).
Like many people (apparently, when I read the report), I choose to be careful about where my personal information goes. And it seemed to me that there was no reason why the Digital Catapult should need more than my email address, even if it did decide that it was not going to make the PDF available for download, but only by personal request over email.
So I tweeted:

Hugh Glaser ‏@hughglaser Jul 29
@DigiCatapult Can I get a copy of #PDTReview without giving you my Personal Data please?

So far, disappointing, but OK.
Now the screw up starts:

Digital Catapult ‏@DigiCatapult Jul 29
@hughglaser Thanks for tweet. We ask anyone wishing to download the report to fill in a few fields so we can get the report to them swiftly.

Yup, “swiftly”!
Good practice to have a prompt response, but I can recognise a classic case of making up an answer, just to get rid of me. 🙂 This is known as flimflam or flannel by support staff. Like “Your database is running slowly because it is the wrong colour, we are repainting for the next release, so it will be better.” (cf http://dilbert.com/strip/1995-11-17)
But actually, this is displaying a lack of respect – the public aren’t fools.
So now I’m hooked!

Hugh Glaser ‏@hughglaser Jul 29
@DigiCatapult Huh? Swiftly?!!! My Organisation? Why not a PDF download link? It smells like you are actually touting to get a DB of people.

Digital Catapult ‏@DigiCatapult Jul 29
@hughglaser For clarity, we are not keeping a DB unless people opt-in. Equally happy to share report with you via non sign in. Please DM.

(Opt-in: There is a tick box on the page that invites people to sign up for a Network.)
But come on! This is more flimflammed flannel. No answer to why the “swiftly”. And why am I typing if it doesn’t get kept. Again, the public aren’t fools. And don’t take me for one. I know you are keeping the data – just own up to it!

Hugh Glaser ‏@hughglaser Jul 29
@DigiCatapult Thanks, may do. But hang on: if I don’t tick the box, you don’t keep anything I type? So why ask? And it says “SIGN UP…”.

Digital Catapult ‏@DigiCatapult Jul 29
@hughglaser Thanks for your feedback Hugh. We’ve amended the page so people can fill in details if they so wish, rather than ‘sign up’.

Classic! I’m not going to fix the bug – I’ll change the documentation – any Software Engineer worth their salt recognises that one. They did change it, so the page says “FILL IN…” instead of “SIGN UP…”.
And what is this “if they so wish” – more noise words of flimflam and flannel – there is no other option I can see.

Hugh Glaser ‏@hughglaser Jul 29
@DigiCatapult Sorry to be a bear with a sore head, but I’m still not making sense of this. What does “if they so wish” mean.

Now, there does seem to be some deeper investigating in order, so I went to the page and gave it my details and got the email with the report – and very interesting reading it is too.
At the same time, I emailed:

To: info@cde.catapult.org.uk
Subject: Personal Data Request
Please can you tell me what personal details you hold about me, as specified in section 1.7 of http://www.digitalcatapultcentre.org.uk/terms/
Thank you.

Within an hour (well done!) I got a lovely response, which I won’t quote because it was in email. But in there it said that they had my name, email, business name and organisation type – exactly the same details that I was being told aren’t being kept!
This needed a private response, I think – although these are really important issues, I had no wish to cause any serious embarrassment on twitter. It was something that the Digital Catapult should get the chance to put right, so it was an email back:

Subject: Re: personal data request
Dear xxxxx,
Many thanks for your swift and detailed response.
I appreciate an organisation that can do this - on the rare occasions I have done asked others, some of them simply fail!
Unfortunately, your response tells me that the person who does your twitter feed has basically lied to me:
Digital Catapult ‏@DigiCatapult 5h5 hours ago @hughglaser For clarity, we are not keeping a DB unless people opt-in. Equally happy to share report with you via non sign in. Please DM.
OK, it may be that the data you have is not in a real “DB", but in principle you have a problem here.
I clearly have a problem with your report being used to get personal data about people.
The deep, deep irony is that it is undermining trust in exactly the way the report says should be built up.
How should we resolve this?
Is your twitter feed going to give some honest and clear answers?
Best
Hugh

In the meantime, the next day I had no response to my question on Twitter, so I asked again:

Hugh Glaser ‏@hughglaser 8h8 hours ago
@DigiCatapult What does “if they so wish” mean? Is there another option if they don’t want to?

To which there was a swift response:

Digital Catapult ‏@DigiCatapult Jul 30
@hughglaser Hi Hugh, you can now directly access the review here: http://bit.ly/1DRpjpy

That’s good!
But it isn’t really what we need by now.
Any trust I might have had in the Digital Catapult has been completely undermined.
What went wrong?
Will it go wrong again?
Should I not expect an apology?
Later, I got a response to my email. It was very nice, inviting me to join the network and asking if I wanted my details removed. But it was also rather disappointing, in that it suggested that what were effectively the lies I had been told about the Digital Catapult keeping my Personal Data were because of the limitations of 140 characters in Twitter, and that the DB issue was because it was a different, “marketing” DB.
Oh dear – more flimflam and flannel.
So…

Thanks xxxxx,
Just a quick question before I respond at more length, please:
...
I think blaming the 140 is a bit disingenuous: "For clarity, we are not keeping a DB unless people opt-in.” is pretty unequivocal.
So what is the scope and purpose of the DB that you found my data in?
Best
Hugh

I also sent a message asking what is the scope and purpose of the DB that I am in.

So, coming to an end…
I exchanged a few more email,and then had a chat on the phone.
As best I see it:
The Digital Catapult wants/needs to report to the funding body what interactions they have with companies. So they gather the sort of information above. It is disturbing, because they don’t seem to think that this is “using” the information (it’s not for marketing, in particular). In fact, I am now really rather disturbed that the Digital Catapult, which has just issued a report relating to Personal Data, have significant people who actually don’t understand Data Protection, and the related regulations. We discussed how the Terms and Conditions could be improved to reflect this situation.

I asked for an apology on Twitter for taking me for a fool (at which point it felt like the call had become distinctly frosty 🙂 Or maybe it was because I mentioned I would blog about it.)

Now it is time to leave it – maybe they have learned something!

One lesson – make sure that the person on your twitter feed is technically savvy, or at least understands when they need to get advice.

Some more lessons?
Don’t take the public fools and just fudge things with brush off messages and flimflam, and when you make a mistake, come clean as soon as possible; oh, and apologise for it.
This is serious stuff – it was worth the Digital Catapult spending money to write a report about it!
And perhaps even paying Experian money to gather data, although that isn’t clear.
And, of course, “Oh the Irony!” That this should happen about a report on Trust and Personal Data!

Flattr this!

Open Data Projects that Don’t Make Their Data Open

Sounds bizarre, but it seems to be the norm.

The very people who are at the forefront of consuming Open Data, funded by my tax money, almost always fail to make the fruits of their labours available as Open Data in their turn.

Let’s see: You get some public funds to build a demonstrator that shows how great Open Data is.

So you go away and harvest all sorts of data from a variety of sources (both easy and hard), and then add lots of value by linking it together and then synthesising new values and providing new insights.

Then hopefully in some time that corresponds to the proposal timescales, you launch your wizzo site, with fancy visualisations and everything. You get the loud praise and tweets from the great and good saying how this demonstrates how great Open Data is.

Then…

I come along with a little email asking if I can please have the data (so that I can use it for sameAs.org and other stuff I have that is set up to use the Open Data food chain).

I then get the standard response: “Ah, yes, we intend to make the data public, but <fill in a reason here />, and give us just a little time.”

Following that, I usually wait a few weeks or months, and then ask again. I might get another response along the same lines. I will probably ask another 2 or 3 times over the course of the next year or so, sometimes getting a response, but then decide that the data is probably too old now to be of interest, and I am wasting my time in any case because it ain’t going to happen. (It is unlikely that the original project actually included a real plan for keeping the application up to date by refreshing the data, or that the technology is actually able to do that.)

The point that anyone working in Open Data knows is that if you don’t build in the mechanisms and technology from the start to make data open, then grafting it on at the end is hard, expensive, unreliable, and various other bad words. So why don’t people put it in the proposal, and why don’t the funders require it to be part of the projects?

And the worst thing is that sometimes you hear the people who have done these projects (that are essentially data sinks) actually complaining that they couldn’t get data they wanted for their project from some other source!

Now ain’t that hypocritical?!

I would actually prefer it, by the way, if the response to my request for the data was more along the lines of, well we built a demonstrator, and we aren’t going to make the data open. There are actually quite a few sensible reasons for doing that.

Right, I’m off to think about which of my datasets is Open… 🙂

Flattr this!

Misquoting Harold Wilson’s “White Heat” speech

Usually Harold Wilson’s famous 1963 speech is quoted as something like Britain needed to be forged in the white heat of technology, or technological revolution.

Both of these are wrong.

I think Wikiquote has it right:
“In all our plans for the future, we are re-defining and we are re-stating our Socialism in terms of the scientific revolution. But that revolution cannot become a reality unless we are prepared to make far-reaching changes in economic and social attitudes which permeate our whole system of society. The Britain that is going to be forged in the white heat of this revolution will be no place for restrictive practices or for outdated methods on either side of industry.”

Why rewrite science to technology? Wilson knew about technology, but he chose science. And his audience would have understood the word at that time.

A good and honest short quote for this might be “forged in the white heat of this [scientific] revolution”.

It is a powerful view, given by Wilson, that it is the scientific revolution and the science that drives the world, and enables the technology, and it is the science that should be supported, and certainly not just the technology.

We can only assume that people who misquote are being deliberately mendacious or simply ignorant.

And it is a shame in this world of technology and Web Science that we lose the import of the middle sentence: that we need to be prepared to make far-reaching changes in economic and social attitudes.

Flattr this!