Programmers are humans too

Apparent Rule

Looking at GA= Georgia, and FL= Florida, it appears that there is no real rule.

What I could work out is:

The first letter of the code is the first letter of the state.
If the state name is two words, then second letter of code is first letter of second word (e.g. NY = New York)
Otherwise, the second letter is an apparently random choice out of the other letters of the state name.

Codes

NE: Nevada or Nebraska?

It's Nebraska, but NB would have been a better choice

MI: Mississippi, Missouri, Michigan, or Minnisota?

Codes

NE: Nevada or Nebraska?

It's Nebraska, but NB would have been a better choice

MI: Mississippi, Missouri, Michigan, or Minnisota?

It's Michigan, but MG would have been a better choice

Codes

NE: Nevada or Nebraska?

It's Nebraska, but NB would have been a better choice

MI: Mississippi, Missouri, Michigan, or Minnisota?

It's Michigan, but MG would have been a better choice

MS: Mississippi, Missouri, or Minnisota?

Codes

NE: Nevada or Nebraska?

It's Nebraska, but NB would have been a better choice

MI: Mississippi, Missouri, Michigan, or Minnisota?

It's Michigan, but MG would have been a better choice

MS: Mississippi, Missouri, or Minnisota?

It's Mississippi, but MP would have been a better choice.

Doing it better

I couldn't believe it wasn't possible to do the 2-letter codes better.

So I wrote a program (in ABC as it happens; more on that later).

The best rule I came up with:

For single-word state names, use the 1st and 4th letter, except:
For the 4 states with a double letter in the first 4 letters (Minnesota, Mississippi, Missouri, Illinois) use the 1st and 5th letter.

The point

My point here is that the 2-letter codes were introduced because of automation.

But that is no excuse for ignoring the needs of people.

People are strange...

The fact is, people are strange, yes even you, even me.

For instance, someone did research in whether tools affected how people wrote.

People are strange...

The fact is, people are strange, yes even you, even me.

For instance, someone did research in whether tools affected how people wrote.

Writing with a pen and paper, people produced less text, but of high quality.

People are strange...

The fact is, people are strange, yes even you, even me.

For instance, someone did research in whether tools affected how people wrote.

Writing with a pen and paper, people produced less text, but of high quality.
Writing with a plain text editor, people wrote more text, but of a lower quality.

People are strange...

The fact is, people are strange, yes even you, even me.

For instance, someone did research in whether tools affected how people wrote.

Writing with a pen and paper, people produced less text, but of high quality.
Writing with a plain text editor, people wrote more text, but of a lower quality.
Writing with a WYSIWYG editor they produced more text of a high quality.

People are strange...

Research into interfaces for playing chess:

command interface
mouse interface
'direct manipulation' interface with real pieces

Mouse was by far the fastest

People are strange...

Research into interfaces for playing chess:

command interface
mouse interface
'direct manipulation' interface with real pieces

Mouse was by far the fastest

But with direct manipulation, people won more often...

Example

Hold your hand up.

Count the number of triangles on the next screen, and check your result.

Drop your hand when you have counted.

Usability

Usability is about designing things (software/programming languages/cookers) to allow people to do their work:

Faster
With less errors
Whilst enjoying it

Efficient, Error-free, Enjoyable or
Fast, Faultless and Fun

Don't confuse usability with learnability: they are distinct and different.

HCI

The problem is that the people designing things are usually not the people who will be using those things, and they tend to design for themselves.

So... you have to use HCI techniques:

design for the user, not for yourself or the computer
worry about implementation later
user test
design iteratively.

Numbers

For instance, Roman numerals:

OK for representing numbers: CXXVIII
Reasonably OK for addition:
CXXVIII+CXXVIII
=CCXXXXVVIIIIII
=CCXXXXVVVI
=CCXXXXXVI
=CCLVI
Terrible for multiplication (which was a university subject until the introduction of Arabic numerals after the renaissance.)

Homework

I was helping my children with their maths homework, and a question arose for me:

Why do they make things so difficult?

So I went back to first principles.

Warning

What follows was more than a year's work, using reams of paper. It is hard work making things simple. It resulted in a monograph that I wrote for my sons' birthdays. You can read it here: Numbers.

As a colleague who designs program interfaces complained: "When you succeed in making an interface as easy to use as a coffee machine, they treat you like a plumber"

Addition

We'll use base 1 numbers: // is 2, /// is 3

We define addition as sticking two numbers together:

//+///=/////

The complement of addition

We define subtraction as

(a+b) − b = a

This is a declarative definition. It tells you the what but not the how.

Because addition is commutative:

(a+b) = (b+a)

you can use subtraction to obtain both the left and right operands of addition:

(a+b) − a =

(b+a) − a = b

Mystery numbers

Subtraction creates a new sort of number we didn't start with, since we have expressions now like

3 − 5

the result of which we can't write with stripes.

Negative numbers upset mathematicians right up into the 19th century, being referred to as "fictitious" solutions.

Even banks didn't start using negative numbers until recently.

Zero

Whether subtraction also introduces zero, or whether it was there from the start is something mathematicians disagree on to this day.

At least historically though, it wasn't originally considered a number.

Zero is the identity for addition and subtraction since

(a+0) = a

We also have a monadic version of − as a shorthand:

−a = 0 − a

(It was from Dijkstra that I first learned this, but I cannot at the moment find the reference)

Multiplication

We go up a level, and define multiplication in terms of addition:

a × 3 = a + a + a

and it also has an identity value:

a×1 = a

(That was handwaving; the true definition of multiplication is:

(a + b) × c = a×c + b×c

It still defines multiplication in terms of addition.)

The complement of multiplication

We define division in exactly the same way as we did with subtraction:

(a×b) ÷ b = a

And since multiplication is also commutative, you can extract both a and b in the same way as we did with subtraction.

(I use ÷ instead of / because when user-testing with my target audience, they said "Please use the same symbols as on a calculator".)

More mystery numbers

Division adds the rationals to the mix, since we now have numbers such as

2÷3

(We may represent such numbers as ⅔, but we should recognise this as just an uncalculated expression, just as −5 is)

Monadic divide

Oddly, there is no monadic version of divide. But if we define it using the same pattern:

÷a = 1 ÷ a

something surprising happens: identities that we know from the lower level have exactly the same form as ones at this level. For instance:

Identities

−(a−b) = (b−a)
÷(a÷b) = (b÷a)

(This shows that while subtraction and division aren't commutative, they are commutable.)

Up another level

Just as multiplication is defined as repeated addition, so is raising to the power defined as repeated multiplication:

a↑3 = a × a × a

One difference at this level: ↑ is not commutative:

2↑3 ≠ 3↑2

Which means it has two complements, one to give a and one to give b.

The complements of power

First for a, which uses our regular pattern:

(a↑b)↓b = a

This is of course what we know of as root, where x↓y traditionally, and weirdly, is notated

^y√x

The other complement of power

Now we need b:

(a↑b)⇓a = b

This is logarithm, where x⇓y is equally weirdly notated traditionally as

log_y x

So if ↑ is a higher version of multiplication, then ↓ and ⇓ are just higher versions of division.

The point

Apart from making the close relationship between root and log obvious, defining them in this way suddenly exposes that literally dozens of well known identities have a similar form to identities at the two lower levels!

Just to pluck a few out:

Identities 2

a×(÷b) = a÷b
a↑(÷b) = a↓b

(I consider this so beautiful, that even if this had been the only result, the whole exercise would have been worth it!)

Identities 3

÷(a÷b) = b÷a
÷(a⇓b) = b⇓a

(1/ log_ba = log_ab)

(I may have learned this at school, but if I did, I had forgotten it. When I saw this result, I was so amazed, that even though I could prove it, I have to admit I checked it with some values as well, to make sure it was right).

Why is this notation better?

⇒ Consistent

⇒ Easier to use (once you've learnt it)

⇒ Exposes otherwise hidden relationships, such as the close relationship between log and root.

⇒ Use the same methods for solving equations:

a+2 = 4
a = 4−2

a × 2 = 4
a = 4 ÷ 2

a↑2 = 4
a = 4↓2

2↑a = 4
a = 4⇓2

Programming languages

A most important, but also most elusive, aspect of any tool is the influence on the habits of those who train themselves in its use. If the tool is a programming language, this influence is -- whether we like it or not -- an influence on our thinking habits. -- Dijkstra