generator, accidentally type out the script for Shakespeare’s Hamlet or write Tolstoy’s War and Peace? Is knowledge generation simply a numbers game? Leo Tolstoy’s War and Peace is generally assumed to be the longest novel ever written. This is not quite true. Wikipedia reckons the longest novel is a French book, Artamène, with over 2.1 million words. Tolstoy comes in sixteenth, with a mere half million! Written in 1869, War and Peace tells the story of five Russian families during the Napoleonic wars. Originally written in a mixture of Russian and French, and numbering over 500,000 words, it was quickly translated to other languages. The mistress of composer Franz Liszt translated it fully into French, where it expands to 550,000 words. Contrary to popular myth the length of the book drops slightly in German. If you really want to save paper Chinese is best. Because it uses a single symbol per word, the Chinese translation needs only 750,000 characters compared with the 3 million for English. It is wrong to assume this is necessarily more efficient than a phonetic language. Although it might save on paper, it is considerably more laborious to write. Three strokes are required to write ‘war’ in English whereas the Chinese pictogram requires ten. War in Chinese Computers work with numbers. It is a simple process to translate a book into numbers because books are composed of discrete symbols. All we need do is give each symbol a unique number and record those numbers in digital format. Artistic works involving pictures and sound are more difficult to represent because they are continuous in nature. We have to digitize them first. With music or painting this inevitably means some loss of information as we can’t cut a sound or image into an infinite number of pieces. The modern standard for translating text to numbers is Unicode. Each character is represented by a five-digit number ranging from 1 to 64,000 – two bytes for those of you who know computing. This is 130 Are the Androids Dreaming Yet? sufficient to code almost all the world’s symbols, so we can avoid any accusation of being language-ist! Here are some examples of the Ancient Greek, Japanese: Kanji, Katakana, Chinese, and Russia-Cyrillic Symbols characters represented by Unicode. For our discussion, it does not matter which language War and Peace is written in. We just treat the symbols as numbers. I am going assume the English translation which has around 500,000 words; a nice round number. Assuming a generous 10 characters per word, War and Peace is approximately 10-megabytes – that’s about the same size as a music track on iTunes. In practice, the book uses a bit more memory, as there is some overhead for formatting information. My laptop has a 500 Gigabyte hard disk so I could fit half a million copies of War and Peace on it! If we take a look at the contents of the file on my computer the book starts: 8710110810844801141051109910144115111 Can a computer calculate this number? The obvious answer is YES. It is just an integer like 1, 3 or 42. Granted it’s a large number, but the length of the number is simply the length of all the symbols in the book coded into Unicode – about 10 million digits. We have already determined this number can be stored on my hard disk half a million times, so it’s not an unimaginably large number. How long would it take to calculate the number corresponding to War and Peace? The simplest method is to count up starting at 1 then 2, 3, 4, 5, and so on until I try every number. Will this eventually get to the War and Peace number? The answer is yes. Eureka! All of human knowledge is computable. I have written this computation out as a simple computer program below. It says, in plain English, start at zero, go round a loop counting up one at a time and print each number as you go along. i==0; Loop i++ Print i; Knowledge 131 Easy! No, unfortunately. The problem is subtler than it first appears. First it will take a VEEEEERRRY long time. If I counted up from one, I would print out War and Peace eventually but it would take 120 billion, billion, billion, billion, billion… (I would need the entire length of this book to write out all the billions) years! For the physicists amongst you, I would need 10 30,000 years, assuming I could use every atom in the known universe counting in parallel at the plank interval. ‘The plank interval’ is the shortest time that can exist in the Universe as a discrete ‘tick’. Even going at this speed using with every atom in the known Universe would take 10 5,000 longer than the age of the Universe. This is stupendously long. Remember scientific notation means I have a 1 with 5000 zeroes after it. It is a deceptive notation as something as innocuous as 10 120 is equal to the number of atoms in the known universe. 10 5,000 is an absolutely enormous number. If you hear something is going to ‘take until the end of time’, we’re talking a lot longer than that! You may have spotted that in the process of counting up to the War and Peace number we also count through EVERY book ever written shorter than 500,000 words in all the world’s languages. Interestingly we counted through the Japanese and Chinese translations of War and Peace quite a bit before we reached the English and finally French translations. During the process, we also stepped through countless other wonderful works: proofs of amazing theorems, the complete works of William Shakespeare, and every composition ever written. Sadly, we never knew it. The problem is my program never stopped and told me it had found any of these wonderful things. I would have to sit staring at the screen to spot them. If I was off doing something else – making a cup of tea, taking the kids to school – I would miss all these wonders; the program never tells me if it has succeeded, but quietly prints out War and Peace and carries on. This is really annoying. It’s not a useful machine. What I need is a machine that rings a bell when it finds something interesting so I can break away from what I am doing and take a look. Reading every book it writes in every language and all the nonsense in between would take a ginormous amount of my time. (By-the-way, contrary to statements by school teachers that ginormous is not a word – it is!) I want a computer to come up with War and Peace without me having to do all the work. It’s no help if the machine writes everything down and lets me take a look in my own good time. That only puts off the time when I have to begin reading all the gibberish it produced. Another practical problem is the massive storage required. Just imagine the immense piles of printer 132 Are the Androids Dreaming Yet? paper! Stephen Hawking and Jacob Bekenstein have shown space appears to have a limit to the quantity of information it can store. The quantity of information we are looking at here is greater than the storage capacity of the Universe and would collapse space-time to a black hole before I got even a fraction of the way through. Let us try to be a bit cleverer about the task of creating this information. The simplest way to tie the computer down is to run a much stricter program. Ask it to count up from one until you get to a number representing the novel War and Peace and then print it, stop and ring a bell. Loop i++ until i == “War and Peace…”; Print i; ring-bell; This program succeeds! I am triumphant. I have calculated the War and Peace number, and this time I did not miss the event. But, if you consider this a little more deeply I gave the computer the answer! I told it the string “War and Peace…” and it was able to count up, stop, and tell me it reached it. In mathematical terminology, the program is said to have ‘halted’ when it reached the War and Peace number and in computer science speak it is a special purpose program designed to do only this one thing. This program is pointless. First, it would still take a ginormous amount of time to get there and, second, it is trivially the same as running the program: Print War and Peace. i = “War and Peace…”; Print i; It’s just the same as me taking my laptop, finding War and Peace and pressing print. In no way is this equivalent to Leo Tolstoy’s creative effort of writing War and Peace in the first place. What went wrong? I wanted my computer to find an interesting string I did not already know. War and Peace is trivially computable after Leo Tolstoy created it but the question is whether my computer could come up with War and Peace or some similar creative work on its own. Can it create and, more importantly, understand it has created something? We have linked the ideas of creativity and understanding, and this will prove to be the key to the problem. Knowledge 133 The Problem One suggestion put forward by Daniel Dennett is the creative process is a two-part task – generate ideas, then critically assess them. I can, in principle, make a program write out every possible book less than 500,000 words long. Provided I don’t store the results this will not collapse the Universe. This just leaves the problem of writing another program to read all the output and ring a bell each time it finds some interesting truth. This second program might be called an appreciation program. Let’s examine this approach. I can write out a very simple program to do this – provided I cheat and ignore the complexity of the term ‘something interesting’. In plain English: Count up from one until I get an interesting fact, write it down and stop. Loop i++ until i == (Something Interesting), Print i This generates two problems. We need to make a program that can tell if something is interesting and it will need to be fast because it is going to be handed a huge amount of junk. Clearly I have a process running in my brain that can determine if something is interesting, but it is quite slow. It takes me an appreciable time to open a book, leaf through the pages and declare it either junk or interesting. Leo Tolstoy had a process in his brain that allowed him to create something interesting but I want to prove he did not do this by generating random junk and sifting through it. Let’s look at the mathematics. We know simply counting sequentially through every number would take too much time, but why not generate random numbers and run our critical eye over them? Surely this would give a faster result. Let us try with a short poem. How hard would it be to come across something as simple as a four-line poem using this technique? This poem, by the late Spike Milligan, is only 23 words long, including the title, and I have a powerful computer. Wouldn’t it be possible to generate it using a computer? Unfortunately, no. We humans don’t have a good head for large numbers and this problem is much harder than it appears. Let’s use playing cards to get a feeling for large numbers. 134 Are the Androids Dreaming Yet? A Simple Poem Rain There are holes in the sky Where the rain gets in But they’re ever so small That’s why the rain is thin. Spike Milligan Spike Milligan Coming upon a poem by chance can be likened to the probability of dealing a perfect bridge hand. Shuffle the deck thoroughly and then deal four hands. What is the probability every player will have the ace through king in a single suit? It’s about 1 in 1,000,000,000,000,000 hands. Because lots of people play a lot of bridge around the world, this outcome has been reported quite a few times. The possibility appears within the bounds of human experience. Fifty-two playing cards seems close to the 80 characters that make up this poem and 13 choices of cards is about the same as the 26 letters of the Latin alphabet. Wouldn’t we expect poems of this complexity to crop up almost as often? NO. Knowledge 135 The 80 characters of this poem versus the 52 playing cards and the greater choice offered by 26 letters increases the problem geometrically. Taken together the probability of accidentally getting this poem is vastly less than a perfect hand of bridge, 1 in 10 83 against the perfect bridge hand of 1 in 10 20 . That’s the difference between the number of atoms in the known universe and the number of atoms in a jug of water! Numbers get big very quickly when we are looking at the permutation of information. And there is another problem with our bridge analogy. All the bridge players in the world are part of the machine finding the perfect hand. When a human sees a perfect bridge hand they are amazed. It is an event that usually hits the local newspapers and a couple of years ago one reached the national papers in Britain. Each bridge player looks at every hand, they play so there is a huge amount of processing going on during every bridge game. To replicate this for our poem, we would need millions of poetry classes spending hours each evening reading through computer printouts of gibberish. I should also add that sightings of perfect bridge hands are almost certainly hoaxes. The probability of it happening even once would require everyone on Earth to play bridge continuously for a thousand years. It is reported somewhere in the world about two or three times a year. If we are charitable, we might assume people failed to shuffle the deck properly but I suspect some mischief is going on! The numbers don’t stack up… You might think the problem is one of improving the efficiency of the filter so humans would only have to examine a smaller number of possibilities. Surely I could improve things by writing a simple program to ban all non-English characters, words and poor grammar; things that don’t pass the Microsoft Word grammar checker. This would generate a more manageable number of potential poems. Lewis Carroll shows this does not work; my idea to use a grammar and spelling checker to filter out gibberish just eliminated Jabberwocky, one of the most famous verses in the English language. Take a look at what Microsoft Word thinks of it. 136 Are the Androids Dreaming Yet? The Jabberwocky ’Twas brillig, and the slithy toves Did gyre and gimble in the wabe; All mimsy were the borogoves, And the mome raths outgrabe. “Beware the Jabberwock, my son! The jaws that bite, the claws that catch! Beware the Jubjub bird, and shun The frumious Bandersnatch!” He took his vorpal sword in hand: Long time the manxome foe he sought— So rested he by the Tumtum tree, And stood awhile in thought. And as in uffish thought he stood, The Jabberwock, with eyes of flame, Came whiffling through the tulgey wood, And burbled as it came! One, two! One, two! and through and through The vorpal blade went snicker-snack! He left it dead, and with its head He went galumphing back. “And hast thou slain the Jabberwock? Come to my arms, my beamish boy! O frabjous day! Callooh! Callay!” He chortled in his joy. Twas brillig, and the slithy toves Did gyre and gimble in the wabe; All mimsy were the borogoves, And the mome raths outgrabe. Lewis Carroll Lewis Carroll’s Jabberwocky Knowledge 137 The Jabberwocky Spell Check Microsoft Verdict on the Poem 39 of the 166 words in the poem are unknown to Word’s spelling checker and this is an optimistic analysis of how the algorithm would fare. Many of the words are in the spelling checker because of the poem: galumphing, for example. Lewis Carroll’s work was sufficiently influential that part of 138 Are the Androids Dreaming Yet? the English language was created in this poem. The same goes for much of Shakespeare. If we used a filter method, we would have just deleted most of Shakespeare from the English language! Indeed half the poems in my anthology of English verse are destined for the waste paper basket due to some minor infraction of ‘the rules’. If you want something that completely flummoxes my spelling checker here is the Loch Ness Monster Song by Scottish poet Edwin Morgan. I asked a Scottish friend whether Scottish spelling checkers fared any better and he assures me, no. The Loch Ness Monster’s Song Sssnnnwhuffffll? Hnwhuffl hhnnwfl hnfl hfl? Gdroblboblhobngbl gbl gl g g g g glbgl. Drublhaflablhaflubhafgabhaflhafl fl fl - gm grawwwww grf grawf awfgm graw gm. Hovoplodok - doplodovok - plovodokot - doplodokosh? Splgraw fok fok splgrafhatchgabrlgabrl fok splfok! Zgra kra gka fok! Grof grawff gahf? Gombl mbl bl - blm plm, blm plm, blm plm, blp Edwin Morgan The Loch Ness Monster Knowledge 139 The foibles of spell checkers have long been a personal pain to me because of my dyslexia. Although I can see the red underlining Microsoft Word kindly inserts so liberally into my text, I can’t easily see the occasions when I use a homonym. A fine poem illustrating the problem was kindly written by Jerrold H. Zar and published in The Journal of Irreproducible Results. It hangs on the wall behind my computer to remind me to check for these errors. Candidate for a Pullet Surprise By Jerrold H. Zar I have a spelling checker, It came with my PC. It plane lee marks four my revue Miss steaks aye can knot sea. Eye ran this poem threw it, Your sure reel glad two no. Its vary polished in it’s weigh. My checker tolled me sew. A checker is a bless sing, It freeze yew lodes of thyme. It helps me right awl stiles two reed, And aides me when eye rime. Each frays come posed up on my screen Eye trussed too bee a joule. The checker pours or every word Too cheque sum spelling rule. Bee fore a veiling checker’s Hour spelling mite decline, And if we’re lacks oar have a laps, We wood bee maid too wine. Butt now bee cause my spelling Is checked with such grate flare, Their are know fault’s with in my cite, Of nun eye am a wear. 140 Are the Androids Dreaming Yet? Now spelling does knot phase me, It does knot bring a tier. My pay purrs awl due glad den With wrapped word’s fare as hear. Too rite with care is quite a feet Of witch won should bee proud, And wee mussed dew the best wee can, Sew flaw’s are knot aloud. Sow ewe can sea why aye dew prays Such soft wear four pea seas, And why eye brake in two averse Buy righting want too pleas. The Search for Knowledge I hope this explanation shows you the simplest model for creativity – working through every possibility, and examining them all – is doomed to failure. It would take longer than until the end of time to even list all the options, let alone analyze them. You might wonder just how long it is until the end of time? It’s generally assumed there are two possible ends to the Universe, a Big Crunch or heat death. Either way the approximate estimate is our Universe will last somewhere between one and fifty times longer than it has lasted so far. That’s a long time, at least another 15 billion years, but just generating War and Peace would take 5000 orders of magnitude longer than this! More complex models such as a three-step process have been suggested. We could perhaps randomly create information and put it through a mechanical filter to bring it down to a manageable set of options and then give it to an appreciation algorithm to finally decide whether we have created something. The real problem with this model is the filters. If we try to reduce the effort by assembling works only from pre-existing words, we will have filtered away many works we know and love. Gone are Shakespeare, Lewis Carroll, Dylan Thomas and Roald Dahl, shall I go on? Indeed, once upon a time there were no words, every word was coined at some point. The process of creating art is continually creative and mechanical filters can’t be applied to things they have not seen before. Knowledge 141 You might argue we could devise a more sophisticated mechanical filter, something that contains an algorithm with an understanding of the rules of language. The problem is both the size of the task and the nature of understanding. If I devised some really good appreciation algorithm which did not delete all the creative words of the English language, it would still have to read and appreciate the huge quantities of input until it hit upon something good. There is no way for any machine to read all this information in the age of our Universe; the numbers are just too large. And there is no way for a machine to understand all the rules of language, they are not written down and constantly evolve. These descriptions should give you an intuitive feel for nature of the creative problem. If you try to deconstruct it into mechanical steps you end up with either a mechanism that needs to be infinitely specified or one that lets through an infinite quantity of nonsense. A human could never sift through all that garbage to find the occasional pearl of wisdom. Until the beginning of the 20 th century, most people thought knowledge and creativity must be just a matter of scale. A big enough, fast enough machine should be able to solve any problem. But early in the 1930s two mathematicians – Kurt Gödel and Alan Turing – showed knowledge was not so simple. Let me give you a feel for why. Knowing When You Know The essence of creating knowledge, is to know when you have done so. In a sense, counting from one to infinity means I know everything, and merely counting to 50 million creates every piece of significant symbolic knowledge that will ever be written – all the books, plays, mathematical theorems you could possibly want. But, if I were to list all these numbers in an enormous imaginary book it would hardly constitute knowing everything: I would be awash with numbers but not with knowledge. The essential feature of ‘knowing’ is to have a small number of steps that will definitely answer a problem. For example, if I wish to phone someone I can look up their details on my phone. The process will tell me their number in two or three steps. If you tell me the number is somewhere in the phone book this is not knowledge. It could mean I need an infinite number of steps. If I accidentally deleted all the names in my phone – a nightmare scenario – and just had a print out of numbers would I still ‘know’ them? Obviously I would recognize my mother’s number, but most of them would be useless. To know something, I need link the information to what it is for. A number with a name allows me to predict what will 142 Are the Androids Dreaming Yet? happen if I make a call. I will have an interesting conversation or pay my gas bill. It’s the same with most numbers. If I have a number that represents the design for a building or a mathematical theorem, these numbers have purpose. If I input these numbers to a computer along with some building design software or a copy of Mathematica they will do something interesting; allowing a construction firm to build a innovative building or a mathematician to check a theorem is sound. It’s a lot harder to prove numbers representing art are functionally useful. A work of art is in some sense not complete – it still needs to go through the process of being appreciated by someone. We could show it to a friend or exhibit it in a gallery but this is un unpredictable process. Van Gogh’s paintings were so criticized in his lifetime, many people would have denied them the label art, and Edwin Morgan’s Loch Ness Art or Information Monster poem is almost pure gibberish, but it’s undoubtedly art. Art is a tricky problem but, in practice, most of us agree on what constitutes good and bad art. We will look again at art, in Chapter 10. Classically we assume knowledge is discovered through random chance and iteration. To understand how this might work let’s lay out the world’s information in a way we can visualize. Imagine every piece of discoverable knowledge could be found in an infinitely large library. Knowledge 143 The infinity library would contain every possible symphony, theorem, novel, poem, and play ever written, or to be written. Its sister library next door, the continuum library, would contain all the analogue works of art; painting, sculpture, architecture, physical artifacts and the like. The curators of the two libraries would constantly argue over whose collection was the better. We’ll leave them to differ for the moment. The infinity library is interesting enough so let’s explore it first. After all, its sister, the continuum library, takes an infinite amount of time just to look at the first room, and we are in a hurry! Although the infinity library is infinite, we are probably only concerned with entries shorter than a million symbols. All the interesting papers, proofs and symphonies I know of are shorter than this. If I wanted to include all computer programs, I would still only need to increase it to 100 million symbols. Looking for knowledge is not itself an infinite task. For the sake of clarity, I will ask the infinity librarian to organize the collection. Any book or paper will be sorted according to its title and the contents of its pages, and similar books should be grouped together. I also only want to look at the English section of the library for the moment. I will still have a huge section to look through but at least every work is titled and readable by me. Much of the information will be junk but amongst the sea of rubbish will be islands of useful knowledge. Now, is there a way to find knowledge in this library in an automated fashion? Battleship 144 Are the Androids Dreaming Yet? The best analogy I can find to illustrate iterative knowledge discovery is the 1970s family game ‘Battleship’. The game consists of two 10 by 10 grids that you plug your ships into. All the ships are linear shapes of a few squares in length. The players cannot see each other’s ships and must guess where they are. A very simple way to do this would be to ask your opponent whether they have a ship on the top left square and continue systematically across the board, square by square, until you reach the bottom right hand corner. This would eventually find every ship. If every ship were a piece of knowledge we could discover all the knowledge in the world by simply stepping through the board one cell at a time, but it would take a long time. A better way to play Battleship is to pick a square at random. If you get a hit, explore linearly around the hit. This will efficiently find the rest of the ship. The same might be true for knowledge. We could take random shots, get lucky and move linearly to flesh out our knowledge. Once we had exhausted an area we could take a step away at random and again hope for another hit. This process is exactly the way some people imagine the frontier of knowledge expands. But, it is wrong. The monkey moon shot story explains… “I believe that this nation should commit itself to achieving the goal, before this decade is out, of landing a monkey on the moon and returning him safely to Earth.” President Monkey The monkey nation is asked to mount a moon shot. After a little time a monkey is asked to report on progress. “I can report,” says the monkey, “I have climbed a particularly tall tree on the tallest hill on my island and have made over seven hundred meters progress towards the moon, although this is only 0.0001% of the way there, this has been quick so I believe we are well on the way.” You see of course the problem. Progress in many problems is nonlinear. Moving a bit of the way towards the goal does not provide any actual progress: That is the problem with knowledge. It is not linear in structure. You need to take leaps to discover new knowledge. You can not simply look around in the general area. Such leaps are mathematically huge. The chance of making a successful one by pure chance is virtually zero. Knowledge 145 But Cats Can! As chance would have it, as I was writing this book about the impossibility of creating great literary works at random, our new kitten, Jessie, sat on my keyboard – she likes the warmth. To my great embarrassment I have been proven wrong. Here is her first literary work. I managed to capture her on camera a little later that evening, editing a spreadsheet. My brain interprets this string as the cat thanking me for good food. I wonder if you see the same thing? This is just a demonstration of the strength of human pattern detection algorithms and not, sadly, of feline communication. Cats Creation …. Kkkklnk gfoooooooofd0------- iiiii;;;;;;;;;;;ii…..fffffffffffffffffffffffff…… ================================================= ================================================= ================================================= ============================pppppppppppppppppppppppp ppppppppppppppppppppppppppppppppppppppppppppppppppppppp ppppppppppppppppppppppppppppppppppppppppppppppppppppppp ppppppppppppppppppppppppppppppppppppppppppppppppppppppp ppppppppppppppppppppppppppppppppppppppppppppppppppppppp ppppppppppppppppppppppppppppppppppppppppppppppppppppppp ppppppppppppppppppppppppppppppppppppppppppppppppppppppp pppppppppppppppppppppppppppp..oppppppppppppppppppppppppp ppppppppppppppppppppppppppppppppppppppppppppppppppppppp ppppppppppppppppppppppppppppppppppppppppppppppppppppppp ppppppppppppppppppppppppppppppppppppppppppppppppppppppp ppppppppppppppppppph Jessie Cat Jessie, Our Creative Kitten Chapter 6 KITTENS & GORILLAS Orangutan and Kitten “No kitten that loves fish is unteachable; No kitten without a tail can play with a gorilla; Kittens with whiskers will always love fish; No teachable kitten has green eyes; No kittens have tails unless they have whisters; hence...” Lewis Carroll “Once you eliminate the impossible, whatever remains, however improbably, must be the truth.” Sherlock Holmes, Arthur Conan Doyle s well as giving us Alice, the Jabberwocky, and the Cheshire Cat, Lewis Carroll lectured on mathematics at Oxford University. He wrote several books on logic, illustrated with wonderful problems involving fish, kittens, and gorillas – much less boring than the brown, grass-eating cows of modern textbooks. Kittens and gorillas are not usually in much contact, but I did find one hit on Google, pictured! The words we organize into books, poems and plays are not just a random jumble; they have structure and a logic to them. We group verbs, subjects and objects together to form sentences and, at a larger scale, characters have motivations and relationships: this character loves that character, the valet had the candlestick in the ballroom and could not have stabbed the butler in the kitchen, and so on. We have dictionaries to define words, but to truly understand the information they convey we need to understand the logical rules governing how they can be combined. Everyday conversation is fragmented and repetitive. Fortunately, now and again, we say something definitive. For example, “This gorilla is brown.” The statement links a property, ‘brownness’, to a thing, ‘a gorilla’. Logical statements are precise but often need to be put in context. If I were standing in a forest when I made my statement you must guess I mean the nearest gorilla. The word ‘This’ implies nearness, but nearness is not well defined. Better to be precise. ‘The gorilla I am closest to, measured by line of sight distance is the Pantone shade dark brown.’ However, if I talked like this all day I would not have many friends. Logical Beginnings The formal study of logic began in 384BC with the publication of a treatise called the Organon by the Greek philosopher Aristotle. A student of Plato, Aristotle taught many of the famous leaders of his time, including Alexander The Great. Ancient Greece was not some idyllic think tank. If you annoyed the political establishment you might find yourself having to leave town in a hurry. This happened to Aristotle after Plato’s death, and he spent nearly a decade touring Europe. Eventually, he returned to Athens where he published his study on logic. In the Organon, Aristotle examined groups of up to four statements, each containing up to four relationships. For example: All kittens eat fish. Some kittens eat fish. No kittens love gorillas. No gorillas eat kittens – luckily. It is possible to put two statements back to back and infer things. 150 Are the Androids Dreaming Yet? I could say, “All gorillas eat leaves.” “All leaves are green.” Therefore I can infer all gorillas eat some green things. This is a valid inference. It is not correct to say, gorillas eat only green things. There are 256 ways you can arrange four Aristotle statements with four relationships but only 19 valid deductive conclusions can be drawn. The kitten puzzle at the start of the chapter is an example of such a logical puzzle. Can you reach the right conclusion? ddd TRY SOLVING THE KITTEN PUZZLE WITHOUT READING ON Aristotle’s syllogisms are only a start. There are many other types of logic. In antiquity, the Stoics developed a different brand of logic based on the idea of larger and smaller. Stoic logic allows us to answer questions of relative size. If a Mini is smaller than an Audi, and an Audi is smaller than a Rolls Royce, then a Mini is smaller than a Rolls Royce. The Stoics pursued their branch of logic until around 180AD when study of this sort died out. It’s not quite clear why. Perhaps the rise of religious power and the onset of the Dark Ages curtailed intellectual inquiry. Even after the Enlightenment began around 1650 it took some time for the discipline of logic to re-emerge. If you want to learn more about syllogistic logic and how to solve Lewis Carroll’s puzzle you should read his book The Game of Logic. The definitive book on the logic of language, in my opinion, is Logic by Wilfrid Hodges. Logic for Computers Western civilization mostly survived on syllogisms and stoic logic for nearly two thousand years before George Boole devised his theory of binary logic in 1847. Boole developed an elegant mathematical system for representing logical statements that allowed simple arithmetical operations to answer logical questions. We now call this system Boolean logic and he gave us the modern convention of using one for true, and zero for false. Computers use his principles all the time. For example, if it is true my bank account shows less than zero, then make it true that someone will send me a letter warning me I am overdrawn. The best way to get your head around Boolean logic is to solve the ancient puzzle of the Two Guards. The puzzle featured in the 1986 movie, The Labyrinth, Kittens & Gorillas 151 starring David Bowie and Jennifer Connelly. If you want to cheat watch the film to see the answer. Here is the puzzle. I’ll put the answer on my website. Two guards stand barring your way and behind them are two doors. One guard always speaks the truth, while the other always lies. You are only allowed to ask one question of one of the guards. Your life depends on picking the right question to ask as, based on the answer, you must pick a door. One leads to life, the other to certain death. Is there a question you can ask to ensure you pick the door leading to life? TRY SOLVING THE GUARD PUZZLE ddd Twin Guards - Left door or Right If you are reading this, you picked the correct door and lived. 152 Are the Androids Dreaming Yet? Logic for Humans Syllogisms can be used for practical purposes. Take, for example, the following set of statements, “I want a hot drink.” “Coffee and tea are hot drinks.” “I always drink milk with tea,” “We have no milk.” What drink should I choose? I’m sure you can work it out. This logical problem follows a simple chain and results in me getting the hot drink I like. We use Boolean logic on a day-to-day basis. The simplest form is a checklist. Pilots use checklists all the time; do I have wings, fuel and a copilot? If they are all there, go ahead and fly. Otherwise do not. Mathematically speaking, a checklist is simply the product of the options. If they are all one, then the product is one – in this case we can fly. If any is false – represented by a zero – the product will be zero and we cannot fly. Life is often more complicated and we have many logical tools at our disposal. Let’s take a look at a few, starting with a famous historical one. Benjamin Franklin invented the lightning rod and bifocal glasses, as well as charting the Gulf Stream and all manner of other scientific discoveries. He described his process for decision-making when there are many pros and cons to consider. “... my Way is, to divide half a Sheet of Paper by a Line into two Columns, writing over the one Pro, and over the other Con. Then during three or four Days Consideration I put down under the different Heads short Hints of the different Motives that at different Times occur to me for or against the Measure. When I have thus got them all together in one View, I endeavor to estimate their respective Weights; and where I find two, one on each side, that seem equal, I strike them both out: If I find a Reason pro equal to some two Reasons con, I strike out the three. If I judge some two Reasons con equal to some three Reasons pro, I strike out the five; and thus proceeding I find at length where the Balance lies; and if after a Day or two of farther Consideration nothing new that is of Importance occurs on either side, I come to a Determination accordingly.” Another important piece of logic is reductio ad absurdum. Reduction to the absurd allows us to disprove something because, if it were true, it would lead to an absurd conclusion. An alibi is a familiar form. If I was seen in the pub when the murder occurred in the ballroom of the manor house and you claim I committed the murder, I must have been in two places at once. People can’t be in two places at once – that would be absurd. Conclusion: I am innocent! Kittens & Gorillas 153 Notice I not only prove I am not guilty I also prove the opposite: I am innocent. When a mathematician uses this trick, it is called an indirect proof and works the same way as the alibi. Assume the opposite is true of some theory you want to prove (I am guilty). If it generates a contradiction or paradox (can’t be in two places at once) you can deduce the opposite must be true (innocence). Mathematicians use this all the time. It assumes, of course, mathematics is consistent and that true and false are opposites. Some mathematicians argue this is too strong an assumption. Why should we assume consistency and recognize only two logical states, true and false? These mathematicians believe the only way to prove a theorem is with positive argument rather than using the opposite of a negative argument. They don’t allow indirect proofs in their mathematical models. This type of mathematics is unsurprisingly called positivism. It’s a pure theory but, unfortunately, if you try to follow it you lose much of our current mathematical knowledge and understanding. Most modern mathematicians think it a historical curiosity, but it does pop up from time to time. Modern mathematics is founded on the axioms that true and false are the opposite of each other and that inconsistency is forbidden within the system. Mathematical proofs submitted to journals are not permitted to contain inconsistencies or result in paradoxes. Paradoxes – When Logic Fails “I would not be a member of any club that would admit me.” Groucho Marx Paradoxes occur when a statement makes no sense, or results in an internal contradiction as with Groucho Marx’s famous quote. They are widely used in mathematics to implement indirect proofs. To do this, we suppose something is true, and if it results in a paradox then the Groucho Marx 154 Are the Androids Dreaming Yet? thing we thought true must be false and the opposite is true. This is a somewhat circuitous route to prove things, but it is often the only practical way. Two paradoxes we are taught as children are the liar’s paradox and Zeno’s paradox – also known as the story of the tortoise and hare. The first is a real paradox but the second is a false paradox. The liar’s paradox is just the simple statement: “This sentence is false.” It is a paradox because of the internal inconsistency: We cannot determine if it is a true or false. First assume it is true, but it says it is false, so it is not true. Then try it the other way around. Assume it is false but the sentence states it is false, so it must be true. If that were so it must be false by the first argument and so on ad infinitum. Either way around, the sentence contradicts itself. A paradox. Zeno’s Paradox, on the other hand, is a false paradox. Here is the story. Once upon a time there was a hare. He was a very arrogant hare and believed he could outrun any animal. A tortoise was walking along the way and the hare jumped out in front of him. “You are so slow,” said the hare. The tortoise replied, “You may be the fastest hare in the kingdom but I am the most persuasive tortoise. I bet I can persuade you of anything, including that I am faster than you.” “I don’t believe you,” said the hare. “OK,” said the tortoise, “let me show you. Give me 100 meters head start since you are so fast. Then, we’ll both start to run. After 10 seconds you will have run 100 meters and arrived where I used to be, but I will now be ten meters ahead. After another second you will be where I am now, but I will be 1 meter ahead again. So you can never catch me.” The hare pondered for a while but, being a hare of little brain, could not make out the true answer. It is a false paradox. The time intervals are getting shorter. The question for a mathematician is, does the problem converge to a solution. The answer is yes, and I can reframe the problem to see how it is solved. Let’s simply look at who would be ahead after 20 seconds: the hare! Kittens & Gorillas 155 The mathematical reason for it being a false paradox is that some series converge and some do not. If I move progressively closer and closer to something in smaller and smaller time intervals then I may indeed reach it. On the other hand, some series never converge. I will never reach infinity how ever many steps I take. The Barber Paradox Now, for a slightly harder paradox, let’s suppose there is a town with just one barber. In this town, every man keeps himself clean-shaven by either shaving himself or going to the barber; the barber shaves all the men in town who do not shave themselves. All this seems perfectly logical, until we pose the question: who shaves the barber? This question results in a paradox because, according to the statement above, he can either be shaven by himself or the barber, which is he. However, neither of these possibilities is valid! This is because if the barber shaves himself, then the barber must not shave himself and if the barber does not shave himself, then the barber must shave himself. You might think this paradox an oddity but, using this simple idea, Bertrand Russell changed the course of mathematical history and it is the fundamental paradox used to show computers are Turing limited. The Russell Paradox In the late 19 th century, mathematicians began to think about the nature of numbers. What is a number? It is certainly not an object we can hold. I can’t hold a two, unless it’s the brass number plate, for my front door. And, in that case I am holding one number plate, so I am not holding the idea of two, but rather the idea of one: one brass plate in the shape of a two. The ‘idea’ of a number is to say something about the things I have in my hand: two apples, two oranges and two brass number plates. These are all sets of two things and ‘two’ is the collection of all these sets. In 1890, Gottlob Frege completed his theory of sets. The project had taken him five years. Unfortunately, just before sending the book to the publisher, Bertrand Russell wrote to him and pointed out the following paradox. What about the set of sets that does not contain itself? Think about it... 156 Are the Androids Dreaming Yet? It is the barber paradox with the word ‘set’ substituted for ‘barber’ and ‘contains’ rather than ‘shave’. But it’s essentially the same logical problem. You might find this rather contrived but mathematicians must have a system totally free from paradox, otherwise there is no certainty. Frege’s system was holed below the water line. Eventually, after much further work, a theory of sets was worked out that does not contain the Russell Paradox. It’s called Zermelo-Fraenkel set theory, or ZF for short. It solves the Frege problem by forbidding sets to refer to themselves. It’s a bit like Microsoft Excel’s solution to dividing something by zero. It is simply forbidden and generates an error message. Set theory was fixed and is now the basis of most mathematical thinking. What is Logic for? Logic is the foundation of mathematics. Applying it enables us to make irrefutable statements about things: numbers, lines, planes, equations and the so on – the things you learned at school – and to prove statements about these things beyond any doubt. This is not the ‘reasonable doubt’ hurdle of our law courts, but an absolute measure: No possible doubt whatever. Let’s look at one of the earliest mathematical proofs: Euclid’s proof there are an infinite number of prime numbers. Euclid created this proof in ancient Greece around 300BC – so far back that logic was in its infancy Euclid’s Elements 100AD Kittens & Gorillas 157 and numbers had not yet been properly invented. Euclid used distances rather than numbers for all his proofs but I will use the word ‘number’ in this explanation. First a little revision. A prime number is a number that can only be divided by itself and one, for example three, five, seven, and eleven. All numbers can be split into primes using a couple of tricks. First, all numbers are divisible by a set of primes. Ten is five times two – two primes. We are also fairly sure we can form any number by adding two primes together. This is Goldbach’s Conjecture, set as a question in a letter written to Euler in 1742. It is still unproven! Euclid proved there are an infinite number of primes by using reductio ad absurdum. Imagine we have a complete list of prime numbers – James’ list of primes. It contains every prime number. (This is the setup. We are proposing something we suspect is incorrect and will lead to a paradox or contradiction. When it does, we will have proven the opposite fact. The proof relies on the fact that a number can either be prime or not prime. There is no middle ground.) Let’s make a new number by multiplying all the numbers on my list together and adding one. There are two possibilities: this new number is either prime or not prime. If the number is prime, it is a new prime number that was not on my list and I have disproved the theory. If it is not prime then it must be divisible by two prime numbers already on my list. However, neither of these numbers could have been on my list, because dividing by one of them would give me a remainder of one. Remember I multiplied all prime numbers together and added one. It must, therefore, be a new prime number, which had previously not been on my list. Once again, I disprove the theory. Since both routes fail, James’ list of prime numbers is not complete and, therefore, prime numbers are infinite. Feynman’s Proof My favorite piece of logic is Richard Feynman’s disproof of the existence of polywater. It’s a strange logical proof bordering on philosophy, but it shows just how far you can take logic. In 1969, an urban legend spread around the world that there was a substance called polywater. It even made it into an episode of Star Trek. Polywater was believed to be a lower energy state of water, more viscous than ordinary water. If this substance did exist, it would be possible to mine the oceans of the world converting water to polywater and 158 Are the Androids Dreaming Yet? therefore generate energy. There was a concern that if the right catalyst was accidentally introduced into the oceans they would solidify into polywater thus dooming the human race, or at the very least making water sports impossible! Feynman was consulted and stated, “If there were such a substance as polywater then there would have evolved an animal that eats water and excretes polywater, using the liberated energy as its power source. Since there is no such animal, polywater does not exist.” Feynman’s proof is an elegant indirect proof coupled with a syllogism. Polywater exists. Polywater is a lower energy form of the highenergy substance called water. Food is a high-energy substance that can be converted to a low energy substance by a process we shall call ‘being good to eat.’ All things on earth that are good to eat have something that eats them. Polywater is a food and therefore good to eat. Therefore an animal must exist that eats polywater. No such animal exists, so either something in our chain of logic is wrong, or the premise is unsound. Since the chain is sound, the original premise must be wrong: Polywater cannot exist. In short, Feynman’s proof says: if a thing is so, then the inevitable consequence is the evolution of something else, and since that something else does not exist, the original thing cannot be so. QED: disproof by nonexistent consequence. The polywater disproof neatly demonstrates the important elements of Feynman’s Evolutionary proof. First, life must be continuously exposed to the thing in question, in this case water. This is clearly so as most life on planet Earth lives in the oceans or is intimately entwined with water. Evolution takes time, so enough time must be allowed for life to evolve. It must be a nearly linear problem so that a solution proceeds in steps where each step is an improvement and no step requires too high a level of mutation or adaption. We can illustrate the boundary between a linear problem and one requiring a step change by describing how triple drug therapy works in the treatment of AIDS. Until triple drug therapy entered the picture progress against AIDS had been a depressing story of drug discovery followed by the almost immediate evolution of the virus to evade the drug. The AIDS virus is a retrovirus with a shell composed of sugar molecules. It is almost trivial for an AIDS virus to mutate these outer markings to look different, even from one day to the next. This is the way the virus continually and nimbly evades our immune system. However, the AIDS virus does have some components that it can’t easily mutate because they are not merely aesthetic, they have a functional purpose. Why not target them? Kittens & Gorillas 159 Unfortunately, it turns out the AIDS virus can even mutate its functional parts, but this is harder. The probability of a successful functional mutation is 1000 times less likely than a simple aesthetic mutation to the sugar coat. Triple drug therapy works by attacking three different functional elements of the virus simultaneously. It is possible for the virus to modify all these functional elements but the likelihood of it doing so is tiny. One mutation alone does not help because the drug cocktail will still target the other two elements and kill the virus. The AIDS virus does not understand that it is facing a triple drug cocktail. It cannot reason like a sentient being and random chance is not sufficient to make the big leap necessary to overcome the cocktail of drugs. Unless you can mutate all three elements at once your time as a virus particle on this planet is over. Most problems we have to solve in this world require more than one simultaneous logical step and these don’t happen by chance. Chapter 7 COMPLEXITY & CHAOS Mandelbrot Set “Life is really simple, but we insist on making it complicated.” Confucius “Any darn fool can make something complex; it takes a genius to make something simple.” Pete Seeger There was once a great King who lived in a marvelous palace. To fend off boredom he collected all manner of interesting games and puzzles. One day an inventor came to his palace and told the King he had a game of such subtle complexity, yet apparent simplicity, the King would play no other. The King learned the game and soon agreed it was, without doubt, the best of all games. The game was, of course, ‘chess’. The King asked the price of this game and the inventor told him it was a mere trifle. The King should give him one grain of rice on the first square of the board, two on the second, four on the third, and so on, doubling each square until he filled the board. The King called his treasurer to honor the bargain and the first bags were brought from the storehouse. The grains were placed on the board in each square but soon there was not enough space and the grains had to be piled on the table next to the board. Soon this, too, was not enough and every table and chair in the hall had to be covered. Even this was not enough and they began to stack whole bags up in the courtyard. When they reached the thirtieth square, the treasurer turned white. He sat and calculated for a while before saying with a trembling voice, “My great ruler, there is not enough rice in all the world to cover this board.” The ruler called the inventor and told him he could not honor the debt and the inventor should name another price. The King had two beautiful daughters, the first knew she was beautiful and deported herself accordingly, and the second, was bookish and shy, but perhaps more beautiful for this. The inventor asked for the hand of the second daughter and lived happily ever after. In the less favorable version of this story, the King becomes very angry and has the inventor beheaded. I prefer the romantic version. Placing rice on a chessboard and doubling it successively leads to wildly large numbers. Covering it completely requires 18,446,744,073,709,551,615 grains, about four hundred trillion tons and equivalent to one thousand years of worldwide rice production. Like the king, humans do not intuitively grasp the enormity of this problem because we’re not good with large numbers. Although the number of grains needed to cover a chess board is very large, it is not hard to calculate. The treasurer is the one who should have lost his head for not being able to do the calculation. The equation is simply two, doubled sixty-four times, less one, 2 64 -1. A pocket calculator can produce this number in a thousandth of a second: it’s just long multiplication. Although calculating this number is quick, it is not always the case. Answers to some problems have short cuts, while others do not. 164 Are the Androids Dreaming Yet? Mathematicians have catalogued the universe of problems into classes rather as biologists have catalogued animals into species. Each problem is examined and put into a genus with a name. Sadly the names are not as readable as the Latin names for animals. For example, ‘nlogn’ is the complexity class of most sorting programs, while traversing a maze typically sits in the class NP or P/POLY. Although the classifications look complex the basis of cataloging is simple, a class name signifies the time needed to solve a problem using the best possible algorithm, and the scale this is measured in is ‘Big O’. Big O Every problem has a complexity. In mathematics this is expressed using ‘Big O’ notation, where ‘O’ stands for order-of-magnitude. The simplest problems have order 1. If I am working at my computer on a Word document and I press print, the printer will spring to action and print the document. This problem is of flat complexity, notated O(1). It does not matter how large the file is; one click is all I need. I am, of course, assuming sufficient paper in the printer and ink in the cartridges. The next complexity class is a linear problem, O(n). For example, walking to the store to buy a pint of milk. The farther the store, the longer the walk. The time needed to get to the store is directly proportional to the distance: if I am walking, a single step multiplied by the number of steps required to cover the distance. You might think adding two numbers together is a linear problem – the bigger the number, the harder the problem – but there’s a clever trick to speed it up. You can get 10 people to add each column in parallel. They’ll need to coordinate when someone ends up with a number larger than ten and has to carry the extra digit but this can be easily solved. A problem gets its classification only once we’ve used the cleverest possible trick to solve it. Most problems we meet in mathematics are somewhere in between flat and linear but there are some that are much harder. The most common hard problem we come across in our daily lives is sorting. Rather than go through a tedious written description, check out the video link on my website. Sorting without using any spare space requires a bubble sort. This is an example of something that needs n squared operations and, since n squared is the simplest example of a polynomial, it is said to be in the polynomial time, or ‘P’ time classification. Complexity & Chaos 165 Bubble Sort Ballet The Hardest Problems You probably hope cracking the encryption used to secure the Internet is one of the hardest problems known to man but I’m sorry to tell you it is not. When you use your credit card to buy something from an online shop, your web browser changes from http to https, the ‘s’ stands for secure. The data you send to the Internet is coded using a system developed in 1977 by Ron Rivest, Adi Shamir and Leonard Adleman of MIT, which is why it is called RSA encryption. Any information you send is raised to the power of a very large number – usually around one hundred digits long. Raising something to the power simply means multiplying it by itself that many times. What makes decrypting a message hard is that division is a slow process; it is called ‘long division’ for a reason. It turns out there is no way to speed it up on a conventional computer so, unless you know the right number to divide by you will have to try every number. It is this that makes decrypting RSA messages hard. Although RSA messages are difficult to decipher, they are nowhere near the hardest problems. That accolade is commonly believed to go to non-deterministic polynomial problems known as ‘NP’ problems. NP problems are easy to describe but fiendishly difficult to solve. Nondeterministic means each time you come to a branch in the problem there is no way to tell which branch is the best to pursue without exploring it all the way to the end. It’s the same as a maze; at each junction in the maze you can decide which path to take, but the junction gives you no 166 Are the Androids Dreaming Yet? Maze clue which one will be better. Beware the confusing naming system, ‘N’ stands for nondeterministic in this case, whereas in normal complexity classes ‘n’ stands for number. Sorry. That’s just the way it is. Let me give you an example of one of these NP problems. Let us assume we have one of those complicated recipes from the latest celebrity chef cookbook. If all the ingredients can be bought from one store, making the dish is straightforward, but if they come from different stores, you will have your work cut out. What is the best order to visit them? With 2 shops, it’s trivial. Either order will do. With 3 it is a little harder and with 4 there is quite a bit of choice. This is known as the ‘traveling salesman’ problem because the original formulation described a salesman wishing to find the shortest route between all the cities in which he had customers. The complexity of this problem rises much faster than the Rice and Chess Board problem. Try it for yourself. It doesn’t matter if you imagine you are visiting customers or shops. I have given you a grid to count off distances. Try to solve a problem for 3 cities, 5 and 10. What is the shortest path allowing you to visit each place? Complexity & Chaos 167 TRY THE PUZZLE ON THE WEB Warning: Don’t spend too long on these problems. ddd The reason I warned you not to spend too long is that solving the 50-city problem would take longer than the age of the known universe. NP problems get harder very fast as the number of elements goes up. A 50-city problem is hugely larger than a five-city problem, not just ten times harder. The Clay Mathematics Institute has offered a $1 million prize for anyone who can say whether NP problems are really as hard as they appear. It may be there is a general trick or a series of tricks that allow you to solve any NP problem in a shorter time. If you could do this, the problem would be demoted to P, allowing fast computers to tackle it. No one has yet found a proof of the P=NP problem. At the time of writing several proofs are sitting with the Clay Prize judges but don’t hold your breath. Most people assume there is no solution. If you want to have a crack at the problem let me state it in simple terms. Traveling Salesman 168 Are the Androids Dreaming Yet? Imagine you wanted to find the center of a maze. Is there a way to speed searching the maze, so you do not have to test every branch? If you can provide a mathematical proof that there is or is not, you win the prize. Places Game While it is commonly assumed NP problems are the hardest, this is not the case. There are quite a few that are harder still. One such is called a PSPACE problem. It’s quite difficult to explain but luckily many of you will have played a form of it on long car trips when you were a child: My family calls it The Places Game. I will pick a place – ‘London,’ and you must then pick another place, say, ‘New York’, that starts with the letter my place ends with. I’ll then pick ‘Canterbury’ and my kids will laugh at my dyslexia and I’ll have to switch to ‘Kansas’ and so on. Once you use a place you can’t use it again. The mathematical question is to predict who will win given each player has a finite list of places they know? It turns out this type of problem is even harder to solve than an NP problem. This is because on each turn a player gets to pick any name from their list. With the traveling salesman problem, there is only one ‘player’ – the salesman – so we can write out a route and check it. In the Places Game there is no single route through the game because, after I pick my favorite town ‘London,’ you can pick any place beginning with ‘N’. I have to anticipate an enormous table of possible paths through the game. The table takes huge physical space – which is where PSPACE gets its name. Remember I’m just playing the simplest mathematical games with bits of paper and discrete ideas. I haven’t strayed into the quantum world yet. That brings with it a whole new level of complexity to explore. Complexity is such a diverse subject that Scott Aaronson of MIT has created a web site called the complexity zoo to catalogue all the different ‘species’. It is much to complex to reproduce here but let me provide a sketch. The Complexity Hierarchy My table below represents the hierarchical complexity of knowledge. We start off with the problems both humans and computers find easy, then rapidly move onto problems that even the fastest machines find difficult: a perfect game of chess or predicting the weather. Above these computable problems are the non-computable ones which no computer Complexity & Chaos 169 running any algorithm can solve, and then there are the free will problems: how do we pick a problem in the first place? How do inventors come up with problems no one had ever thought to solve in the first place, such as the invention of the Rubik’s Cube? Ernő Rubik’s Cube Problem Example Flat Print File (for Human) nlogn Searching a list Linear Finding the lowest number in a list Logarithmic Long Multiplication Exponential Long Division P Most Algorithms Near NP Factor Prime Number NP-non-complete Perfect Game of Chess NP-Complete, tractable Travelling Salesman, SAT Chaotic Weather NP-Complete, Quantum Modeling a Quantum Process NP-Complete, intractable Busy Beaver, Towers of Hanoi PSPACE Graph Problems, Places Game Creativity, Finding Fermat Theorem for a Non-computable Turing machine, Tiling the plane with Penrose Triangles Non-deterministic, Non- time divisible, Non- Free will computable Halting problem for a Turing Machine, some mathematical theorems such as the Continuum Impossible Hypothesis in ZF+AC (Hilberts 1st). Travelling faster than the speed of light. Understanding the American tax code. Known Unknowns I know that I don’t know either way. Unknown Unknowns I have not thought to ask that question yet. Inventing the Rubik’s Cube Butterfly “Does the flap of a butterfly’s wings in Brazil set off a tornado in Texas?” Philip Merilees, improving on Edward Lorenz Chaos Chaos is the twin of complexity. It burst into the public psyche in 1987 with the publication of James Gleick’s book Chaos. It’s not a difficult concept to grasp. Complex systems can be formed using simple rules, and very small changes in starting conditions can profoundly affect future events. I experience this if I miss my train to work in the morning: 30 seconds either way will change the whole pattern of my day, the people I meet and the level of stress I experience. I’m sure you can think of similar experiences. Henri Poincaré, a French mathematician, first studied the effect back in 1880. Poincaré was trying to solve an old mathematical problem called the Three Body Problem originally set by Isaac Newton. Take the Earth, Mars and the Sun. These three bodies orbit each other, or strictly speaking a point in space somewhere between them. Is there an equation that will tell you where the bodies will be in, say, 100 years’ time? The answer is surprising, no. The three bodies will orbit in a non-repeating way. There is no analytical short cut, no equation that will predict where they will be Poincaré 172 Are the Androids Dreaming Yet? at some point in the future. The only way to know is to build a perfect model of the system and see what happens. Poincaré won a valuable prize for his proof from the King of Bavaria. You can see some amazingly complex orbits plotted below. Remember these are still deterministic and predictable – after all, they were calculated with a computer – they are just chaotic. Four Body Problem Butterflies and Sliding Doors After Poincaré, the field of chaos remained fairly quiet until Edward Lorenz began studying weather patterns using computers in the 1960s. The story goes, one day his computer was misbehaving and he had to rekey some data into the machine. Rather than using eight decimal places he used only six to save time, and was amazed when the results of his program came out completely different. Dropping the seventh and eighth decimal place represents a change of only one part in a million, yet the patterns of weather predicted by the computer were completely altered. Complexity & Chaos 173 Lorenz went on to study the effect and created a new branch of mathematics. His quote about the beat of a butterfly wing creating tornados has entered the public psyche and is central to the plot of numerous Hollywood movies. One of his functions – known as the Lorenz Attractor – nicely illustrates the nature of chaos. A very simple equation plots the beautiful, apparently three-dimensional, non-repeating shape. Chaosville Chaos, taken to its logical conclusion could explain our Universe. Stephen Wolfram in A New Kind of Science, makes the argument that simple rules could explain the extraordinary complexity we see in our Universe. He applies rules to elements in a two-dimensional grid programmed on the computer which form ‘cellular automaton’ that function a little like simple animals, generating all manner of complex shapes and behaviors. The inspiration for this approach is almost certainly Conway’s Game of Life developed by John Conway in the 1960’s. In his computer game, animals and machines seem to appear on the screen but in truth they derive from the most simple set of rules. You can check out the website to see a live version of Conway’s Game of Life. It’s a lot of fun. Wolfram’s Strange Attractor 174 Are the Androids Dreaming Yet? thesis is that we could all be living in one of these games. Perhaps our Universe is a form of Mandelbrot diagram – albeit a 3D version with stars and planets. If you look at the picture of a nebula and compare it to the Mandelbrot set, you can see how this is a tempting conclusion. In the Game of Life the rules are simple yet the behavior simulates little animals being created and destroyed. Of course, there are no actual animals. The things you see on the screen, ‘gliders’, ‘walkers’, and ‘cannons’, just hang together accidentally. But, Wolfram considers these little digital creatures are animals. He argues our Universe is just like the Game of Life: A set of simple rules leading to complex behavior. If we are Nebula Complexity & Chaos 175 Cellular Automaton prepared to call ourselves animals, so should the little creatures which emerge within the game. We simply emerged in a similar but slightly more complex game. This proposal would mean our Universe is entirely deterministic, our lives the result of a gigantic computer program that we live within and form part of. Chaos might make it impossible to predict the future without running the program and watching what happened, but the results would be inevitable, set in motion at the dawn of time. There is no place for free will in such a Universe, no place for reason. The world would simply be. But a strange idea will come to our aid to show us the limits of computation and allow us to question whether we live in a predetermined world. This idea is Aleph 1 – something larger than infinity. And it is infinity we will explore next. Conway’s Game of Life Chapter 8 ∞ Hilbert’s Hotel “All infinities are equal, but some are more equal than others.” George Orwell, paraphrase “Only two things are infinite, the universe and human stupidity, and I’m not sure about the former.” Albert Einstein “God gave us the integers, all else is the work of man.” Kronecker Health warning! The man who discovered infinity had a mental breakdown. This subject may tax your brain. Georg Cantor didn’t really ‘discover’ infinity but he was the first mathematician to put it on a firm theoretical footing. In the late 19 th century, most mathematicians thought infinity was a curious idea with no proper place in mathematics. They treated Cantor’s attempts to make it into a real mathematical object with contempt. This affected Cantor’s morale and caused him to suffer several bouts of deep depression, retreating to a sanatorium from time to time. Infinity is a difficult idea to grasp but it is vital to our study of information. It behaves counter-intuitively but is not impossible to grasp. The reason it is important is that information can always be translated into numbers and numbers go on to infinity. If you want to know all about information, you must understand infinite numbers. History Indian scholars began studying infinity in the 4 th Century BC. It turns up naturally in all manner of places. In geometry, parallel lines extend forever in either direction without ever meeting. To define a parallel line you must contemplate infinity. In arithmetic, even if you pick the largest number you can imagine, there is always a larger one; just add one. In the physical world if you look up at the night sky it appears to go on forever. Again you have infinity. Historically there were two interpretations of infinity. The first, favored by Plato, was a journey. When you embark upon a journey, you can always take another step. Infinity is the idea of ‘one more’ or neverending. It can never be reached. The second definition is more radical. Infinity is a thing, a number so big you could not imagine anything bigger, but it is one number. Plato thought this second definition tantamount to madness. Today we embrace this madness and go a whole lot further. Let me show you how. If infinity were a number, you should be able to perform mathematics with it; add it, multiply it, and even raise it to a power. This is not as radical as it might first seem. Until comparatively recently, zero was not accepted as a number – if you consider recent to be one thousand years! Nowadays it is. At the end of the first millennium Indian scholars found, against their intuition, that you can use zero as a number without generating contradictions. Take addition. I can have zero cakes, add one, and I have one cake, add another, and have two cakes and so on. In this way, 180 Are the Androids Dreaming Yet? the number zero behaves just like any other counting number. It also works with multiplication. If I have zero lots of 4 cakes, I have no cakes. Zero times four is zero, so multiplication with zero works. There is one embarrassing exception, if I divide by zero I seem to get infinity. When I was a child this was a definition for infinity, but nowadays mathematicians simply forbid the operation. Division by zero is not allowed and if you try it on your computer, you will get the not terribly useful, #DIV/0! Error. That’s progress I guess! Zero had been tamed. What about infinity? Cantor showed that while you could think of infinity as a number, it might not be just one number. He proposed there are many infinities. In fact, there are a greater than infinite number of them! He did this through a rigorous analysis of a new branch of mathematics called set theory. Set theory is now the cornerstone of modern mathematics, but it was treated with suspicion in Cantor’s time. Rather than embrace the new thinking, many mathematicians ridiculed it; Poincaré wrote that Cantor’s ideas were a grave disease infecting the discipline of mathematics! This seems odd given our modern propensity to embrace innovation, but the tone of science back then was different: innovation was not necessarily considered a good thing. At the turn of the 20 th century, scientists were on a mission to tidy things up. Lord Kelvin announced in 1890 that mankind had discovered everything there was to know and the role of future scientists was simply to catalogue and observe the consequences of these laws, and to improve the accuracy of measurement. The last thing scientists wanted was a completely new set of numbers that behaved in strange ways. Cantor was upsetting the apple cart, but he was in good company. Just a few miles away in Berlin, a young Albert Einstein was beginning to study physics in his spare time. Those studies would culminate in his four papers of 1905, two on Quantum Mechanics and two on Relativity, ushering in the modern age of physics. How to Count To understand infinity you need to count in a particular way. You’re probably used to counting with numbers. You count apples: one, two, three, and say, “I have three apples.” You can do the same with oranges. If you have three apples and three oranges, the totals are the same and you can declare you have the same number of fruits. This is the first way to count. ∞ 181 But there is a second way of counting. Take your apples and put each next to an orange. If they match up, you can easily see they are equal in number. “Look,” I say, “I have the same number of apples as oranges.” This method is more primitive and does not require the concept of numbers, but it is very useful. If I’m a shepherd I can hold a set of counters in a bag, one for each sheep. To ensure all my flock are gathered in for the night I drop one counter into the bag as each sheep enters the enclosure. I don’t need to give the counters number names. The Munduruku tribe, from the Amazon rainforest, have no concept of number names beyond five. Their counting system simply goes one, two, three, four, five, many. Yet this second way of counting allows them to function successfully, deciding whether two groups of things have the same number of elements, even if there are more than five of them. For example, if they need to determine if they have enough spears for a hunt, each person simply stands next to their spear. If everyone has one, they’re ready. If not, then the empty handed Munduruku simply make one. No need for pesky numbers or mathematics lessons. This second way of counting is particularly useful when tackling infinity because we are not sure what infinity is. Treating it the same way the Munduruku treat the number ‘many’ is the safest thing to do. The first question we would like to answer is whether all infinite things are the same. Spears and Hunters 182 Are the Androids Dreaming Yet? We know from our childhood that infinity plus one is still infinity. Is there anything we can do to make infinity bigger? Perhaps multiplying infinity by infinity will do the trick. Infinity times infinity can be visualized as a square with edges of infinite length. We can show that this square is the same size as a onedimensional infinity through a clever trick – the zigzag method. Mark the infinity square into a grid. Start in the corner square, go across, diagonally down, then across, diagonally up, and so on. I’ll draw you a picture. We visit every square in our grid using a single line. We can then lay down our infinite zigzag line next to the infinite line of one of the edges. The lines are the same length as they are both infinitely long! So infinity, times infinity, can be matched to infinity, they are the same. Cantor thought this a very strange result and wrote to a fellow mathematician, Dedekind, “Je le vois, mais je ne le crois pas!”, “I see it, but I don’t believe it!” If you are struggling with this, don’t worry. We just jumped forward to quite a complex concept. Let’s take it more slowly. One way to get a better grip on infinity is through the stories of David Hilbert and the Infinity Hotel. Infinity for Dummies Hilbert’s Hotel is a mythical building with an infinite number of rooms. Other than this strange feature it is a regular hotel complete with minibar, dodgy TV, and slightly mad manager. The rooms are numbered sequential starting at one, then two, three, four, and so on. The hotel allows you to play a series of mathematical games to see how infinity behaves. Are there the same number of minibars as there are rooms? That’s easy. I said every room has a minibar. We can use the matching technique to match minibars with rooms. Go to the first room. There is a number on the door and a minibar inside. The same goes for room 2 and 3 and this goes on forever. I’ve just proven two infinite things are the same – rooms and bars, but I still have not shown you why the zigzag line is the same length as the edge line. When you first explain infinity to a child they immediately ask “What’s infinity plus one.” A particularly smart kid I met, Dermot, asked, “What’s infinity plus three?” Hilbert’s Hotel allows us to answer this problem in a way we can visualize. ∞ 183 Traversing an Infinite Plane with a Line The infinite hotel is full. A man comes to the front desk and asks for a room. The hotel manager says, “I’m terribly sorry, but we are full… But I may be able to help you. Let me think.” He ponders for a moment and then says, “OK – I’ve found you a room.” He calls the people in room 1 and asks them to move into room 2. He calls the people in room 2 and explains that due to a double booking they must move out of their room to let the people from room 1 in. But it’s OK; they can move into room 3. Everyone moves up a room and the new guest gets the checks into the now vacant room 1. This is a little harder to understand. We did not have a perfect oneto-one match as with the rooms and mini-bars. We had a mismatch of guests to rooms. But, we were able to show it is possible to re-establish a one-to-one match by doing something to every guest, having them move up a room. There is no problem with the last guest because it is an infinite hotel, there is no last guest! Another way to visualize the problem is to ask ever hunter to pass their spear to the right in the picture below. 184 Are the Androids Dreaming Yet? Hunter with Spears Provided there are an infinite number of hunters there is always someone to hand the spear to and the person at the front of the line now has space for another spear. You can probably see how to answer Dermot’s question. The hotel manager calls the guest in the first room and asks him to move 3 rooms up rather than one. He then calls the remaining guests and tells them the same thing. Thus, he has managed to fit three more people into the infinite hotel. Infinity plus 3 is infinity. You may worry that it takes the manager an infinite time to call all the rooms, but it’s OK; he lives infinitely long so it all works out. What about fitting an infinite number of new guests into the already full hotel? Surely then we will get stuck. No, Hilbert’s Hotel can fit an infinite number of extra guests. Here’s the trick: ask all the people currently in the hotel to move to the room with double the number they are currently in – 1 goes to 2, 2 goes to 4, 3 goes to 6, and so on. Now all the odd numbers are empty and you can fit an infinite number of people into the empty odd rooms. Infinity plus infinity is infinity. Voila. ∞ 185 Now, a very clever or annoying student asks, “What happens if an infinite number of infinitely large buses arrive at the hotel. Can they all fit in?” The mathematical question is “does infinity times infinity, equal infinity?” Let us ask all the guests to get out of the bus and line up in the parking lot in neat rows. Passengers from bus one in line 1, those from bus 2 in line 2, and so on. All the guests now form a two-dimensional grid. We already know how to map a two-dimensional grid to onedimension using the zigzag method. We can fit them all in the hotel and we are done! Is Anything Larger than Infinity? Is there any bus or combination of buses that would cause the manager of Hilbert’s Hotel a problem. The answer is yes and it involves a subtle change to the contents of the bus. An infinite number of buses turn up but this time the buses are filled with men and women. The hotel manager is asked to put everyone in a room and once again he obliges using the zigzag method. At the end of the process the tour guide comes to him. “I think you have missed some people,” he says. “Since I am just one person, I know you can fit me in. But, I have a whole bus in the car park you completely missed.” “No,” says the manager. “I did every bus.” Infinity Plus Infinity Equals Infinity 186 Are the Androids Dreaming Yet? “Ah, no,” says the tour guide. “The first bus you accommodated had a man in the first seat but this has a woman. The second bus had a woman in the second seat but this one has a man and so on. This bus has a different gender in at least one seat to every bus you so far accommodated. It is a new bus.” The manager finds room for the passengers from the new bus but the tour guide comes back a moment later. “You have missed another bus. This one has a different gender in at least one seat to every previous bus, including the one you just accommodated. It looks like there are an infinite number of buses you missed, all lined up to get into the infinite hotel.” What is it about these buses that make them so difficult to accommodate? They are all just filled with people after all. The manager is defeated by the more complex information held in the contents of the buses. An infinitely large bus full of binary information has more information in it than an infinitely large bus specified only by its size. This is a larger infinity than the counting infinity. The permutation of all the possible options for the occupants of the bus is larger than infinity. Real Numbers What about the real world we live in? Is the larger infinity we failed to fit into Hilbert’s Hotel present, or was it just a mathematical fiction? Hold up your thumb and index finger for a moment. The gap between them is a distance. Most likely this is a whole number with an infinite decimal digits after it – say 2.2320394386…. centimeters. The infinite set of decimal digits in this measurement is the larger type of infinity: called the continuum. Distances in space form a continuous unbroken line of points, with no gaps in between. The counting numbers, on the other hand, form a broken line. We take discrete steps from one number to the next. This is a hard distinction to grasp but it is the same distinction we used in Hilbert’s Hotel. Imagine you believe you have a list of all the real numbers in the world. You can take the first decimal digit from the first number and add one, the second digit from the second number add one and so on generating new numbers not on the original list. Therefore, you cannot have a list all the real numbers; they are not countable. Let’s take a closer look at these real numbers. Here’s a quick test. Which is the larger number, the first or the second? ∞ 187 Holding a Real Number in your Hand ddd First: 3.1233249837583462136421472374 Second: 3.1233249837583462134421472374 You have 2 seconds to answer! TRY ANSWERING WITHOUT READING ON The first is larger. I changed one digit. Can you see? Notice, you need time to read each digit and process the information. If you were an obedient reader and attempted it in two seconds you either guessed or gave up. Two seconds is too short to take in all the digits. Let me give you another test. Again, I’ll ask you the question, “Is the first number larger than the second?” ddd First 3.12332498375834621364214723751646464646464636… Second 3.12332498375834621364214723751646464646464636… 188 Are the Androids Dreaming Yet? I know you’re looking for the difference but you won’t find one, as I did not have time to write the numbers out in full. The 10 20000th digit is different, but even if I took the whole age of the universe and counted as fast as possible I would not reach this digit. Any number greater than, 10 120 /10 -43 digits cannot be distinguished from another in the age of the observable universe. Real numbers are in practice subject to an uncertainty principle. Some mathematicians even wonder whether they really exist. But, they do exist in our minds and our thought experiments. In my view, any model of the Universe that ignores them is likely to be wrong. Random Numbers Which of the following numbers is random? ddd 11111111111111111 34289460370124001 49293741762343083 THINK ABOUT YOUR ANSWER THEN READ ON Each of the numbers could be random. There is no reason any set of 10 digits is more likely than another, but it feels very unlikely that if I tried to generate a random number I would get 15 consecutive digits. What a human means by random is a jumbled up number: one with varying digits that have no real pattern. An American mathematician, George Chaitin has been able to explain this by saying that a random number is uncompressible. This means there is no way to describe the number more efficiently than writing it out in full. A string of ones can be compressed. “Write a million 1s” takes only 18 characters, yet accurately describes a number that is a million digits long. By contrast 8988376132 can’t be compressed very much at all, its information is just a jumble. There are many interesting numbers around. Some numbers are Hamlet; some numbers are pi. One interesting number is the following: 17733173332032037377. It is the genetic sequence for the virus smallpox, or at least the first 20 digits. Copies of the full sequence sit under lock and key in the Pasteur Institute in France and the CDC in Atlanta. This number is a candidate for an ‘evil’ number. You might think there are many numbers that could represent smallpox because there are ∞ 189 Smallpox Virus many languages in the world and many ways you could code the genetic sequence of GATC. But, there will be one most efficient binary coding for smallpox and that number is the nearest we have to an evil number. The other important element of random numbers is the process by which they are created. Computers can’t genuinely generate random numbers. The numbers they generate are predictable and eventually Child Survivor of Small Pox 190 Are the Androids Dreaming Yet? repeat. To create the random number in my example above I went to www. random.org, a website that uses fluctuations in atmospheric quantum noise to generate random numbers. As far as we know quantum effects are truly random and have neither rhyme nor reason. Numbers are more complex than they first appear. They are infinite, yet there are different infinities, and they have meaning. The smallpox example above and the Turing numbers we will discover shortly suggest numbers do have meaning independent of culture and language. The next two chapters will show us what happens when we think about the meaning of numbers. We will also explain one more ‘super infinity’ and this will be the key to understanding creativity. “There are known knowns; there are things we know we know. We also know there are known unknowns; that is to say we know there are some things we do not know. But there are also unknown unknowns - the ones we don’t know we don’t know.” Donald Rumsfeld United States Secretary of Defense (2001-2006, 1975-1977) Chapter 9 KNOWN UNKNOWNS Donald Rumsfeld In the spring of 1981, London staged its first marathon. The field of runners included 1200 international athletes and 20,000 amateurs. An estimated 20 million viewers watched from around the world. The top international runners stayed together for the first twenty miles and then two runners, American Dick Beardsley and Norwegian Inge Simonsen, made a push for the finish. They were long-standing rivals and, as they ran the final mile each man challenged the other to see if they could get ahead and gain the advantage. Because of the fine balance human muscles maintain between anaerobic and aerobic metabolism, the small set advantage could prove insurmountable. The other runner would need to sprint to catch up and the resultant lactic acid generated would turn their legs to jelly. As the two runners neared the finish line they glanced at each other, smiled, reached out and held hands as they crossed the line. Who won? We all instinctively know the answer. The race was a draw, but the rules of the International Athletics Federation are clear. Read rule 164. RULE 164 The Finish 1. The finish of a race shall be denoted by a white line 5 cm wide. 2. The athletes shall be placed in the order in which any part of their bodies (i.e. torso, as distinguished from the head, neck, arms, legs, hands or feet) reaches the vertical plane of the finish line. The organizing committee held a brief conference and the result declared a draw. They had interpreted the rules in the same way 20 million TV viewers already ‘knew’ to be true. This story should set your minds thinking about the nature of rules and truth and how the two are often different. According to the rules, one person crossed the line a little ahead of the other. The truth, as we all instinctively know, is that the race was a draw. Maybe the rulebook is missing a rule – ‘The contact draw rule’. Clearly you could amend the rulebook to add this one rule. I checked the current athletics rules and they don’t contain this amendment. If the rules were amended the mischievous amongst you will realize an unsporting athlete could grab the hand of their opponent as they crossed the line to force a draw. The rules would have to stipulate that holding hands must be voluntary for both parties, and refinements could go on for some time. What if I held your hand but you tripped and let go? What if my attempt to hold your hand Known Unknowns 193 caused you to trip? You could go on forever, generating rules to cover every eventuality. Clearly, in the fuzzy world of human endeavor, truth and rules often part company. Yet, we all assume mathematics is free of such uncertainty. Let me tell you this is not so. The brilliant mathematician Kurt Gödel proved this when he was just 22, and his proof says something fundamental about the nature of knowledge. The story of his discovery involves Kurt Gödel some of the greatest mathematical thinkers in history. My introduction to it came about from a chance accident. I became ill in my first year at University (mononucleosis, otherwise know as glandular fever, if you’re curious) and was eventually sent home to recover. Lying in bed for two months is boring. So to pass the time my mother suggested I read Bertrand Russell’s, The History of Western Philosophy. I think she figured I had plenty of time, so picked a thick book. This nearly 800-page tome charts the entire history of philosophy from the time of the ancient Greeks. I presumed Russell was a philosophy professor, but he was originally a mathematician. He was a mathematician. And because he lived and worked productively for almost all of his 97 years, spanning much of the 19 th and 20 th centuries, he crops up repeatedly as a central figure in many areas of intellectual life. Russell the politician, Russell the philosopher, Russell the mathematician and Russell the peace campaigner are all the same man – not, as I had incorrectly first guessed, a prolific family. In his early career, Bertrand Russell was a Fellow of Trinity College, Cambridge, working on a broad range of mathematical problems. Meanwhile, in Germany, his contemporary David Hilbert, also a polymath, held the chair of mathematics at Göttingen University. Both men shared a common objective: to tidy up the loose ends in mathematics and set down the rules once and for all. This movement was called Formalism. Formalism David Hilbert and Bertrand Russell believed you should be able to set out all the rules of mathematics even though it might be a complicated affair. Without contradiction or inconsistency you should be able to 194 Are the Androids Dreaming Yet? write down the rules and then play the ‘game of mathematics’ to derive every possible truth. Hilbert despised the idea that there could be unknowable things and was a forthright speaker. His battle cry was: Wir müssen wissen — wir werden wissen! “We must know — we will know!” He believed there were no fundamental unknowns in the world. Donald Rumsfeld famously summed up the problem of unknowns in an attempt to clarify a question from a journalist at a Whitehouse press conference: “There are known knowns; there are things we know we know. We also know there are known unknowns; that is to say we know there are some things we do not know. But there are also unknown unknowns – the ones we don’t know we don’t know.” Interestingly Donald Rumsfeld, like Bertrand Russell, is another person to span a huge swath of time in the public eye. He was both the youngest and the oldest serving U.S. Secretary of Defense, serving under both Richard Nixon and George W. Bush. We will shortly discover Rumsfeld’s convoluted view of the world turns out to be closer to the truth than Hilbert’s tidy mathematical aspiration. As well as believing there were no unknowable unknowns Hilbert thought mathematics was completely abstract. You did not need to know what you were talking about. Whether the symbols meant dogs, cats or numbers all you needed to do was apply the rules and all would be well. His belief is captured in his quote below. “It must be possible to replace in all geometric statements the words point, line, plane by table, chair, beer mug.” David Hilbert Geometry with Beer and Furniture Known Unknowns 195 Newton’s Principia PM In 1890, the Cambridge mathematicians Alfred North Whitehead and Bertrand Russell embarked on the mammoth task of writing out all the rules of mathematics and publishing them in a set of books called Principia Mathematica. Every rule is written down in meticulous detail. The books are heavy going and look like more like computer programs than text. They set out precisely what you can, and cannot, do with numbers, and are the most impenetrable textbook you will ever read. Just to give you a flavor here is one line where Russell proves 1+1=2. It has taken about 100 pages of densely packed equations to get to this point! One Plus One Equals Two, PM PM is a 3-volume set of books. Volume One costs £480 on Amazon. This is a significant work and a collector’s item. The last time a first edition volume came up at auction in 2007 it went for over £800. Cambridge University Press printed only 750 copies and I suspect they 196 Are the Androids Dreaming Yet? Amazon Listing for Principia Mathematica are undervalued. When mathematicians use the letters ‘PM’, they are usually referring to Russell and Whitehead’s Principia Mathematica rather than the afternoon. Hilbert’s Problems In 1900, while Russell and Whitehead were in full flow writing out their rules, David Hilbert was invited to deliver the annual lecture at the International Congress of Mathematicians in Paris. He asked a mathematician friend what subject he should pick for the talk and, in a moment of inspiration, the friend suggested laying out a vision for the future of mathematics. Rather than tell people how wonderful mathematicians were, and why their discipline was the pinnacle of human scientific endeavor, why not try modesty and list all the problems on which they were stumped? Hilbert liked the idea and devoted his talk to all the problems he thought mathematicians would solve in the 20 th century. Hilbert’s Problems were simply an intellectual challenge. He offered no prizes. At the turn of the 21 st century, the Clay Institute created the Millennium Prizes for solving the most important modern mathematical problems. Each solution wins a prize of a million dollars! There are 23 numbered Hilbert Problems in all: ten in the original lecture and a further 13 in the written transcript. In 1928, he clarified the 2 nd and 10 th problems, refining them into three distinct questions: Is mathematics consistent, complete and decidable? Ironically this means that Hilbert’s 23 problems actually number 24! The most important Hilbert questions where these last three. They ask whether Russell and Whitehead would be successful – can you write out all the rules of mathematics and then simply calculate the answer to any problem or derive any proof. This is known as the Decision Problem. Can you mechanically decide any mathematical question without doubt? To explain Hilbert’s Problems, I need to define mathematics properly. Giuseppe Peano, Mathematician “A mathematician is a blind man in a dark room looking for a black cat which isn’t there.” Charles Darwin The Game of Math One of my most vivid childhood memories is driving my mother distraction by asking the ‘why’ question. Most children go through this phase: Me: “Why is a sponge wet?” My mother: “Because it has soaked up water.” Me: “Why has it soaked up water?” My mother: “Because it has small holes in it.” Me: “But what makes water wet?” My mother: “Because it is made of wet stuff.” a bit weak now. Me: “What is wet stuff?” … You can ramble on indefinitely unpeeling a never-ending onion. Sometimes, if you are unlucky, you may get stuck in a loop. For example, “where did the chicken come from?” “An egg,” “and where did the egg come from?”... Mathematics breaks this cycle! In mathematics, there is no danger of an infinite number of ‘why’ questions because at its core are a clearly defined set of absolute rules called axioms. You cannot ask the ‘why’ question of an axiom. It is a RULE! Starting from an absolute minimum of fundamental rules everything else is built up so that no step requires any leap of faith nor generates any contradiction. Let me give you a concrete example and, in the process, show you how numbers are defined. Known Unknowns 199 Numbers It was not until the late 18 th century that numbers were properly codified. The mathematician Giuseppe Peano gave us the rules, so they are called Peano axioms. Here are his ‘axioms’ in natural language. Peano Axioms 1. The first number is named zero. 2. Every number has a next number (called its successor). Example: the next number after one is two. 3. Numbers are singular. Every number with the same name is the same thing. 4. If something is true of a number, it should be true of the next