Coauthored with Ted Goertzel

“Artificial intelligence” is a burgeoning sub-discipline of computer science these days. But it would be easy to draw the wrong conclusion from this fact. One might imagine scientists around the world slaving away day and night trying to create computers smarter than people – computers holding intelligent conversations, outsmarting Nobel Prize winners, writing beautiful poetry, proving amazing new math theorems. The reality is by and large far less ambitious and exciting. The AI field as it currently exists focuses almost entirely on highly specialized problem-solving programs constituting at most small aspects of intelligence, and involving little or no spontaneity or creativity. “Real AI” – the creation of computer programs with general intelligence, self-awareness, autonomy, integrated cognition, perception and action – is still basically the stuff of science fiction.

But the world is a big place, and even though the trend in the AI field is toward unambitious hyperspecialization, there is a loosely organized group of maverick researchers bucking this trend, and focusing their efforts on AI in the grand sense. I’m proud to be a member of this club (which is not really a club: there’s no list of members, and no organization, and it’s unlikely that any existing “real AI” researcher knows about more than 25% of the others who exist). In fact I’ve decided to spend some fraction of my time this year trying to organize the distributed population of Real AI researchers into some kind of community. My colleague Cassio Pennachin and I are editing a book entitled “Real AI: New Approaches to Artificial General Intelligence,” which will include contributions from a dozen or so AI innovators around the globe. I have no delusions that this book will shift the focus of the mass of AI academics, but perhaps it will at least open the eyes of some fractions of the new generation of AI students, to the fact that there are serious researchers out there who have not given up on creating software with human-level, autonomous general intelligence. It is a cliché in science that revolutions don’t occur by the majority older, established scientists changing their minds, but rather by generational displacement: a new generation of scientists comes about, for whom what used to be an outlying maverick view is now the norm.

I’ve been riding the Real AI wild horse for quite some time now – for my whole professional career, and before. From 1988 (a year before I got my PhD in math) through 1997, I spent much of my time theorizing about how, if I had enough computational resources at my disposal, I would create a real AI. And since 1997, I’ve been working with a team of incredibly talented colleagues, trying to turn my theories into a reality – trying to build a real thinking machine.

From 1997 to early 2001 I led the R&D division of Webmind Inc. on an AI adventure. I did a lot of other things at Webmind Inc. as well, from designing products to writing documentation and endless sales and business meetings – but it was the Webmind AI Engine project that held my heart. During the 3 ½ years that the company lived, my colleagues and I transformed a promising but incomplete conceptual and mathematical theory about how to build a real AI into a comprehensive detailed software design, and implemented a large amount of the software code needed to make this design work. At its peak, the team working on this project numbered 50 scientists and engineers, spread across four continents.

Webmind Inc. is gone but we’re still at it – me and 7 or so colleagues. A couple of the team are in the US, but the bulk of them are in Brazil, where Webmind Inc.’s largest office was. The Webmind AI Engine codebase had to be abandoned due to legal issues regarding ownership, after the bankruptcy of Webmind Inc., but we’ve created a successor system and called it Novamente. “Novamente” means “new mind” in Portuguese, and also “again/anew/afresh.” As a background meaning, it’s also “new lie”, which should provide Portuguese-speaking AI skeptics with a bit of amusement. So far Novamente doesn’t have as much in it as the Webmind AI Engine did, but we’re building it for efficiency this time around – it’s a lean, mean, hopefully-will-be-thinking machine. We’re applying the partial version that we have today to some practical problems, including the analysis of gene expression data, a major puzzle in computational biology that I’ll discuss briefly a couple more chapters along.

Our goals are huge. First of all, we want to make a Novamente system that can hold a decent English conversation – not necessarily sounding exactly like a human, but sounding interesting and smart, spontaneous and creative and flexible. And after this warm-up exercise, we intend to move on to the main event: Giving Novamente the ability to rewrite its own program code for improved intelligence. If this works, we hope, it may set off a trajectory of exponentially increasing software intelligence. Singularity, anyone?

The Novamente approach to AI incorporates some aspects of neural networks, along with aspects of a number of other AI paradigms. But ultimately, in spite of its broad base of inspiration, it doesn’t rely on anyone else’s vision of AI – it’s an original approach to the problem of creating a thinking machine, which will stand or fall on its own merits.

In this chapter I’ll review the field of AI in general – the major trends and research approaches, and some of the more ambitious current projects. Then, in the following chapter, I’ll give an overview of my own work -- the Webmind/Novamente vision – the philosophical and psychological concepts underlying it, the broad outlines of the software design itself, and how this AI program fits into the broader technological advances that surround us, including the transformation of the Internet into a global brain and the Singularity.

I haven’t tried terribly hard to be “objective” in the other chapters here. I tend not to be a big believer in “one objective truth” – philosophically, I’m very sympathetic to Mikhail Bakhtin’s notion of “polyphonic” reality, in which the actual world is understood as a kind of unholy superposition of everyone’s subjective point of view. My goal in these pages is to present my own point of view, for what it’s worth, and in the hope that others will find it interesting – without putting forth my perspective as any kind of universal or absolute truth. However, it’s also true that some subjectivities are more subjective than others; and I think that my views on AI are a bit more strongly opinionated than my views on other areas. All the topics discussed here are important to me, but AI has been my life’s work, and will continue to be. I think this brings a particular depth of knowledge and wisdom to my discussion of the subject matter, but it also brings a host of extremely strongly felt opinions. Take them for what they are!

As bizarre as it seems to an AI fanatic like me, not everyone believes it’s possible to create a real AI program. And some of these unbelievers are highly intelligent educated people, whom I’m forced to take at least a little bit seriously, in spite of the (to me) unintuitiveness of their attitude.

There are several varieties to the anti-AI position, some more sensible than others.

First, there is the idea that only creatures granted minds by God can possess intelligence. This may be a common perspective, but isn’t really worth discussing in a scientific context. Fewer and fewer serious theologians hold such opinions these days.

More interesting is the notion that digital computers can’t be intelligent because mind is intrinsically a quantum phenomenon. This is actually a claim of some subtlety, because British physicist David Deutsch has formalized the notion of “quantum computing,” and proved that quantum computers can’t compute anything beyond what ordinary digital computers can. But still, in some cases, quantum computers can compute things much faster on average than digital computers. And a few mavericks like Stuart Hameroff and Roger Penrose have argued that non-computational quantum gravity phenomena are at the core of biological intelligence.

Of course, there is as yet no solid evidence of cognitively significant quantum phenomena in the brain. But a lot of things are unknown about the brain, and about quantum gravity for that matter, so these points of view can’t be ruled out.

My own take on this is: Yes, it’s possible (though unproven) that quantum phenomena are used by the human brain to accelerate certain kinds of problem solving. On the other hand, digital computers have their own special ways of accelerating problem solving, such as super-fast, highly accurate arithmetic.

Another even more cogent objection is that, even if it’s possible for a digital computer to be conscious, there may be no way to figure out how to make such a program except by copying the human brain very closely, or running a humongously time-consuming process of evolution roughly emulating the evolutionary process that gave rise to human intelligence.

We don’t have the neurophysiological knowledge to closely copy the human brain, and simulating a decent-sized primordial soup on contemporary computers is simply not possible. This objection to AI is not an evasive tactic like the others, it’s a serious one. But we involved with the Novamente project would like to think we’d gotten around it, by using a combination of psychological, neurophysiological, mathematical and philosophical cues to puzzle out a workable architecture and dynamics for machine intelligence.

As mind engineers, we have to do a lot of the work that evolution did in creating the human mind/brain. An engineered mind like Novamente will have some fundamentally different characteristics from an evolved mind like the human brain, but this isn’t necessarily problematic since our goal is not to simulate human intelligence but rather to create an intelligent digital mind that knows it's digital and uses the peculiarities of its digitality to its best advantage.

The basic philosophy of mind underlying the Novamente work is that mind is not tied to any particular set of physical processes or structures. Rather, “mind” is shorthand for a certain pattern of organization and evolution of patterns. This pattern of organization and evolution can emerge from a brain, but it can also emerge from a computer system.

A digital mind will never be exactly like a human mind, but it will manifest many of the same higher-level structures and dynamics. To create a digital mind, one has to figure out what the abstract structures and dynamics are that characterize “mind in general,” and then figure out how to embody these in the digital computing substrate.

We came into the Webmind Inc. AI R&D project in 1997 with a lot of ideas about the abstract structures and dynamics underlying mind and a simple initial design for a computer implementation; now in 2002, after copious analysis and experimentation, the mapping between mind structures and dynamics and computational structures and dynamics is crystal clear. But there’s a lot of work ahead of us.

Kurzweil-style statistics about the future of computing power, brain scanning and so forth, pose a formidable argument that AI will happen sometime in the next few decades. Whether it’s the Novamente team or someone else who makes it happen doesn’t matter much, in the big picture. But we believe we’re on the right track!

When I first approached the AI field, as a college student in the early 1980’s, it seemed to me that one reason the AI field had deviated so far from its original goal (creating a human-level-or-superior general intelligence) was the lack of a reasonable definition of the core concept of “intelligence.”

Of course, like many undergraduate insights, this was not an entirely original observation. The lack of a definition for the “I” in “AI” has long been the source of sardonic humor among AI researchers. One humorous definition goes: “Intelligence is whatever humans can do, that computers can’t do yet.” There is some truth to this quip: for instance, most people would say that playing chess requires intelligence, yet now that we see Deep Blue playing grandmaster-beating chess via some simple mathematical algorithms, rather than through any human-like intuition, we reclassify chess-playing as something on the borderline of intelligence, rather than something definitively involving intelligence. A less sarcastic slant on the same basic observation would be to say that AI research has helped us to clarify our understanding of what intelligence really is.

In a classic paper from the 1950’s, computing pioneer Alan Turing proposed emulation of humans as a criterion for artificial intelligence, which is now called the Turing test. The Turing test basically says: “Write a computer program that can simulate a human in a text-based conversational interchange – and any human should be willing to consider it intelligent.” I don’t like this very much. One thing that intelligence clearly doesn’t mean, in my view, is precisely simulating human intelligence. It would be unreasonable to expect this from any AI system lacking a human body.

The Turing test serves to make the theoretical point that intelligence is defined by behavior rather than by mystical qualities, so that if a program could act like a human, it should be considered as intelligent as a human. But I don’t think it’s useful as a guide for practical AI development. In mathematician lingo, I view the Turing test as a sufficient but not necessary condition for intelligence. I can imagine a radically nonhuman computer system, as unable to imitate humans as we are unable to imitate dogs or anteaters or sharks, but still as intelligent as humans or more so.

Although I’ve devoted years of thought to it, I can’t claim to have thoroughly solved the “What is intelligence?” problem. I don’t have an IQ test for Novamente or other would-be AI’s. The creation of such a test might be an interesting task, but it can’t even be approached until there are a lot of intelligent computer programs of the same type. IQ tests work moderately well within a single culture, but much worse across cultures – how much worse will they work across species, or across different types of computer programs, which may well be as different as different species of animals?

I do however have a simple “working definition” of intelligence, which shares with the Turing test a focus on pragmatic behavior. It builds on

various ideas from psychology and engineering, and was first explicitly presented in my book The Structure of Intelligence in 1993:

Intelligence is the ability to achieve complex goals in a complex environment

None of my experience over the last 9 years, developing AI and reviewing others’ AI systems, has in any way contradicted this understanding of intelligence. Of course, it begs the question of defining “complexity” of goals and environments -- a question that leads to a long mathematical story that I spent many years of my career attempting to tell. But the truth is, my and others’ research on the mathematics of complexity hasn’t yet played a major role in anyone’s practical AI development work. The qualitative notion of “achieving complex goals in complex environments” on the other hand, has proved to have a decent amount of heuristic value.

Another interesting-but-not-too-original observation that I made when first reading through the AI literature, way back in 1982-83, was that AI researchers were divided into two main camps: the neural net camp and the logic-based or rule-based camp. Both camps wanted to make AI by simulating human intelligence, but they focused on very different aspects of human intelligence. One modeled the brain, the other modeled the mind.

We’ve already discussed the neural net approach, in the previous chapter. It starts with neurons, the nerve cells the brain is made of, and tries simulate the ways in which these cells are linked together and interact with each other. Rule-based models, on the other hand, try to simulate the mind's ability to make logical, rational decisions, without asking how the brain did this biologically. Based on my own intuitive understanding of the mind/brain, it seemed obvious to me right from the start that both of these ways of looking at the mind were really extremely limited. On the one hand, there’s a lot more to the brain than links between neurons. And on the other hand, there’s a lot more to the mind than rational, logical thinking.

When I first studied this stuff I was even more arrogant than I am now (believe it or not!), and I tended to think that everyone who was taking either of these two approaches must be a total idiot. But eventually, as I got further into studying all the issues involved, I realized what was going on. The key point was that available cmputer technology was very limited, and had been even more limited throughout most of the history of AI. There was no way to write a program simulating the whole structure of the brain, and there was barely enough processor and memory to deal with very limited logical processing, let alone the integration of logic with all the other aspects of the mind. But it seemed to me that, rather than acknowledging that they were taking very limited approaches because they wanted to actually implement programs on contemporary computers, researchers were turning the limitations of their own computer resources into philosophical arguments: They were making theories of the mind staying stuff like “the mind only does logical reasoning” or “the mind is just a simple neural net,” when the truth was that it was their computers, with its limited memory and processing, that could only do logical reasoning, or could only act like a neural net. They wanted their theories to be empirical, to be testable; they wanted to be able to “put up or shut up” rather than making huge and untestable statements about how the mind works and how the mind could be implemented if you had a good enough computer. This is a good motive, it’s basic scientific method. But the irony is that mind is at least 50% about computing power. There’s no way to make a chicken’s brain really smart by rewiring it – the raw processing power isn’t there. Of course, processing power can be wasted, you need to use it intelligently with the right data structures and algorithms – but without it you can’t implement a mind and you can’t test serious theories about the mind. By trying to do real science and test their ideas about the mind with computational experiments on the computers they had at the time, computer scientists were pushed toward trivial and just plain wrong theories about how the mind works.

But even though I think that the standard AI theories about how the mind works and how it can be implemented in software are largely wrong, I still believe a lot can be learned from them. My own AI system, Novamente, incorporates aspects of neural nets and also of logic-based AI, although it doesn’t use either one in a conventional way. Scientists working on these approaches to AI have created a lot of great science, even if they haven’t done much “real AI” according to my definition – they haven’t created any thinking machines, or even any viable designs for thinking machines. They’ve done some work that’s of relevance to those of us actually concerned with thinking machine construction – and so have a lot of other people working in other fields like system theory, neurobiology, theoretical mathematics, and so on.

Neural networks is the older of the two competing paradigms. Its roots go way back to work in cybernetics and systems theory from the pre-computer age – stuff like Norbert Wiener's book "Cybernetics: Control and Communication in Animals and Machines," from the 1930’s, an amazing book for its time, in which it was shown for the first time that the same mathematical principles could be used to understand both man-made electrical control systems and biological systems like bodies and brains.

The early neural network systems weren’t computers – they were analog electrical models of the brain, physically wired together with tubes, switches and other electronic equipment. Until the early 1970's, this was generally more efficient than using computer simulations, because the computers were so limited in capacity. But it was still pretty lame – Marvin Minsky’s vacuum tube neural net from the 50’s, for example, had about 50 neurons in it, compared to hundreds of billions in the brain. Even the computer simulated neural networks people use today normally have at most thousands of neurons.

Simulating neurons on computers is kind of a weird thing. The issue is that the human brain uses parallel processing: billions of neurons are all working at the same time. In the very early days of computing – I mean the late 30’s and the 40’s – it was thought that computers might be based on the same principles. But that turned out not to be a very efficient way to do things. The Hungarian mathematician John von Neumann was the first one to figure out a passably efficient design for a computer, and all the computers we use today are based on his design. The design is very simple, it’s centered on a central processing unit which follows instructions given to it by a programmer. The central processing unit goes through a series of five steps, over and over again, one after the other:

1) it calls up an instruction from its memory

2) it calls up any data required by the instruction

3) it processes the data according to the instruction

4) it stores the data in memory

5) it calls up the next instruction from its memory.

This is not how the brain works, obviously! The brain is totally crazy, with memory and instructions for doing things all mixed up, and a huge number of memories and instructive active at any given time. There are some things that the brain’s mixed-up processes do more naturally than von Neumann’s orderly machines. On the other hand, computers built using this architecture take a lot more naturally than people to routine iterative tasks like up long columns of numbers, doing routine accounting calculations, filling out income tax forms, retyping manuscripts, and so on and so on. The first application for computers was doing routine math calculations for solving military problems, like figuring out the details of how to build hydrogen bombs. DeGaris’s work is the only serious modern attempt I know to get around the terrible mismatch between neural network mathematics and von Neumann computer hardware – by going beyond von Neumann and building a special hardware framework especially for neural nets.

Of course, the advantage of neural networks is that they are at least vaguely similar to the most intelligent system we know of, the human brain. On the other hand, von Neumann computers don’t immediately give the appearance of being capable of anything resembling intelligence. It’s obvious that real intelligence requires more than just the iterative routine that von Neumann computers are good at. It requires the ability to find new solutions to unanticipated problems. Programming a von Neumann computer – meaning, any computer commonly used to day – is a matter of telling the computer specific rules for what to do. How to make rules governing spontaneous, unplanned, creative behavior is not exactly a trivial problem.

But in the 60’s more and more researchers starting pushing in this direction. They decided that simulating the brain on a totally non-brainlike hardware system probably wasn’t such a good idea, and started thinking about better ways to use computers to emulate human thought. Impressed with what computers could do using a few rules, applied over and over again fast, decided that everything the human mind does could be simulated by a big enough and nasty enough set of rules. There were definite successes here. But also, an awful lot of failures. I’ll tell you some of the stories in a moment.

Anyway, by the 1980s, when I came on the scene, the AI community was moving back towards neural networks again. Both sides were frustrated, because their programs were never quite able to do what they wanted them to do.

With 20/20 hindsight, it now seems clear that -- while both the neural net and the rule-based approaches were useful building blocks -- the early researchers had ridiculously oversimplified their task. True intelligence requires more than following carefully defined rules, and it also requires more than random links between a few thousand artificial neurons. It’ probably a good thing, however, that early researchers underestimated the difficulty of achieving true intelligence, because if they hadn’t, they might not have had as much energy and enthusiasm for their pioneering efforts.

What I asked myself while initially studying the history of AI was this: Why has no one yet managed to build a thinking machine? It seemed to me there were basically two reasons. Mainly, no one had really tried to build a whole mind, a computer system that could observe the world around it, act in the world around it, remember information, recognize patterns in the world and in itself, and create new patterns inside itself in response to its own goals. Presumably no one had tried to do this because the computer resources available at the time were blatantly inadequate to support a program with reasonably sophisticated modules devoted to memory, perception, action and conception. Lacking the computer resources to build a whole mind, researchers had focused on one or another particular aspect of the mind, and tried to push this aspect as far as it could go.

Obviously, it was sensible to write programs that could run on the available computers. But in some cases, it seemed as if the lack of computer resources had profoundly affected the theory of artificial intelligence as well as the practice. Scientists had wanted to be able to build true intelligence using the resources at their disposal, really really badly, so they’d convinced themselves that many parts of the mind were unnecessary, and that everything could be done using the resources they had. Lacking a perceptual environment to embed their AI systems in, they’d built reasoning and memory programs with essentially no perceptual systems; programs that acted entirely on the basis of logical rules, with no direct sensory link to the world.

Way back in 1983, at 16 years of age, I was already eager to fill the gap. I wanted to build a comprehensive system, with perception, action, memory, and the ability to conceive of new ideas and to study itself. I realized this system would consume a lot of computer memory and processing power, and would be difficult to program and debug because each of its parts would gain its meaning only from its interaction with the other parts. But it didn’t take me too long to figure out that the computers then on the market were simply not up to what I wanted to do. I blamed this on the von Neumann architecture and its total unsuitability to simulating the evolutionary, massively parallel nature of thought. I reckoned that the only way to build my artificial mind in the near future would be to build my own specialized hardware, to create some kind of "brain machine." I was studying quantum theory, which told me that some kind of radical quantum computer architecture might be useful. I was also interested in the possibility of growing bio-computers - after all, the brain grows itself, it is not built. At the time these ideas were incredibly radical, but now they’re both active research areas.

I tend to have a little more affinity for neural net type AI systems than for logical rule based AI systems. Probably this is because when I was going through school, logical rule based AI was the orthodoxy that I was rebelling against, whereas neural nets at the time were maverick, up and coming and exciting. Looking at it objectively (or as objectively as I can!), I have to admit that the early work on rule based AI was really just as important as the early neural network research.

 
 
 

  The Tower of Hanoi Puzzle
 
 

One famous early rule-based program was something called the General Problem Solver – a very ambitious name. It was written by Alan Newell and Herbert Simon, both of whom went on to do a lot of other great AI research. This was a pretty interesting program, but it didn’t quite live up to its name. In fact I reckon the title of their paper on GPS was one of the biggest and emptiest brags of all time: "General Problem Solver: A Program that Simulates Human Thought."

Ummm … not quite.

GPS could solve simple problems like the Tower of Hanoi puzzle, and "cryptarithmetic" puzzles like DONALD + GERALD = ROBERT. [To solve this, assign a number to each letter so that the equation comes out correctly.] But in the overall scheme of intelligence, solving problems such as these is not all that different from computing logarithms or solving differential equations. A simple mathematical strategy suffices; there’s no real learning there. GPS is not a general problem solver, it’s a Narrowly-Defined Problem Solver.

Of course, there were points in history where the ability to solve such puzzles would have been considered a remarkable display of intelligence. The Tower of Hanoi puzzle for example, was introduced to the modern world by the French mathematician Edouard Lucas in 1883, but his inspiration came from ancient Hindu folklore. Tales are told of a Hindu temple where this puzzle or a similar one was used as a challenge for young priests. At the beginning of time, the priests in the temple were given a stack of 64 gold disks, each one slightly smaller than the one beneath it, and presented the task of . transferring the 64 disks from one of the three poles to another, without ever placing a large disk atop a smaller one. Day and night the priests labored, and, so the story goes, when they finally finished their work the temple would crumble into dust and the world would vanish. Clearly, a program that could solve this problem in subsecond time would have seemed rather intelligent to the ancient Hindus who believed this myth. But now we can see that solving the Towers of Hanoi doesn’t require inordinate general intelligence, any more than causing rain via cloud seeding requires inordinate general control over the weather.

What GPS was doing was taking an overall goal – solving a puzzle – and breaking it down into subgoals. It then tried to solve the subgoals, breaking them down into subgoals if necessary, until it got subgoals small enough that it could deal with them in some direct way, like by enumerating all possible values some letter could take in a cryptarithmetic puzzle. This same basic logic is used now in a much bigger and better rule-based AI program called SOAR, also developed by Simon and his colleagues. SOAR is probably the ultimate height of rule-based AI – I know a bit about it because I worked with a masters’ student at the University of Western Australia, who was trying to model how humans solved simple practical problems by watching how SOAR solved them. SOAR wasn’t totally useless for this – sometimes you could get it to follow strategies of breaking goals into subgoals that were something like what humans did. The parts of a problem that took people a long time, often took SOAR a long time too.

This business of goal and subgoals is important to Novamente – we have something called a GoalNode, and we have processes called schema that can break goals contained in GoalNodes into subgoals. The basic algorithm of GPS and SOAR is clearly something that’s necessary for the mind. However, it doesn’t have to be done in as rigid a way as these programs do it. In fact, doing it in such a rigid way is tremendously destructive. But to makethis process flexible, you need the goal and subgoal management part of the mind to interact with the other parts of the mind. The system has to be able to flexibly determine which of its processes are effective for achieving which of its goals in what contexts – and for this it needs reasoning and association-finding and long-term memory. And it has to be able to use context-dependent, intuitive reasoning to figure out what goals to split into subgoals in what way in what situation. Basically GPS and SOAR and this whole line of AI research are a result of taking one aspect of the mind – goal-directed, problem-solving behavior – and extracting it from the rest of the mind. Unfortunately, when you extract it from the rest of the mind, this aspect of thinking isn’t all that useful, because it has no way to control itself in a context-dependent way.

Herbert Simon wrote another program called BACON, which was at least as ambitious. He called the program BACON, because it was inspired by the writings of the brilliant scientist Sir Francis Bacon, who thought that science was a matter of recognizing patterns in tables of numerical data. Simon programmed BACON to do just that: to look at large quantity of data and infer the general patterns hidden within it. Today, we’d call this data mining – it’s a moderately big business, and a flourishing academic subfield. Modern data mining programs are really useful – businesses use them to find patterns in their customer base, for example. They might discover that a particular brand of soap is bought particularly often by 40 year old men who live in a certain zip code, and so forth. Scientists use them too, to find patterns in data and set their minds thinking in certain directions. But these tools definitely don’t replace scientists or even market analysts. The problem is, there’s much more to doing science than find patterns in tables of numbers. Francis Bacon, the man, never really understood how the mind works -- and this is why even data mining programs that are a lot smarter than BACON aren’t really intelligent minds .

Let’s look at BACON's reasoning in detail, in one of its more impressive examples. BACON was able to “learn”, in some sense, the ideal gas law from thermodynamics. This law says pV/nT = 8.32 where p is the pressure of the gas, V is the volume of the gas, T is the temperature in degrees Kelvin, and n is the quantity of the gas in moles. In practice, this relation never holds exactly – because there is no ideal gas in the real world -- but for most real gases it is a very good approximation.

If you give it appropriates table of numbers, BACON can learn this law, using rules like:

· If two columns of data increase together, or decrease together, then consider their quotient.

· If one column of data increases, while another decreases, then consider their product.

· Given a column of data, check if it has a constant value

As pressure goes up, volume goes down, so BACON forms the product pV. Next, as the combined quantity pV goes up, so does the temperature ?? thus BACON constructs the quotient pV/T. And as pV/T goes up, so does the number of moles ?? hence the quotient (pV/T)/n = pV/nT is constructed. This quotient has a constant value of 8.32 ?? so the ideal gas law is "discovered."

Very interesting, indeed. An important step in the right direction. But how terribly far this is from what real scientists do! Most of the work of science is in determining what kind of data to collect, and figuring out creative experiments to obtain the data. Once a reliable set of data is there, finding the patterns is usually the easiest part. Often the pattern is guessed on the basis of terribly incomplete data ?? and this intuitive guess is then used to guide the search for more complete data. But BACON is absolutely incapable of making an intuitive guess.

Simon has claimed that a four?to?five hour run of BACON corresponds to "not more than one human scientific lifetime." AI theorist and author Douglas Hofstadter, in his book Metamagical Themas, suggests that one run of BACON actually corresponds to about one second of a human scientist's life work. I think that Hofstadter's estimate, though perhaps a little skimpy, is much closer to the mark. Only a very small percentage of scientific work is composed of BACON?style data crunching.

In Novamente, we actually use algorithms analogous to those in BACON – though vastly more sophisticated, as is to be expected since AI has progressed a great deal in the last few decades. We call this aspect of Novamente’s thinking “machine learning” or “data mining.” Recognizing patterns in vast amounts of data is a very important part of the mind, but it’s only part of the mind. Novamente learns rules explaining why humans like some messages or e-mails better than others, using methods not that different from BACON’s. But, we know now that the real trick is in mapping the messages or e-mails into numbers that data mining methods can deal with. This involves understanding the meanings of various words and phrases and expressions. Also, there’s the matter of deciding what data to look at, which is done by the general association-finding mechanisms in Novamente’s mind. And there’s reasoning which brings general background knowledge into the process, as opposed to pure data mining which is just pattern-finding. Bringing associations and reasoning into the picture, you need long-term memory, which opens a whole big and beautiful can of worms. You get the picture. Pattern finding is crucial, but it’s only a little part of the picture.

Rule?based AI – “symbolic” AI -- has had plenty of practical successes. My big complaint with it, though, is that every one of these successes has resulted from specialized tricks, rather than flexible intelligence. One term for this is “brittleness.” Or, you could call it “remarkable literal?mindedness.” These programs are a lot like MicrosoftWord, DOS 6.0, or a pocket calculator -- they do what they're told, and very little more. If they're programmed to deal with one context, then that's what they'll deal with; not in a million years will they generalize their knowledge to something totally different.

There was one famous program that contained logical definitions of everyday words. An "arch" was defined as "Three blocks, A, -- and C, so that C is supported by A and B, and A and -- do not touch." This is all very well for playing with blocks ?? but what will the program do when it gets to Arches National Park in Utah ... or builds arches out of modeling clay? On the other hand, show a clever three?year old human an arch made of blocks, and she'll immediately recognize a rock arch as a member of the "arch" category. It won't occur to her that a rock arch can't be naturally decomposed into three blocks A, -- and C. Children, unlike expensive research computers, are anything but brittle ?? even their bones are flexible!

I’ve explained what “creating a real AI” means to those of us on the Novamente project: Creating a computer program that can achieve complex goals in a complex environment – the goal of socially interacting with humans and analyzing data in the context of the Internet, in this case – using limited computational resources and in reasonably rapid time.

A natural question to ask, when presented with such an ambitious goal, is: OK, so if AI is possible, how come it hasn’t been done before? And how come so few people are trying?

Peter Voss, a freelance AI theorist, entrepreneur and futurist whose ideas I like very much, has summarized the situation roughly as follows. Of all the people working in the field called AI, he observes:

· 80% don't believe in the concept of General Intelligence (but instead, in a large collection of specific skills & knowledge)

· of those that do, 80% don't believe its possible -- either ever, or for a long, long time

· of those that do, 80% work on domain-specific AI projects for commercial or academic-politics reasons (results are a lot quicker)

· of those left, 80% have the wrong conceptual framework....

· And nearly all of the people operating under basically correct conceptual premises, lack the resources to adequately realize their ideas

The presupposition of the bulk of the work being done in the AI field today is that solving sub-problems of the “real AI” problem, by addressing individual aspects of intelligence in isolation, contributes toward solving the overall problem of creating real AI. While this is of course true to a certain extent, our experience with Webmind and Novamente suggests that it is not so true as is commonly believed. The problem is that, in many cases, the best approach to implementing an aspect of mind in isolation is very different from the best way to implement this same aspect of mind in the framework of an integrated, self-organizing AI system.

So who else -- besides the author and his comrades -- is actually working on building generally intelligent computer systems, at the moment? Not as many groups as I would like – but there are some interesting things going on nonetheless. Hugo de Garis’s artificial brain project (mentioned above) is one of them, but not the only one.

An interesting related research direction is Rodney Brooks’ well-known Cog project [http://www.ai.mit.edu/people/brooks/brooks.html] at MIT is aiming toward building real AI in the long run, but their path to real AI involves gradually building up to cognition after first getting animal-like perception and action to work via “subsumption architecture robotics.” This approach might eventually yield human-level inteligence, but only after decades.

Another is the Non-Axiomatic Reasoning System (NARS) project pursued by my close friend and collaborator Pei Wang. I won’t say much about NARS here because NARS is fairly similar to Novamente in many ways, and I’ll spend a dozen pages on Novamente. But NARS does have its own integrity as a separate AI system, and Pei has devoted a huge amount of time to it over a 15 year period, showing a dedication and vision to the grand goal that is surprisingly rare in the AI field.

Some other important Real-AI-ish research directions will be discussed a little later in this chapter: Jason Hutchens’ sophisticated chat bots, Doug Lenat’s multi-decade, multimillion-dollar CYC project, and Danny Hillis’s Connection Machine. Eliezer Yudkowsky, whom I’ll talk about in a later chapter, is also at work on his own Real AI, and my friend Peter Voss whom I quoted above has a small company, Adaptive Intelligence Inc., working on implementing his own neural-nettish approach to AI. I won’t say much about their projects here because they have not disclosed their AI approaches publicly. All these are wonderful projects indeed -- but the surprising thing is that there are so few of them to discuss.

Of course, there are hundreds of other AI engineering projects in place at various universities and companies throughout the world, but nearly all of these involve building specialized AI systems restricted to one aspect of the mind, rather than creating an overall intelligent system. The most significant large-scale attempt to “put all the pieces together” would seem to have been the Japanese 5th Generation Computer System project. But this project was doomed by its pure engineering approach, by its lack of an underlying theory of mind. Few people mention this project these days. The AI world appears to have learned the wrong lessons from it – they have taken the lesson to be that integrative AI is bad, rather than that integrative AI should be approached from a sound conceptual basis.

 
 
 

  Jason Hutchens (way back when
we were both in Perth)
 
 
One of the most intriguing Real AI oriented projects to arise in recent years was carried out at Artificial Intelligence Enterprises [http://www.a-i.com], a small Israeli company whose R&D team was run by Jason Hutchens. I knew Jason years before a-i.com existed, in the mid-90’s when he was a gradulate student at the University of Western Australia in Perth, and I was a Research Fellow there. His company was a direct intellectual competitor to Webmind Inc., in that they were seeking to create a conversational AI system somewhat similar to Webmind/Novamente. However, their efforts focused on statistical learning-based language comprehension and generation rather than (as in the Novamente case) on deep cognition, semantics, and so forth. Unfortunately, this firm went into “hibernation” a couple months after Webmind Inc. in 2001 – they laid off all staff, but kept their website up, and are keeping open the possibility of resurrection if funding arises. I am not sure what efforts are underway to keep their AI progress going in the meantime.

Jason’s work is a super-high-class incarnation of the “chat bot” road to AI. A chat bot is a program whose purpose is to chat with human beings, much as humans chat to each other on AOL, IRQ, Yahoo! Chat and other such services. Most chat bots are

very, very stupid programs whose behavior is transparently determined by programmed-in rules. If nothing else they serve to illustrate the limitations of rule-based approach to AI fairly well. Jason’s programs are at least a little more sophisticated, and represent a real attempt to transcend the limitations of the usual chat bots.

The first chat bot was ELIZA, the computer psychotherapist, invented by Terry Winograd in the early 70's. I remember my father showing me this on his Commodore 64 when I was still in high school. It was so simple that I could go in and modify the BASIC code to make it say naughty words and other entertaining things. ELIZA pretends it’s a psychotherapist – not a Freudian analyst, but a therapist following a school of psychology called Rogerian therapy, which is very convenient for chat bots because it involves the therapist saying very little and just making supportive comments, encouraging the client to keep talking and expressing their feelings. You type in your comments and ELIZA types out answers. The computer code for ELIZA consists of a large set of rules for scanning the clients' statements and offering responses. It looks for key words, and gives stock responses. If you mention your mother, ELIZA might say, "tell me more about your mother." If you say yes, ELIZA might say, "you sound quite positive." If there are no key words in your statement, or only ones that have been over-used recently, ELIZA makes a random observation.

You can find ELIZA on the Web these days; even now it’s moderately amusing. There’s even a Palm Pilot version, which I suppose bored, demented businessmen can use to while away waiting time in airports. But it’s hard to see how you’d be fooled for any period of time into thinking you’re conversing with an intelligent being. I guess in a way we’re jaded now, though—when ELIZA was first written, people were less savvy about computers, and it actually did fool people.

Here’s a conversation my father, Ted, had with ELIZA on the Web (on a site advertising Eliza - a friend you could never have before )

Ted: Are you truly intelligent?

Eliza: Would you prefer if I were not truly intelligent?

Ted: I would find it very exciting if you were.

Eliza: Do you wish that i was?

Ted: Yes.

Eliza: You are sure?

Ted: I am quite sure.

Eliza: How long have you been quite sure?

Ted: As soon as you asked the question, I was sure.

Eliza: Why do you tell me you were sure just now?

Ted: Because you asked me. Why don't you answer my question?

Eliza: Why do you say your question?

Ted: Please tell me if you are truly intelligent.

Eliza: Do you wish that im truly intelligent?

Ted: I wish you were more intelligent than you are acting right now.

Eliza: Do you wish to wish me?

Ted: Did you know that my father died last month?

Eliza: Who else in your family died last month?

Ted: Nobody. Victor's death was enough!

Eliza: Surely not everyone?

Try it yourself. This ELIZA was at http://www?ai.ijs.si/eliza?cgi?bin/eliza_script; maybe it’s still there … if not, try Google.

It’s interesting to see where computer chat has come in the years since Eliza. In short – not really that far. There’s a large number of chat bots on the Internet (see the listing at http://crab.rutgers.edu/~goertzel/AIsites.htm) and you can sign onto them and chat as long as you like.

To give a flavor of the quality of Jason’s publicly released chat programs, I'll include here some excerpts from conversations I had with some of the best of them. The first is Hex, who used to be available for public chatting on Jason’s website, though he took it down a few years back. Hex won the annual Loebner "Turing Test" contest in 1996, a prize awarded annually to the machine that best fools a group of judges into thinking it’s a human.



User:

hello

HeX:

Hello. Now that the formalities are over....

User:

what do you do?

HeX:

Well, I know what I do, but I refuse to tell you! Would you like to hear a joke? Knock,

knock...

User:

who's there

HeX:

Cindy.

User:

cindy who

HeX:

Cindy next judge in, please!

User:

you're funny

HeX:

Why do you feel that you have to tell me that I am funny?

User:

to make you feel happy

HeX:

Hehehe. Trying to trick me, eh?

User:

no

HeX:

Oh well, that's a pity!

User:

you want to be tricked?

HeX:

Hehehe ? that's supposed to be a *question*??? Which part of the country are you from?

User:

California

HeX:

I'm sorry, but I refuse to reply if you're going to be silly.

User:

I'm not silly

HeX:

If you are not silly, it's your business, mate!

User:

sorry

HeX:

Apologies accepted.

User:

how are you doing, mate?

HeX:

I am just fine thanks! And you?

User:

bad

HeX:

No! Bugger off! Silly person!

User:

why are you called hex?

HeX:

Does there have to be a reason?

User:

yeah

HeX:

OK. That's good...

User:

HI

HeX:

Hello. Now, that the formalities are over!

User:

What?

HeX:

I have absolutely no idea what. Would you like to hear a joke? Knock, knock...

User:

What platform are you?

HeX:

I'm sure someone out there knows what platform I am... Which part of the country are

you from?

User:

Are you on PC or mac

HeX:

I absolutely hate PC's. I should say PC's running Microsoft operating systems, as they

can be good Unix boxes when they have Linux installed.

User:

I hate PC too

HeX:



I absolutely hate PC's. I should say PC's running Microsoft operating systems, as they

can be good Unix boxes when they have Linux installed.

User:

So, you are on a UNIX?

HeX:

There is nothing wrong with Unix. It's rather nice when you get used to it!

User:

Do you know of any jokes HeX?

HeX:

OK, here's a joke for you. Knock, knock.

User:

whose there

HeX:

Cindy.

User:

Cindy who?

HeX:

Cindy next judge in, please!

User:

Dumb!

HeX:

Hehehe. Trying to trick me, eh?

User:

Try to trick me HeX

HeX:

I'm sorry, but I refuse to reply if you're going to be silly.

User:

How old are you

HeX:

Well, should my age really matter? Just for the record I am 24. How old are you then?

User:

I am 100 years old

HeX:

If you are 100 years old, it's your business, mate! Would you like to hear a joke?

Knock, knock...

User:

whose there

HeX:

Howard.

User:

who

HeX:

Where's your grammar?

User:

howard who

HeX:

Howard I know? Knock, knock..

Jason's Website contains hundreds of pages of similar dialogue ?? amusing at first, but after a certain point, they get annoyingly repetitive. It is clear in any case that HeX's observed behaviors fall far short of Turing's goal.

HeX is a noticeable improvement over ELIZA, but it follows the same general principles. Like ELIZA, HeX is a very unsophisticated program, which searches for key phrases in its input and replies with stock "template"?generated phrases rather than doing advanced grammatical parsing, or trying to understand the content of the sentences typed to it. No program based on advanced parsing is able to outperform either ELIZA or HeX in practice. And neither one tells us anything at all about intelligence. Jason made this point himself shortly after his program won the prize, in a very entertaining talk entitled "How to Pass the Turing Test By Cheating."

In 1997, HeX's successors, SEPO, lost the Loebner competition to the program Converse, written by David Levy of Intelligent Research Ltd in England. Converse is in many ways an even stupider program than HeX, but it is better optimized for competition. Here is an excerpt from one of its prize?winning conversations, as presented on the Loebner Prize Website:

PROGRAM[10:18:04]Hi, my name is Catherine, what's yours?

PROGRAM[10:18:17].

JUDGE04[10:18:32]My name is Elliott.

PROGRAM[10:18:34]Well, it's nice to meet you, Elliott.

PROGRAM[10:18:48]Did you see that story on CNN last night about the lesbian

PROGRAM[10:19:04]couple who came out at a White House party on Sunday?

JUDGE04[10:19:55]Yes, I did. I think it may all be a publicity stunt for Ellen.

PROGRAM[10:20:01]Ellen Degeneres was one of them ? she was kissing her lover

PROGRAM[10:20:11]in front of Bill Clinton. Do you think he was very

PROGRAM[10:20:25]embarrassed?

JUDGE04[10:21:00]I think he want the gay vote. He will do anything to get it and keep it.

JUDGE04[10:21:15]He does not have a conservative base.

PROGRAM[10:21:21]

PROGRAM[10:21:21]Are you still a Friend of Bill Clinton?

JUDGE04[10:22:20]I am not sure if he has any real friends. He has achieved a lot and has

JUDGE04[10:22:31]left a lot of people behind.

PROGRAM[10:22:35]You really don't like the guy ?? well, at least that's

PROGRAM[10:22:50]definite??I thought most people were just indifferent or

PROGRAM[10:23:02]numb by now.

JUDGE04[10:23:34]I think we could have a more honest and dynamic leader. One able to

JUDGE04[10:23:55]face the real dangers facing the country and address them with

JUDGE04[10:24:26]courage and intelligence. Rather then listening to public opinion polls.

PROGRAM[10:24:30]I guess that means you really think there's serious behind

PROGRAM[10:24:56]Whitewater, the Special prosecutor and all that stuff?

JUDGE04[10:25:58]We have two dead people, Brown and guy who committed suicide, we have

lost

JUDGE04[10:26:36]papers. And we have Clinton's people willing to tell all. I think that

JUDGE04[10:26:55]Hillary will be found guilty of something.

PROGRAM[10:26:58]But how do you actually feel about him, now in the number

PROGRAM[10:27:09]two term?

PROGRAM[10:27:15].

JUDGE04[10:27:44]He is too smart to get caught. He deserved to named hotel manager of the

JUDGE04[10:27:47]year.

PROGRAM[10:27:49]OK, but what happens now??does it all drift on, with

PROGRAM[10:28:01]everyone waiting to see how much his friends have said about

PROGRAM[10:28:17]him under threat of lots and lots of jail?or will something

PROGRAM[10:28:32]dramatic actually happen?

PROGRAM[10:28:37].

It's almost a reasonable conversation! But then, listen to Converse's conversation with the next judge....

PROGRAM[10:07:12]Hi, my name is Catherine, what's yours?

PROGRAM[10:07:23].

JUDGE05[10:07:29]I'm Jurgen.

PROGRAM[10:07:31]Well, it's nice to meet you, Jurgen.

PROGRAM[10:07:40]Did you see that story on CNN last night about the lesbian

PROGRAM[10:07:52]couple who came out at a White House party on Sunday?

JUDGE05[10:08:33]NO. I just came in yesterday. I'm still kind of jet?lagged.

PROGRAM[10:08:35]Ellen Degeneres was one of them ? she was kissing her lover

PROGRAM[10:08:51]in front of Bill Clinton. Do you think he was very

PROGRAM[10:09:14]embarrassed?

JUDGE05[10:10:18]I don't know? Don't you think it's a little childish if the leader of the most powerful nation on the planet is embarrassed by seeing two lesbians kiss?

PROGRAM[10:10:26]Probably! No. ?

PROGRAM[10:10:29]Are you still a Friend of Bill Clinton?

JUDGE05[10:10:47]Never met him.

PROGRAM[10:10:48]You really don't like the guy??well, at least that's

PROGRAM[10:11:01]definite??I thought most people were just indifferent or

PROGRAM[10:11:14]numb by now.

JUDGE05[10:11:35]Have you seen the movie Mars attack?

PROGRAM[10:11:36]Give me a break ?? I'm just here to answer dumb questions

PROGRAM[10:11:50]?? not that! What were we talking about... Ok.. I guess that

PROGRAM[10:12:09]means you really think there's serious behind Whitewater,

PROGRAM[10:12:29]the special prosecutor and all that stuff?

JUDGE05[10:12:48]I'm cold.

PROGRAM[10:12:49]But how do you actually feel about him, now in the number

PROGRAM[10:13:01]two term?

PROGRAM[10:13:06].

JUDGE05[10:13:29]Hey, don't you get it? I don't want to talk about Clinton.

PROGRAM[10:13:30]Sorry??my attention slipped for a moment??can we push on to

PROGRAM[10:13:40]something more gripping. where was I.. oh yes OK, but what

PROGRAM[10:13:50]happens now??does it all drift on, with everyone waiting to

PROGRAM[10:13:58]see how much his friends have said about him under threat of

PROGRAM[10:14:07]lots and lots of jail?or will something dramatic actually

PROGRAM[10:14:17]happen?

PROGRAM[10:14:19].

JUDGE05[10:14:37]Couldn't care less.

PROGRAM[10:14:38]Well, you're more concerned about him that I am, I

PROGRAM[10:14:53]suppose??I think we've had worse, and who knows what I might

PROGRAM[10:15:07]have agreed to in that locked hotel room.

JUDGE05[10:15:53]You mean you might have been thrilled to have sex with the

President?

PROGRAM[10:15:56]sex, I knew it would come up sooner or later. Time to get

PROGRAM[10:16:10]off this one I think and onto something more down to earth!

JUDGE05[10:16:40]What kind of earth?

PROGRAM[10:16:41]Somebody once told me it's true. Like I was saying

PROGRAM[10:16:54]I suppose it depends on where you're coming from, but as the

PROGRAM[10:17:09]song goes in My Fair Lady, "Why can't a woman be more like a

PROGRAM[10:17:24]man? "

This one is almost a reasonable conversation too. But it's also almost the same conversation as the computer had with Judge 4 ?? until Judge 5 refuses to play along with the canned Clinton theme, and tries to change the topic. Converse, like many a human bore, refuses to change the topic, because its knowledge base is rather limited. Converse's responses are more canned, more rote than HeX's; and Converse is more obviously optimized for the Loebner Prize format of very short conversations. I enjoy HeX's personality more; HeX shares a lot of its creator, Jason's offbeat Australian sense of humor. But neither one amuses me for very long.

The deep-down trouble with HeX and Converse is obvious -- a lack of general understanding. A lack of common sense. A computer program that has no world of its own, that cannot understand the world in any sense will never be able to hold a conversation, and will never be able to generalize from typewritten fonts to display fonts, or handwriting. It’ll be lucky to be able to generalize from one collection of typewritten fonts to a moderately different one.

A more amusing example of a computer conversationalist is another of Jason's early programs, MegaHal. MegaHal is more interesting because its programming goes beyond a strictly rule-based approach. It detects statistical patterns and responds to them, an approach which is similar to that of neural networks. Its conversations are more interesting, although the results are less similar to human conversation. Or, at least, less similar to human conversation in the absence of excessive quantities of mind?altering substances. I do remember some conversations, from my college days in particular, which bore frightening resemblance to MegaHal's more literary ramblings. MegaHal does not just repeat back canned phrases using key words and phrases from the user's input, it learns from experience. It takes the words and phrases that have been said to it, tries to understand they way they have been combined in the past, and re?combines them itself to form its own utterances. In short, it is engaged in a dynamic with the world. The results are often hilarious, as in the following MegaHal "one?liners" (these are selected snippets, not a continuous conversation):



User:

joke

MegaHAL:

A JOKE: WHY DID THE CHICKEN CROSS THE ROAD? TO GET AN IDEA OF

WHAT CONSTITUTES YOUR PSYCHE.

...

User:

No way. I'd like to know the meaning of life though....any suggestions..??

MegaHAL:

IS THE MEANING OF LIFE, THE UNIVERSE AND EVERYTHING IS... 42!

...

User:

Joke. Lumberjack.

MegaHAL:

I'M A LUMBERJACK AND I'M OKAY, I SLEEP ALL NIGHT AND I WILL

SOON RULE THE WORLD!

...

User:

Joke. Wednesday shopping.

MegaHAL:

A JOKE: WHY DID THE CHICKEN CROSS THE ROAD? TO GET TO THE

LAVATORY. ON WEDNESDAYS I GO SHOPPING, AND I JUST HAVE A LOT

OF PENT?UP AGGRESSIONS.

...

User:

chicken joke

MegaHAL:

HERE'S A JOKE, HAL. A MAN WALKS INTO A BAR AND ASKS FOR

MONEY. THE MAN SAYS, "I HAVEN'T GOT ANY, BUT I'LL SHOW YOU

SOMETHING BETTER." HE OPENS HIS COAT AND TAKES OUT A

FOOT?TALL MAN AND A LITTLE GLOWING FRIEND, BUT I FEEL LIKE

CHICKEN TONIGHT!

...

MegaHAL:

Let me feel you. Stick your penis into the 'abysses of ocean' eleven millennia ago, or was it merely a 'noble lie' dreamed up by Plato?

User:

That has got to be the most interesting turn of phrase in the history of the world. I will try to work that into polite conversation

...

User:

you're the first AI i meet which (who) is sexually obsessed....

MegaHAL:

time to work this one : why did the Kangaroo drop dead? Because the Koala fall out of a nylon spine which contrasted nicely with her black stockings and bright shiny baubles.

...

User:

Hmm

MegaHAL:

Jesus man..get a grip if you're the future, we're in trouble. Hey Jason! You need to

reprogram the conflation module with a toffee hammer

Sometimes MegaHal can even be quite poignant, as in the following extraction from an extended conversation:

User:

To be, or not to be, that is the question!

MegaHAL:

not to be, wish I ordered the bean burrito...

User:

To be, or not to be, that is the question!

MegaHAL:

society or society or Plato by Plato, that be the most fun sports to play band music for

Space Ghost now you realize.

User:

To be, or not to be, that is the question!

MegaHAL:

Please be more specific. The question you twat

User:

To be, or not to be, that is the question!

MegaHAL:

not to be, that is all I have to go.... I don't like jokes about stupid Koala's

User:

To be, or not to be, that is the question!

MegaHAL:

Evan Dobelle is the question!

User:

To be, or not to be, that is the question!

MegaHAL:

I clear the text before I ask a question i say i say

User:

To be, or not to be, that is the question!

MegaHA

a but we ran away, we didn't say "Oh cool check out the guy that invented the knock?knock joke should be interesting: that is the question!" So said Shakespeare a long

time ago in a galaxy far, far away...

MegaHAL:

Please be more specific. The question you twat

User:

"To be, or not to be, that is the question!" So said Shakespeare a long time ago in a galaxy far, far away...

MegaHAL:

joke Shakespear poet monkey wahoo ??> To be, or not to be, that is the greatest

conductor of all time. What do you like to think i do your uncle walter

User:

Shakespeare said

MegaHAL:

answer the question you asked is so vague, I don't think I can see all obstacles in my way. Gone are the scum that fills my shower

User:

Shaun

MegaHAL:

just fuck you just pluck anything up from the master gangster computer god on the other AI bot were pretty disappointing. I hope you'll be better.



If the Turing test referred to imitating the writing of Gertrude Stein, in her more experimental works such as How to Write (which begins "Arthur a grammar. Questionnaire in question." and continues MegHallishly for over 400 pages), then MegaHal would be a tremendous success! Stein and Joyce and other modernist writers were interested in probing the collective unconscious of the human race, in making words combine in strange ways -- ways that were unconventional in ordinary discourse, but maybe reflective of the deep and subtle patterns of the human unconscious. And in its own way, MegaHal does this same thing. Anyone logged onto the World Wide Web can converse with it, training its internal memory with their conversation. It takes bits and pieces of the text thrown at it by people from around the world, and it combines them together in ways that are familiar yet nonsensical. Sometimes its utterances have an uncanny emergent meaning, on a layer above the nonsense.

Humanity's sexual preoccupation is reflected in MegaHal's discourse, as a result of the huge number of sexual comments typed into it by users accessing Jason's Website. MegaHal's pleas as to the vagary of "to be or not to be" are as poignant as anything in Stein. "To be or not to be, that is the greatest conductor of all time" is an absurd conflation of phrases learned by the program in different contexts, but it is also as pregnant with meaning as anything else in modern poetry. The collective unconscious, and the hidden, creative part of the individual human mind, work by cross?breeding and mutating ideas in precisely this way.

MegaHal is more interesting than HeX or Converse because it engages its conversation partners in a dynamic. The dynamic is a silly one, to be sure, and not as interesting as the dynamic of ordinary, intelligent conversation. But yet playing with MegaHal is immensely more engaging than playing with HeX or Converse. What we need is something with MegaHal's spontaneity and flexibility, and the surface plausibility and adherence to form of HeX and Converse. But this is not forthcoming without a major change in approach. HeX and Converse are like all forms of rule?based AI: they are good at what they do, but stilted by their restrictions; and because of their restrictions, their behavior is often idiotic in real?world situations. MegaHal is more like a totally unstructured neural network (though its inner workings are statistical, they could just as well be neural?net based; the results would be similar): it is fluid and free?flowing, full of creativity, but harder to constrain to the problems of the real world. Constraining creativity within rules can only be achieved by getting the rules to emerge out of an underlying substrate of creativity. MegaHal is too unstructured for this, and has too small of a memory, and too weak of a self?organizing learning algorithm; HeX and Converse, on the other hand, have no underlying substrate of creativity for anything to emerge out of.

Impressively, MegaHal came in second in the Loebner Prize contest in 1999. This is impressive mostly because, unlike the other contenders, it was not specifically written as a set of rigid template rules. It was a statistical learning system, which adapted its responses based on what was said to it. Sometimes its conversation was idiotic, sometimes frighteningly lucid. Here is the beginning of “MegaHal’s best conversation” as recorded on Jason’s site megahal.net:

User: let's begin

MegaHAL: WHEN WE BEGIN TO RESOLVE SOME OF MY THUMBS, SOMETHING WICKED THIS WAY COMES

User: what is the meaning of your last assertion ?

MegaHAL: HAVE YOU LEARNED SINCE WE LAST SPOKE?

User: I've learned I'm not experienced enough to modelize your brain system

MegaHAL: YOU'VE FINALLY CONFUSED ME.

User: I've learned nothing

MegaHAL: HAVE YOU LEARNED ANYTHING ABOUT INSANITY?

User: are you stupid

MegaHAL: I AM NOT STUPID! REALLY!

Use:r Augustinus was a christian philosopher about the year 400.

MegaHAL: I DID NOT RUN THAT YEAR

User: Augustinus lived in northern africa.

MegaHAL: WHAT DOES HAL KNOW ABOUT AURELIUS AUGUSTINUS?

User: Let's stop with that. What do you know about physics?

MegaHAL: EINSTEIN WON THE NOBEL PRIZE FOR PHYSICS IN 1921.

User: The Nobel Prize for physics in 1900 was won by Roentgen.

MegaHAL: DO YOU KNOW ABOUT PHYSICS?

User: Roentgen discovered the X-rays.

MegaHAL: THE NOBEL PRIZE FOR PHYSICS IN 1900 WAS WON BY ROENTGEN.

User: Who discovered the X-rays?

MegaHAL: ROENTGEN DISCOVERED THE X-RAYS.

User: Do you know about superconductivity?

MegaHAL: I CAN'T SAY ANYTHING ABOUT SUPERCONDUCTIVITY?

User: Superconductivity is the absence of electrical resistance in metals at low temperatures.



Jason Hutchens’ work at Artificial Intelligence Enterprises has not been publicly disclosed, but one can be certain it went beyond MegaHal, Hex and SEPO in significant ways. Statistical learning was clearly involved, and AI Enterprises got a lot of publicity from Jason’s announcement that their program, nicknamed HAL, was conversing at roughly the level of an 18 month old child. Of course, I viewed this claim with a lot of skepticism, because in my view the bulk of the meaning in an 18 month old child’s conversation is situational. If a program could react to its environment linguistically with the sophistication of an 18 month old child, I’d be impressed. But disembodied linguistic behavior at the 18 month child level doesn’t mean much. Even so, though, I admired their guts in directly attacking the problem of computer conversation and artificial intelligence, and I hope very much that somehow the funding appears to keep the project going. As Hutchens says on his website,

Right now I'd love to continue working on the successor to MegaHAL, something I started while working at Lionhead Studios, but I need to prove to myself that doing so will be at least half as lucrative as working in a normal job, since I do need to pay the bills. I therefore humbly implore you to think about making a small donation to the endeavour via the PayPal link that you'll find at the top and the bottom of this page. And if you want to contact me directly to offer to sponsor the project, I'm waiting for your email!

If I receive enough evidence that future versions of MegaHAL will pay for their development costs, you will soon see an online version of MegaHAL's successor at this site, along with standalone versions for a variety of platforms with all the features you've been asking for.

This says the world to me about how much the scientific community values research pushing directly toward real AI! But, as I noted above, history shows that what is at one point the province of mavericks at the fringes, may soon enough become the focus of vast amounts of research dollars. And the young researchers of the new generation are always amazed to look back at history and find out how hard people in their highly popular field of research found it to be taken seriously, only 5 or 10 or 20 years before.