By Paul Almond, 11 May 2008
Abstract
This is the third in a series of articles exploring the relationship between minds and physical systems (substrates) on which they are based. The first article was Minds, Substrate, Measure and Value, Part 1: Substrate Dependence (available at
http://www.paul-almond.com/Substrate1.pdf [1], http://www.paul-almond.com/Substrate1.htm [1] or http://www.machineslikeus.com/cms/minds-substrate-measure-and-value-part-1-substrate-dependence.html [2]). The second article was Minds, Substrate, Measure and Value, Part 2: Extra Information About Substrate Dependence (available at http://www.paul-almond.com/Substrate2.pdf [3], http://www.paul-almond.com/Substrate2.htm [3] or http://www.machineslikeus.com/cms/extra-information-about-substrate-dependence.html [4]).
The previous articles argued that probability in some thought experiments in which you are unsure about your status is based on the substrates on which algorithms are running and that the nature of the substrate is statistically relevant to the measure with which an algorithm runs and affects the measure of any minds associated with such algorithms.
This article provides a deeper explanation for this by discussing the real problem, which is arbitrariness of interpretation. Previously called "multiple realizability" by John Searle [6,7], this is the problem caused by the need to apply an interpretation to a physical system to say that it is running an algorithm and the possibility of applying any interpretation to obtain any algorithm, leading to an apparent observer subjectivity in the algorithms that a physical system is running. The idea of interpretation is formalized with a hypothetical machine, an algorithm detector, which uses a detection program to make interpretations, and other, similar, devices. It will be shown that the problem of arbitrariness of interpretation is really one of arbitrariness about selection of an appropriate detection program to use in an algorithm detector.
Searle argues that multiple realizability makes the strong AI hypothesis incoherent. This conclusion is unnecessary, although the strong AI hypothesis needs some clarification. Any observer subjectivity can be removed by admitting all interpretations as corresponding to algorithms that are running. This does not mean that all algorithms are equal. Some algorithms, and algorithms of some types, will occur with greater measure than others, due to a larger proportion of all interpretations "finding" them. The set of all interpretations is infinite, so absolute numbers of interpretations cannot be counted. Instead, the measure of an algorithm, or a type of algorithm, relative to that of another algorithm, or type of algorithm, is defined as being the number of all interpretations that produce that algorithm, or type of algorithm, as a proportion of the number of interpretations that produce the other algorithm or type of algorithm, as the length of the detection program tends to (but does not reach) infinity.
This provides the real explanation for the "weak substrate dependence" or statistical substrate dependence which I argued to be necessary in the previous two articles [1,2,3,4]. It was suggested that "redundancy" or "inefficient use of matter" tends to increase the measure with which a particular algorithm may be running in a physical system. The reason for this is that redundancy is likely to allow more ways in which different interpretations can produce algorithms. As an example, if we imagine increasing the thickness of the wires in a computer then this allows more (speaking informally) ways for detection programs in an algorithm detector to make interpretations and find algorithms in that system, which will have an effect on the probability of being of being in various situations in thought experiments like those in the previous articles.
Admitting all interpretations like this removes observer subjectivity, making Searle's "multiple realizability" argument against the strong AI hypothesis invalid.
Previous Article by Nick Bostrom
Since writing the previous two articles in this series [1,2,3,4], I have become aware that Nick Bostrom published an article in 2006 [8] considering issues of splitting and combination of minds and suggesting that consciousness can somehow exist with some sort of degree depending on the nature of the substrate. I would like to ensure that readers are aware of this.
Introduction
"Could I write a mind? Could I sit down and describe a scene, a thought and an emotion so well that it took life on the page?"
- Divergence (Fiction), Tony Ballantyne, 2007 [9].
This article follows on from Minds, Substrate, Measure and Value, Part 1: Substrate Dependence (which is at http://www.paul-almond.com/Substrate1.pdf or http://www.paul-almond.com/Substrate1.htm [1]) and Minds, Substrate, Measure and Value, Part 2: Extra Information About Substrate Dependence (which is at http://www.paul-almond.com/Substrate2.pdf or http://www.paul-almond.com/Substrate2.htm [2]). These articles discussed the concept of substrate with regard to minds. A copy of Minds, Substrate, Measure and Value, Part 1: Substrate Dependence has also been posted on the www.machineslikeus.com website at http://www.machineslikeus.com/cms/minds-substrate-measure-and-value-part-1-substrate-dependence.html [3] and a copy of Minds, Substrate, Measure and Value, Part 2: Extra Information About Substrate Dependence on the same website at http://www.machineslikeus.com/cms/extra-information-about-substrate-dependence.html [4]. Comments about both of these articles can be posted here.
"Substrate" in artificial intelligence (AI) and philosophy of mind is the underlying system, often the physical system, that causes a mind to exist. Substrates might be human brains or computers (though some people like John Searle [5,6,7,10] dispute this). Minds, Substrate, Measure and Value, Part 1: Substrate Dependence [1,2] argued that complete substrate independence in AI is an incoherent concept, using a thought experiment about combining computers, and Minds, Substrate, Measure and Value, Part 2: Extra Information About Substrate Dependence [3,4] gave extra information to support this argument. The argument used in the first article was based on the assumption that the position being argued against was that different computers running identical algorithms correspond to different situations in which you could be and that the number of possible situations depends on the number of such computers. An alternative view would be that it is numbers of unique algorithms - not computers - that determines probability and my argument about combining computers would not be seen as a problem by someone who holds this view. To deal with this I said that the thought experiment about combining computers shows a special case of a more general problem, which is the problem of arbitrariness of interpretation. The problem is that the strong AI hypothesis states that minds are associated with processing of symbols by an algorithm running in some physical system, but interpretation is required to determine what algorithms are running in a physical system. Any symbols and computational states we want can be extracted from a physical system if we are prepared to interpret it in a sufficiently contrived way. This may seem only relevant in debates about the nature of mind, but it is actually relevant in thought experiments like those discussed in the previous two articles, and without a resolution of this issue probability is incoherent.
Minds, Substrate, Measure and Value, Part 2: Extra Information About Substrate Dependence [3,4] discussed the problem of arbitrariness of interpretation briefly. This article will give a more detailed discussion. It will be shown that there is vagueness in the generally understood idea of the strong AI hypothesis and that, without qualification, it is incoherent, meaning that we have no coherent method of calculating probabilities in thought experiments like those discussed earlier. The sort of problem being discussed here has been described by John Searle [6,7]. I will give a more formal description of the problem, which should make it clearer what the problem is, and that it really does exist, using a conceptual machine called an algorithm detector. Unlike Searle, I do not intend to try to destroy the strong AI hypothesis, but rather to show how it needs clarifying. The clarified strong AI hypothesis which results will not be a watered-down version: it will actually be an extreme version of the strong AI hypothesis.
This article will discuss the strong AI hypothesis more generally than the previous articles, but I will make sure that the relevance of physical substrate, an important idea in this series, is not overlooked.
The Strong AI Hypothesis
The term "Strong AI" comes from arguments made by John Searle [10] which distinguished between two philosophical positions about the mind. The "strong AI" view was regarded as being the philosophical position that a computational system can think and that it has a mind. The "weak AI" view was regarded as the philosophical position that a computational system might think and act like it has a mind, but that this does not necessarily imply that it really has a mind.
Some confusion might be caused by the term "strong AI" now because many AI researchers use it differently to how it was first used. They regard strong AI as merely the view that a suitably programmed computer can behave intelligently and behave as if it has a mind, regardless of the philosophical issue of whether or not it has a mind [11]. The term "strong AI hypothesis" is therefore used now to describe the claim that a properly programmed computer would have a mind.
In the earlier articles I discussed "strong AI" but I hope it is apparent that I was referring to "the strong AI hypothesis" - the term that I should have used.
The strong AI hypothesis, then, is basically the idea that computers can have minds, but there is an important implication of this. If an appropriately programmed computer can have a mind then this follows:
A mind exists when the appropriate algorithm is being run on a physical system.
We should not get distracted here by irrelevant issues. This does not necessarily mean that the algorithm must be written by humans, or must be run "tidily" with elements that we would recognize easily as instructions. For example, an artificial neural network in this view could have a mind if it is doing the right sort of processing because the interaction of all of its artificial neurons gets the correct algorithm run. Similarly, a human brain could be regarded as running the correct algorithm, even though human brains clearly are not programmed like conventional computers, because the interaction of all the parts in the human brain could, in principle, be formally described by an algorithm and the fact that you and I experience consciousness would indicate that whatever information processing occurs in such an algorithm is equivalent to a mind. This point is lost on many people who think that an artificial neural network is philosophically different to a conventional computer.
There is a problem with this view, however. I will call it the problem of arbitrariness of interpretation. It has previously been raised by John Searle who calls it multiple realizability. The problem is that there is no obvious, formal method of saying what algorithms a physical system is running.
The Problem of Arbitrariness of Interpretation
Searle's best-known argument against the strong AI hypothesis is probably his Chinese room argument, widely regarded as invalid by advocates of the strong AI hypothesis, but his observation of the problem of arbitrariness in interpretation is potentially deeper. This is what Searle says:
"…computation is not an intrinsic process in nature like digestion or photosynthesis, but exists only relative to some agent who gives a computational interpretation to the physics. The upshot is that computation is not intrinsic to nature but is relative to the observer or user." [6]
What this argument says is that symbols and meaning do not exist in a physical system by themselves, but need an observer to allocate meaning to the physics. For example, 1s and 0s do not exist by themselves inside an electronic computer. Instead, we decide that particular voltages correspond to 1 and other voltages correspond to 0, allowing us to say that an electronic computer has a particular computational state. We interpret the electronic computer's physical state as having a certain meaning.
The problem with this is that the obvious interpretation is not the only one. It would be possible to interpret any physical system in any number of ways to get any computation we want from it. Searle calls this multiple realizability or universal realizability and says this:
"The same principle that implies realizability would seem to imply universal realizability. If computation is defined in terms of the assignment of syntax, then everything would be a digital computer, because any object whatever could have syntactical ascriptions made to it. You could describe anything in terms of 0's and 1's." [6?]
We must interpret what a computer is physically doing to get 1s and 0s and an algorithm, but Searle is saying that you can get any algorithm out of anything if you make an appropriate interpretation, a point he makes clear in the following statement:
"For any program and any sufficiently complex object, there is some description of the object under which it is implementing the program." [7]
Searle gives an example:
"…the wall behind my back is right now implementing the Wordstar program, because there is some pattern of molecule movement that is isomorphic with the formal structure of Wordstar. But if the wall is implementing Wordstar, then if it is a big enough wall it is implementing any program, including any program implemented in the brain." [7]
(Note: Wordstar is a word processing program for personal computers.)
You would have to do some of these interpretations in a contrived way and some people would say that the computation really comes from the way you interpret the matter, rather than the matter itself, but - and this is an important point - there is nothing in the strong AI hypothesis which tells us which interpretations of a physical system are reasonable and which are contrived.
I disagree with Searle's conclusion that the strong AI hypothesis needs discarding. It will be obvious now that I think that the strong AI hypothesis needs clarifying. I do, however, accept the basic point made by Searle - that this apparent subjectivity of interpretation is a problem.
We can get an idea of this problem of interpretation from part of Greg Egan's novel, Permutation City [12], in which arbitrariness of interpretation was actually a major plot device. In Permutation City, two characters, who are "copies" made by mind uploading [13,14] , hide in a computer system by having themselves placed in a complex, encrypted way, among the normal processing of that system. An observer may see that system, for example, running a virtual reality simulation of a fountain, but a bit of the fountain simulation is being stolen to do a bit of the job of running these two stowaways and an environment for them. This is similar to steganography, the practice of hiding messages in other data, such as in computer image files.
Suppose we wanted to check for intelligent agents hiding, encrypted, in a system like this. We would need to run some sort of analysis of the computer - some sort of decryption - to show that they were there. If the decryption is not complex enough it may not find them, but if the decryption is too complex then it may find agents just by arbitrarily interpreting the computer hardware in the right way.
As another example, I will give this encrypted message:
AY5NSL Q23 FQK Q33WDN 7UIKDLQPD IKELDP HJIK IEKSL POLSO
I can assure you that there is a way of decrypting that to get "Hello world." Alternatively, you can decrypt it to get "The quick brown fox." Of course, decrypting it in this way may be contrived, and whatever decryption process we use may need to be supplied with more information than we are actually getting out of the message, but any decryption process needs to be supplied with some information. Ultimately, there is no way of saying that one particular interpretation is valid while others are invalid. You might claim that one method of decrypting the message recovers what was encrypted in the first place, but surely the history of the message does not matter: only what we have now should be an issue. You might say that the simplest interpretation that produces a mind is the correct one, but it should be easy to imagine situations with multiple interpretations.
The previous articles [1,2,3,4] described thought experiments in which whether or not a system is running a particular algorithm can affect probabilities. There is no escape from the problem of arbitrariness of interpretation, no matter how we approach probability. When we base probability on counting computers, arbitrariness of interpretation causes a serious problem in deciding which computers are running algorithms that count, but we might try to ignore this by using some "common-sense" approach. Such a common-sense approach would not be very effective, though: the thought experiments about probability in the previous articles are really special cases of the problem of arbitrariness of interpretation. We are more obviously confronted by the general problem when we get probabilities by counting unique algorithms rather than computers, because we must be able to say how many unique algorithms are in a particular system and arbitrariness of interpretation has the potential for generating unlimited numbers of unique algorithms to mess up our counting.
Why the Problem of Arbitrariness of Interpretation Cannot Be Ignored
The Idea of Reference Class
The problem of arbitrariness of interpretation cannot be ignored because, if the strong AI hypothesis or anything like it is correct, the decision about what algorithms are being run by what physical systems determines our reference class of possible situations in thought experiments. There are descriptions of what "reference class" means in terms of Bayesian probability, but to put it simply (and maybe not as rigorously) the reference class is the set of possibilities from which some statistically based selection is thought to be made. As an example, if I think that there is a 10% chance that the next person who talks to me will be carrying a briefcase then this implies that I have in mind some "reference class", some imaginary collection of all the people who are possible candidates for being the next person I meet next, and that 10% of the people in this reference class carry briefcases. I might arrive at my reference class in a number of ways. For example, if I am on a train I might decide that my reference class is everyone on the train, and I might arrive at my 10% probability if I have reason to believe that 10% of the people on the train are carrying briefcases.
The Problem
The first two articles of this series [1,2,3,4], when discussing weak substrate dependence, discussed thought experiments in which you are unsure about the nature of the physical substrate on which your mind is based. Situations like these arise in the novels Permutation City, by Greg Egan [12], and Recursion, by Tony Ballantyne [15]: in these books characters are confronted with the possibility that they may not be based on organic brains, but may be computer simulations of their brains running in a virtual reality. The reference class in these thought experiments depends on what algorithms are running where. For example, if you are asked if you think you could be existing in your own brain, or on some computer, then whether or not that computer is running an appropriate algorithm that could be considered adequate for providing your experiences is relevant. If the computer is not running such an algorithm then it is impossible that you could be running on it and the probabilities for any other situations in which you could be are correspondingly higher. If you know that the computer is running such an algorithm then you have to face the possibility that your experiences could be due to the running of that algorithm, and the probabilities of being in any other situations must decrease. The previous articles ignored this and concentrated on the issues of combining computers and when we say that a computer running in an algorithm is one computer or two computers. That is also, in a way, a reference class issue, but the reference class problem presented by arbitrariness of interpretation is more basic: without being able to say whether or not a given algorithm is running on a computer we cannot even begin to determine our reference class of possible situations in thought experiments like this and any probability is incoherent. The reference class problem resulting from arbitrariness of interpretation is worse than this, however: it now becomes a serious issue outside what may seem to be the contrived circumstances of the thought experiments. This is because, in the absence of any way of firmly saying what algorithms are running on what physical systems every situation becomes like those in the apparently contrived thought experiments. There is no way of saying that any given physical system is not running an algorithm that corresponds to your mind, and which should be considered part of your reference class of possible situations.
If you think that particular physical systems are running particular algorithms, arbitrariness of interpretation will result in incoherency no matter how you choose your reference class. The previous articles [1,2,3,4] mainly discussed the idea that the reference class should be made of each version of the algorithm that is running on a physical system somewhere. Some people, however, disagree with this and say that we should only be interested in each unique algorithm. This distinction will not help with the problem of arbitrariness of interpretation because both approaches, and any conceivable approach, will depend on what algorithms are running in physical systems - and arbitrariness of interpretation makes it incoherent to ask about this.
It may seem that common sense should sort this out and that it is just obvious that a given algorithm is being run by some physical systems and not others, but common sense and human intuition are insufficient: probability is supposed to be something which we can assign using formal methods. There needs to be a formal method of deciding whether or not a given algorithm is running in a given physical system, or at least some idea of how one would work. As we will now see, this causes a big problem.
The Algorithm Detector
Basic Idea of the Algorithm Detector
It will be hard for many people to see that the problem of interpretation exists. So quickly and so easily do we apply computational interpretations to physical systems, we are not aware, unless we think about it, that we even needed to apply an interpretation. For example, when a circuit designer sees the circuit diagram for a calculator it will seem obvious to him/her that he/she is "seeing" something that implements the algorithms that we associate with calculators. Many advocates of the strong AI hypothesis, if they could see the structure of a human brain in sufficient detail, and could make sense of it, would say that they are "seeing" something that implements (albeit in a messy way) an algorithm corresponding to a mind. I want to show that reaching such a conclusion is by no means trivial and that, unless we do something about it, it implies some subjectivity.
To show that the problem of interpretation exists, rather than discussing how human brains "see" algorithms in nature, I will simplify things by introducing a conceptual machine for detecting algorithms. I will call this machine an algorithm detector. An algorithm detector is a stand-in for a human brain, with none of the abilities of a human brain except the ability to "see" algorithms in physical systems.
If there is an obvious way to interpret physical systems as having algorithms then we should be able to formalize that interpretation method as a computer program. Nothing extraordinary is being claimed here. I am merely saying that if we imagine any reliable human knowledge about determining what algorithms a physical system contains, we should be able to formalize that knowledge as a computer program. The algorithm detector is a machine that has such a computer program, together with sensory devices that allow it to examine a computer, a human brain, or any other physical system and determine what algorithms that system is running.
Description of an Algorithm Detector
Here is a more detailed description of how an algorithm detector would work:
The algorithm detector analyzes physical systems and determines what algorithms they are running.
A computer in the algorithm detector runs a detection program. The detection program controls the movement of physical probes to get data about the system that it is analysing. The detection program contains a formally expressible method, or a number of such methods, for determining the presence of algorithms running in a physical system.
Physical probes move around under control of the detection program. Each has a tip that can measure physical values and send the data back to the detection program. For example, a physical probe may measure the local magnetic or electrical field strength. Physical probes are not subject to any conceivable damage from the environment, nor any conceivable interference with their movement. They can be moved, ghost-like, through matter and have no noticeable effect on the physical system on which they are used. For example, a physical probe could move through the casing of a computer to measure physical values inside, or it could be placed inside a human brain - all without the computer being affected or the human subject being aware of the probing.
The number of physical probes available to an algorithm detector should not be an issue, but it should not constrain any philosophical discussion. We may imagine an algorithm detector with a limited number of physical probes available for simultaneous use, or we may assume that the number of available probes exceeds that which is needed in any situation which we are considering. It may be simplest just to imagine the algorithm detector as having an unlimited number of physical probes.
An output device gives the results of the algorithm detector's analysis. We will assume that this is a printer.
To use the algorithm detector it is placed near the physical system that we want to analyze and turned on. The detection program starts to move the physical probes around inside the system, taking readings of physical values in it. The detection program analyzes the data that it obtains from its probes and intermediate results of this analysis may cause it to make further movements of its probes. When it knows enough about what is inside the system the detection program prints a list of algorithms that the physical system is known to be running. This may be done by printing the program listings of all the algorithms, or by assigning each algorithm a short code (which could only be done if the machine is looking for a small subset of all possible algorithms, or if the results are prioritized in some way).
Notes on the Algorithm Detector
The above description only specifies what an algorithm detector does, and not how well, or how generally, it does it.
An algorithm detector may be designed to detect only certain algorithms that interest us or to detect all algorithms. An algorithm detector that detects only a single algorithm may be programmed with the specific details of the algorithm for which it is looking, but this does not mean that an algorithm detector must be programmed with details of every algorithm that it is going to find: it could have some general method for finding any algorithm. If an algorithm detector is not designed to find a particular algorithm, but has some general method for finding any algorithm, then I will call it a general algorithm detector. A general algorithm detector is not necessarily guaranteed success. Being able to find any algorithm is not the same as being able to deal with any way of encoding algorithms in the physical substrate.
Algorithms may be encoded or "hidden" in a physical system in different ways. An algorithm detector may be designed to find those algorithms encoded in only a particular way, or to find algorithms encoded in any way.
I define an omniscient algorithm detector as one capable of detecting any algorithm encoded in a physical system in any way whatsoever. An omniscient algorithm detector would need to contain every "knowably sound" method for detecting any algorithm in a physical system. This has some similarity, in terms of scope, with the conceptual machine proposed in Penrose's "proof" [16] (flawed in my opinion [17]) against computational AI. An omniscient algorithm detector may not necessarily be something that we could actually make, but most of the discussion in this article will involve algorithm detectors that are not necessarily omniscient. Most of the consideration will be about quite realizable algorithm detectors that only contain the methods that anyone has got round to putting into the detection programs.
An algorithm detector might be able to find a large number of different algorithms, but it may be inconvenient to use: if it finds many algorithms then it could produce a huge printout. We might imagine a more user-friendly algorithm detector which can accept an input consisting of the program code of one or more algorithms and which then attempts to find those algorithms only. This might require the algorithm detector already to have the specific capability of finding those algorithms, so that it will only work when certain algorithms are provided as inputs, or it could be a general algorithm detector, able to search for any algorithm that we supply as an input.
The Problem with the Algorithm Detector
If the strong AI hypothesis, as most people understand it, is coherent and well-defined, we should be able to formalize it. Any understanding we have of what constitutes an algorithm running on a physical substrate should be capable of being formalized as an algorithm and used to run an algorithm detector. The strong AI hypothesis in itself is inadequate for this and does not tell us how to make algorithm detectors. It is therefore incomplete.
We will consider an example:
Suppose we want to find algorithms that play chess. We hire a computer programmer to write a detection program to detect algorithms that play chess and we put this in an algorithm detector or similar device. We agree to pay the programmer a bonus for every chess algorithm that his program finds.
We set up our machine next to a chess computer and it moves its physical probes around inside it before giving us a printout indicating that it has found a chess algorithm. This should not surprise us, because we expect to find chess algorithms in chess computers.
We then position the algorithm detector near another machine and it analyzes this machine before reporting that it has found another chess algorithm. This puzzles us, because we do not see any obvious signs, from the machine's behaviour, that it is doing anything related to chess. On looking inside the machine, however, we see that a wire that should send the output to a monitor is broken, so the machine was running a chess algorithm all along inside: we just could not see it, but the algorithm detector's physical probes and detection program had determined its presence from the internal structure of the machine.
Next, we place the algorithm detector near a third machine and it tells us that this machine is running a chess algorithm. As with the previous machine, we cannot find any external evidence that it is running such a thing. When we look inside, unlike with the previous machine, we do not find a simple failure of some output device. When we investigate further, however, we find that the machine is running a chess algorithm. The way in which the hardware is used to encode the algorithm is somewhat complicated. For example, an individual 1 or 0 is not encoded as a single voltage level: instead there are a number of different permutations of voltage levels in a number of different wires that correspond to a "1". Nevertheless, after we have been looking at the insides of the machine for a while, it is obvious to us that it is running a chess algorithm.
We now place the algorithm detector near a fourth machine and it detects a chess algorithm here too. On looking inside the machine, however, we merely see a lot of complicated hardware. We see no evidence of any chess algorithm. The programmer demands his finder's bonus. We are sceptical, however, that there is any chess algorithm in this machine. We challenge the programmer on this and he assures us that the algorithm is there, but that it is very hard to see it. He tells us that he has more experience of finding algorithms than we do and that he has formalized this experience in the detection program that he wrote for us. We wonder if he rigged his program to falsely detect chess algorithms to defraud us of money, so we inspect the detection program's source code. The program is long and so complex that we cannot understand it. We tell the programmer that the great length of his program is suspicious. What if it actually contains the very chess program for which it is supposed to be searching within itself, so that it can just do some irrelevant and complicated computation and then, in some circumstances, print the listing of a chess program to make it appear that it found it? The programmer replies that the program is merely very long because it can detect chess programs that are hidden in very complicated ways and only a long program can have the sophistication needed to find such programs.
This is the problem. How can we ever know what a valid algorithm detection program is and what is not? Without a way of being able to formally describe what constitutes a valid detection program the strong AI hypothesis is incompletely defined, because it asserts that machines running certain algorithms have minds without stating what it means to say that a machine is running an algorithm. As the discussion of reference class and probability pointed out, this is not a trivial consideration. Without a resolution of this the strong AI hypothesis, as understood by most people, with its lack of a formal interpretation method, gives incoherent results when we try to obtain the probabilities that we are in various situations.
Incidentally, another, similar, point might be made - that an algorithm detector, if it existed, would be a physical system, that the problem of interpretation also applies to the algorithm detector's software itself and that it is impossible to say, without interpretation, what detection program an algorithm detector is running, launching us into an infinite regress of interpretation. I will not be making such an argument because, although it may initially appear tempting, it would be weak.
Inputs and Outputs
An algorithm detector need not be limited to providing us with the program listing. We should be able to imagine an algorithm detector that also reports to us the sequence of inputs and outputs associated with that program. If the algorithm itself is hard to find then these will be just as hard to find. For example, in Greg Egan's novel Permutation City [12] and Tony Ballantyne's novel Recursion [15], simulations of human minds run hidden in virtual reality simulations. An algorithm detector could be imagined that found programs corresponding to human minds like this and told us what inputs and outputs were associated with them. As another example, an algorithm detector might find a program that is doing arithmetic and tell us what numbers it is adding. I will call such an algorithm detector an input and output reporting algorithm detector.
We could go further and imagine a special case of an input and output reporting algorithm detector that not only finds inputs and outputs, but actually allows us to interact with the system that it has found. I will call this an interactive algorithm detector. Such an algorithm detector would have physical probes that are not limited to measuring local physical variables, but can change them as well. The detection program would tell us what outputs are coming out of the algorithm and would prompt us to provide inputs, for example using a keyboard. It would then direct the algorithm detector to use its physical probes to alter the physical system to ensure that the algorithm receives those inputs. Depending on how the algorithm is encoded into the physical system, providing a single input of a 1 or 0 from the "outside world" might involve changing many physical variables.
As well as algorithm detectors equipped to tell us about algorithms and their inputs and outputs we should also be able to imagine less powerful devices that lack the capability to tell us what algorithms are being executed, but can merely find them and tell us what input and output values are associated with these algorithms. I will therefore define these types of detector:
An input and output reporter is a device that tells us what input and output values are associated with algorithms in a system. An input and output reporter is just an input and output reporting algorithm detector without the capability of actually finding out with what algorithm it is dealing.
An interactive input and output reporter is a device that tells us what input and output values are associated with algorithms in a system and allows us to interact with the system by changing output values. Such a device is therefore just an interactive algorithm detector without the capability of actually finding out with what algorithm it is dealing.
Both of these devices have some characteristics in common. The same system of physical probes would be used, able to move through matter without restriction under control of a detection program, and able to read physical values. The detection program also applies an interpretation to the data that has been received and tells us what the device has found. In the case of an interactive input and output reporter, the detection program would also prompt us for input values which are then supplied to the algorithm that the system has found by using the physical probes to change physical variables.
Limitations on Interaction
There will be situations in which interaction with the algorithm cannot occur in real-time or in which in cannot happen at all. This could be because there may be some interpretations for which computational states do not correspond to extractions of information from the physical system at different times and in the same order. For example, an algorithm might be detected that goes through computational states 1, 2 and 3, but it may be that Computational State 3 is obtained from the state in the physical system that occurs first, Computational State 2 is obtained from the state in the physical system that occurs next and Computational State 1 is obtained from the physical state that occurs last. That is to say, the physical system is being viewed backwards to make the interpretation. Some interpretations could be independent of time in the physical system, the information about all the computational states being obtained from the physical system at the same time. An example of this would be a hardcopy printout of a computer program. In principle, the right sort of interpretation applied to this could obtain a description of an algorithm moving through computational states. Other interpretations may be so far removed from our intuitive expectations that we could not even categorise them as easily. For example, an interpretation could involve extracting information from the physical system at many different places and times to make a single computational state, so that from our point of view the physical basis of each computational state is delocalized in space and time.
This will need considering later. For now, it merely needs pointing out so I do not appear to have missed it. Although many extreme interpretations will be like this (I will define these as atemporal interpretations.), there will many that will not - interpretations that work as we would intuitively expect, with the physical state of the system corresponding to some computational state and a later physical state of the system, in time, corresponding to the next computational state, and I will define these as temporal interpretations. We could interact with an algorithm found using a temporal interpretation, but not one found using an atemporal interpretation.
This, incidentally, has possible implications for our view of time that will be discussed later.
How I Will Deal With This
An obvious objection to the above is to say that I am saying that no formal interpretation method exists simply because I cannot see one and that the strong AI hypothesis is therefore invalid. I am not trying to destroy strong AI and join the Searle or Penrose camps - or announce some position of my own against strong AI: I think that both Searle and Penrose are incorrect. Rather, I think that our view of the mind has to rely on computation, but that the obvious lack of any formal interpretation method in strong AI means that something needs to be done. I will be trying to show that none of the obvious things that can be done work and that we are left with the strong suggestion that a drastic approach for getting a formal interpretation method is needed. I will be describing this approach later and therefore suggesting what is intended to be a repaired version of strong AI. What I suggest will not reject the relationship between minds and computation. Rather, computation will be shown to be important to any understanding of minds and other things. The way in which computation relates to minds, will, however, be different to what is argued against by Searle. Rather than minds being viewed as computational in themselves, minds will be shown to be correctly considered as associated with computationally based descriptions (descriptions that are expressed formally) and computationally based interpretations (interpretations that are expressed formally) of physical reality.
Objections
Objection 1: Of course you will be able to find arbitrary algorithms when you are allowed to interpret reality, but there is no reason to make any interpretation at all. You should simply take algorithms that you can observe directly, without any interpretation, as existing, and ignore any algorithms that can only be "found" by interpretation.
Answer
This objection shows a misunderstanding of what is involved in "perceiving" an algorithm. We never directly perceive algorithms but instead perceive physical variables in the world around us and interpret these to decide that some algorithm is there. When we read a printed program listing we decide that particular patterns of ink on a page can be interpreted as symbols which in turn can be placed in a particular order and correspond to a sequence of instructions in a language, which can be interpreted as an algorithm according to the rules of a language. When we say that a computer is running a program we have decided that particular patterns of voltage, or other physical variables, correspond to 1s and 0s, according to some rules of interpretation, and further rules of interpretation dictate how these 1s and 0s are arranged to give the computation states through which the computer is supposed to be going.
It becomes obvious that all claims of the presence of algorithms rely on interpretation when we imagine an algorithm detector with no detection program inside it. It would be incapable of telling us what algorithms are there. It is no different when humans are doing the observing. A truly "direct" observation of the presence of an algorithm would require the observation to be made by human sense organs, without the need for any information processing at all. In reality, the brain is needed to process the sensory information and decide that a computer is being observed running an algorithm and, to anyone who subscribes to the strong AI hypothesis, this must be equivalent to an interpretation algorithm.
Objection 2: You are complicating things needlessly. Humans intuitively know when a computer is running an algorithm and when it is not.
Answer
For the strong AI hypothesis to be coherent, the concept of what constitutes an algorithm should not depend on intuition, but should be formally defined.
Objection 3: Some interpretations are just stupid.
Answer
This is just the previous objection stated in a different way. Unless you can formally state how we tell sensible interpretations from stupid ones there is no formal idea of what constitutes the running of an algorithm.
Objection 4: You can make any interpretation you want to find any algorithm you want, but some of these interpretations would need to be made by very long algorithms. For the more extreme interpretations, the detection program might need to be longer than the algorithm being detected and might effectively be pretending to "find" an algorithm that is actually contained within itself. Short detection programs can be accepted, but long ones are contrived and invalid.
Answer
This objection may seem plausible when we consider it just for really extreme cases because in one way it is correct: the objection describes the "mechanics" of how a detection program may be contrived to find any algorithm that we want.
The problem with this objection, however, is that that there is no specific point at which it comes into force and the detection program becomes "contrived". Suppose we are sure that detection programs with a maximum length of L bits are valid. What if we increased L by one bit? It is hard to see any obvious philosophical reason why an addition of one bit in length should make much difference. We could keep increasing L in one bit increments, so that the detection program could make progressively more contrived interpretations. For what value of L should we say the interpretations are permissible? The issue is not trivial because it will affect probabilities in the sorts of thought experiments that we have been considering [1,2,3,4]. The reference class issue means that choosing some arbitrary value of L would not do, because this would make probability values, which should be objective, dependent on this arbitrary choice and we would have no coherent concept of probability.
I do not think there is any obviously correct value of L and this is a problem.
One approach could be to suggest that while there may be no obvious cut-off point, the interpretations become increasingly contrived as L increases. Doing this, however, means accepting that all interpretations are valid - just that some have some have "more validity", in some sense than others. This would strongly imply some kind of measure of minds, as I stated must be the case in Minds, Substrate, Measure and Value - Part 1: Substrate Dependence [1,2]. This is the approach that I will be suggesting for dealing with the problem of arbitrariness of interpretation.
Objection 5: Programs and minds are real only if they interact with reality. The more extreme interpretations made by an algorithm detector's detection program could not possibly interact with reality. As an example, you may be able to make a contrived interpretation of the matter in a wall as running the WordStar word processing program, as described by Searle in his example [7]. This WordStar program could never be used to word process any real documents. A real word processing program is accessible through the keyboard and can send information to the outside world. The program said to exist in the contrived interpretation can never do that. This tells us which programs are real and which are not.
Answer
We should ask what is meant in the objection by "interact with reality". If an appropriate algorithm detector, or similar device, found the WordStar computer program in the atoms of the wall in Searle's example then who is to say that the algorithm is not interacting with reality? The detector could report the inputs and outputs to us and it could be argued that the inputs and outputs do exist, and that we are not intelligent enough to see that they are there. Against this, it could be argued that the system should interact with "macroscopic variables" in reality that are apparent to us, but it could also be argued that there could be other aspects of reality that are not apparent to us - that the algorithm's behaviour changes variables in reality that, just like the algorithm itself, are hidden to us.
Why should we say that the effects of running an algorithm have to be noticeable in the "outside world"? Suppose we applied that standard to your own brain. What if you were so completely paralyzed that your conscious mind could make no noticeable outputs whatsoever? This does not need to be hypothetical: humans can end up in this situation as a result of neurological conditions. An observer applying the standard that "interaction with reality" is required should pronounce you non-existent at this point, yet I think most people should be able to imagine being in such a state and feeling very much real! A simpler example, and one that may appear at first to be deceptively easier to dismiss, is that of a human placed in a sound-proof metal box through a door which is then welded shut. Someone determined to say that a system must make meaningful outputs to be regarded as existing could say that there are no outputs to indicate the presence of any intelligence in the box. We could further use hypothetical mind uploading as an example. If you subscribe to the strong AI hypothesis you must think that an adequately constructed computer program simulating a human mind must be conscious, so you should be able to imagine the situation of being such a program. What if you were in such a situation, interacting with the outside world, and your inputs and outputs were disconnected from the outside world and connected to a program providing a virtual reality? From the point of view of an outside observer, you might seem to be non-existent, without a careful examination of the internal workings of the computer equipment running you, but would it be reasonable to think your own mental experience would cease just because of this? You could claim that these situations are not the same, because you know that there is really a paralysed person there, or that there is really someone in the metal box, or there is really a simulation of someone running in a virtual reality, but after the algorithm detector gives us its results we could equally well say we know that there is an algorithm hidden inside some physical system - even if it is found by some extreme interpretation, because there is no obvious cut-off between reasonable interpretations and extreme ones.
This idea that inputs and outputs may not be necessary to say that an algorithm exists could be argued against by stating that, even if they are not obviously provided, inputs and outputs can always be found for any information processing system. As an example, in the case of a computer running a simulation of a brain and the virtual reality with which it is interacting, some variables in that program (or memory addresses, or registers, or some combination of variables, etc) must correspond to the inputs and outputs that the brain would have if it were connected to the outside world. As another example, if an algorithm, with no obvious inputs or outputs in the "outside world", is adding numbers then some variables or similar things in the program must correspond to the numbers that are being added. In cases like this we should be able to imagine, if we have the physical capability to reach into systems, physically affecting it to change these variables, memory locations, registers, etc and similarly to read the values stored in them. In this way, we could regard these as inputs and outputs that are not yet connected to the outside world and, by our process of reaching into the system we could connect them to the outside world. This should be easy to imagine in the case of the electronic computer. We could imagine having some diagnostic machine with a keyboard and screen, with wires running from it into the computer. Some of these wires would be used to measure the voltages at some places and some of them would be used to allow us to control the voltages at some places, allowing us respectively to read values and change values in the computer. By doing this, we could argue that we have effectively made "hidden" outputs observable in the outside world and gained the capability of affecting "hidden" inputs.
Doing this could allow us to detect an algorithm in a slightly different way to that previously discussed: not by looking in detail at the internal processes of the algorithm itself, but by finding appropriate inputs and outputs in the system and showing that the relationship between them is consistent with the execution of some algorithm. Although different, this approach would be just as valid. In this context, the objection would be about how "apparent" the inputs and outputs of the system are. The idea of the objection would be that, even if a human were paralysed, in principle, by connecting wires to the appropriate parts of the brain corresponding to inputs and outputs we should be able to interact with the brain and show that an algorithm corresponding to a mind is there. Likewise, if a human is locked in a metal box then by reaching into the box we can find appropriate inputs and outputs that tell us a mind is there. With the example of the simulated brain in the virtual reality, by running wires into the computer we can find ourselves interacting with the mind. The idea would be that in all these cases we can find inputs and outputs that are apparent and can be readily measured, whereas in extreme situations this will not be the case.
The objection is unsuccessful, however. Whether we are obtaining the algorithm directly by inspecting the system, or obtaining inputs and outputs that tell us it is there, interpretation is required. For example, if we decide that a paralysed human has a mind, just by "finding" appropriate inputs and outputs we have to apply an interpretation in which particular firing operations in neurons correspond to outputs and particular changes in neurons correspond to inputs. When I described the algorithm detector I described a second type of machine called an input and output reporting algorithm detector which is able to tell us what input and output values are associated with an algorithm that it has found. I also described the input and output reporter and the interactive input and output reporter. These machines would all do for inputs and outputs what the algorithm detector does for algorithms. They would find inputs and outputs associated with an algorithm that may be hidden in a system and tell us what they are. If we decide that we really need to be able to observe the inputs and outputs of some algorithm then the problem merely changes from what interpretation to apply - what detection program to use - in the algorithm detector - to what interpretation to apply - again, what detection program to use - in the device that will be telling us what the inputs and outputs are. All of the same issues now become relevant. It is meaningless to talk about "directly" observing an input and output when some interpretation will always be required. If we decide that only short detection programs are valid then, just as with detecting algorithms, we have problem of there being no obvious cut-off point between acceptable and unacceptable detection programs.
One point could be made about the capability of humans to interact with systems. It could be argued that by a system being apparent in the real world, we should be able to observe its inputs and outputs, but that (at least in principle) we should be able to interact with the system ourselves by means of these inputs and outputs. This is not the case for all extreme interpretations. Those interpretations that I called atemporal interpretations are ones in which the order of computational states does not correspond to physical states of the system occurring in the same order in time (or maybe any order in time). This objection could have merits with respect to some extreme interpretations - those which I called "atemporal interpretations" - although I think it ultimately unfounded even in these cases, as I will explain later. The problem is that it does nothing to invalidate any of the unlimited supply of temporal interpretations. Even if we decide that interpretations have to be made by "playing by the rules" and that each successive computational state must be correspond to a later state of the physical system than the previous one, we could still make temporal interpretations as extreme as desired to obtain any algorithm - or any collection of input and output values as evidence of the running of an algorithm - that we want. The problem of arbitrariness of interpretation is not resolved.
Objection 6: For us to say that something exists, it should be apparent in the real world.
Answer
This is really the previous objection, above, worded slightly differently, and the answer previously given deals with it.
Objection 7: It is not true to say that there is no obvious cut-off point between valid and invalid interpretations. You said that there is no sense in which any observations can be made directly, but there is one sense in which an interpretation can be direct: by not involving any processing outside the human brain. Some interpretations can be made by an unassisted human, but other, more extreme, interpretations are too complex for an unassisted human brain to make, and humans can only find algorithms with such interpretations by using a computer to run some kind of interpretation algorithm. While we might argue about the semantics of the word "direct" there is a different between interpretations that we can make unassisted - interpretations in which we can look at the physical system and just see the algorithm - and extreme, contrived interpretations in which the algorithm only appears unless when we use a computer to run some complex program.
Answer
For probability to be meaningful in the sorts of situations that have been considered there must be an unambiguous, formal way of deciding what a valid interpretation is. The objection does not provide this, instead suggesting a method based on the arbitrary standard of human biology. We may be able to immediately "see" that an algorithm is running in some situations, but what if human brains were much more capable? We may then be able to examine a physical system and "see" that some interpretation can be made corresponding to finding an algorithm in a way that is unknown to us with our more limited brains. We do not even need to imagine brains vastly different to our own: variation between existing human brains would mean that some human brains can make interpretations that others cannot. In fact, we do not need to consider different brains really: one is enough. The objection does not deal with the problem that varying amounts of thought may be needed to make an interpretation. Someone may examine a physical system and immediately "see" that it is running an algorithm. He/she may think about the system for a few minutes before "seeing" that another interpretation of the system "reveals" another algorithm and may think for six months before "seeing" a further algorithm. Where do you draw the line? If you can solve that problem, you then have the issues of how intelligent the person is, how much sleep he/she had the night before, how motivated he/she is to "see" algorithms, whether or not he/she has used any drugs that compromise thinking abilities, etc.
All these are just symptoms of the main problem, which is that interpretations can be made by different algorithms, and there is nothing profound about those algorithms that happen to be accessible to an unaided human brain.
Objection 8: You are saying that there is no way of saying which interpretations are correct, but you cannot know that. Some unknown physics could make particular interpretations valid. Something, going on in matter, could explain why some interpretations are correct and not others and this could coincide with our intuitive ideas of validity of interpretations.
Answer
The strong AI hypothesis, as generally understood, does not claim that some "unknown physics" is responsible for the process of physical systems acquiring meaning and giving rise to algorithms and computations. Instead, the occurrence of certain physical processes is supposed to be associated with computation and that is it. Advocates of such abstraction cannot reasonably seek refuge in unknown physics.
The very idea of specific physics being involved would suggest that the type of physical system was relevant, as the underlying physics may be different in different physical systems. This would suggest some sort of substrate dependence and that some view other that in the strong AI hypothesis, as most people understand it, would be needed.
There is also the issue of why this physics should coincide with our intuitive ideas about the validity of interpretations.
Objection 9: It is impossible to define how the algorithm detector would work because there is no clear concept of what would constitute a "physical variable" to be measured. A physical variable could only be some variable in fundamental physics if we are sure there is a fundamental physics and if we know what it is. If we allow a physical variable to be something in "higher level" physics then we have the problem that this physical variable could itself be the result of interpretation being applied to lower level physical variables. For example, if physical probes measure the local gas pressure, this gas pressure is the result of an averaging over many different molecules in the gas. An interpretation is therefore being applied to the lower level variables associated with the molecules in the gas to obtain the pressure value. This associates the physical probes themselves with a particular interpretation. If this interpretation is valid, why not others? We could extract any physical values we wanted just by specifying a suitable physical probe design that made a sufficiently contrived measurement and it would be impossible to say what a "correct" physical probe is. This makes the algorithm detector itself incoherent, so we need not worry about what would be valid programming for it.
Answer
It could be argued that this problem could be solved by requiring physical probes to measure values that are part of the most fundamental physics currently known. The idea would be that any problems due to interpretation actually being built into the design of the probe are presumably minimized by doing this. We need not worry about whether or not construction of such a machine is practical: the algorithm detector and related machines are just conceptual devices to help us think about philosophical problems. Against this it could be argued that we may not know what that bottom level of physics is, or there may not even be one, and even our best attempt to do this could involve us in some interpretation, just as if we were using physical probes that read abstracted, macroscopic variables. We could reply that the sorts of interpretations used to obtain such variables would be the particular ones associated with physics, which do not seem to have much to do with finding algorithms, but we could be asked to prove this and also to explain what is special about these interpretations that makes them valid. Continuing further with such an argument would get us into the subject of the next objection, so I will leave that here. Instead, I would point out that, if we decide that the specification of the probes requires an arbitrary interpretation, this does not cause my argument problems. It may cause problems in building a useful algorithm detector, but that was never my objective. The objective was to show that if we had an algorithm detector, there would be arbitrariness in the interpretations that we had it make, that arbitrariness being in selection of its software. If it can be reasonably argued that some arbitrariness must be built into the design of the machine itself - if there is arbitrariness not only in the machine's software, but in the choice of physical probe to be used - then that merely adds to the main point: that there must be arbitrariness in the interpretation performed by such a device and that, as the device represents the process of interpretation itself, the concept of particular algorithms being run by hardware rests on arbitrary interpretation.
Objection 10: What you say about algorithms, you could as easily say about physical objects. For example, the concepts of "cat" or "chair" could be expressed as interpretive algorithms that allow us to decide if these objects are there. The problem is that we might imagine making a very complex interpretation that allows us to find a cat or chair, when none is apparent in the real world. Is this supposed to mean that such objects exist in some sense? As this is absurd, and as the same sort of logic leads to this, we should discount this.
Answer
I will be discussing this issue in a later article. For now, I point out that it does not remove the main problem raised by this discussion. There is nothing in this objection that somehow finds a way of declaring some interpretations valid and others valid. The observation that there may be a similar issue when dealing with physical reality in general may be uncomfortable, but it will not make the problem go away. What I have said so far will have given an idea of what, to some, will be the fairly appalling philosophical step that I am prepared to take to resolve this.
Readers of Greg Egan's books [12] might like it though: I will discuss that later.
Searle's objection does not go far enough
I find myself in a strange position, when I am going to disagree with Searle, of saying that one of his objections to the strong AI hypothesis does not go far enough. Nevertheless, this is the case with his "multiple realizability" objection. Here, again is one of the quotes of Searle that I gave:
"…the wall behind my back is right now implementing the Wordstar program, because there is some pattern of molecule movement that is isomorphic with the formal structure of Wordstar. But if the wall is implementing Wordstar, then if it is a big enough wall it is implementing any program, including any program implemented in the brain." [7]
Searle seems to view the idea of "interpreting" a physical system to obtain computational states out of it a bit like making words with alphabet soup: you can put things together in different arrangements and orders to make anything you want. As I have described in this article, however, the concept of interpretation is more general than this. An interpretation is not limited to being a way of "sticking pieces of reality together": an interpretation is any formally describable way of obtaining the algorithm from reality and is any algorithm which can be applied to reality (or more accurately used to control a machine like an algorithm detector, which is applying the interpretation to reality). This means that there is not even the requirement that the wall should be "big enough". A small piece of wall can also be said to be implementing any algorithm if we like, if we are prepared to use a sufficiently complex interpretation. To take this to an extreme, we could even apply any interpretation we wanted to a single electron in an atom in the wall to obtain any algorithm we wanted. This may seem counter-intuitive, and some readers will be thinking that it is obvious in some cases that all the information that we think we are extracting from the wall is just coming from the interpretation itself. As I have pointed out earlier, the problem is that there is no obvious cut-off point at which interpretations suddenly become "cheating" like this.
Actually, we might go further than saying that any interpretation we want could be made from a single electron. Do we even need the electron? That is something for later articles.
I may seem here to have been trying to strengthen Searle's case against the strong AI hypothesis, but that is not a problem for me. I have no problem with the basis of Searle's "multiple realizability" objection and I am using it myself in this article. It is better for me if it is as strong as possible and I certainly want the version of it that I use here to be the strongest one that can be justified. The difference between my position and Searle's will be in how this objection is answered.
Searle's Answer to the Problem of Arbitrariness of Interpretation
I will discuss Searle's answer to this problem now, but I want to emphasize that I disagree it.
What I call "arbitrariness of interpretation" Searle calls "multiple realizability". We may both use slightly different arguments to try to show that the problem exists, but we appear to agree that the problem exists. Searle's answer to it is definite: he states that the strong AI hypothesis must be incoherent, but not in a way that means it needs more clarification. He thinks it is completely incoherent to associate a mind with the running of an algorithm. Instead, Searle thinks that such things as "thought", "minds", "understanding", "intention" and "consciousness" must be associated with specific physical processes and that other processes do not cause these things just by appearing to be processing information in the same way. As far as Searle is concerned, only some processes have the "causal power" needed for minds.
Searle does not say which physical processes have the right causal power: he admits that he does not know. He does say that (obviously) the human brain has the relevant causal power and that there is no reason to presume that a computer would have it, just because it was performing what looked like the correct computation. This does not mean that Searle thinks that intelligent behaviour by a computer is unlikely. Rather, he would interpret such behaviour differently. He would say that processing of symbols counts for nothing and a computer which appeared to be conscious would probably just be a zombie-like simulation of consciousness - something with merely the appearance of a mind.
What this position implies is particularly obvious when we consider mind uploading. The strong AI hypothesis would indicate that if we could scan a human brain with sufficient accuracy to make a good enough computer simulation of the physical processes occurring in that brain, then the computer simulation would itself be conscious. Searle would say that there is no reason to presume this, because what makes us conscious is the particular physical processes occurring in brains and, while a computer simulation may capture the behaviour, it does not capture the actual physical processes themselves. This would apply, as far as Searle is concerned, if the mind uploaded "copy" were pleading for survival, telling jokes and engaging in philosophy. It would apply even if its behaviour could not be distinguished from the original biological person. As far as Searle is concerned, behaviour, algorithms and processing of symbols count for nothing while the physical nature of the system counts for everything.
My Answer to the Problem of Arbitrariness of Interpretation
I think Searle is wrong. I propose a solution that does not discard the strong AI hypothesis, but instead clarifies it. What I am proposing is this:
We should admit all interpretations equally.
In other words, if a correctly functioning algorithm detector can be made to report the existence of an algorithm then the algorithm should be considered to be there - no matter what sort of detection program was needed or how extreme it was. This applies whether we are detecting the algorithm itself or using one of the other devices that detects inputs and outputs and inferring the existence of the algorithm from these: there is no profound difference.
This may appear philosophically extreme: it should appear philosophically extreme, at least with regard to conforming to our intuitive expectations. If true, it means that a chair or a wall, can be said to be running any algorithm imaginable, because any conceivable algorithm could be "found" by interpreting observations made of it in the right way. This "many-interpretations" approach completely deals with the problem of arbitrariness of interpretation, or "multiple realizability" because the arbitrariness is in the selection of the interpretation that is going to be correct. If all interpretations are valid then there is no selection, and no arbitrariness.
The Extra-Strong AI Hypothesis
This is not an attack on the strong AI hypothesis: clarification of it is not a weakening of it. In fact, I would say that what I am proposing here is an extreme form of the strong AI hypothesis - one in which the idea that minds are associated with symbol processing is taken so literally that any interpretation of a system which gives the correct symbol processing is recognized as corresponding to a real mind.
It may seem strange to some readers to regard this as "extra-strong" AI, but consider Searle's view: Searle says that no computational interpretation should be viewed as corresponding to a mind. That is most certainly an attack on the strong AI hypothesis. I am going to the other extreme and admitting not only the particular computational interpretation of a system that strong AI advocates would insist that I should admit as a mind, but all the other interpretations that describe things like minds as well. I am about as far away from Searle's view as I could be.
It may appear, from this, that I am explicitly assuming that minds must be computational. This is not actually the case: things are more involved than that. Some readers will wonder how that can be: after all I am associating minds with the correct algorithms. I will explain later in this article. For now I point out that I said that minds should be viewed as associated with the correct algorithms. This is not the same as giving the algorithms themselves some sort of metaphysical power, and saying that "an algorithm is running" is really an informal way of saying something about reference classes and minds: I will discuss this later in this article.
Many-Interpretations and Measure
Why the Physical Substrate Matters
Considering different interpretations is equivalent to considering different detection programs that could be used in an algorithm detector machine, or similar device, as described previously. I have said that for any physical system we should admit all interpretations - all detection programs - as corresponding to algorithms that are based on the system. This may seem to make the physical nature of a system irrelevant. After all, if a physical system is running all algorithms, then why should it be of any importance at all if, for example, it contains electronic circuits which under one interpretation could be said to be running the WordStar program? This may seem to push us back into incoherence. The thought experiments in the earlier articles [1,2,3,4] suggested that if multiple systems are running something like your mind then the substrate is statistically important, but does this not suggest that the substrate and anything else is irrelevant? It may seem that this implies that your brain can be said to be causing your mind no more or less than a chair could be said to be causing it. According to such a criticism, it would be just as likely that your experiences, including all your perceptions of the external world, are being caused by a chair or table than your by your brain, which would make it dubious that you were even in your own brain! This would seem to over-rule everything said in the earlier articles about substrate and measure. I want to be clear that I am not advocating such an idea.
How should we resolve this issue? Relative measure, a concept from the previous articles, provides the answer. Just because any system can be said to have interpretations in which it runs all algorithms, and just because we treat each of these interpretations equally, it does not follow that all algorithms that can be produced by such interpretations are equal. In any given physical system, some algorithms - the ones we would normally associate with "common-sense" interpretations - will run with particularly high measure, relative to that of others, and the measure with which a physical system runs a particular algorithm, relative to the measure with which it runs some other algorithm, will depend on the physical nature of the system and the two algorithms.
There are two reasons for this:
- Some interpretations will produce the same algorithm, so the measure of some algorithms is higher, relative to that of others, because more interpretations in the set of all possible interpretations produce them.
- Some interpretations will produce algorithms that are very similar, if not identical, so that the measure of some classification, or general type, of algorithms will be higher, relative to that of others.
To see why the physical nature of the system should matter we can use the following argument:
Consider some physical system, P. We are applying an algorithm detector to P.
Suppose we have some detection program I1 to put in the algorithm detector. I1 has length L1 bits and when applied to P applies a particular interpretation of what P "means" and produces algorithm A.
Now, I suggest that the following statement is true:
If algorithm I1 has L1 bits and, in an algorithm detector, produces an algorithm A by interpreting P, if we are considering all detection programs with lengths up to some maximum lengths of L bits, then the number of detection programs which, when used in the algorithm detector, also produce A is largest when we can find an algorithm I1 such that L1 is smallest.
or to put this more simply:
If an algorithm detector can produce some algorithm A by applying a short detection program to some physical system then we should expect many other detection programs also to produce A. If, however, we need a very long detection program to produce some algorithm A when it is applied to the physical system, we should expect interpretations that produce that algorithm to be rare.
A similar argument could be made about algorithms of some classification or general type.
This would explain why "sensible" interpretations should have high measure, relative to that of extreme ones. Considering your brain as an example, the sort of interpretation that produces an algorithmic description of your mind will be a relatively "sensible" interpretation: it would not need a very long program in an algorithm detector or similar device, because the physical system itself is arranged in the right way to facilitate this sort of interpretation. Such an interpretation will therefore have high measure relative to some of the absurd interpretations that we might imagine.
Infinite Sets and Counting
I have just been talking about counting interpretations, but that is an informal way of viewing things. We cannot really count interpretations or algorithms. There is no absolute number of detection programs that produce A because such a number can only be given for some maximum detection program length of N bits and N is finite in the above reasoning, but unlimited in reality. That is to say, the set of all possible detection programs is infinite, but here we are considering some finite set of detection programs
This is resolved, by using the same sort of technique used in my article Occam's Razor Part 4: An Overview of How Occam's Razor Works [18] of letting N tend to infinity. We can make N progressively larger, determining the number of all interpretations that produce an algorithm A1 and another algorithm A1, and we can obtain the number of all interpretations that produce A1 as a proportion of those that produce A2 as N tends to infinity. This can be taken as the measure of A1, relative to that of A2.
This is the closest we can get to "counting" instances of an algorithm produced by different interpretations. Although we might informally think about "how many" instances of an algorithm are produced by all the different interpretations, the fact that the set of all interpretations is infinite means that it is only technically correct to consider relative measure. This does not really change what was said above about how the physical substrate affects numbers of algorithms. To be technically correct we just need to talk about measure without reference to absolute numbers of interpretations or algorithms. An algorithm, or some general type of algorithm, is obtained from some physical system, by interpretation, with a high measure, relative to that of some other algorithm or type of algorithm, when the shortest detection program that can be put in an algorithm detector to obtain the algorithm, or an algorithm of that general type, is as short as possible.
This sort of reasoning encompasses the "common-sense" view of particular algorithms being run by physical systems. When we examine a physical system we might easily "see" that it is "obviously" running some algorithm, but that really means that some uncontrived interpretation can produce that algorithm from the physical system. This uncontrived interpretation means that the algorithm runs with high measure, relative to the measure with which other algorithms, associated with more contrived interpretations, could be said to be running.
What the "Other" Interpretations Really Mean
Yes, your brain, or a table or chair might be said to be running the WordStar computer program, but the extremely contrived interpretation needed to obtain such an algorithm from these systems means that the measure will be extremely small relative to that with which other algorithms resulting from more "sensible" interpretations are being run by various systems. On the other hand, the physical nature of your brain makes it simple to apply an interpretation that produces an algorithm corresponding to your mind, so the measure of that kind of "sensible" interpretation will be higher relative to that of other, more contrived interpretations. As was discussed in the previous articles [1,2,3,4], relative measure is an indication of probability, so it is much more likely that your experiences are due to the algorithm resulting from the obvious, reasonable interpretation of your brain than it is that your experiences are due to some extreme interpretation of the matter in, for example, a wall.
Many-Interpretations and Weak Substrate Dependence
In the previous two articles in this series [1,2,3,4] an argument was made, with thought experiments, attempting to show that minds exist with different relative measure and that the measure with which a mind exists depends on the physical nature of its substrate - the physical nature of the system running the relevant algorithm. The thought experiments involved situations in which there are multiple physical systems running algorithms that could correspond to your situation, so that there is uncertainty about your situation, and considered the statistical consequences of combining these systems. It was stated that the measure with which a computer runs an algorithm is greater when the redundancy, or inefficiency in use of matter to run the algorithm, is greater. At the time it was stated that this seemed to be true from the thought experiments, but that a deeper understanding was needed to understand how the physical nature of a computer would affect measure. It may seem that we digressed from this with the discussion of arbitrariness of interpretation in this article, but this is actually relevant to the issues discussed earlier. We now have the deeper understanding that I said was needed and are we are in a position to reconsider the thought experiments in the earlier articles in the context of many-interpretations.
In the thought experiments I took the common-sense view that there is a "preferred" algorithm - the one which seems to be run in a reasonable interpretation of the physical system. For example, in the case of an electronic computer this would be the algorithm that appears to be associated with the arrangement, and states, of the electrical components in the system. Now, in this article I have proposed that instead we should accept multiple interpretations, but we will ignore most of these here. We will instead focus on those interpretations which match "common sense" - which give the "preferred" algorithm that most people would reasonably say is running. As has been stated, this algorithm is run with particularly high relative measure anyway, because of the low amount of information needed to construct such interpretations. If we have two physical systems, and for each of them the same algorithm is the "preferred" one run with very high relative measure, then we can compare the relative measures for this preferred algorithm in each case. Suppose both systems are electronic computers of the same general design, but one has much thicker wires. If we imagine an algorithm detector or similar device moving its physical probes around in this system, then there are many more ways in which it can obtain the sorts of values needed to make an interpretation leading to the "preferred" algorithm than there are in the other system with the thinner wires: there is more matter in which the physical probes can be moved around.
This deals with a number of possible objections to the earlier articles. Some objections would use the idea that the previous proposal that inefficient use of matter in computing causes algorithms to run with higher measure is not a formally expressed idea. Such objections could be that different kinds of splitting could be conceptualized or that "inefficient use of matter" is a vague concept. Objections like this are easily dealt with now by saying that the argument about "redundancy" and "inefficient use of matter" was just an informal version of the argument in this article. To be technically correct, we should not be looking at "redundancy" or "inefficient use of matter", but at the measure with which the algorithm is running. Even selecting a single algorithm as the "preferred" algorithm would be a simplification. It would just be one of an infinite set of algorithms which are all running with some measure. Many-interpretations is the real explanation of weak substrate dependence.
Dealing with Groups of Algorithms
Uncertainty About Yourself
The sort of reasoning here would apply to individual algorithms, but also to algorithms of some similar type. In the thought experiments in the previous two articles [1,2,3,4], it was assumed that you know that various computers are running specific algorithms that correspond to your mind. A problem with this scenario is that you are unlikely to know what algorithm corresponds to your mind. Some readers may have noted that with the previous articles this was not dealt with and, in the thought experiments in those two articles, you know that specific systems are running versions of the algorithm that corresponds to you in virtual realities that may be the same or different.
Dealing with Uncertainty
How do we deal with situations like this if there is uncertainty about what algorithm corresponds to your situation? As an informal simplification, the idea in the first article [1,2], that an algorithm tends to be run with greater measure when inefficient use of hardware is involved, still applies, the real explanation of this being that in such situations the algorithm can be produced by more interpretations (using the word "more" in an informal way, given what was said about no absolute numbers being countable), so we can discuss things in terms of the concepts in the first article, to some degree. If the ideas about measure from the first article are valid for thought experiments involving different, identical versions of you, they will hardly go away when we have different versions of you. Whatever we can say about the measure with which a single version of you is run in some computer will still apply whether it is the same or slightly different as other versions of you in other computers.
Let us consider the kind of situation in the first article [1,2] where there is uncertainty about what substrate you are on. The difference now is that you do not know exactly what algorithm would correspond to your mental experiences. You are confronted with a number of different algorithms, running on different computers, all of which are possible candidates for your mental experiences. Each of these slightly different versions of you is running in a virtual reality and the virtual realities are not necessarily the same. They could be very different; however they all involve the simulated version of you being in a situation indistinguishable (based on what you know) from your current situation now (even though what happens to you in ten minutes may be a different matter). You know about the physical details of the computers running each such version of you, and about how the software is implemented in each version. In short, you know all of the details about how all a number of non-identical versions of you are implemented in non-identical virtual realities on non-identical hardware.
How do we deal with this? You cannot rule out any of these versions, because they are all close enough to what you know about your current experiences - your own mental state - to be possible candidates for it. Let us further assume that you do not know enough to declare any of these situations a "better fit" for your current situation than any other. In this kind of situation it should be clear that any distinction between different versions of you is irrelevant in terms of how we assign probabilities. It is pointless to talk about differences in versions, with regard to probabilities, if we cannot distinguish between them. What has been said about substrate, measure and interpretation applies to each individual algorithm that could be found, by interpretation, in any of these systems, and because we cannot distinguish between them we can treat them all as if they are identical versions of you, on different substrates and in possibly different virtual realities, with different futures. It is not necessary to have a collection of identical versions of you to assign probabilities.
Most of the measure will come from "obvious" interpretations in the various computing systems on which you could be running, so we should not need to deal with the contrived versions, found by longer detection programs, to assign probabilities. We should be able to deal just with those interpretations that find algorithms with measure that is high, relative to that of algorithms found by other interpretations. These interpretations will have high measure because of the nature of the physical substrate. For example, if the physical substrate is a human brain, then the "obvious" interpretations will be those that pull things like a specific human mind out of the physical system because less information will be needed to do this. Many such reasonable interpretations will be possible, for example by obtaining physical variables from slightly different points in a neuron. There is not just one reasonable interpretation. There will be many-interpretations that can be made by relatively short detection programs and which are almost the same. In practice, however, we would not need to worry about this when assigning probabilities to different situations as in the thought experiments that have been considered. By looking at a human brain, we could determine what the most "obvious" algorithm resulting from interpretation is. We would not need to work out all the individual interpretations which result in such an algorithm, or minor variations of them. Instead, we could say that a relatively high measure group of interpretations that are compatible with our experiences is implied by this single "obvious" interpretation. We could then use our knowledge about the physical structure of the brain to get an idea what the "measure" of algorithms in such a group is, relative to measures obtained in a similar way in other physical systems.
Two Separate Issues
There are two separate issues relevant to the many-interpretations view and it is easy to mix them up:
- the issue of which interpretations should be considered valid for saying that a given algorithm is running, or that some computation is proceeding through given computational states in some physical system.
- the issue of what algorithms, computational states or sequences of computational states should be regarded as corresponding to minds.
These are separate issues. To state whether or not a given physical system produces a mind we must be able to deal with both issues. This series of articles will mainly deal with the first issue. I have not yet decided whether or not any later articles in this series will deal with the second issue at all.
Both of these can be considered as issues of possible sets which determine the set of possible situations - the reference class - for an observer. This series of articles has considered thought experiments in which you are unsure about which substrate you are on and when considering this kind of uncertainty about your situation, and the chances of being in different situations, we are really considering things in terms of reference class. In probability calculations the reference class is the set of possibilities from which something is chosen and in this sort of consideration the reference class is the set of all possible situations in which you could be existing. A consideration of the above two issues, in reference class terms, is be as follows:
The first issue, the main subject of this article, is the issue of what formally expressed interpretations, or descriptions of your situation based on external reality, should be considered as forming the set of possible candidates for your situation. I will call this the interpretation set.
The second issue is the issue of what sort of information structure, algorithm or computational state, produced or found by an interpretation, should be considered a member of the set of candidates for your mental state. I will call this set the abstraction set.
Each member of the abstraction set has its own interpretation set of formally describable interpretations of the physical world that relate the physical world to that mind/mental state. This means that the situation of any thinking being is described by a pair of items chosen from the two sets: the abstract description of the mental experience comes from the abstraction set and the interpretation that relates these experiences to the physical world comes from the interpretation set corresponding to that particular abstract description. Each member of the reference class of possible situations in which a thinking being could exist is therefore an abstract, formal description of a mental state from the abstraction set and a formal description of how that mental state relates to the physical world (that is to say, a formally expressed interpretation of the physical world) from the interpretation set for that mental state.
It should be noted that mental states do not all occur with the same frequency in the reference class of possible situations. A mental state with more (using that word informally) interpretations in its corresponding interpretation set will feature in more situations in the reference class of possible situations, because every possible combination of a mental state and an interpretation that relates the physical world to that mental state is a member of the reference class of possible situations.
Here is an example of how the abstraction and interpretation sets would determine the reference class of possible situations. Rather than consider infinite, or even very large sets, let us suppose that we have small sets:
There is an abstract, formally described mental state, M1. M1 is related to physical reality by formally described interpretations I1, I2 and I3.
There is a mental state M2, related to physical reality by formally described interpretations I4 and I5.
There is a mental state M3, related to physical reality by formally described interpretation I6.
The reference class of possible situations involves every possible mental state and interpretation pair as follows:
M1I1, M1I2, M1I3, M2I4, M2I5, M3I6.
In this example, different mental states have different numbers of interpretations. Some readers might ask how anything can be counted, given the true infinite sets, but this is dealt with by considering descriptions with lengths that tend to, but do not reach, infinity.
This may be unwieldy to handle statistically, or even to think about, so I can make one suggestion about how it could be simplified for convenience in situations in which we are much more interested in the different possible interpretations than in the different possible mental states: we could just assume that one plausible mental state (that is to say, a mental state from the abstraction set) is the mental state (whether it is or not) and ignore the rest of the abstraction set, considering just the interpretation set corresponding to that mental state.
In the example just given, if you selected M1 as your mental state then your reference class of possible situations would be:
M1I1, M1I2, M1I3
This is likely be an effective simplification technique, because in most cases it will probably be hard to determine any significant difference between the interpretation sets for different mental states in your abstraction set.
Whether these issues are viewed in terms of the descriptions that I initially gave or in reference class terms is partly a matter of convenience, but the second approach has an advantage. A problem of the first approach is that it is philosophically abstract. Knowing what position to take on these issues would mean knowing what a mind is, and what is needed to make one exist, but it is hard to see what empirical questions would be answered by possession of such information and an argument might therefore be made that no questions would be answered. Alternatively, considering these issues in reference class terms means that the probabilities of you being in various situations that you obtain in thought experiments like those discussed earlier [1,2,3,4] are dependent on the approach that you take: considering things in terms of reference class actually has ramifications for your possible situation. This makes the reference class view preferable. A similar reference class view can be taken of the strong AI hypothesis itself, as will be discussed later.
The second issue - that of the abstraction set - is about what formal descriptions of your mind are valid. To some readers, this may seem nonsensical: they may point out that there is only one valid description of your mind - the one that happens to be correct. We do not know, however, what this description is. In fact, your knowledge about your own current state of mind is limited, long before we get as far as computational states or reference classes. What does happiness feel like? Do you know exactly how happy you are now? If you are about to do something can you say exactly how strong that motivation is? If you had to make a formal description of your current state of mind, assuming that you could attempt such a thing (and I am not pretending that I could), limitations would be imposed by your lack of knowledge about own mental state. At best, all you could do is state which kinds of formal descriptions are valid and which kinds are invalid. These valid descriptions may vary in terms of the detail of how they describe your emotions and motivations, your perception of something that they found beautiful, your opinions of this article, and so on, and limitations in your own self-knowledge would mean that you have a lot of valid descriptions. Every possible, formally describable situation that you could be in involves one of these formal descriptions of your mental state and a formal description of how that mental state relates to reality (the interpretation). The "correct", formal description of your situation would be one of these, but you would not know which it is after you have already removed all those which contain invalid mental states or interpretations. Assuming that all valid formal descriptions of your situation are equally likely (we could introduce complications by getting different probabilities involved for different formal descriptions here, but I will not do that), the "correct" formal description of your situation is randomly chosen from one of these, so the set of possible formal descriptions of your situation is a reference class. A reference class of possible, valid formal descriptions of your situation results then from limitations in self-knowledge as well as uncertainty about the correct interpretation.
Some of this self-knowledge may come from limitations of human biology. We might imagine, for example, a machine which can know exactly how happy it is by running some software subroutine which tells it. What if a machine could make similar enquiries about every aspect of its functioning? Could it "know" a formal description of its own computational state and avoid the need for a reference class at all? I think many readers will intuitively suspect that some kinds of limitations about self-knowledge would apply in any situation like this and, indeed, there is reason to think that this is the case and that no thinking entity would be able to avoid the need for a reference class of possible situations involving various formal descriptions of its mental state to deal with limitations in its own self-knowledge. This issue of limitations on self-knowledge has been discussed by Hofstadter [19].
Is this a computational view of the mind?
Ways in which it is and ways in which it is not
The many-interpretations view may appear obviously computational, but things are not that simple. The answer to this question is "yes" and "no" depending on what the term "a computational view of mind" means.
First, let us consider what is computational about the view:
Any mental state is computational to the extent that an algorithm is used to describe it. An observer who is unsure about his/her/its mental state will have a set of possible mental states, each such mental state being expressed as an algorithm.
Any interpretation that means a mind exists, based on physical reality, is computational to the extent that an algorithm is used to describe that interpretation. Any observer who is unsure about his/her/its situation in reality (for example, on what substrate he/she/it really exists) will have a set of possible interpretations, each such interpretation being expressed as an algorithm, implying that it is a computational interpretation.
There is another, obvious way in which the many-interpretations view could be regarded as computational: it allows computers to have minds.
This may appear very computational, so let us consider what is not computational about it:
Although the interpretations and the mental experiences/mental states in the reference class of possible situations are described computationally, this is not the same as stating that the mind results from computation. The many-interpretations position is merely assuming that things in the physical world must relate to the state of any mind that exists, and that this relationship and the mental state must have a formal description. Algorithms are used for that formal description, because of the apparent generality of algorithms, and the many-interpretations view assumes that any interpreting algorithm that can describe the relationship between a mental state and variables in the physical world is associated with a mind.
Does this mean that the mind is being assumed to be "computational" or "an algorithm"? Not really: the same sort of computation could be said to apply to anything else currently recognised by science. For example, let us consider fire engines instead of minds. Any existence of the type of object that we call "fire engine" is based on what exists in a physical system. All of our basic knowledge of the physical system could be described in terms of various physical variables. There would be some relationship between these physical variables and the various features that a fire engine might have, and this physical relationship could be described by an algorithm. There would be an algorithm corresponding to any fire engine, describing how it relates to the underlying physical variables that cause its existence, but would it be reasonable to say that this is equivalent to claiming that fire engines are computational? If we accept these semantics we should recognize that such an idea of "computational" is so general as to allow anything to be computational if it happens to have features that relate to reality in a formally describable way. If we decide that this does not make fire engines computational then we similarly have no reason for saying that minds are computational, just because a mental state is formally - algorithmically - describable and has a formally - algorithmically - describable relationship with physical variables.
Although the many-interpretations view allows computers to have minds, this is not a profound feature of it. According to the many-interpretations view, minds would be associated, with varying degrees of measure, with all physical systems, and described computationally in each case. If a computer is doing the right things needed to have a mind it merely means that the physical arrangement of the matter in the computer makes some interpretations have particularly high measure: those that describe how the mental states of that sort of mind are related to physical variables - how the mental states can be produced by interpretation of physical variables.
What this all means is that the many-interpretations view has some elements of a computational view of the mind in it, which are just products of a formal view of nature in general and the idea that algorithms are particularly well suited for expressing formal descriptions. Relating a mental state to physical reality by means of an algorithm may seem computational but the computation is in the way that the relationship between the mental state and physical reality is expressed rather than the mental state itself.
Some readers might point out that this leaves us with no view of what a mind is. If a mind is associated with the underlying physicality, but is not that, and if it can be described computationally, but is not computation, what is it supposed to be? The answer, and it will not please some people, is that it is a mind, in the same way that a fire engine is a fire engine. We might say that a fire engine's existence is contingent on some underlying matter, but a fire engine is generally regarded as being a fire engine in its own right.
What this all comes down to is that whether or not this is a computational view of the mind depends on what you mean by "computational view". What is important, however, is that there is no explicit assumption that "the mind is computational". The many-interpretations view merely takes the position that the relationship between mental states and the underlying physicality should be capable of formal description.
We can, of course, still talk about how algorithms are "running" in a physical system. In fact, I do so in some parts of this article. We should be aware, however that this is a simplification. Saying that some algorithm is running according to some interpretation is really saying that the interpretation describes a relationship between the physical system and some entity that is formally described by that algorithm.
What this means for the strong AI hypothesis
The strong AI hypothesis is that minds are associated with processing of symbols by an algorithm running in some physical system. Searle says that this idea is rendered incoherent by what he calls "multiple realizability" and I call "arbitrariness of interpretation", as any symbols and any processing can be obtained with the appropriate interpretation. The many-interpretations position deals with the problem of arbitrariness of interpretation by associating any valid interpretation with a mind and giving consideration to the measures of different kinds of minds, which will depend on the physical system. I think the many-interpretations position deals with Searle's multiple realizability objection against computers having minds, but a question that should be asked is whether the many-interpretations view vindicates the strong AI hypothesis or shows that it was wrong all along.
Someone trying to defend Searle's position from the many-interpretations view could argue that the many-interpretations view does not support the strong AI hypothesis because the strong AI hypothesis assumes that a particular "computational" interpretation of what a physical system is doing is the "correct" one, while the many-interpretations view denies that any interpretation is the correct one. Furthermore, the many-interpretations view does not really claim that minds are caused by processing of symbols, but rather accepts computation in description of mental experiences and the interpretation with which a mind is associated. Also, the many-interpretations view involves the concept of measure, whereas the strong AI hypothesis does not involve any such consideration.
This objection cannot make Searle's position correct. Searle clearly does not expect computers to have minds. He may have qualified this by saying that he is not saying that computers do not have minds, but he is clearly sceptical. The many-interpretations view accepts that a computer could have a mind, and in that important respect it is much closer to what the strong AI advocates are saying than what Searle is saying.
Should the strong AI hypothesis be thrown out? I do not think so, but it needs clarifying. If it seems obvious to us that a system is doing the right sort of processing then we might apply the strong AI hypothesis to say that the system has a mind. If, on the other hand, we applied the many-interpretations view to the same system, then (if our initial view that the system was doing the right sort of processing was reasonable) we would be likely to find that the physical substrate was such that interpretations leading to that sort of mind had a particularly high measure. The reason for this is that any reasonable person claiming that a system is doing a particular kind of computation must have made a relatively simple interpretation of the system, suggesting that a high measure interpretation relating the system to a mind is available. If an advocate of the strong AI hypothesis explained why some system had a mind and an advocate of the many-interpretations position explained why the same system had a mind with particularly high measure, then the two explanations would not be very much different with regard to what they said about the system. The semantics of one explanation would be about computation in the system, while the other explanation would be associating the same sort of computation with a high measure interpretation of the system.
It is obvious that the strong AI hypothesis needs some clarification to agree with the many-interpretations view. The most obvious need for such clarification is that the strong AI hypothesis might tell us that some system has a mind, on account of performing some computation, but the many-interpretations view would say that any system has that mind, even though it may be with very low measure. The strong AI hypothesis and the many-interpretations view are therefore not talking about the same things.
From the many-interpretations viewpoint, the strong AI hypothesis is a reasonable, informal statement of what a physical system needs to be doing to have minds with relatively high measure, based on observation of what that system appears to be doing.
The Strong AI Hypothesis in Reference Class Terms
The Reference Class Justification of the Strong AI Hypothesis
I have discussed the two different issues of valid formal descriptions of a mind and valid formal descriptions of interpretations and how these could be viewed in reference class terms. A similar approach can be taken to the strong AI hypothesis itself. The strong AI hypothesis states that minds are associated with the processing of symbols. This is not about the abstraction set. It may seem that accepting the idea of the abstraction set means accepting the strong AI hypothesis, but the abstraction set is just the set of formal descriptions of your mental state that could be valid. Some people like Searle say that specific physical processes are involved - however, a consideration of the many-interpretations view shows this to be a flawed position. People making these sorts of argument treat the idea of "substrate independence" as if it is an assumption that needs some sort of support. I would say, however, that substrate independence (at least weak substrate independence) is arrived at naturally, simply by avoiding prejudice. A justification for this works by turning things around. Instead of looking from outside a mind and asking what would be sufficient for a mind to exist, you look out from your own mind, and ask what would be sufficient to constitute an explanation of your mind. The argument is as follows:
- Any real knowledge that we have should be capable of being formally stated, at least in principle. This should seem a reasonable starting point. If we cannot have this we should ask why we are even trying to do philosophy, and even people like Searle and Penrose have not claimed that the workings of minds are beyond formal description. Theists might.
- Your mental state - the subjective experiences that you are having now - should be formally describable, at least in principle. This follows from (1).
- The correct formal description of your mental state is one description from a set of possible descriptions of your mental state. The size of this set depends on how much you know about your mental state. If you have complete knowledge about your mental state then there will only be one member of this set. If you have incomplete knowledge of your mental state then this set will contain many different possible, descriptions. This set is the abstraction set to which I referred earlier in this article.
- Your mental state exists because of something in the "outside world". By "outside world" all I mean is that part of reality that is not your mental state. Whatever happens in the outside world, there must be some relationship between events in the outside world and your mental state. This relationship is capable of formal description. That is to say, there must be some way of formally describing how events or things in reality "map onto" your mental state - how various characteristics of reality correlate with various characteristics of your mental state. If you know what the nature of reality is, but do not know how events in physical reality correlate with your mental state then an unprejudiced view is one in which, for any given mental state, you consider every such possible formal description of a way in which reality could relate to the formal description of that mental state as being a member of the set of possible, formal descriptions of how reality relates to the description of your mental state. If you have no idea what the nature of physical reality is then you should take a similar sort of position, except that your reference class is now every possible, formal description of how reality relates to the description of your mental state in every possible reality that can be formally described. In either case, each of these formal descriptions of how reality relates to the formal description of your mental state is nothing more or less than an interpretation of the kind we have been considering in this article! The set of such formal descriptions is therefore the interpretation set to which I referred earlier. Simply by not making assumptions about how minds relate to physical reality, we end up with every possible interpretation in the interpretation set.
- If a particular interpretation - a particular formal description of how your mental state relates to the outside world - could conceivably be considered by you to be part of your interpretation set for some possible mental state then, unless you are a solipsist, you should also consider it reasonably possible that any similar interpretations of reality are associated with other minds. This is not dependent on it being likely that you could find yourself in such situations: the mere possibility of including them in your reference class means that you have no strong philosophical argument against them. Nor does it mean that you need to be sure that minds are associated with such interpretations: that would be a matter for further philosophical discussion. It just means that all you can know about a particular formal description of how a mental state relates to physical reality is your decision about whether or not you would admit it into your interpretation set for that mental state, and therefore into your reference class of possible situations. If you do admit it then you have every reason to view such situations as involving minds. If you do not admit it then you should have some good philosophical reason for this prejudice in selection of your interpretation set.
- We can go further than the previous step. It is possible to make an unlimited number of formal descriptions of how your mind could relate to physical reality and, without a strong philosophical argument stating otherwise, you should admit these into your interpretation set for any corresponding, possible mental state. From (5) you should view it as reasonably possible that other such formal descriptions imply the existence of minds. Further, you cannot possess any other evidence that will tell you more about the existence or otherwise of minds associated with such formal descriptions: all you can ever have is knowledge of physical reality and knowledge of what you regard as constituting your reference classes of possible situations.
At this stage, we have ended up with the many-interpretations view with regard to the interpretation set for any mental state that could be involved in your own situation and a strong suggestion of it with regard to the existence of other minds. What is important about how we ended up with it is that we did not assume anything like "substrate independence": instead this followed from the approach of not having any prejudice about how the interpretation set should be constructed. We did not assume a "computational" paradigm. No assumption was made that minds are caused by "the correct computation". Instead, the only assumptions in the first four steps, about our minds, were that minds can be formally described, that minds are caused to exist by something in reality, that the relationship between this "something" and the mind being caused to exist can be formally described, and that in the absence of any information about which such formal description of the relationship between the physica