Disagreeable Me: Putnam, Searle and Bishop: The Failure of Physicalist Computationalism

Wednesday, 24 February 2016

Putnam, Searle and Bishop: The Failure of Physicalist Computationalism

I wanted to come out of blog dormancy to write up my thoughts on what I feel is a very important argument against computationalism. The argument advances the view that there is no objective fact of the matter about which computations a physical system is computing, and if this is the case it would certainly seem to problematise computationalism (the view that what it is to be a conscious mind is just to perform the right kind of computation).

In this post I will explain the argument and some of the common responses to it. I'll reserve my own response (which is quite different from that of most computationalists) for a future post.

The basic idea is not so new, going back at least to the late 80's and early 90's with arguments from Putnam[1] and Searle[2]. Searle neatly captures it as follows:

On the standard textbook definition of computation, it is hard to see how to avoid the following results:

For any object there is some description of that object such that under that description the object is a digital computer.

For any program and for any sufficiently complex object, there is some description of the object under which it is implementing the program.

Thus for example the wall behind my back is right now implementing the Wordstar program, because there is some pattern of molecule movements that is isomorphic with the formal structure of Wordstar. But if the wall is implementing Wordstar, then if it is a big enough wall it is implementing any program, including any program implemented in the brain. (pp. 208-9)

Searle's treatment of the problem is left rather vague. Perhaps many computationalists would be happy to dismiss such an idea as preposterous or absurd. However, Putnam justified in detail a very similar but more precise claim: that any open physical system can be interpreted as implementing any finite state automaton (FSA), where FSA is presumably more or less Searle's textbook definition of computation.

Because Putnam's treatment is so rigorous, let's focus on that for now and begin with an introduction to Finite State Automata (feel free to skip ahead if you are already familiar with the concept).

Finite State Automata

Firstly, I think it's important to point out that the FSA model is strictly less powerful than that of the Turing machine, but only because it has a finite set of possible states while the Turing machine has an infinite amount of tape to work with. As such, it ought to be possible to model any realizable digital computation with an FSA, and indeed it could be said to be a more realistic model than the Turing machine precisely because the set of states is finite.

An FSA is always in precisely one state at a time, which means that right from the outset it diverges quite significantly from how programmers would tend to think of algorithms and computation, where state is really a complex vector composed of variables and data structures. Conversely, the state of an FSA doesn't really have any content, only simple rules regarding how and when it transitions to the next state and what output to produce in each state.

Let's make this a little more concrete, with an unrealistic toy example of a shop system where if the value of the float in the till is less than £20 and the till takes cash, then we need to note that we are short of cash in that till in order to pay out change. A programmer might say something like "IF $till.float < 20 AND $till.type = 'CASH' THEN SET $till.shortOfCash = true", where the dollar signs denote variables being read from and written to. However, in an FSA there are no such variables. Something like this logic would instead be "state A goes to state B", where state A corresponds to the state where the till has less than £20 and accepts cash but we have not yet noted that the till is short of cash, while state B is the state where the till has less than £20 and accepts cash but we have recorded that the till is short of cash.

Suppose that when a till is short of cash a warning light flashes. In the programmer's model, this light would be associated with a variable such as $till.shortOfCash. In the FSA model we would just associate this output with certain states (such as B) and not others (such as A).

The number of variables programmers use in any reasonably complex software is quite large, and the number of possible values each variable could hold is frankly enormous. When you consider the number of combinatorial possibilities we have for all these variables, the number of possible states quickly expands to ridiculous proportions. So in most computations, the number of actual states you would need to account for in your FSA state transition tables is vast (to put it mildly). This is why even though the FSA is useful as a mathematical abstraction of computation it is worthless as a programming paradigm.

All the same, it is not hard to see that any computation can be described as an FSA. At any given time, a physical computer is in a particular state, that is all of its registers and memory addresses have particular values, and the ensemble of values constitute an identifiable state we can label. Forgetting input for a moment, the next state the computer visits and whatever output it might produced is completely determined by the current state. An FSA model would just list all possible states by their labels and what successor states and output they produce. This ought to be enough to capture anything a computer could ever do. Even input can be incorporated if we simply treat it as part of the state. As such, if any algorithm can pass the Turing Test and do all the information processing tasks a human brain can do, then so could an FSA.

(Aside: Searle is often ridiculed for implying that his Chinese Room thought experiment could be implemented with something like a big lookup table matching questions with responses, but consideration of the FSA model implies that in principle he could. With the right FSA table, every interaction with the room would put the FSA in a novel state and so he could mimic the ability of a conventional algorithm to learn and change and give different responses to the same input at different times. Right away we start to see that conventional computationalist attitudes such as faith in the Turing Test as a detector of consciousness are in trouble, because it is very hard to see how something as simple as a big lookup table could produce consciousness. All Searle would need to do to answer a question is to look up a big dictionary for his current state (he would have one such dictionary for each state) mapping input Chinese to output Chinese and identifying the next state. It is often assumed that Searle would need to take a long time simulating neurons and so on, but if he has these dictionaries then that is not so. There is a tradeoff of time versus memory, though, because the number of dictionaries he would need and the size of each dictionary is frankly absurd.)

Putnam's Rock

I've discussed how we can interpret a computer as implementing an FSA. The problem for computationalists is that we can perform much the same kind of interpretation on any open physical system (e.g. a rock) and map this to any FSA we like! Like a computer, the state of any physical system is defined as the ensemble of microstates we can define in the system (instead of registers, we might use the disposition and charges and momenta of all the atoms in the system). And like a computer, each state causally depends on preceding states. If we call the state at time 0 A, and the state at time 1 B, then we have shown that the physical system implements the FSA "A goes to B" and so it can be said to perform the example till-related computation we had above. All that's missing is the output, however if we had the ability to scan the microstate of the physical system then producing the output corresponding to that logical FSA state would not be difficult.

Besides, I'm not convinced that either input or output need be of much concern. We can imagine a computer simulation of a rich virtual environment hosting putatively conscious AIs but which has no input or output. If we can map the states of this computation to the states of a physical system such as a rock, how can we justify our intuition that the computer really is running the computation (and so hosting conscious entities) but the rock is not?

There are a number of angles one could use to question Putnam's result, some more successful than others. I'll get to those a little later, but for now I'll just note that, superficially at least, Putnam would appear to be correct. If to instantiate a computation is just to be describable as the right kind of FSA, then it would appear that every physical system is performing any computation you could care to mention.

Bishop's Pixies

John Mark Bishop has published a number of papers ([3],[4],[5],[6]) which expound on the implications of Putnam's result and related arguments, also answering some challenges that have been raised. They are well worth a read. If you don't have the time, the video above may be of interest.

Bishop interprets Putnam's result to mean that computationalism demands that every physical system is host to a multitude of conscious minds (which he humorously refers to as 'pixies'), by simple virtue of their natural evolution through a succession of distinct states. Since a computationalist believes that to be a conscious mind is just to implement the right kind of computation, and since any physical system is implementing any and all computations simultaneously (depending only on how you interpret it), then all possible conscious minds must be instantiated simultaneously in every rock. For Bishop, this is the most absurd kind of panpsychist position imaginable and so demonstrates that computationalism must be false.

I'm not sure that Bishop really adds much to Putnam's original argument, but he has done a great job of explaining it and advocating it in recent times, as well as taking an interest in addressing objections. For these contributions he is to be commended.

Bishop concludes with Searle and Putnam that computation is very much in the eye of the beholder, that there is no objective fact of the matter about which computations a system may or may not be implementing. To illustrate this point, Bishop raises the example of a truth table in digital logic such as the following.

A	B	Output
0	0	0
0	1	0
1	0	0
1	1	1

Bishop asks: which logical operation does this truth table correspond to? If you know anything about digital logic, you might be inclined to say that this is the truth table for AND. But this assumes that 0 represents FALSE and 1 represents TRUE, which is only a convention. We could equally interpret it the other way around, and now this would no longer be the table for AND but for OR. When it comes to interpreting what physical logic circuits are doing, we don't see TRUE or FALSE anywhere, only conventions of this nature regarding which voltages (or potentially other physical properties) are interpreted as TRUE and which are interpreted as FALSE. Since we can't even say what logical primitive each gate is implementing, it seems hopeless to suggest that there could be a fact of the matter for the system as a whole.

However, though computation seems to be in the mind of the observer, it is almost universally agreed that there is an objective fact of the matter about whether certain physical systems (such as human beings) are conscious or not.

I should emphasise here that we are not talking about subtle gradations between categories. Many of us are comfortable with the idea that there is a grey area in the spectrum from trivial unconscious information processing in bacteria to the complex conscious information processing in humans. The point is whether we can look at a system and objectively place it somewhere along the spectrum. For example, it is usually assumed that a healthy alert adult human is definitively at the conscious end of that spectrum.

An objective fact cannot be explained by a subjective interpretation. It is not plausible, for instance, that a virus could evolve which would kill only beautiful people and leave ugly people alone. Neither is it plausible that it would take less energy to broadcast good TV than bad TV. The very idea is a kind of category error, because people may legitimately disagree on who is beautiful or ugly, what TV is good or bad, but who lives or dies or how much energy is required for broadcasting are objective facts about which well-informed people should not disagree. In the same way, it is not plausible that we can explain the objective fact of consciousness by appeal to subjective interpretations of systems as computers.

Computationalist Responses

A number of responses from computationalists have attempted to resolve the problem, some more successfully than others, in my view. I will mention some of those I find interesting. The first three of these come from conversation with a computationalist friend of mine (Mike Smith over at SelfAwarePatterns). To be honest, I don't think they represent tenable philosophical answers to the problem but they probably are representative of attitudes that may be common among the lay community of computationalists and perhaps explain why there are still so many computationalists around despite the DwP argument.

Accepting the pixies

Mike at one point suggested that the best thing to do might be to accept the existence of the pixies. After all, there is little reason to suspect that our intuition in such matters is likely to be very useful. If it seems absurd, so what? Nature is under no obligation to be sensible. It certainly doesn't seem to be very sensible when it comes to fields such as quantum mechanics, so why should this be any different?

Bishop for his part is happy to let that speak for itself. If you are really willing to defend a view so bizarre, good luck to you! But most computationalists (and I with them) do not think that this is acceptable. If Bishop's "Dancing With Pixies" (DwP) argument is correct, not only are we surrounded by pixies, but the vast majority of conscious experience is realised in pixies, and so we ourselves are almost certainly pixies. Furthermore, though there must be a real world, we can say almost nothing about it. There is no reason to believe it bears any resemblance at all to our apparent environment, and indeed it may be something as trivial as a pair of particles separating from each other forever. Any system which has identifiable non-repeating states is a candidate.

Though we can't absolutely rule this out as a possibility, it is my view so absurd as to overrule any possible reasons for clinging to computationalism.

Rejecting the pixies

Mike was also happy to suggest that it might be best to reject the pixies out of hand, on the basis that the interpretation of a natural system as implementing a particular algorithm is too absurd. On this view, it is really the interpretation that is doing all the work and so it is not too worrying to suppose that we can force such interpretations if we wish -- the act of forcing the unbelievably complex interpretation would be what realises the computation and brings the pixies to life.

I don't think this is a satisfactory answer because the absurdity of the interpretation shouldn't matter if we never have to instantiate it for the pixies to exist, and according to the logic of the DwP argument they should exist even without such an interpretation. Actually building the interpretation itself is just to build a very complex input/output apparatus, but the computation should be taking place regardless.

If you want to say otherwise, then you should be able to draw a sharp distinction between what a rock is doing and what a computer without input/output is doing. The point of the DwP argument is that no such sharp distinctions are possible. There are only degrees of absurdity/complexity versus naturalness/simplicity in our interpretations.

Without an objective way to quantify or measure absurdity of interpretation, and without a natural threshold to mark the border between actual computations and Bishop's absurd phantom computations, it seems we're in trouble. Again, if we take the existence of conscious minds to be objective fact, and if we rule out the idea that a given conscious mind (one like yours or mine) could half-exist, then it's hard to see how something like a computation which exists only to a certain degree (according to the naturalness of an interpretation or the usefulness of regarding it as a computation) can account for the absolute existence of a mind.

The subjective existence of mentality

Another approach mooted by Mike was to adopt the view that since a conscious mind only really exists from its own point of view, it is wrong to assume there is an objective fact of the matter about whether a conscious mind exists in a given physical system. If the existence of a mind were subjective, then it might not be such an issue that the existence of a computation is observer-relative.

I don't think many computationalists will be satisfied with this approach. David Chalmers (perhaps echoing Descartes and "I think therefore I am") likes to point out that the one thing any of us can really be confident in is the fact of our own conscious experience. For this reason, it is not plausible to suggest that we don't really objectively exist or that our consciousness is an illusion. Conversely, if all it is for a mind to exist is for it to exist from its own perspective, then we must accept the first-class existence of fictional minds such as those of Han Solo or Mickey Mouse, something few computationalists would be willing to do.

But perhaps we should interpret "subjective" in this sense to mean only private. There is only one perspective that can ultimately observe a conscious mind to exist, and that is that conscious mind itself. But there is still a (hidden, presumably) objective fact of the matter that the mind exists, even if this fact is not public. But now we're back where we started, with a private objective fact depending for its existence on a public subjective interpretation, an impossible scenario which simply doesn't work.

Douglas Hofstadter - A posteriori cheating

Moving on to what other academics have said on these issues, Douglas Hofstadter (a man I admire quite a bit) has suggested [7] that the kind of approach Putnam takes to mapping FSAs onto a physical system is cheating because it requires a priori knowledge of the evolution of the algorithm, something which in fact can only be known a posteriori after running the algorithm. This is not a real mapping of physical states to logical states, according to Hofstadter. A real mapping would be one we could produce a priori as we do for computers. without having to run the algorithm. This points to a potential difference between Putnam's mappings and the kind of mapping we naturally adopt for computers, a difference that could account for consciousness, perhaps.

Or at least that's what I feel Hofstadter is trying to say. I don't think Bishop interprets him quite right because Bishop's response is to argue that there is nothing stopping us from running an algorithm twice on the same input. The first time, we don't know what the algorithm will do and so we might suppose that a particular algorithm produces consciousness. The second time we run it, we do know exactly what it will do, but of course it must still be conscious -- the mere fact that we know what it will do cannot change this, surely. If knowing what it will do in advance doesn't rob the computer simulation of consciousness, why should it rob Searle's wall of the same?

But of course, running the algorithm twice is only admitting Hofstadter's problem. We can determine mappings for computers without having to do that, but not for walls or other natural objects. On the other hand, it would seem that the logic of the DwP only depends on the logical necessity of the existence of such a mapping, so whether we are in a position to tell what it is in advance may not be important. Even so, if this epistemological difference could be made precise, it might constitute an objective difference in the two kinds of mapping, and that's all we need as grounds for distinguishing "real" computations from "pixie" computations.

Even though this feels to me like it might be the start of a legitimate objection, perhaps Bishop and I are giving Hofstadter too much credit, as his argument is really not very clear. Indeed, the particular passage this criticism comes from doesn't give much in the way of actual argument at all. What we find instead is rhetoric, little more than a list of incredulous comparisons to reading works of literature or classical music in the random structure of natural objects around us, intended to ridicule the very idea of interpreting natural systems as instantiating arbitrary computations. (In fairness to Hofstadter, he is responding to Searle's vague rhetorical argument rather than Putnam's more precise formal argument, so this kind of response is not unreasonable). In this respect, I think Hofstadter's criticism misses the mark because no objective metaphysical fact hinges on whether "the holes in pieces of Swiss cheese code for the entire history of the United States". Yes, a mapping does exist to make this true, and yes, this mapping is absurd and arbitrary, but nobody thinks the instantiating of a representation of the history of the United States objectively brings something qualitatively new (such as a mind) into existence, so this is not a fair comparison. We are not troubled by our inability to say definitively when and where an account of the history of the United States is instantiated, but we ought to be troubled by our inability to say definitively when and where conscious algorithms are instantiated.

However Hofstadter does point the way to a more substantive objection when he says "minds worth calling minds exist only where sophisticated representational systems exist, and no describable mapping that remains constant in time will reveal a self-updating representational system in a car engine or a liver".

This idea of a stable mapping that remains constant in time is perhaps a plausible angle to investigate. I think what he's getting at here is that we should not admit mappings that are so arbitrary as to only map particular runs of an algorithm on particular inputs to a particular period in the evolution of a physical system. Legitimate mappings should be general and extend to any possible inputs and for an indefinite period of time in the evolution of a physical system (and not just a bracketed temporal window). This kind of objection is taken up in more detail by David Chalmers.

David Chalmers - counterfactuals and CSAs

David Chalmers wrote a very detailed and thoughtful response to Putnam in his paper Does a Rock Implement Every Finite-State Automaton? [8]. The paper covers a lot of ground and is highly recommended, but for our purposes the most interesting idea is that a Putnam style mapping fails to be a genuine computation for one reason or another.

One important reason is that an actual digital computer is adaptive and dynamic. It is capable of performing computations not on only on the input it actually receives, but it would also have performed sensible computations on counter-factual input that it didn't receive. Putnam's mappings, on the other hand, are brittle. They only cater for one particular series of inputs, one particular run of a program, and have undefined behaviour in other circumstances.

Another side of this coin is that Putnam's mappings are brittle with respect to the physical circumstances of the system. Putnam mappings can only be made retrospectively on evolutions of systems that have already happened and been recorded. Counterfactually, had the system's state diverged even a little from that specified in Putnam's mapping, then Putnam's mapping would fall apart. The mappings we have for actual computers are robust in that we can say things like "had the voltage in this register been such and such, then that would have corresponded to such and such a logical state".

In other words the relationship between physical states and logical states for computers is lawful and robust, in that they cater for a very wide number of logical and physical circumstances (as long as the computer remains intact, at least) but Putnam mappings are brittle and depend entirely on happenstance. This being the case, it would seem we have an objective difference between the two and so perhaps Putnam's phantom computations should not be regarded as genuine.

To answer this objection, Bishop cleverly makes use of a version of Chalmers' own Fading Qualia Argument (FQA). The original FQA was deployed to illustrate the absurdity of biological chauvinism by postulating that brain cells might be replaced iteratively by electronic functional analogues. If we assume that electronics cannot be conscious, we start with a conscious being and we end with an unconscious being that behaves in precisely the same way. It would seem that during this process, qualia somehow fade out gradually, so that at the halfway point the being is only half conscious, only half-perceiving sensory qualia and so on, while being unaware that anything untoward is happening. Chalmers (correctly, in my view) takes this to be absurd, and concludes that consciousness must be a functional phenomenon.

Bishop's version of the FQA considers a different transition, that of a robot or simulated entity transitioning from full dynamic implementation of an algorithm to one scripted to proceed deterministically and inevitably from one state to the next (just like a Putnam FSA in a rock). At each step in the transition, we simply replace a single conditional branch with a hardcoded state change, so that the transition is perfectly smooth. Again, the behaviour of the algorithm run on the same input is unchanged, and again we are apparently left with the absurd conclusion that the half-way point has a being that is half conscious, half perceiving qualia and so on.

For what it's worth, I don't think this version of the FQA is quite as absurd as Chalmers' original. I can imagine the half way point might consist of an entity that flickers between consciousness and zombiehood as it alternates between executing conditional and hardcoded state transitions. It's still strange, but not quite as weird as imagining what it would be like to half-perceive qualia while being fully functionally aware of them.

Chalmers (and also Chrisley [10]) points out that this need not be so surprising. Deleting these conditionals will inevitably correspond to a physical change in the system, and this difference might make the difference in considering whether consciousness is brought forth.

To this point, Bishop answers that we can instead imagine leaving all the conditional statements in place and instead simply delete (or replace with null operations) the code that will not execute. Now, Bishop argues, the code that actually executes is the same and so we can no longer appeal to a physical change in the system to explain why consciousness might fade.

I'm not sure this argument succeeds for a couple of reasons. Firstly, I don't think it is possible to delete code without having a physical difference, and as long as there is a physical difference in the system then it is possible to point to that as accounting for the difference between conscious and unconscious systems. Secondly, it's not clear to me that a system with conditionals but with deleted code really corresponds very well to a Putnam style FSA which has no conditionals at all. On the other hand, however, it's hard to credit that the physical presence of code never executed is crucial for consciousness.

All in all, I'm left with the impression that this whole line of argument is inconclusive. Despite the loose ends, my sympathy actually lies with Bishop but I think reasonable people might with some justification disagree on whether he has proven his case.

However, Chalmers also presents a related argument which to the best of my knowledge Bishop has not yet addressed. This is the argument that the FSA is just one model of a computation and not necessarily the best one for our purposes. That is, to implement an FSA may not really be all that is required to perform a genuine computation (despite assumptions to the contrary apparently dating back to Turing). To justify this, Chalmers appeals to that difference I noted earlier between the abstract FSA and how computer programs are implemented in practice -- that is that the state of an actual computer has fine-grained content, typically divided into variables and stack pointers and the like, and that there exist meaningful, lawful and causal relations between these sub-states. This is not the case for an FSA where state is associated only with a label and transitions to other states. Chalmers points out that it is possible to build an abstract model of computation that respects not only state transitions but also the content of particular states, and labels this model the Combinatorial State Automaton (CSA). In contrast with FSAs, it is not clear at this time that it is possible to build a mapping between natural physical systems and arbitrary CSAs. If it is not possible, then the computationalist can claim that what it is to be a conscious entity is to implement the right kind of CSA, and that brains and the right kind of AI might do so while rocks would not.

Chrisley [9] has made similar points, arguing that Putnam's account of causality is too weak, and that unlike a real computer, the state transitions in his projected FSAs are not strongly causal, that is, the physical system being in logical state A does not really cause its transition into logical state B. Again, the gist seems to be that simply implementing (or being interpretable as) an FSA is not enough to be a genuine computation.

I think these kinds of argument are plausible, but, as noted, it does depend on the impossibility of mapping natural systems to arbitrary CSAs or better accounting for causality in such mappings. In my quote from Searle, he doesn't assume an FSA mapping, and he doesn't assume that counterfactuals are ignored. He only assumes (albeit without justification) that there is some mapping between the state of the system and the operation of an algorithm. It's entirely possible that he is right, no matter what model of computation we adopt or how much stock we place in the mapping of counterfactuals. Searle and his followers take it for granted that his assumption is correct and most computationalists would seem to assume that it is false. Again, my sympathies on this one probably lie with Searle, but it's far from a knock-down argument against computationalism.

Conclusion

On balance, I think Putnam, Searle and Bishop have a point. I think there is a problem with computationalism as usually conceived. And yet I still call myself a computationalist! There is a way to accept all these arguments and reconcile them with computationalism without (quite!) accepting the existence of pixies in rocks and walls. I'll explain that on my next post, whenever I get time to write it!

[That post is now here]

References

Putnam, Hilary (1987). Representation and Reality. MIT Press.
Searle, John R. (1992). The Rediscovery of the Mind. MIT Press.
Bishop, John Mark (2003). Dancing with pixies: Strong artificial intelligence and panpsychism. In John M. Preston & Michael A. Bishop (eds.), Views Into the Chinese Room: New Essays on Searle and Artificial Intelligence. Oxford University Press
Bishop, John Mark (2002). Counterfactuals cannot count: A rejoinder to David Chalmers. Consciousness and Cognition 11 (4):642-52.
Bishop, John Mark (2009). Why computers can't feel pain. Minds and Machines 19 (4):507-516.
Bishop, John Mark (2009). A Cognitive Computation Fallacy? Cognition, Computations and Panpsychism. Cognitive Computation 1 (3): 221-33
Hofstadter, D.R. & Dennett, D.C. (eds.) (1981). The Mind's I: Fantasies and Reflections on Self and Soul. New York, Basic Books (Chapter 22).
Chalmers, David J. (1996). Does a rock implement every finite-state automaton? Synthese 108 (3):309-33.
Chrisley R. Why everything doesn’t realize every computation. Minds Mach. 1995; 4:403–20.
Chrisley R. Counterfactual computational vehicles of consciousness. Toward a science of consciousness. April 4–8 2006. Tucson Convention Center, Tucson, AZ, USA; 2006.

143 comments:

SelfAwarePatterns24 February 2016 at 16:38
Hi DM,
I think this is an excellent explanation of Putnam’s, Bishop’s, and Chalmers’s views. A far easier read than most of the source material.

I'm honored that you highlighted my views, but I think presenting them as a series of isolated and exclusive responses isn't quite fair. (Although to be honest, I would have struggled to articulate my overall position until the later stages of our discussion, which allowed me to sharpen my thinking on it.)

Just to clarify:
1. From what I can see, the pixies can only come into existence through the existence of an enormously complex description. That description is far more complex than the relatively trivial ones we apply to engineered computing platforms, such as a binary 1=true and 0=false, etc. The description (we called it an “interpretation” in our discussion) is so complex that it would, in all practicality, require its own computing platform.

In essence, the state machine we're interpreting to exist in the rock isn't implementing the pixies (or Wordstar or whatever). It’s the states of the rock plus the implementation of the description that actually implements the pixies. So, the pixies don't come into existence until the description is developed. That's why I said in our discussion that the description amounts to an implementation of an AI that we then blame on the rock, wall, or whatever physical object is under discussion.

2. I’m puzzled by the type coupling you insist must exist between computationalism and the objective existence of a mind. I fully understand many people might find disturbing the idea of that existence being open to same level of interpretation that a putative implementation of Wordstar, but it seems like an inherent and inescapable aspect of the computational theory of mind, indeed of any physical / functionalist understanding of the mind that obeys the normal laws of physics.

(Of course, if the mind obeys its own special laws of physics, per Penrose et al, then you get out of it. But there is currently no evidence for it.)

3. Given 1 and 2, I have no trouble accepting the existence of the pixies, although it’s probably more accurate to say I’m accepting their potential existence. I don’t consider what I’m accepting to be particularly meaningful or troubling.

I do agree that, once we accept the interpretations / descriptions as meaningful, the challenges that Chalmers and others make, fail.

All that said, it’s worth noting again that my computationalism is pragmatic and completely dependent on its usefulness in interpreting neuroscience and psychology. Bizarre unfalsifiable consequences of that theory are interesting, but I’m not inclined to abandon it until it loses its pragmatic usefulness, at which point I would drop it like a hot rock.
ReplyDelete
Replies
Disagreeable Me25 February 2016 at 20:27
Hi Mike,

> The possibility of every computer program exists, but we have both have made a living because it is necessary to actually bring them into existence.

What we do for a living is rearrange electrons so that a particular desired computation is realised according to a pre-existing interpretation. The idea with pixies is not that the possibility of that computation exists (we're not talking about Platonic existence) but that that computation concretely exists according to a possible (not necessarily realised) interpretation.

There is nothing in physics to say that the value in a bit register is a one or a zero. We only agree by convention that it is a one if the voltage is such and such and a zero otherwise. The same goes for everything that happens in a computation. It's all only by convention that it can be viewed as a computation at all. It is useful to regard it as a convention because that convention is pretty sensible and straightforward. But one can imagine conventions that are slightly less straightforward. It's a one if the voltage is such and such as long as today is not Wednesday. And so on. With a little effort, it is perfectly possible to make use of systems with more and more awkward conventions, even if all you're doing is manually inspecting the internal state and not relying on convenient input/output. Eventually you get to conventions as absurd as Putnam's mappings. And what point in this continuum does consciousness disappear and why?

The fact that we are interpreting a system as a computation can have no bearing on whether it is conscious. My consciousness does not depend on you believing that I am conscious. The same would go for a pixie in a rock. The interpretation doesn't have to be realised in order for the pixie to exist, at least if the argument is sound. Chalmers in particular has offered good reasons to question this, and if you want to defend computationalism I would urge to to go down that road rather the one you have been following which seems to me to be a dead end.

> I would say that you are a definite system, a pattern that holds a sub-pattern

I think this makes sense from a third person perspective, a consideration of me as a physical object. But I don't think it works when asking if this physical object hosts a mind or not. Again, from my perspective (and remember, the question is whether I even have a perspective), my existence (according to some notion of existence anyway) is not really subject to doubt.

If you really want to be blasé about whether people really have minds or not then it's very hard to say why you should care about them. But if you do think people have minds, and you don't think rocks have minds in the same way, then you need to be able to say why people have minds and rocks do not. Pragmatic considerations (you can't interact with pixies) don't really do enough to combat the absurdity as I argue in the post.

> My understanding was that they were arguing that the interpretations weren’t logically valid.

To say something isn't logically valid is usually to say that there's some syllogism where the premises don't entail the conclusion. "All men have two legs; Donald the duck has two legs; Donald the duck is a man". I don't see anything like that. Rather, what Chalmers in particular is saying is to concede the point that you can find a mapping to show that rocks are implementing particular runs of FSAs but to argue for various reasons that this is insufficient to show that they are really performing computations.

> The problem with these kinds of metaphysical assertions is, how do we ever determine whether they are true or false?

We can't! This is why this is philosophy rather than science.
ReplyDelete
Replies
Richard Wein28 February 2016 at 15:27
Hi DM. Thanks for drawing my attention to this post.

Searle's argument seems to rely on his belief that we can put any computational interpretation we like on any state of a system. As you know, I've explained at some length why this isn't so, in the appendices of my response to Searle's argument from syntax and semantics (https://barbedsextant.wordpress.com/2015/10/14/searles-argument-from-syntax-and-semantics/).

Your own argument depends on your claim that "any computation can be described as an FSA". It's not clear what you mean by that. The question we should be considering is whether every algorithm has an equivalent FSA. The computationalist claim which is being challenged is that any system that implements the right algorithm will be conscious. In challenging that claim, it does no good to establish some result about all FSAs unless you can show that all algorithms (or at least all the algorithms that might be claimed to be sufficient for consciousness) are equivalent to FSAs. And by "equivalent" I mean that they are the same algorithm in every way that matters, not that they produce the same results. You seem to focus on whether two systems produce the same output, or go through the same states. That's not the right question.

Let me illustrate by means of an example. Let device A be a simple computer with a single ordered memory space, which we'll call RAM. The processor has no memory, and any data which would normally be stored in a register (such as the program counter) is stored in RAM instead. RAM consists of N binary flip-flops, so we can characterise the state of RAM (and therefore the state of the computer) by a sequence of N binary digits, which we can also interpret as a number (m) from 0 to (2^N)-1. There are potentially 2^N states that the computer can be in. (For a particular program the number may be less, since some states may be unreachable.) We will also assume that the computer has no further inputs once the RAM has been set to an initial state and execution has begun. Let's say the computer has been programmed to play both sides in a game of chess, i.e. to play chess against itself. If the computer is run repeatedly, starting each time from the same initial state, it will repeatedly play exactly the same game (or sequence of games). If we wanted it to play different games, we could include a random number seed in the program, and vary that seed with each run, but that would mean that the system was starting in a slightly different state each time. Let A be initialised with one such program. Now I ask you, how can we construct an FSA that would be equivalent to this program?

DM: An FSA model would just list all possible states by their labels and what successor states and output they produce. This ought to be enough to capture anything a computer could ever do.

It seems you think the following type of FSA would do. We could have an FSA with 2^N states, one for each potential state of device A. And for each of these states we could specify the state that comes next during program execution. According to Putnam, any FSA can be realized by any open physical system. But I'm going to describe a device (B) that is uncontroversially a realization of the FSA, and which can be conveniently compared with A. To construct B we take A and replace its processor with a simple integrated circuit and a read-only-memory holding a list with 2^N entries. Each entry has N digits, corresponding to a state of A. Entry number m contains A's state at step m. So the first entry in the table is A's initial state. The second entry is A's state after executing one instruction. And so on. At each step, the integrated circuit uses the current state number (m) as an index into the list, reads off the next state, and puts the RAM into that state.

ReplyDelete
Replies
Richard Wein28 February 2016 at 15:29
<...continued>

B's RAM goes through exactly the same series of states as A's RAM (at the level of binary flip-flops). But are they realizing the same algorithm? No, because they're using a different algorithm for deciding which state comes next. B's integrated circuit has to read the entirety of RAM at each step. A's processor just has to read the program counter, the corresponding instruction, and maybe a few variables. Both devices have a series of chess positions represented in RAM over the course of a run, collectively representing the progress of a chess game. But A is doing the work of deciding what moves to make, while B is just reading off a pre-calculated list of chess positions (plus other stuff). We would need to run A (or something like it) to produce B's look-up list.

More generally, looking something up in a table is not the same algorithm as calculating it, even if both get you to the same state. But sometimes we may not care, because we're only interested in a higher level of abstraction. Suppose you write a computer program that calls a library function to calculate sine values. After running the program for a while you install a new function library which uses a different algorithm for calculating sine (perhaps even looking it up in a table), but which still gives the same results. Are you running a different program now? A different algorithm? At a low enough level of abstraction, you are. But for most purposes we wouldn't care, and we would normally say we're still running the same program, and probably that we're still using the same algorithm. We need to consider which level of abstraction is relevant in a particular context.

DM: As such, if any algorithm can pass the Turing Test and do all the information processing tasks a human brain can do, then so could an FSA.

An FSA (like Ned Block's "Block Head") could in principle pass any Turing test. In practice it wouldn't be able to pass a strong enough test, because of practical limits on the size of the look-up table. In any case, it wouldn't be using the same algorithm as an AI program (or full-brain simulation), so the fact that the algorithm of Block Head is insufficient for consciousness does not refute the claim that those other algorithms are insufficient for consciousness.
ReplyDelete
Replies
Richard Wein28 February 2016 at 15:37
Oops. Those last 3 words should have been "sufficient for consciousness".
ReplyDelete
Replies
Richard Wein28 February 2016 at 18:45
P.S. I may have been misusing the term FSA. I've been using it to refer to an algorithm or model of a computation which employs a look-up table (or equivalent) to decide which state to move to. But now I think the term usually refers to a model that omits any information about how the system decides which state to move to. In that case, it may be correct to say that any computation can be described by an FSA. But such a description may omit crucial information about the computation, as in my example. The computations of devices A and B can both be described by the same FSA, but they are very different computations. And a system that "realizes the FSA" need only realize one of those computations. Therefore we could accept Putnam's conclusion that any open system realizes every possible FSA, without accepting that it realizes every possible computation.
ReplyDelete
Replies
Richard Wein29 February 2016 at 05:07
I've just realized I made a mistake in my example. I wrote (about the look-up table of device B):

> Entry number m contains A's state at step m. So the first entry in the table is A's initial state. The second entry is A's state after executing one instruction. And so on. At each step, the integrated circuit uses the current state number (m) as an index into the list, reads off the next state, and puts the RAM into that state.

The last sentence was correct, but the previous sentences are inconsistent with it. I should have written:

> Think of the list as a one-dimensional array, L. For i=0 to 2^(N-1), L[i] contains the state that A would go into next if its current state is i. B's RAM is initially in the same state as A's. Thereafter, at each step, the integrated circuit reads the current state of RAM, m, looks up L[m], and copies that number into RAM.

Note that in this example there's no need to store any labels (as you suggested), because the contents of RAM can themselves be treated as an ID number, which can be used directly as an index into the list. However, as an alternative, each entry in the list could be a pair: (current state number, next state number). In that case the entries could be stored in any order, and the integrated circuit would have to search the list to find the entry with the current state number.

I hope that's clearer now.
ReplyDelete
Replies
Coel29 February 2016 at 19:35
Hi Mark,
This was a good read, thanks for writing it. This is a topic that I've never really thought about much. I look forward to the followup post.

I have a hard time accepting the idea that there is nothing objective about what a computation is, and thus that a rock or a wall is doing every possible computation -- obviously I need to think about this.

I'm also finding myself not assenting with full-blown computationalism. I conceive of consciousness as some sort of process, but I can also conceive of an equivalent computation, say as emulated by a Turing machine, having a sufficiently different process as to not be conscious.

Unfortunately I can't say much more as I don't feel that I have any convincing account of consciousness to offer in place of computationalism. (And the above may just be defects of my intuition.) I've always considered it way too hard a topic!
Cheers, Coel.
ReplyDelete
Replies
Coel5 March 2016 at 21:23
Hi DM,
I'm trying to think through computationalism and whether I agree with it. Assuming that your previous blog posts make the case for it, can you point me at what you think are the best arguments for computationalism?
Cheers, Coel.
ReplyDelete
Replies
Jochen19 October 2016 at 13:00
Hi DM,
I'm grossly late to the party, but I wanted to say that this is one of the best expositions of the problems of computationalism that I have yet come across---well written and accessible, while still sufficiently detailed so as not to weaken the arguments.

Regarding the counter-arguments, however, I think there's a strategy that can be used so as to make the arguments immune to (at least most of) them. The basic idea of Chalmers' counterfactuals, semantic or syntactic mappings, and the like, is to attack the 'simple mapping account' in one way or another---i.e. to rebuke the idea that all it takes for a physical process to instantiate (or implement) a computation is that there must exist a mapping between its states and the logical states of the computation.

But the argument against computationalism can in fact be formulated without recourse to any account of implementation in particular: basically, the idea is to show that if we accept that a system S implements computation C1, then we ought to accept, on the same grounds (whatever they be), that it implements computation C2. This sidesteps the issue of what, exactly, it means for a system to implement a computation, and leaves us with the conclusion that either the system does not implement any computation at all, or we can equally well associate (at least) two different computations to it.

The argument runs roughly as follows. Let's take a somewhat more simple process than instantiating a mind, e.g. the addition of single-digit binary numbers with carry. That's our computation C1. The formal structure of that is that we have two (single-digit) binary variables, x1 and x2, and two (likewise binary single-digit) outputs, y1 and y2.

Now, the outputs are generated from the inputs as follows: x1 XOR x2 = y1, and x1 AND x2 = y2. Hence, y1 is the binary sum of both bits, and whenever y2 = 1, we know that a carry has occurred.

Now we implement this computation with a physical system S. This system needs two inputs I1 and I2, and two outputs O1 and O2. How we instantiate them is immaterial; we could simply have a system that we can initialize in one of four distinguishable states, then 'switch on' its evolution, which then eventually settles into one of three distinguishable 'output' states (since only the cases y1y2 = 00, 01, and 10 can occur as a result of the computation). We may also imagine a device such that we can apply either high or low voltage to two wires designated I1 and I2, and receive either high (h) or low (l) voltage at wires designated O1 and O2.

Let's go with the latter implementation for concreteness. Then, we wire things up such that whenever a high voltage is present at either I1 or I2, O1 is at h, while if both inputs are either h or both are l, O1 is at l. O2 is at h if and only if both I1 and I2 are at h.

Now, clearly, if we choose high voltage to mean the binary digit 1, and low voltage to consequently mean 0, then I can use this system to perform addition. Moreover, if we chain many such devices, we can perform addition of arbitrary binary numbers---thus including also nontrivial computations where we don't necessarily know the answer beforehand.
ReplyDelete
Replies
Disagreeable Me24 October 2016 at 10:44
This comment has been removed by the author.
ReplyDelete
Replies
Jochen24 October 2016 at 15:19
Hi DM,

I think we're stuck. To me, the issue is perfectly clear: if I have two devices, and I enter the same commands into both, and they react differently, then they don't instantiate the same computation. It might be the case that two devices computing the same function count as, in some sense, different computations, but it is surely the case that two devices computing different functions are different computations.

(Actually, the more I think about it, the less I believe normal computationalism would even accommodate the difference between two different instantiations of the same function: an argument for computationalism is that you could replace neurons by a silicon chip showing the same behavior as the original neuron; on your conception, it would matter how this behavior is arrived at, i.e. whether it does quicksort or bubblesort, which leads to the whole fading qualia issue---i.e. functionalism is usually thought to entail a certain sort of modularity that seems absent in your version.)

And, not that it matters much, but the script for the Quantum Computation class I'm TA-ing exactly characterized computations as the functions from n- to m-bit binary strings; and thus, as the two functions performed by the adder and schmadder differ, they're different computations.

Perhaps somewhat more weightily, Kleene defines algorithms in terms of general recursion (see the wikipedia article), seemingly saying that different recursive functions are different algorithms. In that case, too, addition and schmaddition differ; so I can at least say that using Kleene's definition, both computations differ.

In fact, I haven't found the idea that two devices could produce different output on the same inputs, and yet still implement the same computation, anywhere---but that, of course, is just argument to authority.

What's more persuasive to me is that the sum of two inputs simply is not the sum+1 of two inputs: they're different mathematical objects, different Platonic realities, if you will. Likewise, a negative is different from its positive, and my left hand different from my right. Certainly, I could change the rest of the universe in such a way as to eliminate the difference---but I would have to change it! Usually, one does not consider identity to depend on the state of the rest of the universe.

So I don't think your arguments to identify the two are really conclusive; certainly, it's at least possible for two minds to differ, even if they are related by inversion (since after all, the two functions, related by inversion, do differ, and thus, some things related by inversion differ).

You're saying that because they instantiate the same structure, there's no difference between my left and right hands; I'm saying, because they clearly differ, structure apparently doesn't suffice to capture all of their properties. Perhaps we just have to leave it at that.
ReplyDelete
Replies
Disagreeable Me28 October 2016 at 20:40
Hi Jochen,

> Now, call the indistinguishable objects of your putative five-element set e

That is not how I conceive of indistinguishable objects.

What I mean is more like {A,B,C,D,E}. Their labels are not properties of the objects, they are just the names by which we refer to them. I could represent the same structure as {F,G,H,I,J}. What it means for them to be indistinguishable is that they are truly interchangeable. I can switch the labels of any two of these objects and I will have the same structure. I can swap any two of these objects and I will have the same structure.

But the set of numbers {1,2,3} is not the same structure as the set of numbers {4,5,6} because numbers have properties which distinguish them from each other.

> If by semantics, you mean their identity as greatest and least elements of B2, then yes

OK, well for the structure I care about, this is not important. We are just looking at two different levels of analysis, as I said.

> The crucial point is that AND is a function defined as taking elements from the two-element Boolean algebra;

I accept that. But what I'm saying is that I don't care if a computational system implements AND or OR. I'm not the one who insists on interpreting it as either AND or OR, you are, so it is perfectly reasonable for me to disregard things that are crucial to the definition of AND. If I interpret something as implementing AND, it is only because this is a convenient way to describe the structure I care about. But I don't care that it is actually AND. I only care that it is a function of the form f, which is defined not in terms of 0 and 1 but A and B, where A and B have no intrinsic properties but for the roles they play in function f. A is not the minimum value and B is not the maximum value.

>then you're not talking about AND, but about some function defined on that set of symbols.

Yes, yes, a thousand times yes! By jove, I think you've got it! This is what I have been saying. I'm not talking about AND. I'm talking about function f, which is the function defined by the truth table of AND when interpreted as a function of abstract, meaningless, interchangeable symbols.

> I agree that for x^2, it's immaterial if I call it x^2, y^2, a^2, or whatever else. But that's not what you're doing.

That is exactly what I'm doing. It is not what you are doing. We are doing different things. We are conceiving of the computation differently. For me, it is a manipulation of meaningless binary (in the sense that there are two of them) symbols. For you, it is a manipulation of binary digits.

> That doesn't follow: just because there's no unique right way to do it, doesn't mean there are no wrong ways.

OK, but you have been insisting on your interpretation over mine. That certainly looks like you think there is a best way to look at it. So if you instead think there are many ways to interpret, and that yours just happens to be a correct way, while mine happens to be an incorrect way, then you had better justify that.
ReplyDelete
Replies
Jochen2 November 2016 at 13:43
Hi DM,

first of all, I'm happy to see that we seem to finally be making some progress---it seems you now accept that AND and OR are bona fide distinct structures, even though you're still arguing that it doesn't matter if we forget about what differentiates them, where earlier on, you seemed to argue that they just flat out have the same structure. So let's see if we can build on that.

>It seems really trivial to me but I'd be interested to see what you think.

So as I said earlier, you need to take into account the full structure of a given computation in order to implement it physically. This, for functions not symmetric in their arguments---i.e. where f(x,y)=/=f(y,x)---includes the identity of the inputs: whether a implies b, or b implies a, are simply different questions. Thus, forgetting about which input is which yields, again, a different computation---if you wanted to know whether a implies b, and you get an answer that 'either a implies b or b implies a', you haven't managed to compute an answer to your question, because you haven't instantiated the right computation.

>Neither is there an obvious point at which to start.

There is, though. There is always a minimum amount of structure you need to take into account for each computation---so, if you have an abstract five-state FSA, you need minimally five distinguishable physical states to implement it. You can do this with a system that has a great many more different physical states, but those differences then simply don't map to distinctions in the logical space of the computation---so effectively, if you have six physical states, then at least two map to the same logical state, and so on.

This harkens back to the argument I made regarding whether it's important to have the representation of some computation's output be represented as white text on black, or the other way around---it is important if, and only if, that distinction maps to a distinction in the logical space, i.e. if it 'means' something different for the output to be in white on black, or black on white.

However, what you can't do is represent the five-state FSA with four physical states---then, you necessarily loose structure of the computation, and effectively instantiate a different one, that can be generated from the original by identifying two states of the five-state FSA. This is what you're proposing to do, if you say that we can forget about the difference between 1 and 0.

Now, it may be, in some special case, that the four-state FSA implements the same function as the five-state one; and thus, that functional properties remain invariant under this change. In this case, it might likewise be that performing this change on a computation instantiating a conscious mind does not yield any appreciable difference. But certainly, in the general case, this is false, and forgetting about structure yields a computation that differs from the one you set out to instantiate---and in particular, may no longer correspond to the same mind, or any mind at all.

So there really isn't a possibility to go 'pointlessly fine-grained': there exists a sort of minimum resolution at which you can ensure that the system implements the computation you care about.
ReplyDelete
Replies
Wyrd Smythe17 April 2019 at 23:49
** "If you want to say otherwise, then you should be able to draw a sharp distinction between what a rock is doing and what a computer without input/output is doing."

How about this: In the rock, all computations are occurring, and no given computation is preferred by the system itself. In the computer, all computations are also occurring (they're also all occurring in the computer's case!), but one is very clearly preferred by the system itself.

If you consider all the computations the computer is supposedly performing, there is only one clear winner. In the case of the rock, nothing other than interpretation selects a given computation.

This suggests the rock, if anything, is generating a kind of "white noise" of computation whereas the computer has a very clear, strong signal that rises above its similar noise.

One might also look at the energy or complexity required to extract the computation. In the computer that involves looking or listening to it. In the rock, looking and listening don't do much.

Perhaps that last works as an argument against pixies in that conscious experience may depend on the (putative) conscious computation being easy to extract from the background. Brains, as with computers, have a very strong signal.

I also think the idea that, in a rock or wall, one state doesn't *cause* the next is a powerful argument. In a "real" computation, they are casually linked.

** "This idea of a stable mapping that remains constant in time is perhaps a plausible angle to investigate."

One thing that comes up in the "dust" idea (which is essentially the pixies idea) is that the series of states need not be temporally ordered. The mere existence of such states somewhere in time, in any order, should lead to a linear consciousness.

This is clear from the idea of imagining an FSA for consciousness and putting time delays between steps. This shouldn't change the (putative) experience from within the FSA.
ReplyDelete
Replies