The gardener's problem: from simulation to theory

One conceptual diculty that sometimes a!icts discussions of the role of modeling techniques is the conation between the computer program and the theory. Some authors have gone as far as claiming that \Theories can be stated as computer programs" (Simon, 1992, p 152). In contrast, we consider that it is crucial to insist on the distinction and complementarity between the simulation system and the accompanying theoretical gloss. Computer simulations complement rather than replace verbal descriptions. A clear statement of this complementarity appeared in Palmer & Kimchi (1986), who argue against the notion that the computer program as such constitutes a psychological theory, and insist on the importance of the accompanying description:

a running simulation is only an IP information processing] theory by virtue of the fact that it too can be described by a ow diagram plus mini-mapping theories of its components (p. 57).

Their major argument is that a computer program can be described at various levels of speci cation, and that it may be dicult, without a verbal account, to decide which levels of description are psychologically relevant. This is the problem of mapping hypothetical constructs in the model onto their psychological counter-parts. There is also, however, a related but distinct diculty, which we call the redescription problem. Modelers must specify the properties and characteristics

underlying the model's functioning at a level of abstractness that permits useful and appropriate generalizations.

4.5.1. The mapping problem

The rst point may seem obvious. A model is a metaphor, and a metaphor is illumi-nating only as far as one clari es the relevant features that the metaphorical object shares with the target system, or better, the relevant level(s) of analysis at which a correspondence may be established between the two systems. Yet, in practice, expliciting and understanding the relationship between a simulation model and the corresponding human process is far from trivial. A major cause of this diculty is that both human cognitive processes and computer programs are complex objects that allow for a multiplicity of levels of description.

One well-known reference on the issue of description levels is a well-known pro-posal by David Marr (1982) that identi es three levels of analysis of information processing tasks. The three levels correspond to the computational description of the system (the input-output mapping that the system realizes), its algorithmic description (the algorithm used to perform the mapping) and its hardware imple-mentation. Marr's discussion makes it clear that all three levels may contribute to the understanding of the observed phenomena: some being explicable through hardware properties (afterimages), others (the Necker cube) requiring consideration of both hardware properties and algorithmic description. Furthermore, the notion of algorithmic description masks the fact (known to everyone who has engaged in any sort of computer programming project) that an algorithm can be described with various grains, independently of the hardware speci cations (cf. Palmer and Kinchi's notion of recursivedecomposition).

Given the multiplicityof potential algorithmicdescriptions, a simulationmodel at the algorithmic level could in principle be constructed to match the real function at many dierent levels, from the most abstract level of the input-output mapping (as happens, for instance, if a regression technique was used to derive a mathematical function), to the nest-grained level of elementary processes, with all intermedi-ate possibilities (such as, for instance, in Massaro's, 1989a, Fuzzy Logical Model of Perception, which assumes three stages of perceptual processing|evaluation of per-ceptual features, integration and decision|but restrict the simulationto an abstract mathematical description of the integration and decision operations). Concerning evaluation, it seems obvious that a (hypothetical) simulationmodel in which the cor-respondence goes down to the most elementary level is better, in scope and power, than a model restricted to the most abstract level of mapping. Nevertheless, this does not mean that starting at the most detailed level is the best research strategy.

As Marr suggested, it may be easier to start from a broad abstract characterization of the function, and gradually focus the microscope.

These issues pertain not only to symbolic approaches to modeling, but also to the arti cial neural networks framework. Willshaw (1995) describes a formal technique through which sets of symbolic and subsymbolic algorithms may be organized hier-archically in terms of their level of abstraction and implementation, and concludes

that \symbolic and subsymbolic algorithms are not neatly divided into two distinct classes, with the one being at a 'higher' level than the other" (p. 16).

4.5.2. The description problem

The problem of redescription|extracting an appropriate description of the model functioning from simulation results and knowledge of its design to allow useful generalizations|may appear more acute if one adopts the gardener's approach, though in no way would we argue that it is speci c to that strategy. As we have repeatedly stated, any reasonably complex model may at some point produce un-expected behavior. Indeed, our recent results with ^trace illustrate one case in which the behavior of the system did not correspond to the description given by its designers. It is the job of the designers (or, for that matter, of any serious user of the model) to explore the details of the system performance, the way it changes with variations of the stimulus set, or parameter values, and to provide principled and accurate accounts of how and why the system behaves the way it does.

The gardener's approach may, with much know-how and perhaps a bit of luck, lead to an outcome that matches the empirical observations. Still, that is only the beginning of the hard work. Simulations are not explanations. If we do not understand the simulationprocess any more than we understand the real one, having a running simulation of a given function is of little help. To borrow from a judicious analogy introduced by Forster (1994), this would be no more helpful than having a next-door neighbor capable of predicting, without explaining how, the outcome of any experiment that we might design and run. To some extent, the problem is similar to the use of statistical data- tting techniques: A mathematical equation may provide a descriptively and predictively adequate account of some regularity, but not an explicit description of the process that produces the regularity itself, and this strongly restricts possible generalizations.

This issue has arisen in recent years in the context of the assessment of the dis-tributed arti cial neural networks framework, and the discusion has centered on Seidenberg and McClelland's (1989) model of visual word recognition and naming, and its more recent derivatives (Plaut & McClelland, 1993 Plaut et al., 1996).

Note that the issue is not whether any of these models is empirically adequate, but rather whether they provide or even lead to adequate theories of cognitive functions.

McCloskey (1991) argued that the theoretical claims formulated by Seidenberg and McClelland are vague and too general, and that the theoretical elaboration fails to describe how the network accomplishes its task, because of our limited under-standing of complex connectionist networks. Yet, such a description of processing is certainly no less appropriate or informative than any other type of model cur-rently available. As noted by Seidenberg (1993), \there is a rich theory here: it has only to be acknowledged" (p.233). Granted, the description leaves many details unspeci ed, it may be incomplete, the mechanics of the model is based on new and unfamiliar notions, it is implausible in some respects, and many aspects of its performance could be further explored. However, similar remarks could be made about any other modeling eort.

McCloskey (1991) concluded by arguing that the design (or, for that matter, the growing of) connectionist networks should be viewed more as analogous to the use of animal models than as simulations of theories of human cognitive functions.

He further stated that, just like animal models, connectionist systems are objects of study in themselves, which may aid in developing theories of cognitive systems thanks to their similarity to the human system. We would simply add that one dierence between animal models and computer models is that the availability of the former is limited and constrained by natural selection, whereas the latter are aorded through design principles and constrained by preexisting theoretical hy-potheses. From that perspective, the study of arti cial simulation systems may be the only way to examine the implications of a set of computational principles and assess their validity in accounting for human information processing.

Conclusions

We started our discussion by asking some simple questions: why use computer modeling in cognitive psychology? In what ways does the exercise of computer modeling techniques modify the nature of psychological research?

We consider that a de ning characteristic of cognitive psychology is the search for a particular kind of scienti c explanations that consist in accounting for the behavioral characteristics of human performance in terms of the organization and mechanisms of mental functions. Thus, empirical regularities observed in perfor-mance are used to draw a number of conclusions regarding a hypothetical mental function, the architecture and components it requires and its probable mode of op-eration, so that the empirical observations can be reduced to logical and necessary consequences of the characteristics of that mental machinery.

In this framework, it seems to us that a useful heuristic|perhaps even the only heuristic|is to create models, that is, to produce theoretical elaborations that de-scribe the relevant characteristics of the function, and to explore how well they account for the empirical observations. The use of computer modeling is a natural and obvious extension of this endeavor. Rather than limiting themselves to a verbal description of an imaginary mechanism, designers of computer models attempt to concretize the mechanism as a computer program.

Is this modeling enterprise worth the eort? We have analyzed several types of diculties encountered in current empirical research, and have argued that com-puter modeling provides appropriate tools to confront these problems.

One basic problem stems from the great complexity of our object of study, that is, the graded and multidimensional nature of mental functions. Computer mod-eling provides a good way of dealing with this intrinsic complexity and with the dynamic nature of information processing systems. In contrast, verbal models can make only simple processing predictions and our capacity to grasp these predictions is even more limited. Very limited, indeed: those readers who have tried to present in any detail the subtleties of the dual-route model of visual word recognition to their students may know how limited our capacity to compute mentally the logical

consequences of a (very) simple architecture may be. Few of us can imagine with-out external help the combined evolution of more than two elementary dierential equations over time.

We have also argued that modeling forces researchers to elaborate more detailed and fully speci ed accounts. Although any implemented model involves many arbi-trary decisions, the \full speci cation constraint" is positive pressure that may drive scienti c progress. In fact, any arbitrary implementation choice hides a potential empirical issue: it suces that another designer suggest a dierent solution, and that the resulting models perform dierently or lead to distinct predictions.

We have also suggested that the use of modeling techniques helps delimit the space of potential explanations by enlarging the scope of theoretical accounts and also by referring to general principles of processing. By exploring the intrinsic characteristics of the model, psychologists may be led toward accounts that are more strongly motivated theoretically. Models are also concrete objects, which lend themselves to further study. Access to computational models gives psychologists collections of hypothetical devices that may be constructed, deconstructed, and manipulated at will. We have illustrated how the elaboration of computer models leads to the identi cation of new research issues, and how the exploration and the systematic study of their performance may be helpful to understand the behavioral characteristics of hypothetical processing systems.

Finally, we have claimed that to be useful from a psychological viewpoint, com-puter programs should be accompanied by an appropriate description. Jointly these make it possible to establish how the elements of the designed system map onto the real function, and how the behavioral characteristics of the system emerge from its design features. In our view, the major change that computer models introduce into psychological research is that they allow a dyadic confrontation between empirical observations and verbal models to be transformed into a triadic and interactive confrontation between data, theories and implemented simulation systems.

References

Bever, T. (1992). The demons and the beast|modular and nodular kinds of knowl-edge. In R. G. Reilly & N. E. Sharkey (Eds.), Connectionist approaches to natural language processing, (pp. 213-252). Hillsdale, NJ: Lawrence Erl-baum.

Broadbent, D. (1987). Simple models for experimentable situations. In P. Morris (Ed.), Modelling cognition, (pp. 169-185). London: Wiley.

Chomsky, N. (1965). Aspects of the theory of syntax. Cambridge, Ma: MIT Press.

Content, A., & Sternon, P. (1994). Modelling retroactive context eects in spoken word recognition with a simple recurrent network, Proceedings of the 16th Annual Conference of the Cognitive Science Society, 207-212. Hillsdale, NJ:

Lawrence Erlbaum.

Cutler, A. (1980). Making up materials is a confounded nuisance, or: Will we be able to run any psycholinguistic experiments at all in 1990? Cognition, 10, 65-70.

Dijkstra, A., & de Smedt, K. (1996). Computer models in psycholinguistics: an in-troduction. In A. Dijkstra & K. de Smedt (Eds.), Computational psycholin-guistics: AI and Connectionist Models of Human Language Processing. (pp.

1-23) London: Taylor & Francis.

Ellis, A. W., & Young, A. W. (1988). Human cognitive neuropsychology. Hove:

Lawrence Erlbaum.

Elman, J. L., & McClelland, J. L. (1984). Speech perception as a cognitive process:

the interactive activation model. In N. Lass (Ed.), Speech and language:

Advances in basic research and practive, (Vol. 10, pp. 337-374). New York:

Academic Press.

Estes, W. K. (1993). Mathematical models in psychology. In G. Keren & C. Lewis (Eds.), A handbook for data analysis in the behavioral sciences: methodolog-ical issues, (pp. 3-19). London: Lawrence Erlbaum.

Forster, K. I. (1994). Computational modeling and elementary process analysis in visual word recognition. Journal of Experimental Psychology : Human Perception and Performance, 20, 1292-1310.

Frauenfelder, U. (1996). Computational modelling of spoken word recognition. In A. Dijkstra & K. de Smedt (Eds.), Computational psycholinguistics: AI and Connectionist Models of Human Language Processing. (pp. 114-138) Lon-don: Taylor & Francis.

Frauenfelder, U. H., Content, A. & Scholten, M. (1995). Activation and deactivation in spoken word recognition. Paper presented at the 36th annual meeting of the Psychonomics Society, Los Angeles, November 1995.

Frauenfelder, U. H., & Peeters, G. (1990). On lexical segmentation in ^trace : an exercise in simulation. In G. T. M. Altmann (Ed.), Cognitive Models of Speech Processing, (pp. 50-86). Cambridge, Ma: M.I.T. Press.

Gibson, E. J. (1994). Has psychology a future? Psychological Science, 5, 69-76.

Grainger, J., & Jacobs, A. M. (in press). Orthographic processing in visual word recognition: a multiple read-out model. Psychological Review.

Jacobs, A. M., & Grainger, J. (1994). Models of visual word recognition - Sampling the state of the art. Journal of Experimental Psychology : Human Perception and Performance, 20, 1311-1334.

Johnson-Laird, P. N. (1983). Mental models. Cambridge: Cambridge University Press.

Johnson-Laird, P. N. (1988). The computer and the mind. Cambridge, Ma.: Har-vard University Press.

Lachter, J., & Bever, T. G. (1988). The relation between linguistic structure and associative theories of language learning `|A constructive critique of some connectionist learning models. Cognition, 28, 195-247.

Ling, C., & Marinov, M. (1993). Answering the connectionist challenge: a symbolic model of learning the past tenses of English verbs. Cognition, 49, 235-290.

Loftus, G. (1993). Computer simulation: some remarks on theory in psychology.

In G.

Keren & C. Lewis (Eds.), Data analysis in the behavioral sciences: Statistical issues, (pp. 477-491). Hillsdale, NJ: Lawrence Erlbaum.

MacKay, D. G. (1988). Under what conditions can theoretical psychology survive and prosper? Integrating the rational and empirical epistemologies. Psycho-logical Review, 95, 559-565.

MacKay, D. G. (1993). The theoretical epistemology: a new perspective on some long- standing methodological issues in psychology. In G. Keren & C. Lewis (Eds.), A handbook for data analysis in the behavioral sciences: methodolog-ical issues, (pp. 229-255). London: Lawrence Erlbaum.

MacWhinney, B., & Leinbach, J. (1991). Implementations are not conceptualiza-tions: Revising the verb learning model. Cognition, 40, 121-157.

Marr, D. (1982). Vision. New York: Freeman.

Marslen-Wilson, W. D. (1987). Functional parallelism in spoken word recognition.

Cognition, 25, 71-102.

Marslen-Wilson, W. D., & Welsh, A. (1978). Processing interactions and lexical access during word recognition in continuous speech. Cognitive Psychology, 10, 29-63.

Massaro, D. W. (1988). Some criticisms of connectionist models of human perfor-mance. Journal of Memory and Language, 27, 213-234.

Massaro, D. W. (1989a). Multiple book review of Speech perception by ear and by eye: a paradigm for psychological inquiry. Behavioral and Brain Sciences, 12, 741- 794.

Massaro, D. W. (1989b). Testing between the^tracemodel and the Fuzzy Logical model of speech perception. Cognitive Psychology, 21, 398-421.

Massaro, D. W., & Cowan, N. (1993). Information processing models: microscopes of the mind. Annual Review of Psychology, 44, 383-426.

McClelland, J. L. (1988). Connectionist models and psychological evidence. Journal of Memory and Language, 27, 107-123.

McClelland, J. L. (1991). Stochastic interactive processes and the eect of context on perception. Cognitive Psychology, 23, 1-44.

McClelland, J. L. (1993). Toward a theory of information processing in graded, random and interactive networks. In D. E. Meyer & S. Kornblum (Eds.), At-tention and Performance, (Vol. XIV, pp. 655-688). Hillsdale, NJ: Lawrence Erlbaum.

McClelland, J. L., & Elman, J. L. (1986). The ^tracemodel of speech perception.

Cognitive Psychology, 18, 1-86.

McClelland, J. L., & Rumelhart, D. E. (1981). An interactive activation model of context eects in letter perception: Part 1. An account of basic ndings.

Psychological Review, 88, 375-405.

McCloskey, M. (1991). Networks and theories : the place of connectionism in cognitive science. Psychological Science, 2, 387-395.

Miller, G. A., Galanter, E., & Pribram, K. H. (1960). Plans and the structure of behavior. New York: Holt, Rinehart, & Winston.

Moore, E. F. (1956). Gedanken-experiments on machines. In C. E. Shannon & J.

McCarthy (Eds.), Automata studies. Princeton, NJ: Princeton University Press.

Morais, J., Alegria, J., & Content, A. (1987). The relationship beween segmental analysis and alphabetic literacy: An interactive view. Cahiers de Psychologie Cognitive, 7, 415-438.

Morton, J. (1969). The interaction of information in word recognition. Psycholog-ical Review, 76, 165-178.

Morton, J. (1980). The logogen model and orthographic structure. In U. Frith (Ed.), Cognitive processes in spelling, (pp. 117-135). London: Academic Press.

Neisser, U. (1976). Cognition and reality. San Francisco: Freeman.

Newell, A. (1973). You can't play 20 questions with nature and win. In W. E. Chase (Ed.), Visual information processing, (pp. 283-308). New York: Academic Press.

Newell, A. (1990). Uni ed theories of cognition. Cambridge, Ma: Harvard Univer-sity Press.

Norman, D. A. (1980). Twelve issues for cognitive science. In D. A. Norman (Ed.), Perspectives on cognitive science, (pp. 265-295). Hillsdale: Lawrence Erlbaum.

Norris, D. (1990). A Dynamic-net model of human speech recognition. In G. T. M.

Altmann (Ed.), Cognitive Models of Speech Processing , (pp. 87-104). Cambridge, Ma: M.I.T. Press.

Palmer, S. E., & Kimchi, R. (1986). The information processing approach to cog-nition. In T. J. Knapp & L. C. Robertson (Eds.), Approaches to cognition:

contrast and controversies, (pp. 37-77). Hillsdale: Lawrence Erlbaum.

Parisi, D., & Burani, C. (1988). Observations on theoretical models in neuropsy-chology of language. In F. Denes, C. Semenza, P. Bisiacchi, & E. Andreewsky (Eds.), Perspectives on cognitive neuropsychology, . London: Lawrence Erl-baum.

Plaut, D., & McClelland, J. L. (1993, ). Generalization with componential attrac-tors: word and nonword reading in an attractor network. Paper presented at the 15th Annual Conference of the Cognitive Science Society, Hillsdale.

Plaut, D., & Shallice, T. (1993). Deep dyslexia: a case study of connectionist neuropsychology. Cognitive Neuropsychology, 10, 377-500.

Plaut, D. C., Mcclelland, J. L., Seidenberg, M. S., & Patterson, K. (1996). Un-derstanding normal and impaired word reading: computational principles in quasi-regular domains. Psychological Review, 103, 56-115.

Plunkett, K., & Marchman, V. (1989). Pattern association in a back-propagation network: implications for child language acquisition (Technical Report 8902):

Center for Research in Language, University of California at San Diego.

Plunkett, K., & Marchman, V. (1990). From rote learning to system building (Technical Report 9020): Center for Research in Language, University of California at San Diego.

Port, R. F., & van Gelder, T. (1995a). It's about time: an overview of the dynamical approach to cognition. In R. F. Port & T. van Gelder (Eds.), Mind as motion, (pp. 1-44). Cambridge, Ma.: MIT Press.

Port, R. F., & van Gelder, T. (1995b). Mind as motion. Cambridge, Ma.: MIT Press.

Pylyshyn, Z. W. (1984). Computation and cognition. Cambridge, Ma: MIT Press.

Rohl, M., & Pratt, C. (1995). Phonological awareness, verbal working memory and the acquisition of literacy. Reading and Writing, 7, 327-360.

Rumelhart, D. E., & McClelland, J. L. (1986). On learning the past tenses of English

Dans le document On the need for computer modeling: The case of language processing (Page 22-32)