• Aucun résultat trouvé

Review: Tone of Voice and Mind: The Connections between Intonation, Emotion, Cognition, and Consciousness

N/A
N/A
Protected

Academic year: 2022

Partager "Review: Tone of Voice and Mind: The Connections between Intonation, Emotion, Cognition, and Consciousness"

Copied!
6
0
0

Texte intégral

(1)

Article

Reference

Review: Tone of Voice and Mind: The Connections between Intonation, Emotion, Cognition, and Consciousness

SCHERER, Klaus R., SANDER, David

SCHERER, Klaus R., SANDER, David. Review: Tone of Voice and Mind: The Connections

between Intonation, Emotion, Cognition, and Consciousness. Music Perception , 2005, vol. 23,

no. 1, p. 87-90

(2)

Cognition, and Consciousness by NORMAN D COOK Review by: KLAUS R. SCHERER and DAVID SANDER

Source: Music Perception: An Interdisciplinary Journal, Vol. 23, No. 1 (September 2005), pp. 87-90

Published by: University of California Press

Stable URL: http://www.jstor.org/stable/10.1525/mp.2005.23.1.87 Accessed: 05-10-2017 13:19 UTC

JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://about.jstor.org/terms

University of California Press is collaborating with JSTOR to digitize, preserve and extend access to Music Perception: An Interdisciplinary Journal

(3)

Tone of Voice and Mind: The Connections between Intonation, Emotion, Cognition, and Consciousness By NORMAND. COOK

Amsterdam: John Benjamins, 2002

Reviewed by KLAUSR. SCHERER& DAVIDSANDER

Geneva Emotion Research Group, University of Geneva

W

ITH HIS PREVIOUS BOOK, The Brain Code, Norman Cook established himself as a thoughtful and provocative author in the surging domain of the neurosciences, preferring to explore bold new avenues rather than to follow the trodden paths. It is thus with high expectation that the averted reader opens the author’s new book, Tone of Voice and Mind. The introductory passage confirms the author’s tactical approach—intending to blaze new trails into the jungles of music perception, voice intona- tion, and consciousness, adopting a neuroscience per- spective and giving much attention to emotion and affective experience. While Cook strives to highlight the links between the chapters in the book and to iden- tify a thread of argument, the reader will find that many chapters can be read on their own, offering little in terms of systematic cross-references or systematic development. For all practical purposes, the book con- sists of a collection of essays organized loosely around the issue of “mind” in an almost philosophical sense.

(The title is a bit misleading as the tone of voice is of central interest in only one chapter.) In consequence, this review will deal with individual chapters and the (clearly announced) messages they convey, with a par- ticular emphasis on issues related to music perception.

Wondering what it means to be a human being, Cook argues that a key to the answer lies in the investigation of functional asymmetry of the human cerebral cortex.

In Chapter 1, the author reminds us of one of his major fields of expertise, functional hemispheric asymmetry, and discusses the fact that the two human cerebral hemispheres are differentially specialized for music processing. He interprets the literature as supporting the claim that the right hemisphere is particularly involved in perceiving pitch, harmony, and melody, while the left hemisphere is specialized in processing the rhythm or tempo of musical tones. Keeping with the issue of cerebral specialization, Chapter 2 is dedicated to what Cook for almost 20 years has referred to as the

“central dogma” of human neuropsychology (Cook, 1986). In this chapter, the case of language is used as a confirmation of the view developed in Chapter 1 that the two cerebral hemispheres interactively cooperate and demonstrate complementary talents. Conceptualiz- ing functional hemispheric asymmetry as being hemi- spheric differentiation plus hemispheric collaboration, Cook suggests that, in language production, the right and the left hemispheres first interact during processing and that the left hemisphere consecutively generates motor commands for the striate musculature. Accord- ing to Cook, this central dogma (i.e., RH ⬍- -⬎LH -⬎

striate musculature, p. 40) would be fundamental for language and handedness and is critical when consider- ing the equipotentiality but functional specialization of cerebral hemispheres in humans. Although the pro- posal that human cerebral hemispheres cooperate is uncontroversial, the reader might wonder why func- tional asymmetry should be a key issue in understand- ing the specificity of human nature. Indeed, while Cook strongly advocates that “the functional asymmetry of the human cerebral cortex implies a form of brain acti- vation probably never experienced by animals of other species” (p. 22), the reader may disagree with the view of a qualitative radical difference between hemispheric asymmetry principles in human beings and other ani- mals. For instance, certain species of birds, including canaries, seem to show a lateralization of song control toward their left hemisphere (Hutsler, Gillespie, &

Gazzaniga, 2002), and it was suggested that some birds show a right hemisphere advantage for harmonic per- ception, as seems to be the case for human beings (Cynx, Williams, & Nottebohm, 1992). Therefore, hemispheric asymmetry research appears critical not only to under- stand what it means to be a human being but also in order to understand what it means to be a canary.

Chapters 3 and 4 are linked by the author’s stated intention of reviewing, in a “musical interlude”

(Chapter 3), the literature on music perception to iden- tify the neurobiological mechanism proposed as the basis of affective meaning of intonation contours in speech. Cook professes a strong belief in this link, even arguing that “anyone with an interest in speech intonation and prosody simply must come to grips with the basic phenomenon of harmony theory (three- tone effects)” (p. 56). Unfortunately, the reader has some difficulties in understanding why that should be the case. In the first part of the chapter, the author reviews some very basic issues of general and musical acoustics, then examines interval dissonance and con- sonance as well as tension and resolution in harmonic

Tone of Voice and Mind: The Connections between Intonation, Emotion, Cognition, and Consciousness 87

BOOK REVIEWS

06.MUSIC.23_087-102.qxd 10/6/05 11:09 AM Page 87

(4)

chords, and then turns to potential evolutionary origins of the affective impressions produced by major/minor mode differences. While there are occasional references to speech, there is no systematic argument on the direct relevance of the issues discussed to the understanding of affect expression in speech prosody.

While much of the material covered in this chapter may be, as the author acknowledges, too basic for the expert in music and too complex for the novice, the point of view advocated and the data from a perceptual experiment and an fMRI study provide an interesting contribution to the ongoing debates about the issues of consonance vs. dissonance and tension vs. resolution, particularly with respect to the issue of the underlying units (intervals vs. three-tone effects). The author makes a strong argument for the specificity of three- tone effects (as compared to the combination of interval effects) for harmonic tension.

Cook works up to argue that the “psychology of major and minor harmonies” needs to be understood as

“the tonal directionof movement away from unresolved chordal tension” and thus that minimal changes in pitch produce the bright and dark moods associated with the modes (p. 83). What is the origin of this perceptual ten- dency? Cook suggests that it is the sound symbolism in animal calls: “[A]n increase in vocal pitch is indicative of inferior social status and submission, whereas a decrease in vocal pitch is indicative of superior social status and dominance” (p. 85). While this is a juicier explanation than the assumption of historically grown preferences, and while there may well be evolutionary mechanisms at work, Cook’s tendency to resort gener- ous associations and analogies does disservice to the cause. In addition, there seems to be some contradic- tion between the arguments. At several points in the book the author claims that it is inadmissible to resort to simple measures of mean tendency of pitch or funda- mental frequency and its variation and that it is only through consideration of three-tone chords and har- monic tension that pitch movements and their effects can be understood. When using sound symbolism to explain mode effects, he no longer makes that distinc- tion, resorting to simple pitch levels by citing Ohala (1994), whose frequency code hypothesis focuses on vocal pitch height. As in other parts of the book, experts in several of the specialized domains involved will not fully agree with his interpretation of the litera- ture. Thus, when citing Morton’s (1977) motivation- structure hypothesis in the context of linking pitch level with modes, Cook glosses over the fact that cross- species similarities in affect and status are based on a combination of F0, energy, and spectral quality or

timbre (Hauser, 1996, p. 478; Scherer, 1985). In addi- tion, dominance and threat will not necessarily result in bright, major-like sounds but may sound quite gruff.

Similarly, a shrill high-pitched alarm call does not convey a dark, minor mood. Admittedly, Cook warns that his analogy only holds “provided that we consider what a major or minor chord is in relation to a state of harmonic ambiguity or tension” (p. 89). But he does not provide the mechanism that could explain the discrepancy between the musical effect of brightness/

happiness produced by a very slight decrease in pitch of one of the tones of a chord and the not so bright/happy effect of a massive, physiologically based decrease in F0 when an animal threatens another (making itself appear bigger, see Fitch, 2002).

Cook starts Chapter 4 (“The Coding of Human Emotions”) by saying that the coding principle (for speech prosody) invoked in this chapter is based on the pitch coding mechanism for major and minor modes in diatonic music. Given the confusion created by the ref- erence to the vocal pitch height/animal vocalization phenomenon in the preceding chapter, the reader does not quite know what to expect. The confusion is deep- ened by a rather incomplete review of work on vocal affect expression and the role of F0 therein. Things become a bit clearer when Cook uses the notion of harmonic tension, developed in the preceding chapter, to claim four intonational universals: (a) interval disso- nance, (b) the movement away from dissonance, (c) harmonic tension, and (d) the movement away from tension (toward major and minor resolution), with the latter supposedly providing the link to human emo- tions. The reader now starts to understand that Cook proposes a musical analysis of intonation.

Cook then criticizes the intonation literature for lack of attention to small pitch changes (with semitone reso- lution) and likens current prosodic notation systems to the primitive level of music notation in the 12th cen- tury. While there are some pertinent observations and appropriate criticisms in the author’s review of the state of the art, he does seem to throw the baby out with the bathwater. While it is true that there is no consensual system for intonation description and that many sys- tems are motivated by fairly narrow concerns, one does have to acknowledge, which Cook fails to do, that all of these systems are driven by the attempt to describe linguisticnot affectivestructures, an activity that centers around the identification of relative accent heights and placements (for both syntactic, semantic, and pragmatic purposes). Again, one has the feeling that by trying to make his point forcefully, Cook deprives the reader of a more comprehensive understanding of a complex

(5)

Tone of Voice and Mind: The Connections between Intonation, Emotion, Cognition, and Consciousness 89

phenomenon and the history of its scientific analysis.

The reader interested in a more in-depth understanding of the issues might want to consult some more extensive overviews (for language, see Cruttenden, 1997; Hirst

& DiCristo, 1998; Ladd, 1996; for affect, see Bänziger &

Scherer, in press).

However, this should not detract from Cook’s intriguing argument that, whether in music or in speech, pitch configurations in three-tone combinations have intrinsic affective meaning, especially with respect to bright/major and dark/minor moods. The claim is that emotion coding prosody can only be understood by analyzing the pitch intervals with respect to the harmonic tension of both simultaneous and sequential chords. In speech, these must necessarily be sequences of three sequential F0 values. With respect to the latter, there is an interesting discussion, unfortunately a bit short, of the “psychological now,” that is, the moving sub- jective time window that serves to unify and integrate psychological impressions (see also Stern, 2004).

What is the evidence for this provocative claim?

Unfortunately, no theoretical or functional argument is provided (except the general assertion made in Chapters 1 and 2 that speech and music are similarly processed by shared brain mechanisms). Rather, Cook relies on an empirical analysis of the F0 distribution found in 24 (Japanese) speakers who were asked to portray happy, unhappy, and ambiguous emotions (using semantically appropriate short sentences for each emotion). F0 values were extracted, transformed into semitones, and plotted over time. In the next step, the F0 data for all sentences and speakers were averaged within each emotion, based on the argument that “macroscopic regularities of pitch substructure due to affect” are to be identified and that summation is permissible and meaningful as long as semitone conversion is used. Then, a set of Gaussian curves is empirically fitted to the peaks in the summated empirical distribution and selected triads are interpreted with respect to major and minor modes, as well as tension. Despite the acknowledgement that “clear triads were only infrequently found,” it is claimed that “the group data revealed harmonic phenomena that are clearly suggestive of expression of mood through musical mode.”

Unfortunately, it cannot be excluded that affect- appropriate sentences had semantically driven accent structures, making summation problematic. In addi- tion, the data are clearly insufficient to draw any conclusions, as no statistical analyses are possible given the small N of clear triads. Thus, Cook’s claim that the emotional effect of intonation needs to be analyzed with respect to harmonic tension and mode remains a

provocative but unsubstantiated speculation. Unfor- tunately, Cook does not provide a well-argued and documented story as an underpinning to his claim that could sufficiently motivate others to empirically exam- ine the suggestion, which, prima facie, is interesting in the light of the many patterns of similarity between the expression of emotion in speech and music (Juslin &

Laukka, 2003). On the other hand, Cook’s judgment that simple pitch height measures are unlikely to bear any relation to affective meaning in intonation is con- tradicted by strong empirical data showing that most of the variance in intonation contours (in actor portrayals) is due to F0 level differences rather than contour shapes (Bänziger & Scherer, in press).

Chapter 5 presents an intriguing but controversial theory on the nature of the code by which the brain processes exteroceptive information and stores and transmits emotion. Returning to the notion of “central dogma” of human neuropsychology, Cook examines a series of issues related to the form of cooperation between the two hemispheres at the neuronal level given that, according to the author, language processing involves conventional syntax/semantics in the left hemisphere and the sound-symbolism of intonation in the right hemisphere. Specifically, Cook introduces a new concept designed by a neologism—“intonatopic”—

in order to describe the topological organization of auditory information related to the production and perception of intonation in speech (p. 138). Such a proposal has implications for Cook’s theory concerning the brain code; in particular, the author offers two theo- retical predictions relative to emotion: (a) The “posi- tive/negative” valence of voice intonation is due to the patterns of tone combinations in intonatopic maps, and (b) the motivational force of emotion is due to the need to resolve unresolved harmonic tension. Given that understanding processes involved in valence evaluation and in motivational aspects of emotion represent two of the most important issues in emotion research for more than a century, it is with great interest that the reader awaits empirical evidence consistent with these two proposals. The second part of the book, “Consciousness and Cognition,” starts with a physiological account of cognition: Chapter 6 is mainly aimed at describing the proposal that ion-exchanges during action potentials are not “informational” but are a means of directly “feeling”

the electrostatic state of the environment by allowing the free diffusion of ions into and out of the cell. Chapter 7 represents an impressive attempt to describe “cognition”

and “awareness” of animal organisms in terms of neuronal synchronization. Chapter 8 consists of a tech- nical presentation on how a computational simulation of

06.MUSIC.23_087-102.qxd 10/6/05 11:09 AM Page 89

(6)

the cerebral hemispheres can lead to functional asym- metry in object recognition tasks. In his conclusion, Chapter 9, Cook summarizes the major claims pre- sented in his book and addresses a number of philo- sophical issues concerning the human mind. In closing, he ventures that modern neuroscience has provided us with all major pieces to solve the puzzles of the human mind. Not all readers will agree with this very optimistic statement and many will close the book with

the feeling that although the neurosciences have provided a profusion of challenging ideas, many of these ideas still need support from more conclusive empirical evidence.

Address correspondence to: Prof. Klaus R. Scherer, Department of Psychology, University of Geneva, 40.

Bd. du Pont d’Arve, CH-1205 Geneva. E-MAIL

Klaus.Scherer@pse.unige.ch

References

BÄNZIGER, T., & SCHERER, K. R. (in press). The role of intonation in emotional expressions. Speech Communication.

Special issue on Quantitative prosody modeling for natural speech description and generation.

CRUTTENDEN, A. (1997). Intonation. New York: Cambridge University Press.

CYNX, J., WILLIAMS, H., & NOTTEBOHM, F. (1992).

Hemispheric differences in avian song discrimination.

Proceedings of the National Academy of Sciences, USA, 89, 1372-1375.

FITCH, T. (2002). The evolution of spoken language: A comparative approach. Proceedings of the 2002 International Conference on Spoken Language Processing (ICSLP), Denver, Colorado: Center for Spoken Language Research, pp. 1-8.

HAUSER, M. D. (1996). The evolution of communication.

Cambridge: MIT Press.

HIRST, D., & DICRISTO, A. (Eds.) (1998). Intonation systems:

A survey of twenty languages. Cambridge: Cambridge University Press.

HUTSLER, J. J., GILLESPIE, M. E., & GAZZANIGA(2002).

The evolution of hemispheric specialization. In E. Bizzi, P. Caliassano, & V. Volterra (Eds.), Frontiers of life:

Vol. III. The intelligent systems(pp. 37-49). New York:

Academic Press.

JUSLIN, P. N., & LAUKKA, P. (2003). Communication of emotions in vocal expression and music performance:

Different channels, same code? Psychological Bulletin, 129, 770-814.

LADD, D. R. (1996). Intonational phonology. Cambridge:

Cambridge University Press.

MORTON, E. W. (1977). On the occurrence and significance of motivation—structural rules in some bird and mammal sounds. American Naturalist, 111, 855-869.

OHALA, J. J. (1994). The frequency code underlies the sound-symbolic use of voice pitch. In L. Hinton, J. Nichols, &

J. J. Ohala (Eds.), Sound symbolism(pp. 325-347). New York:

Cambridge University Press.

SCHERER, K. R. (1985). Vocal affect signalling: A comparative approach. In J. Rosenblatt, C. Beer, M.-C. Busnel, &

P. J. B. Slater (Eds.), Advances in the study of behavior: Vol. 15 (pp. 189-244). New York: Academic Press.

STERN, D. N. (2004). The present moment in psychotherapy and everyday life. New York: Norton.

Références

Documents relatifs

Other interesting fields related to AVT are automatic sign language translation and signing avatars (Wolfe et al. Still, very often this type of image-related research is

Dieudonn´e’s account deals briskly with the first half of the paper where Dedekind and Weber drew out the analogy between number fields and function fields. Ideals in this ring

The jurisprudence of United Nations treaty bodies, as well as regional human rights courts, provide authoritative and valuable interpretation of the human rights obligations that

When a child is the victim of a criminal offence, appropriate measures to protect the rights and best interests of the child shall be adopted at all stages of the criminal

There is a causal explanation of the solidity of the table, but of course the causal explanation explains a very feature of the system whose behavior causes that feature?.

According to visual intentionalism, unless such visual properties as shape, orientation, color, texture, motion had been instantiated by mind-independent objects in the envir- onment

What the literature has often been silent on, however, is how those who lost kith and kin during slave raids within the interior of Africa, where the bulk of slaves were

This section attempts to check whether the teachers are aware of intensive and extensive reading techniques and if they use any skills and strategies in