Categorical perception

Categorical perception

Categorical perception is the perception of different sensory phenomena as being qualitatively,or categorically, different. It is opposed to "continuous perception", the perception of different sensory
phenomena as being located on a smooth continuum.

Categorical perception (CP) can be inborn or can be induced by learning. Formerly thought to be peculiar to speech and color perception, CP turns out to be far more general, and may be related to how the neural networks in our brains detect the features that allow us to sort the things in the world into their proper categories, "warping" perceived similarities and differences so as to compress some things into the same category and separate others into different categories.

Categorization

A category [cite encyclopedia |author=Harnad, Stevan |year=2005 |title=To Cognize is to Categorize: Cognition is Categorization |editor=C Lefebvre & H. Cohen |encyclopedia=Handbook of Categorization in Cognitive Science |location=New York |publisher=Elsevier Press |url=http://cogprints.org/11725/] , or kind, is a set of things. Membership in the category may be (1) all-or-none, as with "bird": Something either is a bird or it isn't a bird; a penguin is 100% bird, a platypus is 100% not-bird. In this case we would call the category "categorical." Or membership might be (2) a matter of degree, as with "big": Some things are more big and some things are less big. In this case the category is "continuous" (or rather, degree of membership corresponds to some point along a continuum). There are range or context effects as well: elephants are relatively big in the context of animals, relatively small in the context of bodies in general, if we include planets.

Many categories, however, particularly concrete sensori-motor categories (things we can see and touch), are a mixture of the two: categorical at an everyday level of magnification, but continuous at a more microscopic level. An example of this is Color categories: Central reds are clearly reds, and not shades of yellow. But in the orange region of the spectral continuum, red/yellow is a matter of degree; context and contrast effects can also move these regions around somewhat. Perhaps even with "bird," an artist or genetic-engineer could design intermediate cases in which their "birdness" was only a matter of degree.

Resolving the "blooming, buzzing confusion"

Categories are important because they determine how we see and act upon the world. As William James noted, we do not see a continuum of "blooming, buzzing confusion" but an orderly world of discrete objects. Some of these categories are "prepared" in advance by evolution: The frog's brain is born already able to detect "flies"; it needs only normal exposure rather than any special learning in order to recognize and catch them. Humans have such innate category-detectors too: The human face itself is probably an example. So too are our basic color categories, although according to the "Whorf Hypothesis" (Whorf 1956; also called the "linguistic relativity" hypothesis), colors are determined by how culture and language happens to subdivide the spectrum.

But if one opens up a dictionary at random and picks out a content word, chances are that it names a category we have learned to detect, rather than one that our brains were innately prepared in advance by evolution to detect. The generic human face may be an innate category for us, perhaps even the various basic emotions it can express, but surely all the specific people we know and can name are not. "Red" and "yellow" may be inborn, but "scarlet" and "crimson"?

The motor theory of speech perception

And what about the very building blocks of the language we use to name categories: Are our speech-sounds -- ba, da, ga -- innate or learned? The first question we must answer about them is whether they are categorical categories at all, or merely arbitrary points along a continuum. It turns out that if one analyzes the sound spectrogram of ba and pa, for example, both are found to lie along an acoustic continuum called "voice-onset-time." With a technique similar to the one used in "morphing" visual images continuously into one another, it is possible to "morph" a ba gradually into a pa and beyond by gradually increasing the voicing parameter.

Alvin Liberman and colleagues [cite journal |author=Liberman, A. M., Harris, K. S., Hoffman, H. S. & Griffith, B. C. |year=1957 |title=The discrimination of speech sounds within and across phoneme boundaries |journal=Journal of Experimental Psychology |volume=54 |pages=358–368 |doi=10.1037/h0044417] reported that when people listen to sounds that vary along the voicing continuum, they hear only ba's and pa 's, nothing in between. This effect -- in which a perceived quality jumps abruptly from one category to another at a certain point along a continuum, instead of changing gradually -- he dubbed "categorical perception" (CP). He suggested that CP was unique to speech, that CP made speech special, and, in what came to be called "the motor theory of speech perception," he suggested that CP's explanation lay in the anatomy of speech production.

According to the (now abandoned) motor theory, the reason we perceive an abrupt change between ba and pa is that the way we hear speech sounds is influenced by the way we produce them when we speak. What is varying along this continuum is voice-onset-time: the "b" in ba is voiced and the "p" in pa is not. But unlike the synthetic "morphing" apparatus, our natural vocal apparatus is not capable of producing anything in between ba and pa. So when I hear a sound from the voicing continuum, my brain perceives it by trying to match it with what it would have had to do to produce it. Since the only thing I can produce is ba or pa , I will perceive any of the synthetic stimuli along the continuum as either ba or pa, whichever it is closer to. A similar CP effect is found with ba/da; these too lie along a continuum acoustically, but vocally, ba is formed with the two lips, da with the tip of the tongue and the alveolar ridge, and our anatomy does not allow any intermediates.

The motor theory of speech perception explained how speech was special and why speech-sounds are perceived categorically: sensory perception is mediated by motor production. Wherever production is categorical, perception will be categorical; where production is continuous, perception will be continuous. And indeed vowel categories like a/u were found to be much less categorical than ba/pa or ba/da.

Acquired distinctiveness

If motor production mediates sensory perception, then one assumes that this CP effect is a result of learning to produce speech. Eimas et al. (1971), however, found that infants already have speech CP before they begin to speak. Perhaps, then, it is an innate effect, evolved to "prepare" us to learn to speak. [cite journal |author=Eimas, P.D., Siqueland, E.R., Jusczyk, P.W., & Vigorito, J. |year=1971 |title=Speech perception in infants |journal=Science |volume=171 |pages=303–306 |doi=10.1126/science.171.3968.303] But Kuhl (1987) found that chinchillas also have "speech CP" even though they never learn to speak, and presumably did not evolve to do so. [cite encyclopedia |author=Kuhl, P. K. |year=1987 |title=The Special-Mechanisms Debate in Speech Perception: Nonhuman Species and Nonspeech Signals |editor=S. Harnad |encyclopedia=Categorical perception: The groundwork of Cognition |location=New York |publisher=Cambridge University Press] Lane (1965) went on to show that CP effects can be induced by learning alone, with a purely sensory (visual) continuum in which there is no motor production discontinuity to mediate the perceptual discontinuity. [cite journal |author=Lane, H. |year=1965 |title=The motor theory of speech perception: A critical review |journal=Psychological Review |volume=72 |pages=275–309 |doi=10.1037/h0021986] He concluded that speech CP is not special after all, but merely a special case of Lawrence's classic demonstration that stimuli to which you learn to make a different response become more distinctive and stimuli to which you learn to make the same response become more similar.

It also became clear that CP was not quite the all-or-none effect Liberman had originally thought it was: It is not that all pa's are indistinguishable and all ba's are indistinguishable: We can hear the differences, just as we can see the differences between different shades of red. It is just that the within-category differences (pa1/pa2 or red1/red2) sound/look much smaller than the between-category differences (pa2/ba1 or red2/yellow1), even when the size of the underlying physical differences (voicing, wave-length) are actually the same.

The modern definition of categorical perception

This evolved into the contemporary definition of CP, which is no longer peculiar to speech or dependent on the motor theory: CP occurs whenever perceived within-category differences are compressed and/or between-category differences are separated, relative to some baseline of comparison. The baseline might be the actual size of the physical differences involved, or, in the case of learned CP, it might be the perceived similarity or discriminability within and between categories before the categories were learned, compared to after.

The typical learned CP experiment would be the following: A set of stimuli is tested (usually in pairs) for similarity or discriminability. In the case of similarity, Multidimensional scaling might be used to scale the rated pairwise similarity of the set of stimuli. In the case of discriminability, same/different judgments and signal detection analysis might be used to estimate the pairwise discriminability of a set of stimuli. Then the same subjects or a different set are trained, using trial and error and corrective feedback, to sort the stimuli into two or more categories. After the categorization has been learned, similarity or discriminability are tested again, and compared against the untrained data. If there is significant within-category compression and/or between-category separation, this is operationally defined as CP. [cite book |author=Harnad, S. (ed.) |year=1987 |title=Categorical Perception: The Groundwork of Cognition |location=New York |publisher=Cambridge University Press |url=http://cogprints.org/1571/]

The Whorf Hypothesis

According to the Sapir-Whorf Hypothesis (of which Lawrence's acquired similarity/distinctiveness effects would simply be a special case), colors are perceived categorically only because they happen to be named categorically: Our subdivisions of the spectrum are arbitrary, learned, and vary across cultures and languages. But Berlin & Kay (1969) showed that this was not so: Not only do most cultures and languages subdivide and name the color spectrum the same way, but even for those who don't, the regions of compression and separation are the same. [cite book |author=Berlin, B. & Kay, P. |year=1969 |title=Basic color terms: Their universality and evolution. |publisher=Berkeley: University of California Press] We all see blues as more alike and greens as more alike, with a fuzzy boundary in between, whether or not we have named the difference. So there is no Whorfian learning effect with colors: Or is there?

Evolved CP

First, back to vowels. The signature of CP is within-category compression and/or between-category separation. The size of the CP effect is merely a scaling factor; it is this compression/separation "accordion effect," that is CP's distinctive feature. In this respect, the "weaker" CP effect for vowels, whose motor production is continuous rather than categorical, but whose perception is by this criterion categorical, is every bit as much of a CP effect as the ba/pa and ba/da effects. But, as with colors, it looks as if the effect is an innate one: Our sensory category detectors for both color and speech sounds are born already "biased" by evolution: Our perceived color and speech-sound spectrum is already "warped" with these compression/separations.

Learned CP

The Lane/Lawrence demonstrations, lately replicated and extended by Goldstone (1994), showed that CP can be induced by learning alone. [cite journal |author=Goldstone, R. L. |year=1994 |title=Influences of categorization on perceptual discrimination |journal=Journal of Experimental Psychology |volume=General 123 |pages=178–200] And there are also the countless categories cataloged in our dictionaries that could not possibly be inborn (though nativist theorists such as Fodor [1983] have sometimes seemed to suggest that all of our categories are inborn). [cite book |author=Fodor, J. |year=1983 |title=The modularity of mind |publisher=MIT Press] There are even recent demonstrations that although the primary color and speech categories are probably inborn, their boundaries can be modified or even lost as a result of learning, and weaker secondary boundaries can be generated by learning alone. [cite journal |author=Roberson, D., Davies, I. & Davidoff, J. |year=2000 |title=Color categories are not universal: Replications and new evidence from a stone-age culture |journal=Journal of Experimental Psychology |volume=General 129 |pages=369–398]

Perhaps CP performs some useful function in categorization? In the case of innate CP, our categorically biased sensory detectors pick out their prepared color and speech-sound categories far more readily and reliably than if our perception had been continuous. Could something similar be the case for our repertoire of learned categories too?

Computational and neural models

Computational modeling (Tijsseling & Harnad 1997; Damper & Harnad 2000) has shown that many types of category-learning mechanisms (e.g. both back-propagation and competitive networks) display CP-like effects. [cite journal |author=Damper, R.I. & Harnad, S. |year=2000 |title=Neural Network Modeling of Categorical Perception |journal=Perception and Psychophysics |volume=62(4) |pages=843–867 |url=http://cogprints.org/1620/] [cite encyclopedia |author=Tijsseling, A. & Harnad, S. |year=1997 |title=Warping Similarity Space in Category Learning by Backprop Nets |editor=Ramscar, M., Hahn, U., Cambouropoulos, E. & Pain, H. |encyclopedia=Proceedings of SimCat 1997: Interdisciplinary Workshop on Similarity and Categorization |publisher=Department of Artificial Intelligence, Edinburgh University |pages=263-269 |url=http://cogprints.org/1608/] In back-propagation nets, the hidden-unit activation patterns that "represent" an input build up within-category compression and between-category separation as they learn; other kinds of nets display similar effects. CP seems to be a means to an end: Inputs that differ among themselves are "compressed" onto similar internal representations if they must all generate the same output; and they become more separate if they must generate different outputs. The network's "bias" is what filters inputs onto their correct output category. The nets accomplish this by selectively detecting (after much trial and error, guided by error-correcting feedback) the invariant features that are shared by the members of the same category and that reliably distinguish them from members of different categories; the nets learn to ignore all other variation as irrelevant to the categorization.

Very little is known yet about the brain mechanisms of category perception and learning. The computational models are really causal hypotheses about what the brain might be doing. Neural data provide correlates of CP and of learning. [cite journal |author=Sharma, A. & Dorman, M.F. |year=1999 |title=Cortical auditory evoked potential correlates of categorical perception of voice-onset time |journal=Journal of the Acoustical Society of America |volume=106(2) |pages=1078–1083 |doi=10.1121/1.428048] Differences between event-related potentials recorded from the brain have been found to be correlated with differences in the perceived category of the stimulus viewed by the subject. Neural imaging studies have shown that these effects are localized and even lateralized to certain brain regions in subjects who have successfully learned the category, and are absent in subjects who have not. [cite journal |author=Seger, Carol A.; Poldrack, Russell A.; Prabhakaran, Vivek; Zhao, Margaret; Glover, Gary H.; Gabrieli, John D. E. |year=2000 |title=Hemispheric asymmetries and individual differences in visual concept learning as measured by functional MRI |journal=Neuropsychologia |volume=38(9) |pages=1316–1324 |doi=10.1016/S0028-3932(00)00014-2] [cite journal |author=Raizada, RDS; Poldrack; RA |year=(2007) |title=Selective Amplification of Stimulus Differences during Categorical Processing of Speech |journal=Neuron |volume=56 |pages=726–740 |doi=10.1016/j.neuron.2007.11.001]

Language-induced categorical perception

Both innate and learned CP are sensorimotor effects: The compression/separation biases are sensorimotor biases, and presumably had sensorimotor origins, whether during the sensorimotor life-history of the organism, in the case of learned CP, or the sensorimotor life-history of the species, in the case of innate CP. The neural net I/O models are also compatible with this fact: Their I/O biases derive from their I/O history. But when we look at our repertoire of categories in a dictionary, it is highly unlikely that many of them had a direct sensorimotor history during our lifetimes, and even less likely in our ancestors' lifetimes. How many of us have seen a unicorn in real life? We have seen pictures of them, but what had those who first drew those pictures seen? And what about categories I cannot draw or see (or taste or touch): What about the most abstract categories, such as goodness and truth?

Some of our categories must originate from another source than direct sensorimotor experience, and here we return to language and the Whorf Hypothesis: Can categories, and their accompanying CP, be acquired through language alone? Again, there are some neural net simulation results suggesting that once a set of category names has been "grounded" through direct sensorimotor experience, they can be combined into Boolean combinations (man = male & human) and into still higher-order combinations (bachelor = unmarried & man) which not only pick out the more abstract, higher-order categories much the way the direct sensorimotor detectors do, but also inherit their CP effects, as well as generating some of their own. Bachelor inherits the compression/separation of unmarried and man, and adds a layer of separation/compression of its own. [cite journal |author=Cangelosi, A. & Harnad, S. |year=2001 |title=The Adaptive Advantage of Symbolic Theft Over Sensorimotor Toil: Grounding Language in Perceptual Categories. |journal=Evolution of Communication |volume=4(1) |pages=117–142 |url=http://cogprints.org/2036/] [cite journal |author=Cangelosi A., Greco A. & Harnad S. |year=2000 |title=From robotic toil to symbolic theft: Grounding transfer from entry-level to higher-level categories |journal=Connection Science |volume=12(2) |pages=143–162 |url=http://cogprints.org/1647/ |doi=10.1080/09540090050129763]

These language-induced CP-effects remain to be directly demonstrated in human subjects; so far only learned and innate sensorimotor CP have been demonstrated. [cite encyclopedia |author=Pevtzow, R. & Harnad, S. |year=1997 |title=Warping Similarity Space in Category Learning by Human Subjects: The Role of Task Difficulty |editor=Ramscar, M., Hahn, U., Cambouropolos, E. & Pain, H. |encyclopedia=Proceedings of SimCat 1997: Interdisciplinary Workshop on Similarity and Categorization |publisher=Department of Artificial Intelligence, Edinburgh University |pages=189-195 |url=http://cogprints.org/1607/] [cite journal |author=Livingston, K. Andrews & Harnad, S. |year=1998 |title=Categorical Perception Effects Induced by Category Learning |journal=Journal of Experimental Psychology: Learning, Memory, and Cognition |volume=24(3) |pages=732–753 |url=http://cogprints.org/2574/ |doi=10.1037/0278-7393.24.3.732] The latter shows the Whorfian power of naming and categorization, in warping our perception of the world. That is enough to rehabilitate the Whorf Hypothesis from its apparent failure on color terms (and perhaps also from its apparent failure on eskimo snow terms [cite journal |author=Pullum, G. K. |year=1989 |title=The great eskimo vocabulary hoax |journal=Natural Language and Linguistic Theory |volume=7 |pages=275–281] ), but to show that it is a full-blown language effect, and not merely a vocabulary effect, it will have to be shown that our perception of the world can also be warped, not just by how things are named but by what we are told about them.

References

Bibliography

*This article is based on material from the article" Categorical Perception "in the" Encyclopedia of Cognitive Science, "used here with permission of the author, S. Harnad."
*cite journal |author=Burns, E. M.; Campbell, S. L. |year=1994 |title=Frequency and frequency-ratio resolution by possessors of absolute and relative pitch: Examples of categorical perception? |journal=Journal of the Acoustical Society of America |volume=96 |pages=2704–2719 |doi=10.1121/1.411447
*cite paper |author=Belpaeme, Tony |year=2002 |title=Factors influencing the origins of colour categories |publisher=Artificial Intelligence Lab, Vrije Universiteit Brussel |url=http://arti.vub.ac.be/~tony/phd/index.htm
*cite journal |author=Bimler, D & Kirkland, J. |year=2001 |title=Categorical perception of facial expressions of emotion: Evidence from multidimensional scaling. |journal=Cognition & Emotion |volume=15 |pages=633–658 |doi=10.1080/02699930143000077
*cite journal |author=Calder, A.J., Young, A.W., Perrett, D.I., Etcoff, N.L. & Rowland, D. |year=1996 |title=Categorical perception of morphed facial expressions |journal=Visual Cognition |volume=3 |pages=81–117 |doi=10.1080/713756735
*cite journal |author=Campanella, S., Quinet, O., Bruyer, R., Crommelinck, M. & Guerit, J.M. |year=2002 |title=Categorical perception of happiness and fear facial expressions : an ERP study |journal=Journal of Cognitive Neuroscience |volume=14 (2) |pages=210–227 |doi=10.1162/089892902317236858
*cite journal |author=Goldstone, R. L, Lippa, Y., & Shiffrin, R. M. |year=2001 |title=Altering object representations through category learning |journal=Cognition |volume=78 |pages=27–43 |doi=10.1016/S0010-0277(00)00099-8
*cite encyclopedia |author=Goldstone, R. L. |year=1999 |title=Similarity |editor=R.A. Wilson & F. C. Keil |encyclopedia=MIT encyclopedia of the cognitive sciences |pages=763-765 |location=Cambridge, MA |publisher=MIT Press
*cite journal |author=Guest, S. & Van Laar, D. |year=2000 |title=The structure of colour naming space |journal=Vision Research |volume=40 |pages=723–734 |doi=10.1016/S0042-6989(99)00221-7
*cite journal |author=Harnad, S. |year=1990 |title=The Symbol Grounding Problem |journal=Physica D |volume=42 |pages=335–346 |url=http://cogprints.soton.ac.uk/documents/disk0/00/00/06/15/index.html |doi=10.1016/0167-2789(90)90087-6
*cite journal |author=Kotsoni, E; de Haan, M; Johnson, MH. |year=2001 |title=Categorical perception of facial expressions by 7-month-old infants |journal=Perception |volume=30 |pages=1115–1125 |doi=10.1068/p3155
*cite journal |author=Lawrence, D. H. |year=1950 |title=Acquired distinctiveness of cues: II. Selective association in a constant stimulus situation |journal=Journal of Experimental Psychology |volume=40 |pages=175–188 |doi=10.1037/h0063217
*cite journal |author=Rossion, B., Schiltz, C., Robaye, L., Pirenne, D. & Crommelinck, M. |year=2001 |title=How does the brain discriminate familiar and unfamiliar faces ? A PET study of face categorical perception |journal=Journal of Cognitive Neuroscience |volume=13 |pages=1019–1034 |doi=10.1162/089892901753165917
*cite journal |author=Schyns, P. G., Goldstone, R. L, & Thibaut, J. |year=1998 |title=Development of features in object concepts |journal=Behavioral and Brain Sciences |volume=21 |pages=1–54 |doi=10.1017/S0140525X98000107
*cite journal |author=Steels, L. |year=2001 |title=Language games for autonomous robots |journal=IEEE Intelligent Systems |volume=16(5) |pages=16–22 |doi=10.1109/5254.956077
*cite encyclopedia |author=Steels, L. and Kaplan, F. |year=1999 |title=Bootstrapping Grounded Word Semantics |editor=Briscoe, T. |encyclopedia=Linguistic evolution through language acquisition: formal and computational models |location=Cambridge UK |publisher=Cambridge University Press
*cite book |author=Whorf, B. L. |year=1964 |title=Language, thought and reality |location=Cambridge, MA |publisher=MIT Press

ee also

* Color
* Language
* Learning
* Motor theory
* Neural nets
* Phonemes
* Symbol grounding
* Sapir-Whorf hypothesis


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Categorical — See:* Categorical imperative * Morley s categoricity theorem * Categorical data analysis * Categorical distribution * Categorical logic * Categorical syllogism * Categorical proposition * Categorization * Categorical perception * Category theory… …   Wikipedia

  • Speech perception — is the process by which the sounds of language are heard, interpreted and understood. The study of speech perception is closely linked to the fields of phonetics and phonology in linguistics and cognitive psychology and perception in psychology.… …   Wikipedia

  • Motor theory of speech perception — When we hear spoken words we sense that they are made of auditory sounds. The motor theory of speech perception argues that behind the sounds we hear are the intended movements of the vocal tract that pronounces them. The motor theory of speech… …   Wikipedia

  • Absolute pitch — (AP), widely referred to as perfect pitch, is the ability of a person to identify or recreate a musical note without the benefit of a known reference.DefinitionAbsolute pitch, or perfect pitch, is the ability to identify the frequency or musical… …   Wikipedia

  • Phonological development — Sound is at the beginning of language learning. Children have to learn to distinguish different sounds and to segment the speech stream they are exposed to into units – eventually meaningful units – in order to acquire words and sentences. So, if …   Wikipedia

  • Oído absoluto — Existen desacuerdos sobre la neutralidad en el punto de vista de la versión actual de este artículo o sección. En la página de discusión puedes consultar el debate al respecto. El oído absoluto se refiere a la habilidad de identificar una nota… …   Wikipedia Español

  • Symbol grounding — The Symbol Grounding Problem is related to the problem of how words (symbols) get their meanings, and hence to the problem of what meaning itself really is. The problem of meaning is in turn related to the problem of consciousness, or how it is… …   Wikipedia

  • Haskins Laboratories — [http://www.haskins.yale.edu] is an independent, international, multidisciplinary community of researchers conducting basic research on spoken and written language. Founded in 1935 and located in New Haven, Connecticut since 1970, Haskins… …   Wikipedia

  • Mark J. Blechner — (born 1950, in Manhattan, New York) is an American psychologist and psychoanalyst. He has developed and researched new ideas in a number of areas: dreams, gender and sexuality, HIV/AIDS, psychotherapy and the interface between neuroscience and… …   Wikipedia

  • Julia Fischer (Biologin) — Julia Fischer Julia Fischer (* 22. Juli 1966 in München) ist eine deutsche Biologin, Primaten und Verhaltensforscherin, Autorin und Herausgeberin. Inhaltsverzeichnis …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”