Categorical Perception: Definition & Explanation Video

An error occurred trying to load this video.

Try refreshing the page, or contact customer support.

Coming up next: Extrasensory Perception: Definition, Types & Examples

You're on a roll. Keep up the good work!

Take Quiz Watch Next Lesson
Your next lesson will play in 10 seconds
  • 0:00 An Example Of…
  • 1:25 Definition Of…
  • 1:45 Categorical Perception…
  • 4:45 Categorical Perception…
  • 5:15 How It Affects Perception
  • 5:40 Lesson Summary
Save Save Save

Want to watch this again later?

Log in or sign up to add this lesson to a Custom Course.

Log in or Sign up

Speed Speed

Recommended Lessons and Courses for You

Lesson Transcript
Instructor: Alyson Froehlich

Ali teaches college courses in Psychology, a course on how to teach in higher education, and has a doctorate degree in Cognitive Neuroscience.

In this lesson, you will learn about a phenomenon called categorical perception and how it actually changes the way you hear and see certain things. We will focus on the perception of speech, where categorical perception was first discovered.

An Example of Categorical Perception

When someone speaks a language you don't speak, you of course don't understand the meaning of what they say. But did you know that you actually hear different sounds than what a native speaker of that language hears? Take for example the /l/ and /r/ sounds of the English language. While native English speakers easily distinguish these two sounds, Japanese speakers are generally unable to hear the difference. The English words 'lag' and 'rag' would sound the same to them. Likewise, there are sounds that English speakers are unable to distinguish, such as the /t/ and /T/ sounds in Hindi. Why is that? The reason has to do, in part, with a trick of the brain called categorical perception.

Categorical perception helps us to detect the differences between things when we need to be able to, and it masks the differences when we need to treat things as the same. The Japanese language doesn't differentiate between the /l/ and /r/ sounds, so through categorical perception, the brains of native Japanese speakers have learned to treat the two sounds as the same by actually hearing the same sound when each is spoken. However, because those sounds are differentiated in English (changing the l in 'lag' to an r will change the meaning of the word), native English speakers have learned to hear a difference between the two.

Definition of Categorical Perception

Categorical perception occurs when items that range along a continuum are perceived as being either more or less similar to each other than they really are because of the way they are categorized. For example, if items falling within a certain range along that continuum belong to a single category, they will be perceived as being more similar to each other than items outside of that range.

Categorical Perception of Speech Sounds

Categorical perception was first demonstrated with speech sounds (such as /ga/, /pa/, and /du/). Researchers were trying to make sense of the fact that humans understand speech at all when there is so much variability in how different speakers pronounce speech.

For example, when your friend says the word 'dog', it is going to be pronounced slightly differently from how your mom says the word 'dog'; yet each time, you will still hear the same 'dog' despite the variability. What's even more interesting is that you likely never even notice the variability.

Some speech sounds differ only by voice onset time (VOT). VOT is the time it takes for the vocal cords to start vibrating after the release of certain consonant sounds. For example, the sound /da/ differs from /ta/ only in the amount of time it takes for the vocal chords to vibrate after making the consonant sound. Give it a try. Make the sound /da/ and then the sound /ta/. Did you notice how it takes longer for the 'ahh' sound to come out with /ta/ than it does with /da/? In fact, the VOT for /da/ ranges from about 0 to 30 milliseconds (msec), and the VOT for /ta/ runs from about 50 to 80 msec. Take a look at Figure 1. You can see that if the VOT is between 0 to 30 msec, people report that they hear /da/, but if the VOT is 50 to 80 msec, they report hearing /ta/.

Figure 1. When the voice onset time (VOT) of the speech sounds range from 0 to 30 milliseconds (msec), listeners report hearing the sound /da/. When the VOT falls within 50 to 80 msec, listeners hear /ta/.
Voice onset time for /da/ and /ta/.

Here in Figure 1, notice how there is a range of msec for each speech sound. What researchers find is that it doesn't really matter what the VOT is within that range; listeners will hear the same sound regardless. /da/ with a VOT of 10 msec is going to sound like the same /da/ with a VOT of 30 msec. But, increase that VOT to 50 msec, and suddenly you have /ta/.

To unlock this lesson you must be a Member.
Create your account

Register to view this lesson

Are you a student or a teacher?

Unlock Your Education

See for yourself why 30 million people use

Become a member and start learning now.
Become a Member  Back
What teachers are saying about
Try it risk-free for 30 days

Earning College Credit

Did you know… We have over 200 college courses that prepare you to earn credit by exam that is accepted by over 1,500 colleges and universities. You can test out of the first two years of college and save thousands off your degree. Anyone can earn credit-by-exam regardless of age or education level.

To learn more, visit our Earning Credit Page

Transferring credit to the school of your choice

Not sure what college you want to attend yet? has thousands of articles about every imaginable degree, area of study and career path that can help you find the school that's right for you.

Create an account to start this course today
Try it risk-free for 30 days!
Create an account