Have you ever had this experience?
You can read emails in English. When you see the script, you think, "Oh, that's simple." Yet, when you hear it spoken, you can't understand what's being said. When native speakers start talking in a meeting, you suddenly feel left behind.
Many learners at an intermediate level (e.g., TOEIC 600-800) hit this wall. Even after listening to audio materials or podcasts for six months or a year, they struggle to break through.
Why is that?
The reason is simple: you keep listening aimlessly, without knowing what exactly you're missing.
Dictation (the practice of listening to English and writing it down word-for-word) is the most reliable way to solve this problem. It reveals your specific listening weaknesses in a tangible way.
In other words, it's a health check-up for your listening skills.
This article, based on know-how from the English Language Teaching (ELT) field, explains everything from the benefits of dictation to the correct method and common pitfalls.
The Single Reason Why Passive Listening Doesn't Improve Your Skills
First, let's clarify why passive listening alone isn't enough.
When you listen to English, your brain is performing two tasks simultaneously.
- Decoding Sounds — The task of identifying, "What word was that sound?"
- Comprehending Meaning — The task of understanding, "So, what is this about?"
These two tasks compete for the same "working space" in your brain. If you're struggling to decode the sounds, you won't have the mental capacity to process the meaning. Conversely, if you rely too much on guessing the meaning, you'll overlook the finer details of the sounds.
The problem with passive listening is that you don't realize which sounds you've missed.
For example, imagine you hear the sentence, "I've been waiting for an hour." Even if your ears only catch "I...been waiting...hour," you can still guess the meaning from the context: "Ah, they've been waiting for an hour."
The meaning is correct. But you didn't hear three words: 've (have), for, and an.
This state of "understanding the meaning without hearing the sounds" is tricky. You remain unaware of the problem and repeat the same listening mistakes hundreds of times.
Dictation forces these "invisible gaps" into the open. The moment you try to write it down, the parts you didn't hear appear as blank spaces right in front of you.
The 3 Key Benefits of Dictation
Dictation is more than just a transcription exercise because of these three benefits.
Benefit 1: Pinpoint Exactly Which Sounds You're Missing
This is the greatest value of dictation.
When you compare your written text with the script, it becomes immediately obvious what you missed. Moreover, you'll notice that you repeatedly make mistakes in the same areas. This is what we call "patterning your weaknesses."
Once you know the pattern, you can take action. It's far more efficient than just "listening a lot" without a clear goal.
Benefit 2: Reduce the Number of Words You Can Read but Not Hear
Do you know the word "comfortable"? Most people do.
But when you hear it spoken, native speakers pronounce it as /ˈkʌmftəbl/ (a three-syllable word closer to "kumf-tuh-bl"), not the four-syllable "com-for-ta-ble" that some learners might expect. Words where the pronunciation in your head doesn't match the actual sound are impossible to catch, no matter how many times you hear them.
When you experience being unable to write down a word you thought you knew during dictation, you are forced to notice this discrepancy. This becomes an opportunity to relearn the word with its correct pronunciation.
Benefit 3: Your Ears Get Used to the Rhythm of English
English has "strong" words and "weak" words. Nouns, verbs, and adjectives are stressed, while articles, prepositions, and pronouns are often unstressed.
Many languages do not have this stress-timed rhythm, which can make unstressed words sound like they're not even there. Through repeated dictation, you develop an ear that can predict, "Ah, there should be a weak sound here."
Which Type Are You? 5 Common Listening Mistake Patterns for English Learners
This is the core of this article.
Based on analysis of dictation results in the ELT field, the listening mistakes of many English learners can be grouped into the following five patterns. Identifying which type you are is the first step toward the most efficient improvement.
Pattern 1: Small Words Disappear
Symptoms: You can't write down the "'ve" in "I've been." You miss the "for" or "an" in "for an hour."
In English, short words like "a," "the," "of," "for," "to," "him," and "her" are pronounced incredibly softly and quickly in conversation (these are called "weak forms").
For example, the /h/ in "him" often disappears within a sentence, making it sound like "im." This is why "Can you tell him?" can sound like "Can you tell-im?"
In particular, many learners find it difficult to catch the pattern where the /h/ in him / her / his disappears.
Solution: First, understand the principle that small words are pronounced weakly. Then, practice consciously listening for words like "a," "the," "of," and "for" in short sentence dictations.
Pattern 2: Word Boundaries Become Unclear
Symptoms: "pick it up" sounds like one single block of sound, and you can't tell how many words there are.
In English, when a word ends in a consonant and the next word starts with a vowel, the sounds are linked together (linking).
- "pick it up" → sounds like "pi-ki-tup"
- "turn it on" → sounds like "tur-ni-ton"
- "check it out" → sounds like "che-ki-tout"
In dictation, this appears as a "boundary error," where you write one word too many or too few.
Solution: Learn the common linking patterns (consonant + vowel connections) as knowledge, and then repeatedly practice dictation with materials that include these patterns.
Pattern 3: Final Sounds of Words Are Inaudible
Symptoms: "asked" sounds like "ask." The final /s/ in "months" disappears.
Many languages have syllables that almost always end in a vowel. English, on the other hand, has a vast number of words that end in consonants. Furthermore, in conversation, these final consonants can be weakened or changed by the sound of the next word.
- "asked" → The final /t/ sound is almost inaudible.
- "next day" → The /t/ disappears, sounding like "nex day."
- "ten people" → The /n/ changes to /m/, sounding like "tem people."
Solution: Short sentence dictation focusing on final -s, -t, -d, and -ed is effective. Get into the habit of listening with the expectation that "there should be a sound at the end of the word."
Pattern 4: Unable to Recognize Known Words
Symptoms: "water" doesn't sound like "wah-ter." "better" sounds like a different word.
In American English, when a /t/ sound is between two vowels, it often changes to a sound similar to a soft 'd' (this is called flapping).
- "water" → sounds closer to "wadder"
- "better" → sounds closer to "bedder"
- "get it" → sounds closer to "geddit"
If you have a fixed, non-native pronunciation in your head, it won't match the actual sound, leading to the phenomenon of not being able to recognize a word you actually know.
Solution: Check the actual pronunciation of the word you couldn't hear and overwrite the incorrect version in your mind. The pronunciation feature in dictionary apps is very useful for this.
Pattern 5: You Can Transcribe It, but You Don't Understand the Meaning
Symptoms: You can manage to write down each word, but if someone asks, "So, what was it about?" you can't answer.
This happens when you expend all your mental energy on decoding the sounds, leaving no capacity to think about the meaning. Unlike Patterns 1-4, the cause here is "slow processing speed" for listening.
Solution: At this stage, shadowing is more effective than dictation. By increasing your listening speed and "automating" the process, you create mental space to think about the meaning (more on this later).
It will become immediately clear which pattern applies to you once you actually try dictation. The next section explains the correct way to do it.
How to Do Dictation Effectively: A 6-Step Guide
The biggest drawback of dictation is that it's time-consuming. Trying to transcribe a long piece of audio can take over 30 minutes per session.
Here, we introduce a "diagnostic-focused" method that can be completed in 15-20 minutes per session.
Step 1: Choose Your Material
Select a short audio clip, about 30-60 seconds long. There are three key points when choosing:
- A level where you can understand about 70-80% (if it's too difficult, the whole page will be blank, which isn't diagnostic).
- Material that comes with a script (transcript) (it's pointless if you can't check your answers).
- A topic that interests you (this helps with motivation to continue).
Here are some recommended materials by level.
Approx. Level | Recommended Material | Features |
|---|---|---|
Beginner-Intermediate (TOEIC 500-650) | VOA Learning English | Slower speed, full scripts available |
Intermediate (TOEIC 650-800) | BBC Learning English / TED-Ed | Near-natural speed. Wide range of topics |
Intermediate-Advanced (TOEIC 800+) | BBC 6 Minute English / Podcast | Natural native conversation |
Test Preparation | TOEIC Part 1-2 / IELTS Listening Section 1-2 | Short sentences, ideal for pinpointing weaknesses |
Step 2: Listen Once Without Writing
First, put your pen down and listen to the entire clip. The goal here is just to get a general idea of who is talking and what they are talking about. If you try to write from the start, you'll get bogged down in details and lose the big picture.
Step 3: Pause and Write, Sentence by Sentence
On the second listen, pause the audio after each sentence and write it down.
For any parts you can't understand, leave a blank space (__). If you think you hear something but aren't sure, you can jot it down phonetically. However, treat these phonetic notes as temporary placeholders and be sure to replace them with the correct English words later.
Step 4: Listen One More Time to Fill in the Blanks
On the third listen, try to fill in the blank spaces.
Important rule: Stop after three listens. No matter how many times you repeat it, you won't hear what you can't hear. It's far more efficient to move on, look at the script, and confirm what you missed than to keep trying.
Step 5: Check Against the Script and "Categorize" Your Mistakes
This is the most important step. Most people look at the script, say, "Oh, so that's what it was," and stop there. That's like getting a health check-up and going home without looking at the results.
Categorize the parts you missed according to the five patterns introduced in the previous section.
- Missed a / the / for → Pattern 1 (Disappearing small words)
- Mistook word boundaries → Pattern 2 (Linking)
- Couldn't write final -s / -ed → Pattern 3 (Final consonants)
- Heard a known word as a different sound → Pattern 4 (Pronunciation gap)
- Wrote it down but didn't understand the meaning → Pattern 5 (Processing speed)
If you keep a record in a notebook or memo app, like "Today: 3 instances of Pattern 1, 2 instances of Pattern 3," you'll start to see the trends in your weaknesses.
Step 6: Say the Parts You Couldn't Hear Out Loud
Finally, read the parts you couldn't hear out loud while looking at the script.
You can learn to hear the sounds you can produce yourself. Conversely, sounds you can't hear are often sounds you can't pronounce correctly yourself. Use reading aloud to internalize "so this is how that sound is supposed to be."
If you have time, doing shadowing (repeating the audio just after you hear it, without looking at the script) will further solidify your learning.
Dictation vs. Shadowing: Which Should You Do First?
We often get the question, "Which is better, dictation or shadowing?" The answer isn't "one or the other"; the correct approach is to use them at different stages.
These two methods have completely different roles.
Dictation = A health check-up. Its purpose is to reveal "what you can't hear" and identify your weaknesses.
Shadowing = Strength training. Its purpose is to increase your listening speed and build your sound-processing power.
The criteria for choosing which to use are simple.
"I can understand it when I read the script, but not when I listen." → Start with dictation to identify what you're not hearing.
"I can mostly understand it, but I can't keep up with the speed." → You're at the stage for shadowing to improve your processing speed.
We recommend alternating them in a cycle every 2-3 weeks.
- Identify weaknesses with dictation (e.g., "I always miss the weak forms of 'for' and 'to'").
- Shadow with the same material (to drill the weak sounds into your ears and mouth).
- Do dictation again with new material (to check if the weakness has improved).
By following this "diagnose → train → re-diagnose" cycle, you will improve much more reliably than by simply doing a large volume of unfocused practice.
→ For a detailed guide on how to do shadowing, click here.
Effective Shadowing Techniques: How Prosody Shadowing Automates Auditory Perception
Summary
The biggest reason for stalled progress in listening is not knowing what you can't hear.
Dictation is a health check-up for your listening skills that makes these "invisible weaknesses" visible.
The process is simple: listen to a short audio clip three times, write it down, check it against the script, and categorize your listening mistakes. Just by doing this, you'll be able to decide for yourself what you need to practice next.
A single session of dictation can teach you far more than a year of passive listening.



