How to Use This Chapter
This chapter targets predictable beginner errors with short, repeatable correction routines. Work on one contrast at a time. For each contrast: (1) set the mouth shape, (2) produce the sound in isolation, (3) add a vowel, (4) use a minimal pair, (5) put it into a short sentence, (6) check meaning with a quick question.
| Practice order (recommended) | Time | Goal |
|---|---|---|
| Mouth-shape checkpoint | 20–30 sec | Stop the wrong sound before it starts |
| Isolated sound → syllable | 1–2 min | Make the contrast audible |
| Minimal pairs | 2–3 min | Stabilize the difference |
| Sentence + comprehension | 2–3 min | Make the contrast functional |
(1) Confusable Consonants: One Contrast at a Time
A. j/q/x vs zh/ch/sh (front vs retroflex)
Typical error: producing j/q/x with a curled-back tongue (retroflex), making them sound like zh/ch/sh.
Mouth-shape checkpoints
- j/q/x: tongue tip stays down (near lower teeth); the front of the tongue rises toward the hard palate; lips often slightly spread.
- zh/ch/sh: tongue tip curls back (retroflex); lips may round slightly; the sound feels farther back.
Fix routine (2 minutes)
- Freeze the tongue tip: lightly touch the tongue tip to the back of the lower front teeth. Keep it there.
- Make the “front” friction: produce
x(like a soft hiss) while keeping the tip down. - Add vowels:
xi,xue,xiao(slow, then normal). - Contrast drill: alternate
xi↔shiwhile watching that the tongue tip never curls forxi.
Minimal pairs (say A-B-A-B)
- Listen to the audio with the screen off.
- Earn a certificate upon completion.
- Over 5000 courses for you to explore!
Download the app
xivsshijiavszhaqianvschanxuevsshuo(not identical finals, but useful for “front vs back” feel)
B. z/c/s vs zh/ch/sh (dental/alveolar vs retroflex)
Typical error: curling the tongue for z/c/s, turning them into zh/ch/sh.
Mouth-shape checkpoints
- z/c/s: tongue tip points forward toward the ridge behind the upper teeth (no curl); airflow is sharp and forward.
- zh/ch/sh: tongue tip curls back; sound is darker and farther back.
Fix routine (90 seconds)
- Smile slightly: this helps keep the tongue forward.
- Tap test: say
sssand feel air hit the teeth area; if it feels “hollow/back,” you’re drifting tosh. - Build syllables:
sa,se,si; then contrast withsha,she,shi.
Minimal pairs
sivsshisanvsshancevschezongvszhong
C. r vs l (frictiony “r” vs clear “l”)
Typical error: replacing r with l (or making r too English-like).
Mouth-shape checkpoints
- l: tongue tip touches the ridge behind upper teeth; air flows around the sides (clear, open).
- r: tongue tip is near (not pressing firmly); the sound has friction and feels slightly farther back; lips may round a bit.
Fix routine (2 minutes)
- Lock in “l” first: hold
llllwith the tongue tip firmly touching. - Release to “r” position: move the tongue tip slightly back and off the ridge (do not touch), add gentle friction:
rrrr. - Add vowels:
la↔ra,li↔ri,lou↔rou.
Minimal pairs
livsrilanvsranluvsruleivsrei(rare syllable; use as a mouth drill if helpful)
D. h vs f (common in certain accents)
Typical error: swapping h and f, especially before u or in fast speech.
Mouth-shape checkpoints
- f: upper teeth touch lower lip; clear lip friction.
- h: no teeth-on-lip contact; friction comes from the throat/back of mouth.
Fix routine (60–90 seconds)
- Contact check: say
fffwhile feeling teeth on lip. Then sayhhhwith lips relaxed and open. - Pair drill:
fa↔ha,fei↔hei,fu↔hu. - Speed ladder: slow (clear contact/no-contact) → normal → slightly fast without losing the checkpoint.
(2) Confusable Finals: Targeted Fixes
A. e vs o (avoid “uh” drift and rounding mistakes)
Typical error: rounding e into something like o, or making o too open/flat.
Mouth-shape checkpoints
- o: lips rounded and forward.
- e: lips more relaxed/neutral; tongue feels slightly higher and flatter; no strong rounding.
Fix routine (2 minutes)
- Silent shape: form
o(round lips) for 2 seconds; then switch toe(relax lips) for 2 seconds. Repeat 5 times. - Syllable pairs:
bo↔be,po↔pe,mo↔me. - Record-check: if your
esounds rounded, redo with a slight smile.
B. “i” after z/c/s/zh/ch/sh/r: apical vowel vs regular i
Typical error: pronouncing zi/ci/si like ji/qi/xi, or pronouncing zhi/chi/shi/ri with a clear “ee” vowel.
Key idea: in zi/ci/si and zhi/chi/shi/ri, the written i is not the regular “ee”-like vowel. It’s an apical vowel: the tongue position is driven by the consonant, and the vowel is more like a sustained continuation of that consonant’s resonance.
Mouth-shape checkpoints
- Regular i (as in
mi): lips slightly spread; tongue high and front; clear vowel quality. - Apical “i” (as in
si/shi): keep the consonant shape; do not “switch” into a brightee.
Fix routine (2 minutes)
- Hold the consonant: sustain
ssssthen gently “voice” intosiwithout changing tongue shape. - Retroflex set: sustain
shhhhthen move intoshiwithout adding a bright vowel. - Contrast with regular i: alternate
si↔xi; thenshi↔xi. Aim for clearly different vowel color.
Minimal pairs
sivsxicivsqizhivsjishivsxi
C. ü omission after j/q/x (keep the front-rounded vowel)
Typical error: saying ju/qu/xu like joo/choo/shoo (losing the front-rounded quality), or replacing ü with u.
Mouth-shape checkpoints
- ü: lips rounded like
u, but tongue is front/high likei. - u: lips rounded; tongue is more back.
Fix routine (90 seconds)
- Two-step build: say
i(tongue front) → keep tongue position → round lips intoüwithout moving the tongue. - Add initials:
jü,qü,xü(slow, then normal). In standard pinyin you’ll seeju/qu/xu, but the vowel is stillü. - Contrast drill: alternate
ju(ü) ↔zhu(u) to force front vs back.
Minimal pairs
juvszhuquvschuxuvsshu
D. -n vs -ng (front nasal vs back nasal)
Typical error: merging -n and -ng so an/ang or in/ing sound the same.
Mouth-shape checkpoints
- -n: tongue tip goes to the ridge behind upper teeth (front closure).
- -ng: back of tongue lifts to the soft palate (back closure); tongue tip stays more relaxed.
Fix routine (2 minutes)
- Feel the closure: say
anand hold the nasal; feel tongue tip contact. Then sayangand hold; feel the back of tongue lift. - Open-close test: for
-n, you can often release into a clear vowel quickly; for-ng, the release feels farther back. - Minimal-pair ladder:
an/ang→en/eng→in/ing.
Minimal pairs
anvsangenvsenginvsingwanvswang
(3) Tone Pitfalls: Fix Drills (Humming → Syllable)
A. Tone 4 becomes “stress” instead of a clean fall
Typical error: making tone 4 loud/tense (like emphasis) rather than a controlled, quick fall in pitch.
Fix drill (3 steps, 2 minutes)
- Tone-only hum: hum a short, clean falling pitch (no extra force). Keep volume steady.
- Add a neutral syllable: hum-fall on
maasmà(focus on pitch movement, not punch). - Contrast with tone 1: alternate
mā(steady) ↔mà(fall). Ifmàsounds like “shouting,” reduce volume and shorten duration.
B. Tone 2 gets flattened (rising disappears)
Typical error: tone 2 becomes mid-level, especially in longer phrases.
Fix drill (3 steps, 2 minutes)
- Tone-only hum: hum a clear rise (start mid, end higher). Keep it smooth, not jumpy.
- Add syllable:
máwith the same rise; avoid adding extra length at the start. - record yourself: the end of tone 2 should be noticeably higher than the start. If it’s flat, exaggerate the rise for 10 reps, then reduce to natural.
C. Tone 3 is over-dipped (too low, too long)
Typical error: producing a full deep dip every time, causing slow, heavy speech and unstable pitch.
Fix drill (3 steps, 2–3 minutes)
- Tone-only hum: hum a low, compact tone 3 target without a dramatic scoop. Think “low and short.”
- Add syllable:
mǎas a low, contained tone; avoid dropping into vocal fry. - Pair contrast: alternate
má(rise) ↔mǎ(low) ↔mà(fall). Your tone 3 should not be the longest one.
Extra control drill (optional): whisper the syllable while keeping the pitch contour in your head; then voice it softly. This reduces the urge to “dig” downward.
(4) Contextual Minimal-Pair Sentences + Quick Comprehension
Read each pair aloud. Your job is to make the sound contrast so clear that the meaning difference is obvious. After each pair, answer the question without looking back.
A. z/c/s vs zh/ch/sh (sound changes meaning)
Pair 1
Wǒ yào sì bēi shuǐ.Wǒ yào shí bēi shuǐ.
Question: In which sentence does the speaker want ten cups of water?
Pair 2
Tā zài sān lóu.Tā zài shān lǐ.
Question: Which sentence says the person is in the mountains?
B. j/q/x vs zh/ch/sh
Pair 1
Qǐng gěi wǒ yī bēi chá.Chǐng gěi wǒ yī bēi chá.
Question: Which one sounds like a polite request (qǐng) rather than a mispronounced syllable?
Pair 2
Tā xiǎng qù xī'ān.Tā xiǎng qù shī'ān.
Question: Which sentence contains a real place name pronunciation (Xī'ān)?
C. r vs l
Pair 1
Wǒ yào liǎng bēi kāfēi.Wǒ yào rǎng bēi kāfēi.
Question: Which sentence clearly means “two cups of coffee” (liǎng)?
Pair 2
Tā shì lǎoshī.Tā shì rǎoshī.
Question: Which sentence clearly says “teacher” (lǎoshī)?
D. h vs f
Pair
Wǒ hěn hǎo.Wǒ fěn fǎo.
Question: Which sentence is a natural way to say “I’m very good” (hěn hǎo)?
E. e vs o
Pair
Wǒ hē chá.Wǒ hōu chá.
Question: Which sentence uses a normal verb “drink” pronunciation (hē)?
F. apical “i” vs regular i
Pair
Tā shì lǎoshī.Tā xì lǎoshī.
Question: Which sentence contains the correct syllable for “teacher” with apical-vowel shi?
G. ü after j/q/x vs u (contrast with zh/ch/sh)
Pair
Wǒ xiǎng qù xuéxiào.Wǒ xiǎng qù shuǐxiào.
Question: Which sentence clearly says “go to school” (xuéxiào)?
H. -n vs -ng
Pair 1
Wǒ zài Běijīng.Wǒ zài Běijīn.
Question: Which sentence correctly pronounces “Beijing” with -ng?
Pair 2
Tā yào mǎi yī běn shū.Tā yào mǎi yī bēng shū.
Question: Which sentence clearly means “buy one book” (běn)?
I. Tone contrasts in meaning (tone-only → syllable → sentence)
Step 1 (tone-only): hum tone 2 vs tone 4: rise ↔ fall. Step 2 (syllable): má ↔ mà. Step 3 (sentence pairs):
Wǒ yào mǎi.vsWǒ yào mài.Tā lái le.vsTā lài le.
Questions: Which sentence means “I want to buy” (mǎi)? Which one means “I want to sell” (mài)?