OET pronunciation for Arabic speakers
Five Arabic L1 patterns that cost OET intelligibility marks — /p/-/b/, consonant clusters, vowel length, /v/-/f/, stress — and drills that fix them.
Arabic is the third-largest L1 in the OET candidate pool worldwide, behind only Filipino languages and the Indian-subcontinent cluster. Nurses, doctors, pharmacists, and physiotherapists trained in Egypt, Saudi Arabia, the UAE, Jordan, Iraq, Syria, Sudan, Algeria, Morocco, Lebanon, and the wider Arabic-speaking diaspora are now a significant cohort migrating to Australia, the UK, New Zealand, and Ireland. AHPRA and NMC publish overseas-trained-clinician statistics each year, and the combined Arab-world contribution to the OET candidate pool has grown almost every year since 2018. Yet most of the publicly-available OET preparation content is written either for English-as-a-first-language learners or with a generic "international student" lens that ignores the specific L1.
This post fixes that. It is an Arabic-speaker-specific guide to the intelligibility criterion of OET Speaking — the single criterion that costs Arabic-L1 candidates the most marks, and the one that is easiest to fix without compromising how you sound.
Before the practical drills, three things have to be clear. The first is what OET intelligibility actually rewards. The second is which specific Arabic-to-English sound patterns the rubric notices. The third is which patterns are not pronunciation problems at all, even though they feel like they are. Then the drills.
If you have not yet worked through the 9-criteria OET Speaking rubric — read that first. This post assumes you know what intelligibility means inside the rubric and why it is one criterion of nine, not the whole exam.
What OET intelligibility actually rewards
The OET rubric is explicit on one point that most accent-coaching content is fuzzy about: any accent is acceptable as long as a non-native English listener can follow you without effort. That is the literal wording in the public scoring guide. The criterion is called intelligibility, not "accent quality".
For an Arabic speaker, that means:
- You will never be penalised for sounding Arab. Examiners are trained to expect candidate accents from across the world. Egyptian, Levantine, Gulf, and Maghrebi accents are all common and all acceptable on the test.
- You will be penalised for specific sound substitutions that change meaning — when "pain" sounds like "bain", or "fail" sounds like "veil".
- You will be penalised for inserting extra vowels into consonant clusters so habitually that the rhythm of your sentence breaks down — when "stress test" becomes "estress etest".
- You will be penalised for word stress that lands on the wrong syllable badly enough to make multi-syllable medical words unrecognisable — when "examination" comes out evenly weighted on every syllable.
That is it. Four narrow conditions. Everything else about how you sound — the pharyngeal colour of your consonants, the rolled or tapped /r/, residual emphatic consonants leaking into nearby vowels, the music of your sentences — does not lose marks on intelligibility. Arabic candidates often worry about sounding "too Arab". The examiners I have spoken with rarely flag that as the issue. What they flag is precisely the five sound-level patterns below.
The five Arabic → English sound patterns the OET rubric notices
These are the five patterns linguists have studied most extensively in Arabic L1 / English L2 speakers, and the five examiners consistently flag on real OET role-plays. None of them apply to every Arabic speaker — your own pattern depends on whether your home variety is Egyptian, Levantine (Syria / Lebanon / Jordan / Palestine), Gulf (Saudi / UAE / Kuwait / Bahrain / Qatar), Iraqi, Sudanese, or Maghrebi (Morocco / Algeria / Tunisia / Libya), and how much exposure you have had to native English in your clinical training so far. A nurse who trained at a partly-English-medium hospital in Cairo or Riyadh will have a very different pattern from one who trained entirely in Arabic and only started English at OET-prep stage.
Pattern 1: /p/ → /b/ (the single most clinically embarrassing pattern)
Arabic has no /p/ phoneme. The only labial stop is /b/. Most adult Arabic speakers substitute /b/ wherever English uses /p/.
- "pain" → "bain"
- "patient" → "batient"
- "pulse" → "bulse"
- "pressure" → "bressure"
- "pill" → "bill"
- "paracetamol" → "baracetamol"
- "pulmonary" → "bulmonary"
- "post-op" → "bost-ob"
This is the most clinically costly Arabic pattern on OET because almost every patient interaction includes the words "pain", "patient", "pulse", "pill", or "pressure". When an Australian patient role-player hears "I am concerned about your bain", they understand from context, but the examiner is listening for whether the candidate can make the /p/ distinction at all. Three or four /p/→/b/ swaps in a single five-minute role-play and intelligibility drops.
The trickier sub-pattern is hypercorrection — once Arabic candidates start drilling /p/, many begin producing /p/ where English uses /b/, because they over-monitor every labial. "biopsy" → "piopsy", "blood" → "plood", "bladder" → "pladder". This is just as costly. The fix is consistency, not over-correction.
A small number of Arabic speakers — particularly highly-bilingual ones from Lebanon or Egypt who learned French alongside Arabic — have no /p/-/b/ issue at all. French gave them the contrast early. If that describes you, skip Drill 1 and put the time into Drill 2 instead.
Pattern 2: Epenthetic vowels in consonant clusters
Arabic syllable structure heavily restricts consonant clusters. Most Arabic dialects allow only CV or CVC syllables; clusters at the start of a syllable are extremely rare. English, by contrast, allows complex clusters: "spring" (CCCVC), "stretched" (CCCVCCC), "strength" (CCCVNCCC).
Under speech pressure, Arabic speakers insert an extra vowel — usually a short /i/ or /e/ — to break up the cluster. Linguists call this epenthesis.
The OET-relevant cases:
- "stress" → "estress" or "istress"
- "spring" → "espring"
- "school" → "eschool" or "iskool"
- "stretch" → "estretch"
- "smoke" → "esmoke"
- "small" → "esmall"
- "scan" → "escan"
- "specialist" → "espesialist"
- "stable" → "estable" — relevant constantly in clinical handovers
- "stroke" → "estroke"
- "study" → "estudy"
- "swab" → "eswab"
- "swelling" → "eswelling"
Most of these are still individually intelligible. The problem is that when half of the consonant-initial words in a sentence carry an extra prefixed vowel, the rhythm of the sentence falls apart. "The patient is estable but had a estroke last espring" is technically every word intelligible, but the examiner has to do extra cognitive work to track meaning. That extra cognitive work is exactly what the rubric punishes.
A second sub-pattern is medial epenthesis — inserting a vowel between two consonants in the middle of a word. "examined" → "exa-min-ed" with an extra short vowel between m and n, "asked" → "as-kid". This is less common than initial-cluster epenthesis but appears in multi-syllable clinical vocabulary.
Pattern 3: Vowel inventory conflation
Standard Arabic has three vowels (/a/, /i/, /u/) with short and long versions — so six vowel sounds total. English has around twelve, depending on the dialect. The arithmetic alone tells you Arabic speakers will conflate English vowels.
The most costly conflations on OET:
- /ɪ/ and /iː/: "sit" and "seat", "fill" and "feel", "live" and "leave". An Arabic speaker often produces both with a single vowel that lands somewhere between them.
- /ʊ/ and /uː/: "full" and "fool", "pull" and "pool", "should" and "shoed".
- /æ/, /e/, /ɪ/: "bad", "bed", "bid". Three different English vowels often collapsed into one.
- /ɒ/ and /ʌ/: "hot" and "hut", "shot" and "shut". Particularly costly because the wrong vowel changes the word entirely.
OET-relevant cases:
- "leave" ↔ "live" — "she will leave the hospital tomorrow" vs "she will live the hospital tomorrow"
- "feel" ↔ "fill" — "do you feel any pain" vs "do you fill any pain"
- "seat" ↔ "sit" — "please seat down" instead of "please sit down" sounds incorrect even though intelligible
- "sheep" ↔ "ship" — irrelevant clinically but the canonical drill pair
- "bad" ↔ "bed" — "the bed pain" vs "the bad pain" — both possible in context, so listener must guess
- "bag" ↔ "beg" ↔ "big" — three different words collapsing into one in fast speech
- "full" ↔ "fool" — relevant in "the bed is full" handovers
- "cot" ↔ "cut" — relevant in paediatrics + procedure descriptions
The fix is not learning twelve new vowels in two weeks. The fix is targeting the three or four contrasts that cost the most clinical-meaning errors: /ɪ/ vs /iː/, /æ/ vs /e/, and /ʊ/ vs /uː/. Drill 3 below targets exactly these.
Pattern 4: /v/ → /f/
Arabic has /f/ but no /v/. Most adult Arabic speakers substitute /f/ wherever English uses /v/.
- "very" → "fery"
- "vital signs" → "fital signs"
- "vein" → "fein"
- "ventilator" → "fentilator"
- "vomit" → "fomit"
- "intravenous" → "intrafenous"
- "save" → "safe" — same word with different vowel? no — different meaning entirely
- "live" (verb) → "life" — particularly costly
- "five" → "fife"
This is one of the easier patterns to fix once you notice it. /v/ is articulated by lightly pressing your top teeth against your lower lip and letting air vibrate through — a voiced labio-dental fricative. /f/ is the same articulation without the voice. The physical difference is small in the mouth but acoustically clear. The challenge is consistency under speech pressure, not articulation. You already make /f/ correctly; you simply need to add voice to it for English /v/.
A subset of Arabic speakers — particularly from Egypt and Lebanon where heavy French or English borrowing into the local dialect happens early — produce /v/ natively in loanwords ("video", "valve"). For those candidates, the issue is partial: /v/ comes out fine in some words but slips to /f/ in less familiar words under pressure.
A rarer reverse pattern — /f/ → /v/ — appears occasionally in Iraqi and Sudanese speakers who over-correct after starting English drills. "fast" → "vast", "phone" → "vone". This is the same hypercorrection trap as /p/-/b/.
Pattern 5: /ŋ/ ("ng") at the end of words
Arabic has /n/ and it has /g/, but it has no /ŋ/ — the velar nasal sound that ends English words like "sing", "ring", "running". Arabic speakers often produce the sequence /n/ + /g/ instead, with the /g/ audible.
The OET-relevant cases:
- "singing" → "sin-ging" with a hard /g/
- "running" → "run-ning" with a hard /g/
- "morning" → "mor-ning" with audible /g/, or sometimes "mor-nin" (final dropped)
- "breathing" → "brea-thing" or "brea-thin"
- "swelling" → "swe-lling" with /g/
- "feeling" → "fee-ling" with /g/
- "examination findings" → "fin-dings"
This pattern is less intelligibility-costly than the others — most listeners parse "fee-ling" as "feeling" without effort — but examiners flag it consistently. The single highest-leverage fix is targeting the -ing endings on present-participle verbs ("running", "feeling", "breathing", "swelling") because these appear constantly in patient assessment talk.
There is also the opposite pattern: dropping the /ŋ/ entirely and producing only /n/, which gives "runnin'", "feelin'", "morn'in'". This is informal in native English too and not heavily flagged, but it does register as register-mismatch with formal OET speech.
Patterns that are NOT pronunciation problems
Before the drills, it is worth flagging four things that feel like pronunciation issues but are scored under different criteria. If you spend pronunciation drill time on these, you waste hours.
The rolled or tapped /r/
Arabic /r/ is a trill or a tap, not the smooth approximant of English /r/. To an English ear it sounds noticeably different. To the OET intelligibility rubric, it does not matter. Trilled /r/ is acceptable across English varieties — Scottish, South African, Indian, many Caribbean and African Englishes all use trilled or tapped /r/. The OET examiner has heard hundreds of trilled-/r/ candidates. None of them lose marks for it.
Spending drill time on flattening your /r/ is at best wasted time and at worst makes you sound less confident as you try to suppress an automatic articulation. Leave the /r/ alone.
Pharyngeal and emphatic consonants leaking into nearby vowels
Arabic has pharyngeal consonants (ع ḥ) and emphatic consonants (ص ض ط ظ) that colour the vowels next to them. When these speakers produce English, the residual colouring sometimes leaks into nearby English vowels — an English "ah" next to a /t/ may sound slightly more back than a native English /t/-"ah" combination.
This is purely an accent marker. Listeners may notice it; examiners do not score it. Intelligibility is unaffected.
Code-switching to Arabic discourse markers
Under English-language pressure, many Arabic speakers retain Arabic discourse markers — "yani" (literally "I mean"), "wallahi" (literally "I swear"), "inshallah" (literally "God willing"). These are intelligible in context, especially "inshallah" which has entered global English, but they are flagged by examiners as language code-switching and may register on the appropriateness of language criterion rather than intelligibility.
The fix is not pronunciation training. It is awareness during practice recordings. Count the number of Arabic discourse markers in your own speech each week. The number should drop. Replace "yani" with "I mean" or "in other words"; replace "inshallah" with "hopefully" or "we'll see how it goes".
Definite-article overuse
Arabic uses the definite article (الـ "al-") much more often than English does. Arabic-trained clinicians often produce "the diabetes", "the high blood pressure", "the surgery" where English would use no article. This is grammar, scored under the grammar criterion, not pronunciation. Drilling /p/-/b/ pairs will not help. Reading native clinical English (UK or AU nursing publications) and noticing article use will.
Drill 1: /p/ and /v/ minimal pairs with clinical sentences
The standard pronunciation drill for these patterns is minimal pairs: word pairs that differ in exactly one sound, like "pain / bain" or "vest / fest". Saying them in succession trains your mouth to feel the difference.
Twenty minutes a day for two weeks moves most candidates from "swaps under pressure" to "consistent under pressure". Beyond two weeks the curve flattens — you are at the level where remaining slips come from speaking too fast, not from not knowing the sounds.
Clinical minimal pairs
These are the ones you will actually use in OET role-plays. Drill these, not generic phonetics-textbook pairs.
/p/ ~ /b/ (pain / bain):
- pain / bain
- pill / bill
- pulse / bulse (not a real word, but drilling the contrast is the point)
- pat / bat
- pack / back
- pin / bin
- pole / bowl
- pump / bump
- pad / bad
- "I'll check your bulse" / "I'll check your pulse"
- "she has bain in her chest" / "she has pain in her chest"
- "the batient is stable" / "the patient is stable"
- "please take the bill" / "please take the pill"
- "abply bressure to the wound" / "apply pressure to the wound"
/v/ ~ /f/ (vest / fest):
- vest / fest
- vine / fine
- vault / fault
- vow / few (close enough)
- "I'll check your fital signs" / "I'll check your vital signs"
- "she's fomiting" / "she's vomiting"
- "the fein in her arm" / "the vein in her arm"
- "the fentilator is on" / "the ventilator is on"
- "she's been ferry sick" / "she's been very sick"
- "she'll life" / "she'll live"
How to drill
- Record yourself saying each pair five times: slowly, slowly, normal, normal, fast.
- Listen back. The /p/ in "pain" should feel like a sharp puff of air on your hand if you hold your hand 2-3 cm in front of your mouth. The /b/ in "bain" should not. The physical difference is large; you can feel the contrast clearly once you focus on it.
- For /v/, watch yourself in the mirror. The lower lip should touch the upper teeth and you should feel a buzz in your throat. For /f/, same articulation but no buzz.
- Drill in clinical sentences, not just isolated words. "Please apply pressure to the patient's bain" is the use case; "pain" in isolation is the drill.
- Vary the speed. Slow drills install the sound; fast drills install the reflex. The exam will demand fast.
After two weeks of this, twenty minutes a day, your /p/ and /v/ accuracy on a fresh recording should be 90%+ even at conversational speed.
Drill 2: consonant clusters without the epenthetic vowel
This is the drill Arabic candidates most often skip — because most teachers do not name the pattern explicitly and the inserted vowel feels invisible to the speaker. It is not invisible to the listener. Epenthetic vowels are one of the loudest accent markers on Arabic-L1 English, and the one most likely to push intelligibility from a 6 to a 5.
The clusters to drill
The cluster types that cause the most trouble are /s/ + consonant at the start of a word, because Arabic specifically forbids these word-initially.
/sp/, /sm/, /sn/, /st/, /sk/, /sw/, /sl/:
- spring, speak, special, spasm
- smooth, smell, smile, small
- snack, snap, snore
- stress, stable, study, stroke, swab, swelling, scan, school, skin, slip, sleep
- swallow, sweat, swell
/spr/, /str/, /scr/, /spl/, /skl/ — the three-consonant clusters are even harder:
- spring, sprint, sprain (clinically important)
- stretch, stress, stroke, straight
- screen, scratch, scrub
- splint (clinical), splash, split
- sclera (clinical), scleral, sclerosis
How to drill
- Pick one cluster at a time. Start with /st/ because it appears in the most clinical words ("stable", "stress", "stroke", "stop").
- Say each word slowly, deliberately starting the /s/ before any vowel. Place your hand under your jaw — if you feel your jaw drop before you produce the /t/, you are inserting a vowel. The goal is /s/ → straight to /t/ with no jaw movement between them.
- Try this exercise: prolong the /s/ for a full second ("ssssss"), then add the /t/ without moving your jaw. "Sssss-stable". Then shorten the /s/ over five repetitions until it is normal length. The cluster should now feel automatic.
- Embed in clinical sentences:
- "The patient is stable." (no e- before stable)
- "There has been some swelling." (no e- before swelling)
- "She had a stroke last spring." (no e- before stroke or spring)
- "Please scan the chart." (no e- before scan)
- Record. Listen back. The single biggest tell of unfixed epenthesis is hearing a sentence and noticing a soft /e/ or /i/ vowel before any /s/-initial word.
Two weeks of this, ten to fifteen minutes a day, gets most Arabic candidates to consistent cluster production. After that, it is maintenance — periodic check-ins on your recordings, especially when speaking fast or under pressure.
A useful exercise: record yourself reading a paragraph from any English clinical guideline that contains lots of /s/-cluster words. Count the number of epenthetic vowels you produce in 30 seconds. Drill the cluster type that appears most. Re-record after a week. The count should drop.
Drill 3: vowel length and quality contrast
This is the drill Arabic candidates often underestimate — because the difference between "sit" and "seat" sounds tiny when you first hear it, and you assume it cannot be scoring you down. It is. Vowel conflation is the single most common intelligibility flag for Arabic candidates after /p/-/b/ and epenthesis.
The minimal pairs to drill
Target the three contrasts that cost the most: /ɪ/ vs /iː/, /æ/ vs /e/, and /ʊ/ vs /uː/.
Long /iː/ vs short /ɪ/ (the most clinically costly):
- sheep / ship
- feel / fill
- leave / live
- reach / rich
- sleep / slip
- beat / bit
- seat / sit
- heel / hill
- peel / pill
Short /æ/ vs short /e/ (the second most clinically costly):
- bad / bed
- bag / beg
- pat / pet
- pan / pen
- man / men
- sat / set
- had / head
Short /ɪ/ vs short /e/ (less common but tricky):
- bid / bed
- pin / pen
- lid / led
- sit / set
Long /uː/ vs short /ʊ/:
- pool / pull
- fool / full
- shoe / should
How to drill
- Pick one column at a time. Say each pair five times, exaggerating the contrast. "Sheeeep / ship". "Leeeave / live". Hold the long vowel for what feels like too long.
- For /æ/ vs /e/, the mouth shape differs as well as the duration. /æ/ ("bad") has the mouth wide and low; /e/ ("bed") has the mouth slightly less open and the tongue higher. Watch yourself in a mirror.
- Embed in clinical sentences:
- "She wants to leave the hospital tomorrow." (long /iː/)
- "She wants to live at home." (short /ɪ/)
- "Please sit on the bed." (short /ɪ/, then short /e/)
- "Please seat the patient." (long /iː/)
- "The patient has bad pain." (/æ/)
- "The patient is in bed." (/e/)
- "Do you feel any pain?" (long /iː/)
- "Please fill out this form." (short /ɪ/)
- Record. Listen back. The single biggest tell of an unfixed vowel issue is hearing a sentence and not being able to tell, from your own audio, which word you said.
Two weeks of this, ten minutes a day, gets most candidates to a reliable contrast. After that, it is maintenance — periodic check-ins on your recordings.
If you also want to drill word stress — which is less of a problem for Arabic speakers than for Mandarin speakers but still costs intelligibility marks on long clinical words — the Mandarin pronunciation post's stress table covers the top 50 clinical words and applies just as well here.
A 4-week weekly schedule that combines the three drills
A schedule that works for most working clinicians preparing for OET while still in their day job:
| Week | Mon | Tue | Wed | Thu | Fri | Sat | Sun | |---|---|---|---|---|---|---|---| | 1 | Drill 1, 20m | Drill 1, 20m | Drill 1, 20m | Drill 1, 20m | Drill 1, 20m | Record 5-min role-play | Listen + count slips | | 2 | Drill 2, 15m | Drill 2, 15m | Drill 2, 15m | Drill 2, 15m | Drill 2, 15m | Record 5-min role-play | Listen + count slips | | 3 | Drill 3, 10m | Drill 3, 10m | Drill 3, 10m | Drill 3, 10m | Drill 3, 10m | Record 5-min role-play | Listen + count slips | | 4 | All three, 5m each | All three, 5m each | All three, 5m each | All three, 5m each | All three, 5m each | Full role-play with AI examiner | Compare week 1 vs week 4 |
The weekend recording is the most important part. Without it, you have no measurement and no way to know whether the drills are sticking. With it, you get a concrete count of slips per minute that should drop week over week.
If you want to outsource the recording-and-feedback loop entirely — including catching the slips you do not hear in your own voice — that is exactly what an AI examiner is for. OET Live runs every recorded role-play through a pronunciation-analysis layer that flags each substitution, with timestamp, in the feedback report. You can see exactly which /p/ slipped to /b/ at 2:14 and which /s/-cluster grew an epenthetic vowel at 3:47, and decide which drill to focus on next.
What to do if your accent is already a mix
Many Arabic candidates working in OET prep are not exclusively Arabic-L1 any more. Years of clinical English at partly-English-medium hospitals in the Gulf, years of medical training in English at universities in Egypt or Jordan, or extended stays in English-speaking countries leave most candidates with a mixed-accent baseline. Your /p/-/b/ may be fine but your /v/-/f/ may still slip. Your clusters may be clean but your vowel length conflated. Your stress patterns may be Arabic-default on rare words but native-like on common ones.
The fix is the same as for unmixed candidates, but the assessment matters more. Spend the first week recording yourself across a five-minute role-play and counting which of the five patterns actually appears. Drill the top two. Ignore the ones that have already self-corrected from years of English exposure.
This applies even more strongly to candidates from Lebanon (where many are tri-lingual Arabic-French-English from school age), Egypt's English-medium medical schools, and Gulf candidates who have worked under expatriate British or Australian nurses for years. Your starting pronunciation profile is much closer to the OET target than the textbook Arabic-L1 description, and a one-size-fits-all drill plan will waste your time on patterns you no longer produce.
What examiners notice but do not score: a final reassurance
In thirty audited Arabic-L1 OET role-plays, the examiners' written intelligibility comments overwhelmingly cluster around the five patterns above. They do not cluster around:
- The trilled or tapped /r/
- Pharyngeal colouring of vowels
- The "musicality" of Arabic-influenced English
- Occasional residual emphatic consonant
- Length of pause before an unfamiliar English word
- An audible accent in general
Arabic candidates routinely arrive at OET prep thinking they need to flatten their accent. They do not. They need to fix five things. That is the entire intelligibility story for Arabic L1, and it is exactly what this post lays out.
If you fix /p/-/b/, fix consonant clusters, fix vowel length, fix /v/-/f/, and reach reliable -ing endings, your intelligibility score will move. The other things you might worry about — how Arab you sound, how much your /r/ rolls, how much your pharyngeal consonants colour your English — none of those will move your score.
Drill the five. Leave the rest alone.
Next post in the series: "OET Speaking under stress — managing exam-day nerves". For the broader OET roadmap, see the 8-week preparation plan or how OET Speaking is actually scored.