Polish pronunciation
This is my attempt to describe Polish phonology and phonotactics. The Wikipedia article has a lot more detail, though.
This mainly applies to contexts where there are native words. Loanwords that are unchanged from the original spelling are pronounced regarding the origin language's pronunciation rules. Thus, weekend is pronounced like in English, but Waszyngton (Washington) is pronounced with a V sound instead of a W sound.
IPA transcriptions for phonemes (annotated /.../) are provided, but I tried to avoid going into phones (annotated [...]) as much as possible. Graphemes are annotated ⟨...⟩.
Alphabet
The Polish alphabet extends the Latin alphabet, and has 32 letters. Q, V, and X are only in loanwords. Providing it makes it easier to understand the phonology.
A Ą B C Ć D E Ę F G H I J K L Ł M N Ń O Ó P R S Ś T U W Y Z Ź Ż
a ą b c ć d e ę f g h i j k l ł m n ń o ó p r s ś t u w y z ź ż
Fun fact! X actually used to be used in native words, e.g. xiądz ("priest"). X was removed from the core alphabet at some point, and was replaced with ks or gz, depending on the pronunciation, such as in ksiądz, egzamin ("exam").
Stress
Most words are stressed on the penultimate syllable (as in, the 2nd syllable, but counting from the end). However, there are cases where the stress is on the last syllable, or on the 3rd or 4th syllable from the end.
- 2nd syllable from the end:
- single words not indicated below
- in prepositional phrases with a personal pronoun: do niej ('to her'), na nas ('on us'), przeze mnie ('because of me')
- last syllable:
- many interjections: hura ("hooray")
- initialisms: RP (er-pe, "Republic of Poland")
- unadapted borrowings from French: faux pas
- 3rd syllable from the end:
- words from classical languages:
- words ending with -ika, -yka: rytmika ("rhythmics"), muzyka ("music")
- some longer words (only in the nominative singular, and no derived words with native affixes): uniwersytet ("university"), but uniwersytecki ("relating to a university")
- 1st- and 2nd-person plural past forms: czytaliśmy ("to read": 1-PL-PAST), czytaliście ("to read": 2-PL-PAST)
- 3rd-person plural conditional forms: czytaliby ("to read": 3-VIR-PL-COND), czytałyby ("to read": 3-NVIR-PL-COND)
- words from classical languages:
- 4th syllable from the end:
- 1st- and 2nd-person plural conditional forms: czytalibyśmy ("to read": 1-VIR-PL-COND), czytałybyście ("to read": 2-NVIR-PL-COND)
Vowels
Tables
Modern Polish's oral vowel system is similar to languages like Ukrainian or Indonesian, having two front, two central, and two back vowel phonemes. These vowel positions alone are significant, other factors like length or tone are not.
The close central vowel can be transcribed /ɘ/, /ɨ/, or (more rarely) /ɪ/. This article will use /ɘ/.
| Front | Center | Back | |
|---|---|---|---|
| Close | i | ɘ | u |
| Mid | ɛ | ɔ | |
| Open | a |
While it is debated, there are two nasal vowel phonemes. Some sources support the below table, but others (especially newer ones) don't and consider nasal vowels as combinations of two sounds. In most positions, they would end with /ŋ/ or /w̃/. This article will transcribe those with /w̃/.
| Front | Center | Back | |
|---|---|---|---|
| Mid | (ɛ̃) | (ɔ̃) |
These are the graphemes used to represent each vowel sound:
| Sound | Graphemes |
|---|---|
| /a/ | a |
| /ɛ/ | e |
| /i/ | i |
| /ɔ/ | o |
| /u/ | ó, u |
| /ɘ/ | y |
| (/ɛ̃/) | ę |
| (/ɔ̃/) | ą |
Fun fact! Polish is one of the few modern Slavic languages to preserve nasal vowels in some way. In other Slavic languages, they have merged with other vowels.
Fun fact! The U/Ó split is a relic of when Ó represented the long O sound. At some point, when Polish stopped distinguishing vowels by length, Ó represented /o/, and eventually /u/. Some words have been respelled with the other letter of the two, such as bruzda ("furrow, groove") and ogórek ("cucumber"). The split still exists in some local dialects, such as the Podhale dialect.
In Ą and Ę, the consonant may be reduced or shifted into another one.
| Shift | ą | ę | Condition |
|---|---|---|---|
| denasalization | wziął → /vʑɔw/ (he) took | wzięła → /vʑɛwa/ (she) took | before l or ł |
| addition of /m/ | dąb → /dɔmp/ oak (n) | dębowy → /dɛmbɔvɘ/ oak (adj) | before b/p |
| addition of /n/ | piątka → /pjɔntka/ five (n) | wędka → /vɛntka/ fishing rod | before t/d, c/dz, cz/dż |
| addition of /ɳ/ | wziąć → /vʑɔɲt͡ɕ/ take (v) | pięć → /pjɛɲt͡ɕ/ five (num) | before ś/ź, ć/dź |
| addition of /ŋ/ | mąka → /mɔŋka/ flour | ręka → /rɛŋka/ hand (n) | before k/g |
| no change | wąsy → /vɔw̃sɘ/ mustache | węch → /vɛw̃x/ (sense of) smell | before f/w, s/z, sz/ż/rz, h/ch in ę (name of the letter Ę) |
| no change (Ą) denasalization (Ę) | są → /sɔw̃/ "they are", be-3.PL | się → /ɕɛ/ reflexive pronoun | word-finally |
In careful speech, Ę is pronounced /ɛw̃/ word-finally, except in ę, the name of the letter Ę. Otherwise, it is pronounced /ɛ/.
Because this exception doesn't apply to verbs, it may be confusing if a verb has two present tense forms that end with -ę (1st person singular) and -e (3rd person singular). Despite the fact that Polish is a pro-drop language, they are nonetheless easy to distinguish: dropping pronouns mostly only happens to 1st- and 2nd-person contexts. /piʂɛ tɔ/ by itself is generally understood to be in 1st person: "Piszę to."
Consonants
| Labial | Alveolar | Retroflex | Palatal | Velar | |
|---|---|---|---|---|---|
| Nasal | m | n | ɳ | ŋ | |
| Nasal approximant | (w̃) | ||||
| Plosive | p b | t d | k ɡ | ||
| Affricate | t͡s d͡z | t͡ʂ d͡ʐ | t͡ɕ d͡ʑ | ||
| Fricative | f v | s z | ʂ ʐ | ɕ ʑ | x ɣ |
| Approximant | l | j | w | ||
| Trill | r |
All consonants use the same graphemes as the IPA, except:
| Sound | Graphemes |
|---|---|
| /ɳ/ | ń, n (before ⟨i⟩ + consonant), ni (before vowels) |
| /v/ | w |
| /w/ | ł |
| /t͡s/ | c |
| /t͡ʂ/ | cz |
| /d͡ʐ/ | dż |
| /ʂ/ | sz |
| /ʐ/ | ż, rz |
| /x/ | h, ch |
| /t͡ɕ/ | ć, c (before ⟨i⟩ + consonant), ci (before vowels) |
| /d͡ʑ/ | dź, dz (before ⟨i⟩ + consonant), dzi (before vowels) |
| /ɕ/ | ś, s (before ⟨i⟩ + consonant), si (before vowels) |
| /ʑ/ | ź, z (before ⟨i⟩ + consonant), zi (before vowels) |
The sounds /ŋ/, /w̃/, /ɣ/ don't have separate graphemes. Instead, they can only happen in certain circumstances:
- /ŋ/ represents ⟨n⟩, and it only appears before /k/, /ɡ/, /x/.
- It also appears after /ɔ/ and /ɛ/, before /k/ and /ɡ/, to represent ⟨ą⟩ and ⟨ę⟩.
- /w̃/ only appears:
- after /ɔ/ before /f/, /v/, /s/, /z/, /ʂ/, /ʐ/, /x/, and word-finally to represent ⟨ą⟩.
- after /ɛ/ before /f/, /v/, /s/, /z/, /ʂ/, /ʐ/, /x/ to represent ⟨ę⟩.
- In careful speech, it also appears after /ɛ/ word-finally.
- /ɣ/ only appears before voiced obstruents (see below).
Fun fact! Polish used to use the /ɫ/ sound (same as the dark L in English, but as a separate phoneme), but as delateralization took its course, it shifted to /w/. Despite this, it is still written Ł, a letter suggesting similarity to L.
We have a name for this shift: wałczenie. This is a fairly old phenomenon, which started with 16th-century peasants. It was rejected by the upper class, and the stigma lasted until around the 1950s. It was described by realizing Ł as a "non-syllabic U". Today, pronouncing Ł as [ɫ] is considered conservative.
Serbo-Croatian (except the Croatian dialects) also underwent L-vocalization, but only for syllable-final Ls, which changed to Os, cf. Београд Beograd ("Belgrade"). Some other Slavic languages, such as Bulgarian, are reportedly having the same L-vocalization that affected Polish.
Fun fact! RZ used to be a fricative R (/r̻/), but has merged into Ż phonemically. This is another leftover of old Slavic languages, preserved in Czech and Slovak. In other Slavic languages, it has become R instead.
In modern Polish, this is preserved in inflection and derivation, where RZ can become R and vice-versa, e.g. powietrze ("air") and wiatr ("wind").
In dialects with mazuration, chiefly the Mazovian and Lesser Poland dialects, ⟨ż⟩ is pronounced /z/, but ⟨rz⟩ is pronounced /ʐ/. Therefore, morze ("sea") and może ("maybe"), which are homophones in standard Polish, are not homophones in those dialects.
Phonotactics
The Polish syllable structure appears to be (C)(C)(C)(C)V(C)(C)(C), as exemplified by the words wstrząs ("shock, impact") and rejestr ("register" [noun]).
Final obstruents are devoiced: dług ("debt") → /dwuk/. Inflection explains the spelling: dług-∅ → dług-u.
Some words have difficult consonant clusters in the middle that get simplified: sześćset ("six hundred") → /ʂɛɕ.sɛt/ or /ʂɛj.sɛt/, jabłko ("apple") → /jap.ko/. Inflection and derivation often explains the spelling of those consonant clusters: sześćset → sześciuset, jabłko → jabłuszko.
Assimilation
- Voiced obstruents right before voiceless consonants become voiceless: książka (book) → /kɕɔw̃ʂka/.
- In the onset, voiced obstruents right after voiceless consonants become voiceless: przemiana (transformation) → /pʂɛ.mja.na/.
- In the middle of a word, voiceless obstruents immediately before voiced obstruents become voiced: prośba (request) → /prɔʑ.ba/.
- This is the only instance when a phonemic /ɣ/ may appear: klechda (legend) → /klɛɣ.da/.
- Unlike in English, this kind of assimilation does not happen before sonorants: tandetny (trashy) → /tan.dɛt.nɘ/.
- While rz and ż normally represent the same phoneme, rz cannot cause regressive assimilation and will always be subject to progressive assimilation, including in the middle of a word. Compare pokrzywa ("nettle") → /pɔ.kʂɘ.va/ and także ("also, too") → /taɡ.ʐɛ/.
Onset consonant clusters
Idiosyncracies like rdza ("rust"), rtęć ("mercury, quicksilver"), or źdźbło ("blade [of a grass or cereal]") are mostly preserved for historical reasons. For this reason, Poles often insert a very short vowel between the consonants in everyday speech. This shows a fossilized trace of otherwise reduced yers, very short vowels that existed in Proto-Slavic, marked with *ь/*ĭ and *ъ/*ŭ.
| Modern Polish | Gloss | Old Polish | Proto-Slavic | Cognates in other Slavic languages |
|---|---|---|---|---|
| rdza | rust | (unknown) | *rъďa | Bulgarian ръжда́ rǎždá Ukrainian іржа́ iržá |
| rtęć | mercury | rtęć | *rьtǫtь | Czech rtuť Slovak ortuť |
| srebro | silver | śrzebro | *sьrebro | Russian серебро́ serebró Silesian strzybło |
| źdźbło | blade of grass or cereal | śćbło | *stьblo | Ukrainian стебло́ stebló ("stalk, stem") Serbo-Croatian ста́бло stáblo ("tree, trunk") |
In Czech and Slovak, r and l may be syllabic, which led to the tongue twister strč prst skrz krk. In Polish and most other Slavic languages, only vowels may be syllabic.