Jump to content

C: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Undid good-faith revision 1147350964 by Félix An (talk): already covered in the previous hatnote which also includes C♯ (musical note), etc.
 
(40 intermediate revisions by 26 users not shown)
Line 1: Line 1:
{{short description|Third letter of the Latin alphabet}}
{{short description|3rd letter of the Latin alphabet}}
{{about|the letter|the programming language|C (programming language)|other uses}}
{{about|the letter|the programming language|C (programming language)|other uses}}
{{pp-semi-indef|small=yes}}
{{pp-semi-indef|small=yes}}
{{pp-move-indef}}
{{pp-move-indef}}
{{technical reasons|C#|uses of C#|C-sharp (disambiguation){{!}}C-sharp}}
{{Hatnote|For [[WP:NCTR|technical reasons]], "C#" and ":C" redirect here. For uses of C#, see [[C-sharp (disambiguation)|C-sharp]]. For the keyboard symbols, see [[List of emoticons]].}}
{{Infobox grapheme
{{Infobox grapheme
|name = C
|name = C
|letter = C c
|letter = C c
|variations=([[C#Related characters|See below]])
|variations=
|image=File:C cursiva.gif
|imagesize=200px
|imagealt=Writing cursive forms of C
|script=[[Latin script]]
|script=[[Latin script]]
|type=[[Alphabet]]
|type=[[Alphabet]]
Line 27: Line 24:
|fam7=[[Gamma|Γ γ]]
|fam7=[[Gamma|Γ γ]]
|fam8=[[𐌂]]
|fam8=[[𐌂]]
|sisters={{grid list|[[Г]]|[[G]]|[[wikt:Գ|Գ]] [[wikt:գ|գ]]|[[Gimel|𐡂 ࠂ
}}
ג ܓ ج]]}}
|image=File:Latin_letter_C.svg}}
{{Latin letter info|c}}
{{Latin letter info|c}}

[[File:Copyright.svg|thumb|right|100px|'''C''' in [[copyright symbol]]]]
[[File:Copyright.svg|thumb|right|100px|'''C''' in [[copyright symbol]]]]
'''C''', or '''c''', is the third [[Letter (alphabet)|letter]] of the [[Latin alphabet]], used in the [[English alphabet|modern English alphabet]], the alphabets of other western European languages and others worldwide. Its name in English is [[English alphabet#Letter names|''cee'']] (pronounced {{IPAc-en|'|s|iː}}), plural ''cees''.<ref>"C" ''Oxford English Dictionary,'' 2nd edition (1989); ''Merriam-Webster's Third New International Dictionary of the English Language, Unabridged'' (1993); "cee", ''op. cit.''</ref>

'''C''', or '''c''', is the third [[Letter (alphabet)|letter]] in the [[Latin alphabet]], used in the [[English alphabet|modern English alphabet]], the alphabets of other western European languages and others worldwide. Its name in English is [[English alphabet#Letter names|''cee'']] (pronounced {{IPAc-en|'|s|iː}}), plural ''cees''.<ref>"C" ''Oxford English Dictionary,'' 2nd edition (1989); ''Merriam-Webster's Third New International Dictionary of the English Language, Unabridged'' (1993); "cee", ''op. cit.''</ref>


==History==
==History==
Line 38: Line 37:
! Egyptian
! Egyptian
! Phoenician <br />''[[gimel|gaml]]''
! Phoenician <br />''[[gimel|gaml]]''
! Greek<br />''[[Gamma]]''
! Western Greek<br />''[[Gamma]]''
! Etruscan<br />C
! Etruscan<br />C
! Old Latin<br />C (G)
! Old Latin<br />C (G)
Line 45: Line 44:
| {{align|center|<hiero>T14</hiero>}}
| {{align|center|<hiero>T14</hiero>}}
|[[File:Phoenician gimel.svg|30px|Phoenician gimel]]
|[[File:Phoenician gimel.svg|30px|Phoenician gimel]]
|[[File:Gamma uc lc.svg|55px|Greek Gamma]]
|[[File:Greek Gamma archaic 1.svg|30px|Greek Gamma]]
|[[File:EtruscanC-01.svg|30px|Etruscan C]]
|[[File:EtruscanC-01.svg|30px|Etruscan C]]
|[[File:Old Latin G.svg|20px|Old Latin]]
|[[File:Old Latin G.svg|30px|Old Latin]]
| [[File:Capitalis monumentalis C.svg|x20px|Latin C]]
|[[File:Capitalis monumentalis C.svg|x30px|Latin C]]
|}
|}
"C" comes from the same letter as "G". The [[Semitic people|Semites]] named it [[gimel]]. The sign is possibly adapted from an [[Egyptian hieroglyphs|Egyptian hieroglyph]] for a [[Staff-sling|staff sling]], which may have been the meaning of the name ''gimel''. Another possibility is that it depicted a camel, the Semitic name for which was ''gamal''. [[Barry B. Powell]], a specialist in the history of writing, states "It is hard to imagine how gimel = "camel" can be derived from the picture of a camel (it may show his hump, or his head and neck!)".<ref>{{cite book|last=Powell|first=Barry B.|title=Writing: Theory and History of the Technology of Civilization|date=27 Mar 2009|publisher=Wiley Blackwell|isbn=978-1405162562|page=182|url=https://books.google.com/books?id=PZ2Gr3d9X2UC&q=Gimel+shaped+like+a+camel%27s+neck&pg=PA182}}</ref>
"C" comes from the same letter as "G". The [[Semitic people|Semites]] named it [[gimel]]. The sign is possibly adapted from an [[Egyptian hieroglyphs|Egyptian hieroglyph]] for a [[Staff-sling|staff sling]], which may have been the meaning of the name ''gimel''. Another possibility is that it depicted a camel, the Semitic name for which was ''gamal''. [[Barry B. Powell]], a specialist in the history of writing, states "It is hard to imagine how gimel = "camel" can be derived from the picture of a camel (it may show his hump, or his head and neck!)".<ref>{{cite book|last=Powell|first=Barry B.|title=Writing: Theory and History of the Technology of Civilization|date=27 Mar 2009|publisher=Wiley Blackwell|isbn=978-1405162562|page=182|url=https://books.google.com/books?id=PZ2Gr3d9X2UC&q=Gimel+shaped+like+a+camel%27s+neck&pg=PA182}}</ref>


In the [[Etruscan language]], [[plosive consonant]]s had no contrastive [[phonation|voicing]], so the [[Greek language|Greek]] '[[Gamma|Γ]]' (Gamma) was adopted into the [[Old Italic script#Etruscan alphabet|Etruscan alphabet]] to represent {{IPA|/k/}}. Already in the [[Western Greek alphabet]], Gamma first took a '[[File:Early Etruscan C.svg|frameless|17x17px]]' form in Early Etruscan, then '[[File:Classical Etruscan C.gif|15px]]' in Classical Etruscan. In Latin it eventually took the '{{smallcaps|c}}' form in Classical Latin. In the earliest [[Latin]] inscriptions, the letters '{{smallcaps|c k q}}' were used to represent the sounds {{IPA|/k/}} and {{IPA|/ɡ/}} (which were not differentiated in writing). Of these, '{{smallcaps|q}}' was used to represent {{IPA|/k/}} or {{IPA|/ɡ/}} before a rounded vowel, '{{smallcaps|k}}' before '{{smallcaps|a}}', and '{{smallcaps|c}}' elsewhere.<ref>{{cite book
In the [[Etruscan language]], [[plosive consonant]]s had no contrastive [[phonation|voicing]], so the [[Greek language|Greek]] '[[Gamma|Γ]]' (Gamma) was adopted into the [[Etruscan alphabet]] to represent {{IPA|/k/}}. Already in the [[Western Greek alphabet]], Gamma first took a '[[File:Early Etruscan C.svg|frameless|17x17px]]' form in Early Etruscan, then '[[File:Classical Etruscan C.gif|15px]]' in Classical Etruscan. In Latin it eventually took the '{{smallcaps|c}}' form in Classical Latin. In the earliest [[Latin]] inscriptions, the letters '{{smallcaps|c k q}}' were used to represent the sounds {{IPA|/k/}} and {{IPA|/ɡ/}} (which were not differentiated in writing). Of these, '{{smallcaps|q}}' was used to represent {{IPA|/k/}} or {{IPA|/ɡ/}} before a rounded vowel, '{{smallcaps|k}}' before '{{smallcaps|a}}', and '{{smallcaps|c}}' elsewhere.<ref>{{cite book
|title=New Comparative Grammar of Greek and Latin
|title=New Comparative Grammar of Greek and Latin
|first=Andrew L.
|first=Andrew L.
Line 65: Line 64:
}}</ref> During the 3rd century BC, a modified character was introduced for {{IPA|/ɡ/}}, and '{{smallcaps|c}}' itself was retained for {{IPA|/k/}}. The use of '{{smallcaps|c}}' (and its variant '{{smallcaps|g}}') replaced most usages of '{{smallcaps|k}}' and '{{smallcaps|q}}'. Hence, in the classical period and after, '{{smallcaps|g}}' was treated as the equivalent of Greek gamma, and '{{smallcaps|c}}' as the equivalent of kappa; this shows in the romanization of Greek words, as in 'ΚΑΔΜΟΣ', 'ΚΥΡΟΣ', and 'ΦΩΚΙΣ' came into Latin as '{{smallcaps|cadmvs}}', '{{smallcaps|cyrvs}}' and '{{smallcaps|phocis}}', respectively.
}}</ref> During the 3rd century BC, a modified character was introduced for {{IPA|/ɡ/}}, and '{{smallcaps|c}}' itself was retained for {{IPA|/k/}}. The use of '{{smallcaps|c}}' (and its variant '{{smallcaps|g}}') replaced most usages of '{{smallcaps|k}}' and '{{smallcaps|q}}'. Hence, in the classical period and after, '{{smallcaps|g}}' was treated as the equivalent of Greek gamma, and '{{smallcaps|c}}' as the equivalent of kappa; this shows in the romanization of Greek words, as in 'ΚΑΔΜΟΣ', 'ΚΥΡΟΣ', and 'ΦΩΚΙΣ' came into Latin as '{{smallcaps|cadmvs}}', '{{smallcaps|cyrvs}}' and '{{smallcaps|phocis}}', respectively.


Other alphabets have letters [[homoglyph]]ic to 'c' but not analogous in use and derivation, like the [[Cyrillic script|Cyrillic]] letter [[Es (Cyrillic)|Es]] (С, с) which derives from the lunate [[sigma (letter)|sigma]], named due to its resemblance to the crescent moon.
Other alphabets have letters [[homoglyph]]ic to 'c' but not analogous in use and derivation, like the [[Cyrillic script|Cyrillic]] letter [[Es (Cyrillic)|Es]] (С, с) which derives from the lunate [[sigma (letter)|sigma]].


==Later use==
===Later use===
When the Roman alphabet was introduced into Britain, {{angbr|c}} represented only {{IPA|/k/}}, and this value of the letter has been retained in loanwords to all the [[insular Celtic languages]]: in [[Welsh language|Welsh]],<ref>{{Cite web|url=http://www.mit.edu/people/dfm/canol/chap29.html|title=Reading Middle Welsh -- 29 Medieval Spelling|website=www.mit.edu|access-date=2019-11-19}}</ref> [[Irish language|Irish]], and [[Scottish Gaelic language|Gaelic]], {{angbr|c}} represents only {{IPA|/k/}}. The [[Old English Latin alphabet|Old English Latin-based writing system]] was learned from the Celts, apparently of Ireland; hence {{angbr|c}} in Old English also originally represented {{IPA|/k/}}; the Modern English words ''kin, break, broken, thick'', and ''seek'' all come from Old English words written with {{angbr|c}}: {{lang|ang|cyn, brecan, brocen, þicc}}, and {{lang|ang|séoc}}. However, during the course of the Old English period, {{IPA|/k/}} before front vowels ({{IPA|/e/}} and {{IPA|/i/}}) was [[Palatalization (sound change)|palatalized]], having changed by the tenth century to {{IPA|[tʃ]}}, though {{angbr|c}} was still used, as in {{lang|ang|cir(i)ce, wrecc(e)a}}. On the continent, meanwhile, a similar phonetic change before the same two vowels had also been going on in almost all modern romance languages (for example, in [[Italian language|Italian]]).
When the Roman alphabet was introduced into Britain, {{angbr|c}} represented only {{IPA|/k/}}, and this value of the letter has been retained in loanwords to all the [[insular Celtic languages]]: in [[Welsh language|Welsh]],<ref>{{Cite web |title=Reading Middle Welsh -- 29 Medieval Spelling |url=https://www.mit.edu/people/dfm/canol/chap29.html |access-date=2019-11-19 |website=www.mit.edu}}</ref> [[Irish language|Irish]], and [[Scottish Gaelic language|Gaelic]], {{angbr|c}} represents only {{IPA|/k/}}. The [[Old English Latin alphabet|Old English Latin-based writing system]] was learned from the Celts, apparently of Ireland; hence {{angbr|c}} in Old English also originally represented {{IPA|/k/}}; the Modern English words ''kin, break, broken, thick'', and ''seek'' all come from Old English words written with {{angbr|c}}: {{lang|ang|cyn, brecan, brocen, þicc}}, and {{lang|ang|séoc}}. However, during the course of the Old English period, {{IPA|/k/}} before front vowels ({{IPA|/e/}} and {{IPA|/i/}}) was [[Palatalization (sound change)|palatalized]], having changed by the tenth century to {{IPA|[tʃ]}}, though {{angbr|c}} was still used, as in {{lang|ang|cir(i)ce, wrecc(e)a}}. On the continent, meanwhile, a similar phonetic change before the same two vowels had also been going on in almost all modern romance languages (for example, in [[Italian language|Italian]]).


In Vulgar Latin, {{IPA|/k/}} became palatalized to {{IPA|[tʃ]}} in Italy and Dalmatia; in France and the Iberian peninsula, it became {{IPA|[ts]}}. Yet for these new sounds {{angbr|{{sm|c}}}} was still used before the letters {{angbr|e}} and {{angbr|i}}. The letter thus represented two distinct values. Subsequently, the Latin phoneme {{IPA|/k<sup>w</sup>/}} (spelled {{angbr|{{smallcaps|qv}}}}) de-labialized to {{IPA|/k/}} meaning that the various Romance languages had {{IPA|/k/}} before front vowels. In addition, [[Norman language|Norman]] used the letter {{angbr|k}} so that the sound {{IPA|/k/}} could be represented by either {{angbr|k}} or {{angbr|c}}, the latter of which could represent either {{IPA|/k/}} or {{IPA|/ts/}} depending on whether it preceded a front vowel letter or not. The convention of using both {{angbr|c}} and {{angbr|k}} was applied to the writing of English after the [[Norman Conquest]], causing a considerable re-spelling of the Old English words. Thus while Old English {{lang|ang|candel, clif, corn, crop, cú}}, remained unchanged, {{lang|ang|Cent, cǣᵹ (cēᵹ), cyng, brece, sēoce}}, were now (without any change of sound) spelled {{lang|enm|Kent, keȝ, kyng, breke}}, and {{lang|enm|seoke}}; even {{lang|ang|cniht}} ('knight') was subsequently changed to {{lang|enm|kniht}} and {{lang|ang|þic}} ('thick') changed to {{lang|enm|thik}} or {{lang|enm|thikk}}. The Old English {{angbr|cw}} was also at length displaced by the French {{angbr|qu}} so that the Old English {{lang|ang|cwēn}} ('queen') and {{lang|ang|cwic}} ('quick') became [[Middle English]] {{lang|enm|quen}} and {{lang|enm|quik}}, respectively. The sound {{IPA|[tʃ]}}, to which Old English palatalized {{IPA|/k/}} had advanced, also occurred in French, chiefly from Latin {{IPA|/k/}} before {{angbr|a}}. In French it was represented by the digraph {{angbr|ch}}, as in ''champ'' (from Latin {{lang|la|camp-um}}) and this spelling was introduced into English: the Hatton Gospels, written {{circa|1160}}, have in Matt. i-iii, {{lang|enm|child, chyld, riche, mychel}}, for the {{lang|ang|cild, rice, mycel,}} of the Old English version whence they were copied. In these cases, the Old English {{angbr|c}} gave way to {{angbr|k}}, {{angbr|qu}} and {{angbr|ch}}; on the other hand, {{angbr|c}} in its new value of {{IPA|/ts/}} appeared largely in French words like {{lang|xno|processiun, emperice}} and {{lang|xno|grace}}, and was also substituted for {{angbr|ts}} in a few Old English words, as {{lang|ang|miltse, bletsien}}, in early Middle English {{lang|enm|milce, blecien}}. By the end of the thirteenth century both in France and England, this sound {{IPA|/ts/}} de-affricated to {{IPA|/s/}}; and from that time {{angbr|c}} has represented {{IPA|/s/}} before front vowels either for [[etymology|etymological]] reasons, as in ''lance, cent'', or to avoid the ambiguity due to the "etymological" use of {{angbr|s}} for {{IPA|/z/}}, as in ''ace, mice, once, pence, defence''.
In Vulgar Latin, {{IPA|/k/}} became palatalized to {{IPA|[tʃ]}} in Italy and Dalmatia; in France and the Iberian peninsula, it became {{IPA|[ts]}}. Yet for these new sounds {{angbr|{{sm|c}}}} was still used before the letters {{angbr|e}} and {{angbr|i}}. The letter thus represented two distinct values. Subsequently, the Latin phoneme {{IPA|/k<sup>w</sup>/}} (spelled {{angbr|{{smallcaps|qv}}}}) de-labialized to {{IPA|/k/}} meaning that the various Romance languages had {{IPA|/k/}} before front vowels. In addition, [[Norman language|Norman]] used the letter {{angbr|k}} so that the sound {{IPA|/k/}} could be represented by either {{angbr|k}} or {{angbr|c}}, the latter of which could represent either {{IPA|/k/}} or {{IPA|/ts/}} depending on whether it preceded a front vowel letter or not. The convention of using both {{angbr|c}} and {{angbr|k}} was applied to the writing of English after the [[Norman Conquest]], causing a considerable re-spelling of the Old English words. Thus while Old English {{lang|ang|candel, clif, corn, crop, cú}}, remained unchanged, {{lang|ang|Cent, cǣᵹ (cēᵹ), cyng, brece, sēoce}}, were now (without any change of sound) spelled {{lang|enm|Kent, keȝ, kyng, breke}}, and {{lang|enm|seoke}}; even {{lang|ang|cniht}} ('knight') was subsequently changed to {{lang|enm|kniht}} and {{lang|ang|þic}} ('thick') changed to {{lang|enm|thik}} or {{lang|enm|thikk}}. The Old English {{angbr|cw}} was also at length displaced by the French {{angbr|qu}} so that the Old English {{lang|ang|cwēn}} ('queen') and {{lang|ang|cwic}} ('quick') became [[Middle English]] {{lang|enm|quen}} and {{lang|enm|quik}}, respectively. The sound {{IPA|[tʃ]}}, to which Old English palatalized {{IPA|/k/}} had advanced, also occurred in French, chiefly from Latin {{IPA|/k/}} before {{angbr|a}}. In French it was represented by the digraph {{angbr|ch}}, as in ''champ'' (from Latin {{lang|la|camp-um}}) and this spelling was introduced into English: the Hatton Gospels, written {{circa|1160}}, have in Matt. i-iii, {{lang|enm|child, chyld, riche, mychel}}, for the {{lang|ang|cild, rice, mycel,}} of the Old English version whence they were copied. In these cases, the Old English {{angbr|c}} gave way to {{angbr|k}}, {{angbr|qu}} and {{angbr|ch}}; on the other hand, {{angbr|c}} in its new value of {{IPA|/ts/}} appeared largely in French words like {{lang|xno|processiun, emperice}} and {{lang|xno|grace}}, and was also substituted for {{angbr|ts}} in a few Old English words, as {{lang|ang|miltse, bletsien}}, in early Middle English {{lang|enm|milce, blecien}}. By the end of the thirteenth century both in France and England, this sound {{IPA|/ts/}} de-affricated to {{IPA|/s/}}; and from that time {{angbr|c}} has represented {{IPA|/s/}} before front vowels either for [[etymology|etymological]] reasons, as in ''lance, cent'', or to avoid the ambiguity due to the "etymological" use of {{angbr|s}} for {{IPA|/z/}}, as in ''ace, mice, once, pence, defence''.
Line 74: Line 73:
Thus, to show etymology, English spelling has ''advise'', ''devise'' (instead of ''*advize'', ''*devize''), while ''advice, device, dice, ice, mice, twice'', etc., do not reflect etymology; example has extended this to ''hence, pence, defence'', etc., where there is no etymological reason for using {{angbr|c}}. Former generations also wrote ''sence'' for ''sense''. Hence, today the [[Romance languages]] and [[English language|English]] have a common feature inherited from [[Vulgar Latin]] spelling conventions where {{angbr|c}} takes on either a [[Hard and soft C|"hard" or "soft"]] value depending on the following letter.
Thus, to show etymology, English spelling has ''advise'', ''devise'' (instead of ''*advize'', ''*devize''), while ''advice, device, dice, ice, mice, twice'', etc., do not reflect etymology; example has extended this to ''hence, pence, defence'', etc., where there is no etymological reason for using {{angbr|c}}. Former generations also wrote ''sence'' for ''sense''. Hence, today the [[Romance languages]] and [[English language|English]] have a common feature inherited from [[Vulgar Latin]] spelling conventions where {{angbr|c}} takes on either a [[Hard and soft C|"hard" or "soft"]] value depending on the following letter.


==Use in writing systems==
==Pronunciation and use==
{{See also|Hard and soft C}}
{{See also|Hard and soft C}}
{| class="wikitable sortable mw-collapsible"
{| class="wikitable sortable mw-collapsible"
|+ Pronunciation of {{angbr|c}} by language
|+Pronunciations of Cc
!Orthography
! colspan="5" |Most common pronunciation: {{IPAslink|k}}
!Phonemes
''Languages in italics do not use the [[Latin alphabet]]''
|-
!Language
!Dialect(s)
!Pronunciation ([[International Phonetic Alphabet|IPA]])
!Environment
!Environment
!Notes
|-
|-
! colspan="2" |[[Albanian language|Albanian]]
! [[Albanian orthography|Albanian]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|
|
|
|-
|-
![[Arabic language|''Arabic'']]
![[Cypriot Arabic]]
!''[[Cypriot Arabic]]''
|{{IPAslink|ʕ}}
|{{IPAslink|ʕ}}
|
|
|Latinization
|-
|-
! colspan="2" |[[Azerbaijani language|Azeri]]
![[Azerbaijani language|Azeri]]
|{{IPAslink|dʒ}}
|{{IPAslink|dʒ}}
|
|
|
|-
|-
! colspan="2" |[[Berber languages|''Berber'']]
![[Berber Latin alphabet|Berber]]
|{{IPAslink|ʃ}}
|{{IPAslink|ʃ}}
|
|
|Latinization
|-
|-
! colspan="2" |[[Bukawa language|Bukawa]]
![[Bukawa language|Bukawa]]
|{{IPAslink|ʔ}}
|{{IPAslink|ʔ}}
|
|
|
|-
|-
! colspan="2" rowspan="2" |[[Catalan language|Catalan]]
! rowspan="2" |[[Catalan orthography|Catalan]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i
|
|
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i
|Before e, i
|-
!{{nwr|[[Standard Chinese]]}} ([[Pinyin]])
|{{IPAslink|tsʰ}}
|
|
|-
|-
! colspan="2" |[[Crimean Tatar language|Crimean Tatar]]
![[Crimean Tatar language|Crimean Tatar]]
|{{IPAslink|dʒ}}
|{{IPAslink|dʒ}}
|
|
|
|-
|-
! colspan="2" |[[Cornish language|Cornish]]
![[Cornish language|Cornish]] {{nwr|([[Standard Written Form]])}}
|{{IPAslink|s}}
|{{IPAslink|s}}
|
|
|Standard Written Form
|-
|-
! colspan="2" |[[Czech language|Czech]]
![[Czech orthography|Czech]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|
|
|
|-
|-
! colspan="2" rowspan="2" |[[Danish language|Danish]]
! rowspan="2" |[[Danish orthography|Danish]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i, y, æ, ø
|
|
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i, y, æ, ø
|Before e, i, y, æ, ø
|
|-
|-
! colspan="2" rowspan="3" |[[Dutch language|Dutch]]
! rowspan="3" |[[Dutch orthography|Dutch]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i, y
|
|
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i, y
|Before e, i, y
|
|-
|-
|{{IPAslink|tʃ}}
|{{IPAslink|tʃ}}
|Before e, i,y
|Before e, i in loanwords from Italian
|in loanwords from Italian
|-
|-
! colspan="2" rowspan="3" |[[English language|English]]
! rowspan="3" |[[English orthography|English]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i, y
|
|
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i, y
|Before e, i, y
|
|-
|-
|{{IPAslink|ʃ}}
|{{IPAslink|ʃ}}
|Before ea, ia, ie, io, iu
|
|
|-
|-
! colspan="2" |[[Esperanto]]
![[Esperanto]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|
|
|
|-
|-
! colspan="2" |[[Fijian language|Fijian]]
![[Fijian language|Fijian]]
|{{IPAslink|ð}}
|{{IPAslink|ð}}
|
|
|
|-
|-
! colspan="2" rowspan="2" |[[Filipino language|Filipino]]
! rowspan="2" |[[Filipino orthography|Filipino]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i
|
|
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i
|Before e, i
|
|-
|-
! colspan="2" rowspan="2" |[[French language|French]]
! rowspan="2" |[[French orthography|French]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i, y
|
|
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i, y
|Before e, i, y
|
|-
|-
! colspan="2" |[[Fula language|Fula]]
![[Fula language|Fula]]
|{{IPAslink|tʃ}}
|{{IPAslink|tʃ}}
|
|
|
|-
|-
! colspan="2" |[[Gagauz language|Gagauz]]
![[Gagauz language|Gagauz]]
|{{IPAslink|dʒ}}
|{{IPAslink|dʒ}}
|
|
|
|-
|-
! colspan="2" rowspan="3" |[[Galician language|Galician]]
! rowspan="2" |[[Galician alphabet|Galician]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i
|
|
|-
|-
|{{IPAslink|θ}}
|{{IPAslink|θ}} or {{IPAslink|s}}
|Before e, i
|Before e, i
|
|-
|-
! rowspan="2" |[[German orthography|German]]
|{{IPAslink|s}}
|Before e, i
|in [[seseo]] zones
|-
! colspan="2" rowspan="2" |[[German language|German]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before ä, e, i, ö, ü, y in loanwords and names
|
|Only in loanwords and names
|-
|-
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|Before ä, e, i, ö, ü, y
|Before ä, e, i, ö, ü, y in loanwords and names
|Only in loanwords and names
|-
|-
! colspan="2" |[[Hausa language|Hausa]]
![[Hausa language|Hausa]]
|{{IPAslink|tʃ}}
|{{IPAslink|tʃ}}
|
|
|
|-
|-
! colspan="2" |[[Hungarian language|Hungarian]]
![[Hungarian orthography|Hungarian]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|
|
|
|-
|-
! colspan="2" |[[Indonesian language|Indonesian]]
![[Indonesian language|Indonesian]]
|{{IPAslink|tʃ}}
|{{IPAslink|tʃ}}
|
|
|
|-
|-
! colspan="2" rowspan="2" |[[Irish language|Irish]]
! rowspan="2" |[[Irish orthography|Irish]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i; or after i
|
|
|-
|-
|{{IPAslink|c}}
|{{IPAslink|c}}
|Before e, i; or after i
|Before e, i; or after i
|
|-
|-
! colspan="2" rowspan="2" |[[Italian language|Italian]]
! rowspan="2" |[[Italian orthography|Italian]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i
|
|
|-
|-
|{{IPAslink|tʃ}}
|{{IPAslink|tʃ}}
|Before e, i
|Before e, i
|

|-
|-
! colspan="2"|''[[Khmer language|Khmer]]''
![[Khmer language|Khmer]] ([[Romanization of Khmer#ALA-LC Romanization Tables|ALA-LC]])
|{{IPAslink|c}}
|{{IPAslink|c}}
|
|
|[[Romanization of Khmer#ALA-LC Romanization Tables|ALA-LC]] latinization
|-
|-
![[Kurdish languages|Kurdish]]
![[Kurmanji]] ([[Kurdish alphabets|Hawar]])
![[Kurmanji]]
|{{IPAslink|dʒ}}
|{{IPAslink|dʒ}}
|
|
|
|-
|-
! colspan="2" rowspan="2" |[[Latin language|Latin]]
![[Latin language|Latin]]
|{{IPAslink|k}}
|{{IPAslink|k}} (and {{IPAslink|g}} in early Latin)
|
|
|
|-
|-
![[Latvian orthography|Latvian]]
|{{IPAslink|g}}
|Early Latin
|
|-
! colspan="2" |[[Latvian language|Latvian]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|
|
|
|-
|-
! colspan="2" |[[Malay language|Malay]]
![[Malay orthography|Malay]]
|{{IPAslink|tʃ}}
|{{IPAslink|tʃ}}
|
|
|
|-
![[Mandarin Chinese|Mandarin]]
![[Standard Chinese|Standard]]
|{{IPAslink|tsʰ}}
|
|[[Pinyin]] latinization
|-
|-
! colspan="2" |[[Manding languages|Manding]]
![[Manding languages|Manding]]
|{{IPAslink|tʃ}}
|{{IPAslink|tʃ}}
|
|
|
|-
|-
! colspan="2" rowspan="2" |[[Norwegian language|Norwegian]]
! rowspan="2" |[[Norwegian orthography|Norwegian]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i, y, æ, ø in loanwords and names
|
|Only in loanwords and names
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i, y, æ, ø
|Before e, i, y, æ, ø in loanwords and names
|Only in loanwords and names
|-
|-
! colspan="2" |[[Polish language|Polish]]
! rowspan="2" |[[Polish orthography|Polish]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|Except before i
|
|
|-
|{{IPAslink|tɕ}}
|Before i
|-
|-
! colspan="2" rowspan="2" |[[Portuguese language|Portuguese]]
! rowspan="2" |[[Portuguese orthography|Portuguese]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i, y
|
|
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i, y
|Before e, i, y
|
|-
|-
! colspan="2" rowspan="2" |[[Romanian language|Romanian]]
! rowspan="2" |[[Romanian orthography|Romanian]]
|{{IPAslink|k}}
|Except before e, i
|-
|{{IPAslink|tʃ}}
|{{IPAslink|tʃ}}
|Before e, i
|Before e, i
|
|-
|-
! rowspan="2" |[[Romansh language|Romansh]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i
|
|
|-
|-
! colspan="2" rowspan="2" |[[Romansh language|Romansh]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|Before e, i
|Before e, i
|
|-
|-
! rowspan="2" |[[Scottish Gaelic]]
|{{IPAslink|k}}
|
|
|-
! colspan="2" rowspan="2" |[[Scottish Gaelic]]
|{{IPAslink|kʰ}}
|{{IPAslink|kʰ}}
|Except before e, i; or after i
|
|
|-
|-
|{{IPAslink|kʰʲ}}
|{{IPAslink|kʰʲ}}
|Before e, i; or after i
|Before e, i; or after i
|
|-
|-
! colspan="2" |[[Serbo-Croatian]]
![[Serbo-Croatian]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|
|
|
|-
|-
! colspan="2" |[[Slovak language|Slovak]]
![[Slovak orthography|Slovak]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|
|
|
|-
|-
! colspan="2" |[[Slovene language|Slovene]]
![[Slovene orthography|Slovene]]
|{{IPAslink|ts}}
|{{IPAslink|ts}}
|
|
|
|-
|-
! colspan="2" |[[Somali language|Somali]]
![[Somali language|Somali]]
|{{IPAslink|ʕ}}
|{{IPAslink|ʕ}}
|
|
|
|-
|-
! rowspan="3" |[[Spanish language|Spanish]]
! rowspan="2" |[[Spanish orthography|Spanish]]
!All
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i, y
|
|
|-
|-
|{{IPAslink|θ}} or {{IPAslink|s}}
!Most of [[Peninsular Spanish|European]]
|{{IPAslink|θ}}
|Before e, i, y
|Before e, i, y
|
|-
|-
! rowspan="2" |[[Swedish orthography|Swedish]]
![[Spanish language in the Americas|American]], [[Andalusian Spanish|Andalusian]], [[Canarian Spanish|Canarian]]
|{{IPAslink|s}}
|Before e, i, y
|
|-
! colspan="2" rowspan="2" |[[Swedish language|Swedish]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i, y, ä, ö
|
|
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i, y, ä, ö
|Before e, i, y, ä, ö
|
|-
|-
! colspan="2" |[[Tatar language|Tatar]]
![[Tatar language|Tatar]]
|{{IPAslink|ʑ}}
|{{IPAslink|ʑ}}
|
|
|
|-
|-
! colspan="2" |[[Turkish language|Turkish]]
![[Turkish alphabet|Turkish]]
|{{IPAslink|dʒ}}
|{{IPAslink|dʒ}}
|
|
|
|-
|-
! colspan="2" rowspan="2" |[[Valencian language|Valencian]]
! rowspan="2" |[[Valencian orthography|Valencian]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except before e, i
|
|
|-
|-
|{{IPAslink|s}}
|{{IPAslink|s}}
|Before e, i
|Before e, i
|
|-
|-
! colspan="2" rowspan="3" |[[Vietnamese language|Vietnamese]]
! rowspan="2" |[[Vietnamese orthography|Vietnamese]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|Except word-finally
|
|
|-
|-
|{{IPAslink|k̚}}
|{{IPAslink|k̚}}
|Word-final
|Word-finally
|
|-
|[{{IPAlink|kp}}]
|Word-final after u, ô, o
|
|-
|-
! colspan="2" |[[Welsh language|Welsh]]
![[Welsh orthography|Welsh]]
|{{IPAslink|k}}
|{{IPAslink|k}}
|
|
|
|-
|-
! colspan="2" |[[Xhosa language|Xhosa]]
![[Xhosa alphabet|Xhosa]]
|{{IPAslink|ǀ}}
|{{IPAslink|ǀ}}
|
|
|
|-
|-
! colspan="2" |[[Yabem language|Yabem]]
![[Yabem language|Yabem]]
|{{IPAslink|ʔ}}
|{{IPAslink|ʔ}}
|
|
|
|-
|-
! colspan="2" |[[Central Alaskan Yup'ik language|Yup'ik]]
![[Central Alaskan Yup'ik language|Yup'ik]]
|{{IPAslink|tʃ}}
|{{IPAslink|tʃ}}
|
|
|
|-
|-
! colspan="2" |[[Zulu language|Zulu]]
![[Zulu alphabet|Zulu]]
|{{IPAslink|ǀ}}
|{{IPAslink|ǀ}}
|
|
|
|}
|}
Line 474: Line 374:
In the Romance languages [[French language|French]], [[Spanish language|Spanish]], [[Italian language|Italian]], [[Romanian language|Romanian]] and [[Portuguese language|Portuguese]], {{angbr|c}} generally has a "hard" value of {{IPA|/k/}} and a "soft" value whose pronunciation varies by language. In French, Portuguese, Catalan and Spanish from Latin America and some places in Spain, the soft {{angbr|c}} value is {{IPA|/s/}} as it is in English. In the Spanish spoken in most of Spain, the soft {{angbr|c}} is a [[voiceless dental fricative]] {{IPA|/θ/}}. In [[Italian language|Italian]] and [[Romanian language|Romanian]], the soft {{angbr|c}} is {{IPA|[t͡ʃ]}}.
In the Romance languages [[French language|French]], [[Spanish language|Spanish]], [[Italian language|Italian]], [[Romanian language|Romanian]] and [[Portuguese language|Portuguese]], {{angbr|c}} generally has a "hard" value of {{IPA|/k/}} and a "soft" value whose pronunciation varies by language. In French, Portuguese, Catalan and Spanish from Latin America and some places in Spain, the soft {{angbr|c}} value is {{IPA|/s/}} as it is in English. In the Spanish spoken in most of Spain, the soft {{angbr|c}} is a [[voiceless dental fricative]] {{IPA|/θ/}}. In [[Italian language|Italian]] and [[Romanian language|Romanian]], the soft {{angbr|c}} is {{IPA|[t͡ʃ]}}.


Germanic languages usually use c for Romance loans or digraphs, such as {{angbr|ch}} and {{angbr|ck}}, but the rules vary across languages. [[Dutch language|Dutch]] uses {{angbr|c}} the most, for all Romance loans and the digraph {{angbr|ch}}, but unlike English, does not use {{angbr|c}} for native Germanic words like ''komen'', "come". German uses {{angbr|c}} in the digraphs {{angbr|ch}} and {{angbr|ck}}, and the trigraph {{angbr|sch}}, but only by itself in unassimilated loanwords and proper names. [[Danish language|Danish]] keeps soft {{angbr|c}} in Romance words but changes hard {{angbr|c}} to {{angbr|k}}. Swedish has the same rules for soft and hard {{angbr|c}} as Danish, and also uses {{angbr|c}} in the digraph {{angbr|ck}} and the very common word ''och'', "and". [[Norwegian language|Norwegian]], [[Afrikaans language|Afrikaans]], and [[Icelandic language|Icelandic]] are the most restrictive, replacing all cases of {{angbr|c}} with {{angbr|k}} or {{angbr|s}}, and reserving {{angbr|c}} for unassimilated loanwords and names.
Germanic languages usually use {{angbr|c}} for Romance loans or digraphs, such as {{angbr|ch}} and {{angbr|ck}}, but the rules vary across languages. Of all the Germanic languages, only English uses initial {{angbr|c}} in native Germanic words like ''come''. Other than English, [[Dutch language|Dutch]] uses {{angbr|c}} the most, for most Romance loans and the digraph {{angbr|ch}}. German uses {{angbr|c}} in the digraphs {{angbr|ch}} and {{angbr|ck}}, and the trigraph {{angbr|sch}}, but by itself only in unassimilated loanwords and proper names. [[Danish language|Danish]] keeps soft {{angbr|c}} in Romance words but changes hard {{angbr|c}} to {{angbr|k}}. Swedish has the same rules for soft and hard {{angbr|c}} as Danish, and also uses {{angbr|c}} in the digraph {{angbr|ck}} and the very common word ''och'', "and". [[Norwegian language|Norwegian]], [[Afrikaans language|Afrikaans]], and [[Icelandic language|Icelandic]] are the most restrictive, replacing all cases of {{angbr|c}} with {{angbr|k}} or {{angbr|s}}, and reserving {{angbr|c}} for unassimilated loanwords and names.


All [[Balto-Slavic languages]] that use the Latin alphabet, as well as [[Albanian language|Albanian]], [[Hungarian language|Hungarian]], [[Pashto language|Pashto]], several [[Sami languages]], [[Esperanto]], [[Ido]], [[Interlingua]], and [[Americanist phonetic notation]] (and those [[Indigenous languages of North America|aboriginal languages of North America]] whose practical orthography derives from it) use {{angbr|c}} to represent {{IPA|/t͡s/}}, the [[voiceless alveolar sibilant affricate|voiceless alveolar]] or [[voiceless dental sibilant affricate]]. In [[Hanyu Pinyin]], the standard romanization of [[Standard Chinese|Mandarin Chinese]], the letter represents an aspirated version of this sound, {{IPA|/t͡s<sup>h</sup>/}}.
All [[Balto-Slavic languages]] that use the Latin alphabet, as well as [[Albanian language|Albanian]], [[Hungarian language|Hungarian]], [[Pashto language|Pashto]], several [[Sami languages]], [[Esperanto]], [[Ido]], [[Interlingua]], and [[Americanist phonetic notation]] (and those [[Indigenous languages of North America|aboriginal languages of North America]] whose practical orthography derives from it) use {{angbr|c}} to represent {{IPA|/t͡s/}}, the [[voiceless alveolar sibilant affricate|voiceless alveolar]] or [[voiceless dental sibilant affricate]]. In [[Hanyu Pinyin]], the standard romanization of [[Standard Chinese|Mandarin Chinese]], the letter represents an aspirated version of this sound, {{IPA|/t͡s<sup>h</sup>/}}.
Line 489: Line 389:


As in English, {{angbr|ck}}, with the value {{IPA|/k/}}, is often used after short vowels in other [[Germanic languages]] such as German and [[Swedish language|Swedish]] (other Germanic languages, such as Dutch and [[Norwegian language|Norwegian]], use {{angbr|kk}} instead). The digraph {{angbr|cz}} is found in Polish and {{angbr|cs}} in Hungarian, representing {{IPA|/t͡ʂ/}} and {{IPA|/t͡ʃ/}} respectively. The digraph {{angbr|sc}} represents {{IPA|/ʃ/}} in Old English, Italian, and a few languages related to Italian (where this only happens before [[front vowel]]s, while otherwise it represents {{IPA|/sk/}}). The trigraph {{angbr|sch}} represents {{IPA|/ʃ/}} in German.
As in English, {{angbr|ck}}, with the value {{IPA|/k/}}, is often used after short vowels in other [[Germanic languages]] such as German and [[Swedish language|Swedish]] (other Germanic languages, such as Dutch and [[Norwegian language|Norwegian]], use {{angbr|kk}} instead). The digraph {{angbr|cz}} is found in Polish and {{angbr|cs}} in Hungarian, representing {{IPA|/t͡ʂ/}} and {{IPA|/t͡ʃ/}} respectively. The digraph {{angbr|sc}} represents {{IPA|/ʃ/}} in Old English, Italian, and a few languages related to Italian (where this only happens before [[front vowel]]s, while otherwise it represents {{IPA|/sk/}}). The trigraph {{angbr|sch}} represents {{IPA|/ʃ/}} in German.

==Other uses==
{{main article|C (disambiguation)}}
* In the [[hexadecimal]] (base 16) numbering system, C is a number that corresponds to the number 12 in [[decimal]] (base 10) counting.
* In the [[Roman numeral]] system, C represents 100.
* [[Unit prefix]] c, meaning one hundredth.


==Related characters==
==Related characters==
Line 525: Line 431:
*Ꜿ ꜿ : [[Middle Ages|Medieval]] abbreviation for [[Latin]] syllables con- and com-, [[Portuguese language|Portuguese]] -us and -os<ref>{{Cite web|url=https://www.unicode.org/L2/L2006/06027-n3027-medieval.pdf|title=L2/06-027: Proposal to add Medievalist characters to the UCS|date=2006-01-30|first1=Michael|last1=Everson|first2=Peter|last2=Baker|first3=António|last3=Emiliano|first4=Florian|last4=Grammel|first5=Odd Einar|last5=Haugen|first6=Diana|last6=Luft|first7=Susana|last7=Pedro|first8=Gerd|last8=Schumacher|first9=Andreas|last9=Stötzner}}</ref>
*Ꜿ ꜿ : [[Middle Ages|Medieval]] abbreviation for [[Latin]] syllables con- and com-, [[Portuguese language|Portuguese]] -us and -os<ref>{{Cite web|url=https://www.unicode.org/L2/L2006/06027-n3027-medieval.pdf|title=L2/06-027: Proposal to add Medievalist characters to the UCS|date=2006-01-30|first1=Michael|last1=Everson|first2=Peter|last2=Baker|first3=António|last3=Emiliano|first4=Florian|last4=Grammel|first5=Odd Einar|last5=Haugen|first6=Diana|last6=Luft|first7=Susana|last7=Pedro|first8=Gerd|last8=Schumacher|first9=Andreas|last9=Stötzner}}</ref>


==Other representations==
==Code points <span class="anchor" id="Computing codes"></span>==
===Computing <span class="anchor" id="Computing codes"></span>===
These are the [[code point]]s for the forms of the letter in various systems
The Latin letters {{angbr|C}} and {{angbr|c}} have [[Unicode]] encodings {{unichar|0043|Latin letter capital C}} and {{unichar|0063|Latin small letter c}}. These are the same [[code point]]s as were used in [[ASCII]] and [[ISO 8859]]. There are also [[precomposed character]] encodings for {{angbr|C}} and {{angbr|c}} with diacritics, for most of those listed [[#Related characters|above]]; the remainder are produced using [[combining diacritic]]s.
{{charmap
| 0043 | 0063 | name1 = Latin Capital Letter C | name2 = Latin Small Letter C
| map1 = [[EBCDIC]] family | map1char1 = C3 | map1char2 = 83
| map2 = [[ASCII]] <sup>1</sup> | map2char1 = 43 | map2char2 = 63
}}
: <sup>1</sup> {{midsize|Also for encodings based on ASCII, including the DOS, Windows, ISO-8859 and Macintosh families of encodings.}}


Variant forms of the letter have unique code points for specialist use: the [[Mathematical Alphanumeric Symbols|alphanumeric symbols set]] in mathematics and science, [[Voiceless palatal stop|voiceless palatal]] sounds in linguistics, and [[halfwidth and fullwidth forms]] for legacy CJK font compatibility. The Cyrillic [[homoglyph]] of the Latin {{angbr|C}} has a separate encoding: {{unichar|0421|Cyrillic capital letter Es|nlink=Es (Cyrillic)}}.
In Unicode, C is also encoded in various font styles for mathematical purposes; see [[Mathematical Alphanumeric Symbols]].


==Other representations==
===Other===
{{Letter other reps
{{Letter other reps
|NATO=Charlie
|NATO=Charlie
Line 545: Line 446:
}}
}}


==Use as a number==
In the [[hexadecimal]] (base 16) numbering system, C is a number that corresponds to the number 12 in [[decimal]] (base 10) counting.

In the [[Roman numeral]] system, C represents 100.
{{clear}}
{{clear}}


Line 559: Line 456:


==External links==
==External links==
{{EB9 Poster|C}}
{{Wikisource1911Enc|C}}
{{Wikisource1911Enc|C}}
*{{Commons-inline|C}}
*{{Commons-inline}}
*{{Wiktionary-inline|C}}
*{{Wiktionary-inline|C}}
*{{Wiktionary-inline|c}}
*{{Wiktionary-inline|c}}

Latest revision as of 17:32, 9 May 2024

C
C c
Usage
Writing systemLatin script
TypeAlphabetic
Language of originLatin language
Phonetic usage
Unicode codepointU+0043, U+0063
Alphabetical position3
Numerical value: 100
History
Development
Sisters
Other
Associated numbers100
This article contains phonetic transcriptions in the International Phonetic Alphabet (IPA). For an introductory guide on IPA symbols, see Help:IPA. For the distinction between [ ], / / and ⟨ ⟩, see IPA § Brackets and transcription delimiters.
C in copyright symbol

C, or c, is the third letter of the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is cee (pronounced /ˈs/), plural cees.[1]

History

Egyptian Phoenician
gaml
Western Greek
Gamma
Etruscan
C
Old Latin
C (G)
Latin
C
T14
Phoenician gimel Greek Gamma Etruscan C Old Latin Latin C

"C" comes from the same letter as "G". The Semites named it gimel. The sign is possibly adapted from an Egyptian hieroglyph for a staff sling, which may have been the meaning of the name gimel. Another possibility is that it depicted a camel, the Semitic name for which was gamal. Barry B. Powell, a specialist in the history of writing, states "It is hard to imagine how gimel = "camel" can be derived from the picture of a camel (it may show his hump, or his head and neck!)".[2]

In the Etruscan language, plosive consonants had no contrastive voicing, so the Greek 'Γ' (Gamma) was adopted into the Etruscan alphabet to represent /k/. Already in the Western Greek alphabet, Gamma first took a '' form in Early Etruscan, then '' in Classical Etruscan. In Latin it eventually took the 'c' form in Classical Latin. In the earliest Latin inscriptions, the letters 'c k q' were used to represent the sounds /k/ and /ɡ/ (which were not differentiated in writing). Of these, 'q' was used to represent /k/ or /ɡ/ before a rounded vowel, 'k' before 'a', and 'c' elsewhere.[3] During the 3rd century BC, a modified character was introduced for /ɡ/, and 'c' itself was retained for /k/. The use of 'c' (and its variant 'g') replaced most usages of 'k' and 'q'. Hence, in the classical period and after, 'g' was treated as the equivalent of Greek gamma, and 'c' as the equivalent of kappa; this shows in the romanization of Greek words, as in 'ΚΑΔΜΟΣ', 'ΚΥΡΟΣ', and 'ΦΩΚΙΣ' came into Latin as 'cadmvs', 'cyrvs' and 'phocis', respectively.

Other alphabets have letters homoglyphic to 'c' but not analogous in use and derivation, like the Cyrillic letter Es (С, с) which derives from the lunate sigma.

Later use

When the Roman alphabet was introduced into Britain, ⟨c⟩ represented only /k/, and this value of the letter has been retained in loanwords to all the insular Celtic languages: in Welsh,[4] Irish, and Gaelic, ⟨c⟩ represents only /k/. The Old English Latin-based writing system was learned from the Celts, apparently of Ireland; hence ⟨c⟩ in Old English also originally represented /k/; the Modern English words kin, break, broken, thick, and seek all come from Old English words written with ⟨c⟩: cyn, brecan, brocen, þicc, and séoc. However, during the course of the Old English period, /k/ before front vowels (/e/ and /i/) was palatalized, having changed by the tenth century to [tʃ], though ⟨c⟩ was still used, as in cir(i)ce, wrecc(e)a. On the continent, meanwhile, a similar phonetic change before the same two vowels had also been going on in almost all modern romance languages (for example, in Italian).

In Vulgar Latin, /k/ became palatalized to [tʃ] in Italy and Dalmatia; in France and the Iberian peninsula, it became [ts]. Yet for these new sounds c was still used before the letters ⟨e⟩ and ⟨i⟩. The letter thus represented two distinct values. Subsequently, the Latin phoneme /kw/ (spelled qv) de-labialized to /k/ meaning that the various Romance languages had /k/ before front vowels. In addition, Norman used the letter ⟨k⟩ so that the sound /k/ could be represented by either ⟨k⟩ or ⟨c⟩, the latter of which could represent either /k/ or /ts/ depending on whether it preceded a front vowel letter or not. The convention of using both ⟨c⟩ and ⟨k⟩ was applied to the writing of English after the Norman Conquest, causing a considerable re-spelling of the Old English words. Thus while Old English candel, clif, corn, crop, cú, remained unchanged, Cent, cǣᵹ (cēᵹ), cyng, brece, sēoce, were now (without any change of sound) spelled Kent, keȝ, kyng, breke, and seoke; even cniht ('knight') was subsequently changed to kniht and þic ('thick') changed to thik or thikk. The Old English ⟨cw⟩ was also at length displaced by the French ⟨qu⟩ so that the Old English cwēn ('queen') and cwic ('quick') became Middle English quen and quik, respectively. The sound [tʃ], to which Old English palatalized /k/ had advanced, also occurred in French, chiefly from Latin /k/ before ⟨a⟩. In French it was represented by the digraph ⟨ch⟩, as in champ (from Latin camp-um) and this spelling was introduced into English: the Hatton Gospels, written c. 1160, have in Matt. i-iii, child, chyld, riche, mychel, for the cild, rice, mycel, of the Old English version whence they were copied. In these cases, the Old English ⟨c⟩ gave way to ⟨k⟩, ⟨qu⟩ and ⟨ch⟩; on the other hand, ⟨c⟩ in its new value of /ts/ appeared largely in French words like processiun, emperice and grace, and was also substituted for ⟨ts⟩ in a few Old English words, as miltse, bletsien, in early Middle English milce, blecien. By the end of the thirteenth century both in France and England, this sound /ts/ de-affricated to /s/; and from that time ⟨c⟩ has represented /s/ before front vowels either for etymological reasons, as in lance, cent, or to avoid the ambiguity due to the "etymological" use of ⟨s⟩ for /z/, as in ace, mice, once, pence, defence.

Thus, to show etymology, English spelling has advise, devise (instead of *advize, *devize), while advice, device, dice, ice, mice, twice, etc., do not reflect etymology; example has extended this to hence, pence, defence, etc., where there is no etymological reason for using ⟨c⟩. Former generations also wrote sence for sense. Hence, today the Romance languages and English have a common feature inherited from Vulgar Latin spelling conventions where ⟨c⟩ takes on either a "hard" or "soft" value depending on the following letter.

Use in writing systems

Pronunciation of ⟨c⟩ by language
Orthography Phonemes Environment
Albanian /ts/
Cypriot Arabic /ʕ/
Azeri //
Berber /ʃ/
Bukawa /ʔ/
Catalan /k/ Except before e, i
/s/ Before e, i
Standard Chinese (Pinyin) /tsʰ/
Crimean Tatar //
Cornish (Standard Written Form) /s/
Czech /ts/
Danish /k/ Except before e, i, y, æ, ø
/s/ Before e, i, y, æ, ø
Dutch /k/ Except before e, i, y
/s/ Before e, i, y
// Before e, i in loanwords from Italian
English /k/ Except before e, i, y
/s/ Before e, i, y
/ʃ/ Before ea, ia, ie, io, iu
Esperanto /ts/
Fijian /ð/
Filipino /k/ Except before e, i
/s/ Before e, i
French /k/ Except before e, i, y
/s/ Before e, i, y
Fula //
Gagauz //
Galician /k/ Except before e, i
/θ/ or /s/ Before e, i
German /k/ Except before ä, e, i, ö, ü, y in loanwords and names
/ts/ Before ä, e, i, ö, ü, y in loanwords and names
Hausa //
Hungarian /ts/
Indonesian //
Irish /k/ Except before e, i; or after i
/c/ Before e, i; or after i
Italian /k/ Except before e, i
// Before e, i
Khmer (ALA-LC) /c/
Kurmanji (Hawar) //
Latin /k/ (and /g/ in early Latin)
Latvian /ts/
Malay //
Manding //
Norwegian /k/ Except before e, i, y, æ, ø in loanwords and names
/s/ Before e, i, y, æ, ø in loanwords and names
Polish /ts/ Except before i
// Before i
Portuguese /k/ Except before e, i, y
/s/ Before e, i, y
Romanian /k/ Except before e, i
// Before e, i
Romansh /k/ Except before e, i
/ts/ Before e, i
Scottish Gaelic // Except before e, i; or after i
/kʰʲ/ Before e, i; or after i
Serbo-Croatian /ts/
Slovak /ts/
Slovene /ts/
Somali /ʕ/
Spanish /k/ Except before e, i, y
/θ/ or /s/ Before e, i, y
Swedish /k/ Except before e, i, y, ä, ö
/s/ Before e, i, y, ä, ö
Tatar /ʑ/
Turkish //
Valencian /k/ Except before e, i
/s/ Before e, i
Vietnamese /k/ Except word-finally
// Word-finally
Welsh /k/
Xhosa /ǀ/
Yabem /ʔ/
Yup'ik //
Zulu /ǀ/

English

In English orthography, ⟨c⟩ generally represents the "soft" value of /s/ before the letters ⟨e⟩ (including the Latin-derived digraphs ⟨ae⟩ and ⟨oe⟩, or the corresponding ligatures ⟨æ⟩ and ⟨œ⟩), ⟨i⟩, and ⟨y⟩, and a "hard" value of /k/ before any other letters or at the end of a word. However, there are a number of exceptions in English: "soccer" and "Celt" are words that have /k/ where /s/ would be expected.

The "soft" ⟨c⟩ may represent the /ʃ/ sound in the digraph ⟨ci⟩ when this precedes a vowel, as in the words 'delicious' and 'appreciate', and also in the word "ocean" and its derivatives.

The digraph ch most commonly represents //, but can also represent /k/ (mainly in words of Greek origin) or /ʃ/ (mainly in words of French origin). For some dialects of English, it may also represent /x/ in words like loch, while other speakers pronounce the final sound as /k/. The trigraph ⟨tch⟩ always represents //.

The digraph ⟨ck⟩ is often used to represent the sound /k/ after short vowels, like "wicket".

C is the twelfth most frequently used letter in the English language (after E, T, A, O, I, N, S, H, R, D, and L), with a frequency of about 2.8% in words.

Other languages

In the Romance languages French, Spanish, Italian, Romanian and Portuguese, ⟨c⟩ generally has a "hard" value of /k/ and a "soft" value whose pronunciation varies by language. In French, Portuguese, Catalan and Spanish from Latin America and some places in Spain, the soft ⟨c⟩ value is /s/ as it is in English. In the Spanish spoken in most of Spain, the soft ⟨c⟩ is a voiceless dental fricative /θ/. In Italian and Romanian, the soft ⟨c⟩ is [t͡ʃ].

Germanic languages usually use ⟨c⟩ for Romance loans or digraphs, such as ⟨ch⟩ and ⟨ck⟩, but the rules vary across languages. Of all the Germanic languages, only English uses initial ⟨c⟩ in native Germanic words like come. Other than English, Dutch uses ⟨c⟩ the most, for most Romance loans and the digraph ⟨ch⟩. German uses ⟨c⟩ in the digraphs ⟨ch⟩ and ⟨ck⟩, and the trigraph ⟨sch⟩, but by itself only in unassimilated loanwords and proper names. Danish keeps soft ⟨c⟩ in Romance words but changes hard ⟨c⟩ to ⟨k⟩. Swedish has the same rules for soft and hard ⟨c⟩ as Danish, and also uses ⟨c⟩ in the digraph ⟨ck⟩ and the very common word och, "and". Norwegian, Afrikaans, and Icelandic are the most restrictive, replacing all cases of ⟨c⟩ with ⟨k⟩ or ⟨s⟩, and reserving ⟨c⟩ for unassimilated loanwords and names.

All Balto-Slavic languages that use the Latin alphabet, as well as Albanian, Hungarian, Pashto, several Sami languages, Esperanto, Ido, Interlingua, and Americanist phonetic notation (and those aboriginal languages of North America whose practical orthography derives from it) use ⟨c⟩ to represent /t͡s/, the voiceless alveolar or voiceless dental sibilant affricate. In Hanyu Pinyin, the standard romanization of Mandarin Chinese, the letter represents an aspirated version of this sound, /t͡sh/.

Among non-European languages that have adopted the Latin alphabet, ⟨c⟩ represents a variety of sounds. Yup'ik, Indonesian, Malay, and a number of African languages such as Hausa, Fula, and Manding share the soft Italian value of /t͡ʃ/. In Azeri, Crimean Tatar, Kurmanji Kurdish, and Turkish ⟨c⟩ stands for the voiced counterpart of this sound, the voiced postalveolar affricate /d͡ʒ/. In Yabem and similar languages, such as Bukawa, ⟨c⟩ stands for a glottal stop /ʔ/. Xhosa and Zulu use this letter to represent the click /ǀ/. In some other African languages, such as Berber languages, ⟨c⟩ is used for /ʃ/. In Fijian, ⟨c⟩ stands for a voiced dental fricative /ð/, while in Somali it has the value of /ʕ/.

The letter ⟨c⟩ is also used as a transliteration of Cyrillic ⟨ц⟩ in the Latin forms of Serbian, Macedonian, and sometimes Ukrainian, along with the digraph ⟨ts⟩.

Other systems

As a phonetic symbol, lowercase ⟨c⟩ is the International Phonetic Alphabet (IPA) and X-SAMPA symbol for the voiceless palatal plosive, and capital ⟨C⟩ is the X-SAMPA symbol for the voiceless palatal fricative.

Digraphs

There are several common digraphs with ⟨c⟩, the most common being ch, which in some languages (such as German) is far more common than ⟨c⟩ alone. ⟨ch⟩ takes various values in other languages.

As in English, ⟨ck⟩, with the value /k/, is often used after short vowels in other Germanic languages such as German and Swedish (other Germanic languages, such as Dutch and Norwegian, use ⟨kk⟩ instead). The digraph ⟨cz⟩ is found in Polish and ⟨cs⟩ in Hungarian, representing /t͡ʂ/ and /t͡ʃ/ respectively. The digraph ⟨sc⟩ represents /ʃ/ in Old English, Italian, and a few languages related to Italian (where this only happens before front vowels, while otherwise it represents /sk/). The trigraph ⟨sch⟩ represents /ʃ/ in German.

Other uses

  • In the hexadecimal (base 16) numbering system, C is a number that corresponds to the number 12 in decimal (base 10) counting.
  • In the Roman numeral system, C represents 100.
  • Unit prefix c, meaning one hundredth.

Related characters

Ancestors, descendants and siblings

A curled C in the coat of arms of Porvoo
  • 𐤂 : Semitic letter Gimel, from which the following symbols originally derive
    • Γ γ : Greek letter Gamma, from which C derives
      • G g : Latin letter G, which is derived from Latin C
        • Ȝ ȝ : Latin letter Ȝ, which is derived from Latin G
  • Phonetic alphabet symbols related to C:
    • ɕ : Small c with curl
    • ʗ : Stretched c
    • 𝼏 : Stretched c with curl - Used by Douglas Beach for a nasal click in his phonetic description of Khoekhoe[5]
    • 𝼝 : Small letter c with retroflex hook - Para-IPA version of the IPA retroflex tʂ[6]
    • ꟲ : Modifier letter capital c - Used to mark tone for the Chatino orthography in Oaxaca, Mexico; Used as a generic transcription for a falling tone; Used in para-IPA notation[7]
  •  : Modifier letter small c[8]
  •  : Modifier letter small c with curl[8]
  • ᴄ : Small capital c is used in the Uralic Phonetic Alphabet.[9]
  • Ꞔ ꞔ : C with palatal hook, used for writing Mandarin Chinese using the early draft version of pinyin romanization during the mid-1950s[10]

Add to C with diacritics

Derived ligatures, abbreviations, signs and symbols

Other representations

Computing

The Latin letters ⟨C⟩ and ⟨c⟩ have Unicode encodings U+0043 C LATIN CAPITAL LETTER C and U+0063 c LATIN SMALL LETTER C. These are the same code points as were used in ASCII and ISO 8859. There are also precomposed character encodings for ⟨C⟩ and ⟨c⟩ with diacritics, for most of those listed above; the remainder are produced using combining diacritics.

Variant forms of the letter have unique code points for specialist use: the alphanumeric symbols set in mathematics and science, voiceless palatal sounds in linguistics, and halfwidth and fullwidth forms for legacy CJK font compatibility. The Cyrillic homoglyph of the Latin ⟨C⟩ has a separate encoding: U+0421 С CYRILLIC CAPITAL LETTER ES.

Other

See also

References

  1. ^ "C" Oxford English Dictionary, 2nd edition (1989); Merriam-Webster's Third New International Dictionary of the English Language, Unabridged (1993); "cee", op. cit.
  2. ^ Powell, Barry B. (27 Mar 2009). Writing: Theory and History of the Technology of Civilization. Wiley Blackwell. p. 182. ISBN 978-1405162562.
  3. ^ Sihler, Andrew L. (1995). New Comparative Grammar of Greek and Latin (illustrated ed.). New York: Oxford University Press. p. 21. ISBN 0-19-508345-8.
  4. ^ "Reading Middle Welsh -- 29 Medieval Spelling". www.mit.edu. Retrieved 2019-11-19.
  5. ^ Miller, Kirk; Sands, Bonny (2020-07-10). "L2/20-115R: Unicode request for additional phonetic click letters" (PDF).
  6. ^ Miller, Kirk (2021-01-11). "L2/21-041: Unicode request for additional para-IPA letters" (PDF).
  7. ^ Miller, Kirk; Cornelius, Craig (2020-09-25). "L2/20-251: Unicode request for modifier Latin capital letters" (PDF).
  8. ^ a b Constable, Peter (2004-04-19). "L2/04-132 Proposal to add additional phonetic characters to the UCS" (PDF).
  9. ^ Everson, Michael; et al. (2002-03-20). "L2/02-141: Uralic Phonetic Alphabet characters for the UCS" (PDF).
  10. ^ West, Andrew; Chan, Eiso; Everson, Michael (2017-01-16). "L2/17-013: Proposal to encode three uppercase Latin letters used in early Pinyin" (PDF).
  11. ^ Everson, Michael (2005-08-12). "L2/05-193R2: Proposal to add Claudian Latin letters to the UCS" (PDF).
  12. ^ Everson, Michael; Baker, Peter; Emiliano, António; Grammel, Florian; Haugen, Odd Einar; Luft, Diana; Pedro, Susana; Schumacher, Gerd; Stötzner, Andreas (2006-01-30). "L2/06-027: Proposal to add Medievalist characters to the UCS" (PDF).

External links

  • Media related to C at Wikimedia Commons
  • The dictionary definition of C at Wiktionary
  • The dictionary definition of c at Wiktionary