welsh – A Box Of Chocolates

Is Gàidhlig Exceptional?

Posted on 27/05/2025 by zerothly

We had a conversation in a café in Dail Bheithe (Dalbeattie) yesterday which rather surprised us. We got into a conversation with someone at the next table who was learning Welsh. She was clearly thoroughly Scottish, so this was an interesting choice, although one also made by another acquaintance living here. It then turned out that a family at another table were actually Welsh and fluent in the language, and all of us ended up discussing various things about the surviving Celtic languages, particularly Gàidhlig and Welsh. One of the things they said about Welsh is that it’s “the hardest language”. This reminded me of how Japanese speakers are fond of saying that their language is hard and that the learner’s polite response to this is said to be to acknowledge it and say one is working hard on learning it. People seem to be proud of the idea that their native language, or one they associate with their heritage, is in some way difficult or special.

My knowledge of Welsh is very limited, more so than my knowledge of Gàidhlig, substantially because it isn’t really part of my heritage. There’s an argument that Welsh is the heritage of most White people with English ancestry because in a sense Welsh used to be the main language spoken in what would become England and being from Kent, there are plenty of “Welsh” placenames in the area such as Dover and Wye. However, one reason for me not even trying very much with Welsh is that when I read the chapter on it in ‘Language Made Plain’, Anthony Burgess pointed out that the initial consonants often changed according to their setting and grammatical context, and I found that so off-putting that I didn’t pursue it any further. In fact it’s quite common for languages to do this, particularly the surviving Celtic languages, which as far as I know all do it. From a practical perspective it makes it difficult to look words up in the dictionary when one hears them spoken.

Regarding Welsh in particular, English speakers in England and parts of Wales are often quite hostile to it, making the following comments:

It hasn’t got any vowels. This is obviously nonsense and refers to the spelling using letters used for consonants in English, W and Y, being used for vowels in Welsh.
It involves a lot of phlegm and spitting. I’m guessing this is about the voiceless consonants which are unusual in modern English, namely CH, LL and RH.
It’s spelt impenetrably. This is deeply unfair. Welsh spelling is actually quite close to Old English spelling and is very close to being phonetic compared to English nowadays. One would be hard-pressed to find another Western European language with better spelling. Spanish, Basque and German are about on the same level. English, French, Faroese and the continental Scandinavian languages are decidedly not.

Earlier that day, we’d been wandering about what might be called the machair, although I’m not sure it counts as true machair as we live in the southwest, and it reminded me of the concept of the ionad as it was one of several examples of a border between two things being considered as a place in itself. Whether or not it was actually machair, it was definitely that, and I’ll come back to that. There were also what initially appeared to be large, smooth pebbles stranded along the shoreline among the plants, and we quickly realised they were jellyfish, Aurelia in fact, which gave me to wonder if they were osmotic conformers who couldn’t cope with the low salinity of the water. Coincidentally, the Welsh people in the café brought up the Welsh way of referring to jellyfish, which opened up a can of tentacles, because one nickname used appears to be “pysgod wibli-wobli”, which incidentally illustrates how very phonetic their spelling is. I’m not entirely convinced this is true because of the “popty ping” thing, which they also mentioned, but apparently that too is a nickname and not just a joke to be played upon the Sassenach (I’m vaguely aware of a Welsh word “Saesneg” but I’ve probably got it wrong so you’re stuck with the Gàidhlig there). The term for microwave oven is probably not problematic in Welsh, but it turns out one term for jellyfish definitely does need a euphemism because it’s “cont y môr”, which translates literally as “C-word of the sea”. Makes me think of vagina dentata, which in turn I usually associate with sea urchins myself.

Becoming curious about the Gàidhlig for jellyfish, I find the word “sgoldrach” and a further term “muir-tèachd” – “sea gel” – and “sgeith-ròin”, which seems to mean “puke hair”, and that does make sense to some degree, and even edges into offensive territory, which I’m guessing “cont y môr” is supposed to be as well. The Welsh term seems to have a double connotation in that it refers to the animal’s shape and probably also to its threatening character to humans, although that presumes that words for genitals are considered pejorative in Welsh, since apparently they aren’t in Gàidhlig, so I’m told.

All of that, then, has led to highly fruitful cogitation over the use of these words in Celtic languages, and their unfamiliarity to a Sassenach is very stimulating in this respect. However, I’m not quite finished with Welsh yet. One of the remarkable things about Welsh is that the oldest written records in the language are not from Wales but apparently from around here, what became southwest Scotland. They comprise poems attributed to Taliesin and Aneirin. Wikipedia provides the following picture of a page from the ninth century CE book, Llyfr Aneirin:

Impressed upon me as a non-speaker of Welsh, this looks very Welsh indeed to me, which I find rather surprising. I’d assumed that the Brittonic language spoken along much of the western part of Great Britain varied more than that, but to my eye this just looks like Welsh, even in its spelling, which I mention because I’ve seen archaic spellings of Welsh using the likes of LH instead of LL. However, maybe it isn’t Welsh. Cornish and Breton aren’t Welsh and my understanding was that Cumbric was the Brittonic language spoken here immediately before Gàidhlig arrived from Ireland and Man. Whatever the language was, it survives in the well-known counting rhymes used by shepherds, which resemble Welsh quite closely.

Before I leave the topic of Welsh for now, I want to point out that neither of the two people I know living locally who are learning it were aware that it used to be spoken here. They just decided to learn it anyway, and I wonder if that means there’s more to it than it seems. Is there something about this region which hints that it used to be Welsh-speaking? It’s hard not to be doubtful about that idea as it sounds supernatural. It is genuinely the case, though, that the placenames around here are sometimes Welsh, a notable example being Cummertrees, whose first two syllables may in fact have originally been something like “Cymru”. There also used to be a Welsh-speaking kingdom here called Rheged, which as far as I understand it stretched from Ayrshire to Ynys Môn (I’m probably wrong about that). Awareness of its existence made me feel slightly better when we went to Carlisle recently, which incidentally is Caerliwelydd in Welsh. Nonetheless, this is all ancient, well mediaeval, history.

Turning to Goidelic, i.e. Q-Celtic for the purposes of modern times, languages and their possible exceptionalism, there’s a second issue. Apparently some Goidelic speakers regard their speech as part of a continuum of the same language which Sassenachs perceive as separate. I’ve seen, and unfortunately forgotten, evidence that the Gàidhlig spoken in Galloway was close to Manx, which wouldn’t be surprising, but as few traces remain the version we’re learning is that of the Gaidhealtacht (sp?), insofar as that’s unified. I’ve also heard that Enya’s Irish is closer to Gàidhlig than more southerly Irish speakers’ because she’s from Donegal in Ulster. It actually seems like a pity to me that we’re not learning a reconstructed Galwegian Gàidhlig, as that would to some extent restore the continuum, but presumably resources and the reasons people learn it preclude this. So when I ask whether it’s exceptional, what do I mean by that? Am I talking about the extinct Galwegian Gàidhlig, the almost extinct but very similar Manx, Gàidhlig as spoken in the Gàidhealtachd (which I can’t spell, apparently) or the entire gamut of living and recently extinct Goidelic languages?

I suppose I mean three things, as applied to the style of speech I’m learning: vocabulary, phonetics and grammar. One of the issues with living Celtic languages appears to be an historical academic bias towards classical European languages, mainly Latin and Ancient Greek, and later to what’s been called “Standard Average European”, or SAE. This was apparently originally Benjamin Whorf’s idea, and it involves the hypothesis that European languages have tended to become more like each other due to being in constant close physical proximity, leading to common features in syntax, grammar, vocabulary and usage. Two examples of how this can perhaps surprisingly manifest itself are found in many terms in Tok Pisin, an English-based creole in New Guinea, which uses “gras bilong hed” to refer to “hair”, when this way of thinking of hair would never occur to most Europeans – “head grass”, and in dvandva compounds, where two words are combined with equal force and type of reference such as “mother-father” meaning “parents” rather than something like “maternal grandfather” as occurs in Swedish. We don’t even know we’re doing it. This leads us to become less aware of other possibilities or features in languages which are unlike these, which is basically a form of ethnocentrism. Another aspect of this, from Swedish, is /ɧ/, a sound almost unique to that language and with its own phonetic symbol, but at the same time the click sounds of Khoisan languages were for a long time analysed into separate letters when they were actually single sounds rightly represented by single, but different, letters.

And the situation with Celtic languages as spoken today is that they’re on the fringe of Europe, not in a pejorative way but in terms of being less influenced by other European languages and less likely to be pulled towards them. To a limited extent this is also seen with Icelandic, which along with Faroese is an outlier with respect to other Nordic languages in spite of being the one which has changed least in the last millennium or more. In both cases, social forces have failed to pull these languages into the general homogeneity. It’s also alleged that the standard system for representing speech sounds, the IPA or International Phonetic Alphabet, is actually not very adequate for transcribing Goidelic languages. Other stuff is going on with their pronunciation which is hard to set down on paper because the notation is missing. This is clearly manifested in the apparently perverse spelling, but there is a reason why it’s like this. My personal opinion is that it should be written using Cyrillic script, the alphabet used for Russian and various other languages, but this is obviously never going to happen. It is, though, notable that the Slavic languages too, particularly Russian, are on the margins of Europe and therefore less driven towards SAE.

Thus in this case it does actually seem to make sense that Gàidhlig, or whatever it is I’m learning, is an exceptional language and that the thoughts and experiences of the Gaels are less forced into a SAE filter. Goidelic languages are from “outside the empire” and the fact that they were the languages of barbarians is a potential point in their favour. There is also a paradoxical sense in which these ultimately Western European languages, Irish, Gàidhlig, Manx, Faroese and Icelandic, are among the least Standard Average European languages in Europe! The well-known Balkan Sprachbund in southeastern Europe also tends to differ radically from more centrally located languages, perhaps for similar reasons.

I can’t say I have sufficient grasp of the grammar to make sweeping and accurate statements about it, but one thing which is notable is that the older terminology used to describe it is often inappropriate. The words “infinitive”, “dative”, “subjunctive” and “aspiration” are all used to describe features of the grammar and in the last case also phonetics which really don’t apply to the language at all. They may for all I know represent some kind of historical feature in a much earlier stage of the Celtic languages, but they really don’t make sense any more and it’s probably better not to use them. As I’ve said though, my understanding of the grammar is quite poor.

Vocabulary is slightly better. I’ve already written about “ionad”, the word for place or centre (in the sense, for instance, of “leisure centre”), and due to that word being used in maths, in itself it’s enough to revolutionise my perception of reality. It also very much anchors the word in my mind: I’m always going to notice when someone says “ionad” from now on. It’s a very inefficient way to learn a language of course, but the words get properly learnt this way. This also means you have to be careful to get it at least roughly right, in order not to have to make a greater effort to unlearn them otherwise. But many words can be treated as meditations from which one attempts to distil the water of life. That doesn’t make Gàidhlig exceptional, except in the usual sense that every language is exceptional, but adopting that attitude towards it may help one get the most out of it.

Somewhere else on the internet, some guy running a Gàidhlig group in Baltimore Maryland has singled out a number of words for the learner’s consideration. I can’t help thinking that this is rather precious in a distinctly American way which attempts to romanticise the language inappropriately, but it’s still interesting. His post can be found here on WordPress. Nine words are involved: cèilidh, slàn, dùthchas, cliù, aiteal, smùirnean, crith, lannair and deò.

“Cèilidh” was familiar to me from the late 1970s CE, when my English teacher and year head was in a band called the Oyster Cèilidh Band, now just known as the Oysterband and closely connected to Fiddler’s Dram and their one-hit wonder ‘Day Trip To Bangor’. I had no idea it was Gàidhlig at the time, and it has of course been extensively used in English since. It actually came to me after my first attempt to learn the language was over. The word is linked to “visit”, although I think this has now been replaced by another word. The precise image it brings up in my mind is the very English but also universal experience of a social gathering in a village square in Chilham, Kent, with gaily attired folk dancing round a maypole, getting drunk, playing instruments and singing songs. It’s actually quite “hey nonny-nonny” for me, and although Morris dancing is also a Scottish tradition – Dannsa na clag as it’s known – the strong association I make with that is there too for me. I’m a Sassenach, what do you expect? Nonetheless, I think it makes perfect sense to transport the spirit of such an occasion to Scotland and make the appropriate associations. Cèile also means “partner” or “spouse”, so although that may be coincidence, the idea of “partnering”, perhaps in a dance or accompanying each other in music is suggested to me here. It also implies belonging.

“Slàn” is the root of slàinte or “health”, and this actually feels like a fairly prosaic and boring word. It isn’t that it doesn’t have interesting associations, this time with wholeness, unbrokenness and the wholesome, but that kind of association exists between words linked to health and completeness in all sorts of languages, so it’s interesting and it helps one remember it, but it doesn’t seem special to me.

“Dùthchas” provides more food for thought in this respect. It has various reasons such as “place of origin”, “homeland”, “heritage”, “heredity” and also refers to a legal form of inherited land tenure. Hence the connotation of “place” exists with this word in a similar way to the connection between that concept and “ionad”. I find this a little worrying because I can see it lending itself to fascistic and negatively nationalistic urges, but I also hope that’s not so big a problem here and there are positive ways of being patriotic, such as admiring tolerance and care for the vulnerable if those are strong traditions in a place. My own dùthchas is of course partly Scottish, as is that of the whole diaspora. I wouldn’t presume to be as Scottish as an actual Scot but this place is keen on adopting others as their own and I’m hoping to be able to aspire fairly to that. In legal terms, links have been made between the right of indigenous peoples to their land in countries with colonial history such as Australia and the US and rights to clan lands here. It’s also been described as expressing the idea that people belong to places rather than places to people.

“Cliù” means “reputation” and “renown”, also “praise”, and therefore also “character”. It expresses the idea that one is proud of being useful to a community. It’s apparently summed up in Iain Crichton’s essay on being a Gael, ‘Real People In A Real Place’. It seems to be about being perceived as more than a “character”, and escaping from the, I don’t know, mysticisation and mistification of Na h-Eileanan an Iar as a kind of misty and mysterious place at the end of the Earth, and the romanticisation of the Gael. It’s a little like the essay ‘For All Those Who Were Indian In A Former Life‘ in that it makes the outsider question how they approach the cultures of others. I fervently want to be an asset to this community, not in a proud way but just to have a good idea, and the motivation, to know how I can be the most value to the people who live here. I suppose that’s cliù.

Next comes a bit of a departure from the communal and social in the form of “aiteal”, which I can’t help noting is a lot easier to type than most of the other words here. This reveals the absence of a proper Gàidhlig keyboard layout for computers in general. On this Chromebook I have a dozen keyboard layout options, none of which are suitable for the language. Leaving all that aside though, “aiteal” can be translated as “glimpse”, “sprinkle”, “slight breeze”, “ray of sunshine” and “smidgeon”. Maybe a “breath”. It’s a hint of things to come or the hidden, a bit like the idea of an iceberg being mainly hidden below the surface.

“Smùirnean” also refers to a small thing, like an atom or a mote of dust, and it’s at this point that I feel the guy in Baltimore has kind of gone off on one. I think it’s okay to engage in this kind of thing to fix the concepts in one’s mind, but this is also an everyday, working language spoken by real people who share in the universals which are also part of us. Anyway, “smùirnean” has a more figurative meaning of an initial inkling about something, the start of a realisation. But then he goes on about “the interconnected nature of life” and I start to be reminded of ‘Zen And The Art Of Motorcycle Maintenance’ and ‘Jonathan Livingstone Seagull’, but maybe it’s I who am failing to appreciate the authenticity of this take.

He does the same thing again with “crith”: tremble, be in a tremor, the ague, shaking, quaking, shivering. This to him conveys the transient nature of an apparently solid and reliable order of things, the way an earth tremor reveals that this apparently sturdy, steady and monumental land we all live on is ultimately anything but, and that we’re almost here on suffrance, or rather that the world is indifferent to us. It can also refer to the shimmering of a mirage, something I perhaps ignorantly assumed is not a common experience in the Gaidhealtachd, although there are certainly mirages here. Maybe the “Fata Morgana” is a frequent experience here. This is the appearance of a distorted, distant object beyond the horizon, often seen above ice, and it does in fact refer to Morgan Le Fay of the tales of King Arthur, although we’re not in the Brittonic realm here, or at least not any more. It could refer to a deceptive thin layer of moss floating on a bog which makes the ground appear more solid than it really is, or possibly the misleading nature of a heath with hidden crevices into which one’s foot might slip and be trapped, as is apparently the case in Clan McIntyre country, and I’m sure many other places. The world is in fact less stable and substantial than we tend to think of it, although it can be hard to live confidently without assuming that. This is true both in terms of the illusion of security many people have and more literally so in the fact that matter is almost entirely empty space, as is the Universe beyond this planet.

Next there’s “lannair”: “radiance”, “gleam”, the glitter of fish scales or swords. Again I feel the author, whose name is Richard Gwynallen incidentally, is somewhat romanticising things in referring to the glint of swords. It can refer to the inner gleam of light that appears in someone’s eyes which presages something to come, once again. There’s a definite theme in many of these words and it’s hard to tell whether this says more about our Richard or more about the language.

His last word is “deò”: breath, vital spark, ghost, spark of fire or ray of light, once again. He speaks of it as the moment where the water of a river slips into the sea and we humans become part of a greater whole.

I don’t know what to make of all this. It’s a personal view and it seems rather fanciful and poetic. Then again, maybe the language is primarily poetic. I’ve felt for a while that it’s almost meant to be sung rather than spoken, so maybe he has a point with all this. Comparing German, because of reading Heidegger and other philosophers in German, pretty much prosaic, ordinary things expressed in that language come across to me as somewhat philosophical in nature and in fact Heidegger was himself quite attached to the idea that the common folk’s use of terms made them more significant, therefore preferring terms like “Geworfenheit” and “zuhanden” to rather more classically-based language. It’s not advisable to go too far with this because he was a Nazi, something I’d like to come to terms with. Even so, the poetry and song composed in Gàidhlig could be a demonstration of how the language also tends to go in ordinary conversation, so maybe Richard Gwynallen is right.

In the end, then, maybe Gàidhlig and her sisters really are exceptional, substantially because they’ve not been squashed down into the homogeneity of SAE. Maybe Gàidhlig is the “Anti-Esperanto”, not only due to its small number of native speakers but also because there’s no rule to force it to follow. It seems quirky to the outsider, to the Sassenach and no doubt even to many Scots, but it has a freedom and spontaneity, the foam, froth and white water of a mountain burn or waves breaking on the white sand of a Hebridean beach, rather than the flatness and standardisation of the speech of central Europe. Maybe this is what we can get from it. And we need to let it flow over us and our ears, trying to swim in its relentless flood, and one day we will succeed.

Crotchety Old Grandpa Of A Language

Posted on 07/03/2025 by zerothly

I’ve blogged about Gàidhlig on here before, but I’m guessing also that there are entire blogs on the language, so I think it’s okay for me to talk about it again. Though I don’t recall exactly what I said in that post, it amounts to the highly idiosyncratic nature of the surviving Celtic languages, which is unfair because back in the day they were very similar to Latin, even uncannily so. But something happened to the languages the Celts spoke once they reached these isles, and oddly it happened to both lots. The Celtic languages spoken in what Gàidhlig calls Na Eileanan Bhreatainn didn’t stem from a single language or even two closely related languages, but from two languages which had already been evolving apart from each other for many centuries, so the peculiarities of the six surviving or revived languages are quite mysterious. But I’ll come back to that in a bit.

I think I first started to try to learn the language in 1978 CE. I know it was quite some time before studying O-levels was an imminent prospect, so it can’t have been after 1981. I remember this because the library book I was using had school exam past papers at the back and I found this quite intimidating and expected never to get any qualifications at the time. The actual book I didn’t find at all daunting though. This contrasts with my almost simultaneous attempt to learn Russian where for some reason I got stuck on a single world, «идёт», which was enough to make me give up (apparently it means “is coming” according to Google Translate). However, since the book was due back after a few weeks and I didn’t renew it, I only got a few lessons in.

Welsh television was somewhat widely available in the ’70s, so if I’d tried to learn that it would’ve been easier to expose myself to the spoken world, and to a limited extent I did in fact do this, but as far as I know there was absolutely no Gàidhlig television or radio in England. In Kent, it was easier to pick up Dutch television than Scottish, so it was a bit of a struggle to learn it. There was a TV series called ‘Can Seo’ whose books were available, but I don’t think it was available on the telly in Kent. Over that time, it didn’t seem difficult, although it was unusual. I tried a few more times in the ’80s and was really put off when I read Anthony Burgess’s description of the Welsh language and realised that the initial letters of words changed rather than just the endings. This is actually not unusual outside Indo-European languages – Bantu languages such as Swahili do it for example – but at the time it considerably freaked me out and discouraged me because it meant that if I didn’t know a word, I couldn’t necessarily just look it up in the dictionary because it often wouldn’t be there. The answer, of course, is to learn the changes, but you have to do that even before you get to look it up, so it immediately becomes a slog.

The Q-Celtic languages – okay, I’ll explain that in a bit but I mean Irish, Gàidhlig and Manx, also known as the Goidelic – have the added issue of spelling. Welsh is, despite what English people say, very phonetic and actually somewhat similar to Old English in its orthography. Irish and Gàidhlig are decidedly not, for various reasons, and Manx has a different problem because it’s been squozen into English spelling without actually having English-like pronunciation, with the result that although it looks to an English reader like a breath of fresh air, what you see on the page doesn’t actually correspond that closely to what you hear. I always say that they should be written in Cyrillic, because that script also has to deal with languages whose letters are doubled as broad and slender a lot of the time and does so by using extra vowel letters and the soft and hard signs, sometimes, as with Serbian, introducing extra letters. I actually suspect the intimidating-looking spelling is a factor in it not spreading more widely. That said, according to our teacher yesterday the number of speakers in Dumfries & Galloway, or should I say Dùn Phrìs is Gall-Ghaidhealaibh, has doubled in the last decade, and there are moves to use the Gàidhlig version of “Welcome to Dumfries & Galloway” on the signs into the region.

It isn’t just that either. It’s been said that of all the Indo-European languages, the Celtic branch has deviated furthest from its roots. Were it not for the fact that most of the vocabulary has been inherited relatively cleanly from its ancestors, nobody would guess that they were even in the same family. I was rather surprised to hear from an actual Scot a few months ago that she’d heard that it was close to Sanskrit! The existence of a phenomenon known as sandhi does link them, to be sure, but this is not unique to the two by any means. It’s also been said that the unit of the Gàidhlig language is not the word but the sentence, and I basically agree with that. I’ll illustrate the issue with a phrase we learnt yesterday: Tha mi air m’obair a leigeil dhiom. This means “I’m retired”. However, literally it means “Am I my work a-laying aside from me”, and switching the syntax from the VSO of Celtic, “I am laying aside my work from me”. There is another way of saying it – tha mi retireadh – “I am retired” – but where’s the fun in that? This does, however, illustrate one of the weirdnesses of the language: it’s entirely possible to use it as if it were a more conventional European language and literally say “I am retired”, but on the whole, it just isn’t done! It’s all circumlocutory. I used to find this exasperating, but no longer.

So, more personally and experientially, the class is good. Despite the fact that I started to learn forty-six years ago, most of the other students are ahead of me even though some of them have only been doing it a few weeks. The teacher is great, although she sounds like a Geordie. I don’t know if this is because she’s from the Borders and they sound like that over there. There are other Sassanaich in the class than Sarada and I, thankfully, although unsurprisingly most are Scots. Lessons are an hour long, and as Sarada noted, this is quite long enough. Some of what one hears in Scottish and even more so in Hibernian English reflects Celtic idioms, so for example I’d expect to hear “the hair on her” rather than “her hair” around here and likewise in Gàidhlig “a’ ghruag oirre” – “the hair on her”. This is one of the things that spares the Scots language itself from just being another Germanic language like Frisian or Danish – it has both Celtic borrowings and idiomatic influences. It doesn’t have the breeziness, and therefore the boredom, of learning most other European languages because it is so very different. More different in many ways, in fact, even than Finnish, which isn’t even Indo-European but at least has the decency to behave as if it is. The highly idiosyncratic nature of expression does in fact tend to infect the way I express myself in English and it actually seems to mess with your head to some extent, making fluency in English harder.

Although the teacher suggested to me that I go up to the next level, me having told her that I’d been studying it since 1978 and knew fifteen languages (not that well though, in either case) and her having heard me speak, I am seriously not ready to do so. As I put it to her, the principles are clear in my head but the practice is another matter entirely. Maybe at some point it’ll happen.

To come back to the points I mentioned earlier, there’s an hypothesis out there called “Italo-Celtic” which is about the idea that Celtic and Italic, nowadays Romance as the others became extinct during the Roman period although they did influence some dialects of Italian, stem more recently from the same language than the rest of Indo-European, or alternatively that they were spoken in close proximity to and influenced each other. I personally honestly suspect the former. The Italic and Celtic languages really were spoken in adjacent areas, the former on the Italian peninsula and the latter somewhat further North in today’s Austria, and this gives Gàidhlig the redeeming feature of having numerous cognates with Latin which really helps. For instance, “feasgar” means “afternoon/evening” and clearly corresponds to the Latin vesper, and “fear” – man – similarly reflects “vir”, and in fact also the English “were” as in “werewolf” – “man-wolf”. “Obair” is very clearly linked to opus, but calls to mind the later plural become singular opera, and I imagine this is because in some positions S becomes R but I’m just guessing. Therefore it does help and I’m not completely at sea.

Just to compare the numbers from one to ten in Proto-Celtic and Proto-Italic:

Italic	Celtic
oinos	oinos
duos	dwau
tres	tris
kwettuor	kwettwares
kwenkwe	kwenkwe
seks	swekhs
septem	sekhtam
okto	okhtu
nowem	nawan
dekem	dekam

If this pattern is repeated across the two proto-languages, it seems to me that they were basically two dialects of the same language, and the same definitely does not apply to the other big groups of European languages such as Germanic and Balto-Slavic compared to Italo-Celtic.

There’s also the division between Q- and P-Celtic, which also occurs in Italic but only Q-Italic languages have survived. Romanian does use P sometimes but this is a later development. Celtic is distinctive in having lost its original /p/ sound, so any such sounds which occur in it today are later developments. Basically, the Q-Celtic languages, also known as Goidelic and now including Manx, Gàidhlig and Irish, retained the “kw” sound into historical times, so for example the Primitive Irish word for “son”, “mac” in Gàidhlig, was actually “MAQ”, i.e. “makw”, but in the P-Celtic languages, alias Brittonic languages and now including Welsh, Cornish and Breton this became a new /p/ or /b/ sound, so the Welsh word for “son” is “mab“. This is just my opinion, but I find it interesting that the same division existed in Italic, with Latin and Faliscan Q-Italic languages and Oscan and Umbrian P-Italic, and I think this really means that the Italic and Celtic languages were originally not just close but were actually in the same branch to the extent that this variation divides the entire set of languages into two halves, with the Q-Celtic languages closer to Latin and the P-Celtic closer to Oscan.

There seem to have been two incursions of Celtic speakers into these isles, one Q-Celtic and from Galicia in what became Spain, and the other P-Celtic and from transalpine Gaul, i.e. most of today’s France. The division already existed before they came over, and then the mysterious thing happened to change the languages into what they are today. I’ve talked about this before because up until recently I thought it was just me, but the odd thing is that they share features with the completely unrelated Semitic languages such as Hebrew and Arabic. In fact I even tried to use the Arabic relative Maltese as a way of helping me with Gàidhlig, but that language has gone in exactly the opposite direction and been strongly influenced by Italian, making it pointless. It was therefore quite interesting that our teacher pointed out that the phrase for “I am disabled” is “tha ciorrarm orm” – “there is a disability on me” – observing that it sounded quite Biblical. Similarly, the Gàidhlig for “humanity” or “humankind” is “an mac an duine” or “the son of man” as it were. The King James Version of the Bible is notable in being nearer to a literal translation of the Hebrew than many other or newer versions are and I’d be almost prepared to bet that many of the idioms in English inherited from the KJV are also found in Gàidhlig, not because of the influence of the Bible but due to the language’s baffling similarities with Hebrew. In fact, recently I’ve found that Arabic, which I don’t understand, is notably easier to disentangle than it used to be, possibly for this reason. As I say, I’ve covered this before but there are three schools of thought regarding this: it’s coincidence; it’s due to implicational universals (the languages may share just a couple of features which force the rest of them to be similar because otherwise they wouldn’t work or the human mind wouldn’t be able to use the languages); it’s due to an Afro-Asiatic substrate in these isles before the Celts got here which influenced the languages.

So I’ve tried to be brief, I’ve labelled various objects around the room: uinneag, doras, balla, sòfa, bòrd, sporan, telebhisean, leabhar, baidhsagal, cuisean, briosgaidhean, staidhrichean, pink for feminine, blue for masculine and alternating pink and blue letters for nouns which are either, and I’m about to watch some telebhisean Gàidhlig (that’s probably wrong), so progress will be made!

Two Flowerings And A Cousin

Posted on 02/06/2022 by zerothly

Here at chez zerothly, the mornings are currently filled with the bongs and bings of Duolingo as Sarada and I vainly and not so vainly, not respectively, attempt to learn two distantly-related languages. Sarada is having a lot more luck than I am, or rather, she’s making progress much more rapidly. I am plodding, and as I advance through the lessons the number of mistakes I make grows steadily. I feel in no way on top of my learning, and that’s unusual for language learning for me, although not in any way a surprise in this case. Sarada in the meantime has a degree in French and is learning a closely-related language which she’s already made progress in through evening classes. As far as I can tell, the sum total of her efforts with respect to learning this language right now consist of Duolingo. My efforts consist of listening to radio stations in my language and watching the TV news in it. There probably isn’t much difference in our degree of motivation, but whereas she’s not putting as much time into it as I am, she’s getting a lot further a lot faster.

By antmoose – [1], CC BY 2.0, https://commons.wikimedia.org/w/index.php?curid=562371

Both languages are Western European Indo-European KENTUM languages. I’ve been into the classification of IE languages before on this blog, but to cut a long story short, here’s a quick summary. The Indo-European language family is the largest and best-researched language family and consists of languages originating in Eurasia from Ireland and Portugal through to Bangladesh, extending into the Arctic Circle and across Russia. Particularly in Europe, there are only a handful of indigenous tongues which are not members. The family consists of three divisions, one much smaller than the other two and completely extinct, namely the Anatolian languages which include Hittite and were so ancient they were often written in cuneiform. Of the other two, one is probably a more genuine division than the other: KENTUM and SATEM, based on their words for “hundred”, which reflect sound changes in the two main branches. Although KENTUM languages tend to be more western in origin than the SATEM ones, the easternmost subfamily of all, completely extinct now for over a millennium, is Tocharian and is KENTUM. The SATEM group is probably not closely bound and likely reflects the languages which simply didn’t descend from the one which underwent the KENTUM changes rather than having a common ancestor beyond Proto-IE itself

I’ve covered all of this before. Among the KENTUM languages, as I count them, and this is not actually the official way they’re counted nowadays but I do this based strictly on the word for “hundred” which may have been altered by other influences, the branches are Illyrian, Tocharian, Celtic, Germanic and Italic. The sole surviving Illyrian language is Albanian, but there are likely to have been many others spoken in the Balkans which were never written down and just died out. I don’t include Greek because its word for “hundred” isn’t like ours, and I seem to remember that linguists often group it and Armenian together. However, the Armenian word for “hundred” is “հարյուր”, and I think you’ll agree that doesn’t look much like ours, beautifully written though it be. I am, incidentally, aware of the peculiar history of Armenian but I don’t want to get too sidetracked.

I’ve taken old written examples of each branch of the KENTUM languages and compared the vocabulary. I found, perhaps surprisingly, that the two closest seemed to be Latin and Gothic. This is a little misleading as history is, literally in this case, written by the winners, and the Albanians, Tocharians and Celts definitely didn’t turn out to be the victors in the long run. The Tocharians were so long gone and utterly eradicated that nobody even remembered they’d existed and they were only unearthed because they lived in a desert area of Chinese Turkestan where their documents were preserved by the conditions. The Albanians are the sole survivors who seem to have clung on because of living in an isolated mountain kingdom which nobody wanted and was in any case pretty inaccessible. As for the Celts, well . . .

On the SATEM side of things, Baltic and Slavic are evidently rather close to each other, and also influenced Germanic because the people speaking these languages didn’t have much respect for what philologists were going to do in fifteen centuries’ time, and therefore didn’t realise they weren’t supposed to speak to their KENTUM neighbours. In the KENTUM case, two subgroups are particularly close to each other, or rather, they’re closer than the others are, and also closer to each other than they are to the others. These are Italic and Celtic. In a late nineteenth century edition of Cassell’s Etymological Dictionary, a book I studied very closely as a child before I got a copy of Skeat as an Xmas present, Italo-Celtic was considered a single branch on the family tree rather than two.

Both the languages Sarada and I are currently learning, or in my case trying to learn rather unsuccessfully, are Italo-Celtic. Sarada is picking up Italian quickly and I am slogging away unfruitfully at that nasty grandparent language known as Gàidhlig. Sarada has on a number of occasions asked me why I’m bothering, considering it’s such a huge effort and such a minor language, to which my answer is that it’s an endangered language and part of my heritage. I’ve been into this before on this blog. I’m not going to pretend it’s easy, wonderful, beautiful or anything else, but the fact is that it’s on its last legs and deserves to be preserved. To compare, there are more than three dozen native American languages with more speakers than Gàidhlig and I’d never suggest that they should be allowed to die out, so here I am learning this language whose term for spider translates for some reason as “wild stag”, and in other circumstances I’d find that picturesque and charming but to be honest my immediate reaction is “Just why on Earth‽”. But this unlovely tongue is the one in which my surnames are, and what kind of rootless fool would I be if I couldn’t even pronounce my own name? So I’m stuck with it, and as you’ll know if you’ve been reading this blog, I consider myself obliged to learn the sodding thing. This is in no way a slight on the Gaels, which would be weird and self-hating to some extent anyway, though I consider myself a White Northwestern Eurasian above everything else (or a NW European if you want to be parochial about it). To be fair, I’m not a huge fan of English either due to things like its weird vowels and diphthongs, overuse of the word “do” and only having one word for “you”, but I honestly can’t say I actually prefer Gàidhlig as a language.

There’s considerable doubt about the validity of Celtic as an identity, but whatever is true, there are a maximum a few thousand speakers of each surviving spoken Celtic language, which are divided into two halves, P-Celtic and Q-Celtic according to how they treated proto-IE “KW”. P-Celtic survivors comprise Welsh, Cornish and Breton, plus a few words adopted into a now-extinct Siouan language called Mandan. The Q-Celtic languages are Gàidhlig, Irish and Manx, which form a linguistic continuum interrupted by the ingress of Scots and English into the southwestern part of what became Scotland, although a P-Celtic language was also spoken there. It must also be mentioned that all surviving Celtic languages have mysterious similarities to Afro-Asiatic languages such as Arabic and Hebrew which have never been explained, and again I’ve been into this before.

All that said, this isn’t what I’m going to focus on today. Rather, the activity of learning Gàidhlig just as Sarada is learning Italian has highlighted the possibility of Italo-Celtic as a division of the KENTUM branch of IE, and in fact if you go back far enough they’re remarkably similar, particularly if you take out the bizarre Semitic tendency in Celtic.

The Italic languages are peculiar in that they’ve flourished twice. They’re the only example I know of a language group which developed into a whole range of spoken languages, all but one of which died out, only for that sole survivor to become another whole range. That was of course Latin, and its descendants the Romance languages, including Italian itself and also Catalan, Provençal, Romanian, French, Castilian and various others. Traces of the older Italic languages still exist in Italian dialects but the only one to emerge and spread from the Italian peninsula was Latin itself. The others were Oscan, Umbrian, South Picene, Faliscan, and possibly Ligurian, Sicel, Nuragic, Raetic and Venetic. There were also other less-closely related languages spoken before the founding of Rome on the peninsula and associate islands, including the unclassified Etruscan, which was definitely not IE but whose actual allegiance can’t be traced definitively, and also Illyrian languages and Greek, as well as Punic, an Afro-Asiatic language spoken in North Afrika. The actual Italic speakers had migrated southward from the trans Alpine region, and this is where the Celtic connection becomes apparent.

The area north of the Alps was Celtic at the time, insofar as the Celts ever really existed in their own right, so either the speakers of the Proto-Italic language were in contact with the speakers of Proto-Celtic or they actually were the speakers of Proto-Celtic, id est they were actually the same language. I’m going to use Latin spelling here as I write the numbers from one to ten in Proto-Italic:

oinos, duo, tres, quettuor, quenque, sex, septem, octo, novem, decem.

Here’s the same in Celtic using the same spelling:

oinos, duau, tris, quetuares, quenque, suexs, sextam, oxtu, navam, decam.

I’d say these are close enough to be in the same language spoken in different accents. Proto-Celtic has a more archaic flavour to me, and the presence of the A’s in “navam” and “decam” give them a kind of Sanskrit flavour – नव (nava) and दश (daśa) being the same words in that language. That doesn’t mean the rest of the two languages were as similar. For instance, even now some Q-Celtic uses a dual number – a form like the plural but for when there are two of something – but Italic has never had that, although it has traces such as “ambas/-os” in Castilian for “both”, and of course similar traces exist in English. That said, the languages are unusually similar.

You might be wondering how this can be reconstructed since this was all going on rather a long time ago. The answer is that Italic languages did in fact often have a written form, having alphabets derived from Greek, usually via Etruscan which was the high civilisation on the peninsula at the time. Each actually had a different script. Consequently it can be seen that there are a number of similar languages which have certain things in common and one-way processes can be identified. Italic is not puzzling in this respect. Celtic is somewhat more confusing, because the only surviving Celtic languages are the ones spoken in the British Isles and Breton, which is descended from a British Celtic language, and those only date from the first millennium CE in written form. There are older examples but these tend to be rather limited, consisting of short inscriptions. On the Iberian peninsula, five scripts existed which seem to have been derived directly from Phœnician. They tended to be syllabaries rather than alphabets, i.e. one character per syllable. Elsewhere, Celtic languages actually used Italic scripts, which considering they were in the same area is unsurprising but it illustrates the close contact between the two groups.

Turning to more general vocabulary, similarities are sometimes obscured by semantic drift, id est, changes in the meanings of words, as for example happened with our “silly”, “nice” and “gay”. For instance, the Proto-Celtic word for “snake” is “natrixs” but the Latin “natrix” means “water snake”, the Proto-Italic word being “anγwis”, which became the Latin “anguis”, which now means “slow worm”, and later the word for eel, “anguilla”. The vocabulary is in both cases also, unsurprisingly, both less “Latin” and less “Celtic” in character because it retains words from PIE which later disappeared and may also have picked up words from the Germanic tribes living nearby. Hence Latin “filia” for daughter is a replacement for a word closer to “daughter” which in Oscan, for example, is “futir”. Even so, various words are quite close or identical:

Proto-Italic	Proto-Celtic	English
tu	tu	thou
is	es	he
ques	quis	who
quid	quis	what
ne	ne	not
alios	alios	other
anγustos	angus	narrow
tenuis	tanauios	thin
viros	viros	man
mater	matir	mother
pater	φatir	father
piscis	φescos	fish
co	cu	dog
palma (palm)	φlama	hand
auzis	ausos	ear
cord	cridiom	heart
cano	caneti	sing

There are many more examples, and this is not cherry-picked, although the words are from a core vocabulary which tends to change less than average. What I have done is ignore vowel length and adjusted both sets of spelling to a kind of classical Latin standard, which brings out how similar Proto-Celtic and Latin really are.

However, it’s uncontroversial that Proto-Celtic and Proto-Italic are related. This is already established. A proper study would compare it with the successful Germanic branch of the KENTUM group. Fortunately this can be done. Here are the numbers from one to ten in Proto-Germanic:

ainaz, twai, þriz, fedwor, fimf, sehs, sebum, ahtou, newun, tehun.

These are, as could be expected, somewhat similar to Italic and Celtic but don’t have the uncanny affinity shown between those two. As for the list above, the Proto-Germanic version looks like this:

þu, iz, hwaz, hwat, ne, allaz, anguz, þunnuz. weraz, moder, fader, fiskaz, hundaz, handaz, auso, herto, singwaną.

In this case, though, the words selected are synonyms whose alternate forms are not found, and as Germanic language users ourselves we can spot some of these, such as “allaz” and “anguz” for “other” and “narrow”, both of which already existed in Proto-Germanic in recognisable forms as “anþeraz” and “narwaz”. This doesn’t happen so much with Proto-Italic and Celtic. Germanic is distinctive in having a large number of words not closely connected to other IE words. Apparent examples here are “hand” and “sing”. But it could still be that Germanic is simply the outlier and Celtic and Italic developed along more typical lines. Except that this isn’t so.

As well as the similarities between words, the two languages also share other features not found in Germanic or any other IE languages. For instance, the superlative, expressed in English by “-est”, and similarly in, for example, Greek, is expressed in Italic and Celtic using an ending based on “-isṃmo-“, as in Italian “fortissimo” and Old Irish “sinem” – “oldest”. The subjunctive mood of the verb is descended from the proto-IE optative (“would that it. . .”) in both cases, which is highly unusual. The genitive uses an I in its ending in both cases too, and there are several other grammatical similarities. Again, these could be primitive features which survived in Italic, Celtic and nowhere else rather than direct connections between the two, but something like the adaptation of the optative to the subunctive isn’t an archaism or that mood would have been like that in Sanskrit, for example, and it isn’t.

The hypothesis became popular from the 1860s and was attacked successfully by the Harvard linguist Calvert Watkins in 1966, so it could be that my attachment to the idea is anachronistic. The problem is supposed to be that the features held in common each connect only one Italic and one Celtic language, and not the same pairs at any point. This is interesting for another reason. Celtic languages are fairly well-known for falling into two subgroups: P-Celtic and Q-Celtic. Hence the Welsh for “five” is “pump” but the Irish equivalent is “cúig”, and the Old Irish for “son” is MAQ, later “mac”, but the Welsh word is “mab”. For a while it was thought that this division merely occurred in the British Isles, but it turns out that continental Celtic languages were also divided in this way. Something similar happens in today’s Italic languages with, for example, the Romanian “patru” for Italian “quattro”, although this development occurred after Latin split up. It also, though, took place in the older Italic languages. Oscan and Umbrian use P where Latin and Faliscan use QU. Hence a weird division can be made among Italo-Celtic where Q languages include Italian, French, Manx and Irish, among others, whereas P languages include Breton, Welsh, Oscan and Umbrian, reflecting a tendency in the entire group for this to happen, as it has in Romanian for example. However, all these similarities don’t mean that Romanian and Welsh have a common ancestor in Oscan, and likewise some of the other tendencies might follow from a pre-existent state which trends in that direction. It’s similar to the phenomenon where both English and German started off with a long I pronounced “ee” and a long U pronounced “oo”, which however became “ai” and “au” independently: min – mine/mein; hus – house/Haus.

Nonetheless, I’m not writing this with the courage of my convictions and that long list of identical and very similar words is hard to discount. This part of western Eurasia can be simplified into a series of peninsulas. There’s Scandinavia in the north, the main part of Western to Central Europe further south and the separate peninsulas of Iberia, Italy and Greece stretching into the Med. The Greeks and Illyrians occupied the last and are not so relevant to the situation. The Germanic peoples originate in Scandinavia, a peninsula which provides an obvious stronger separation from the others, and the characteristics of our languages clearly show the influence of the Uralic languages, for instance in the absence of a separate inflected future tense. It makes sense that Germanic languages would be the outliers in this respect. Then, further south lie the Tumulus and then Urnfield Cultures of the Bronze Age, itself followed by the clearly Celtic Hallstatt Culture. The peoples of the first two of these came to radiate south into Iberia and Italy, but there was a period during which the Etruscans dominated in Italy and only later came to cohabit with the Italic-speaking peoples. These clearly came from the north, trans Alpine region, and the Alps clearly constitute a barrier between the Italics and the Celts. What seems to have happened, or might have anyway, is that the Italo-Celtic speaking people north of the Alps and also spreading into Iberia got separated from the Italic speakers of Italy, and the languages started to diverge, but it clearly makes complete sense that the Urnfield Culture, lasting from 1300 – 750 BCE, or put another way, the five and a half centuries before Romulus and Remus, spoke a group of dialects which were ancestral to both Irish and Portuguese, as it were, along with everything therebetwixt.

These two branches fared very differently. Italic languages kind of triumphed, although most of the first season became subsumed into Latin dialects, and in the form of Latin came to dominate first much of Europe and then the wider globe, such as South and Central America, the Philippines, the former French colonies, and indirectly in the form of English with its extensive French borrowings. Celtic had a very successful period during which it was spoken in the British Isles, Gaul, Iberia, Central Europe and Anatolia, but was then eclipsed by first Latin and then Germanic, leaving it spoken only in Brittany, Ireland and the west of Great Britain and nearby islands, although it did also reach Nova Scotia and Patagonia in the end, in small communities. The grammar of the two halves today shows almost nothing recognisable in common except for things like the occasional letter I in unexpected places in Q-Celtic, but the vocabulary is still faintly connected. This, however, is unclear because of the influence of the Church, leading to loanwords from Ecclesiastical Latin. The Celts are also outside the Empire. The languages were spoken and finally throve best where Latin was not spoken. Their distribution was complementary, and this complementarity followed class and ethnic divisions as well as geographic ones.

The Tumulus Culture is named after the practice of interring bodies in mounds of earth. This practice seems to have spread from the Kurgans of the area north of the Black Sea, named after similar structures, who are widely believed to be the original Proto-IE speakers, so it makes sense that these people would’ve been speaking the common ancestor of Celtic and Italic languages. Their successors, the Urnfield Culture, are named after their tradition of cremating their dead and burying them in urns in fields. If these people were indeed Celts, there may a direct line between this practice at the Cremation Act 1902, which legalised crematoria in Wales, England and Scotland. This act was passed after a famous test case where in 1884 an eccentric Welsh medical doctor, Dr William Price, cremated the body of his five-month old son on a funeral pyre and was tried for it. He was re-enacting a Druidic practice in doing so and was cremated himself a few years later in 1893. Hence our current practice of widespread cremation in Britain may be directly descended from the Urnfield Culture.

At this point, I should make it clear that I know practically nothing about archæology, so I’m venturing well beyond my comfort zone here and you should take what I’m saying with a larger than usual pinch of salt. I should also point out that I don’t in fact know why Celts are not considered an ethnicity beyond a very sketchy idea that they are generally just what the Romans and Greeks thought of as the “not-we’s”,which can’t be quite true or it wouldn’t explain Germans.

The idea that the Urnfield Culture is the original Celtic culture, or Italo-Celtic, is only one of several competing theories about the origin of the Celts. There are also “central” and “western” theories. The western theory is that Celtic languages began along the Atlantic coast and were used as an auxiliary language between traders. This then spread eastward. This is interesting because it seems to imply that the areas where Celtic languages are currently spoken were close to their original territory. The idea of Atlantic Europe is anthropological and in current terms includes Portugal, the British Isles, Northwestern France, the Low Countries, the hinterland of the German coast, Jutland and Norway. Interestingly from a British perspective, it has a Southwest-Northeast orientation like divisions on our own island. It used to be claimed that there was considerable genetic unity among the humans of this area, but in the case of the British Isles any sign of this is obscured by the presence of the R1b haplogroup, of which I have a subclade. This originated from the Yamnaya in the Copper Age, who were what used to be called the Aryans. That is, they were the original PIE speakers. They’re the fair-skinned lactose-tolerant people who tend to occupy Europe.

The “central” theory is that Proto-Celtic arose in Gaul and radiated thence, which makes it easier to account for the similarities in ancient Celtic languages over a large area. It also means that the Celts began close to Italy, which means the Italo-Celtic hypothesis can be maintained more easily.

One thing I haven’t done here is mention the ancient Celtic languages much. The oldest known written Celtic is Lepontic, found in Cisalpine Gaul from about two centuries after the foundation of Rome. There are also Celtiberian, Gaulish, British, Galatian (spoken in what became Turkey – it’s been disputed whether this is a truly Celtic language), Noric and Gallaic. Celtiberian in particular is of interest here as it’s a Q-Celtic language which seems to be ancestral to Irish and therefore also Manx and Gàidhlig, confirming the origin story of the Irish that they came from Spain. There’s actually a fair amount of continuous text available in Celtiberian because of the Botorrita Plaques, which are bilingual Latin and Celtiberian law codes dating from the fifth century Anno Urbis Conditæ. This is a fairly raw transliteration of one of the plaques (the language did not use the Latin alphabet):

trikantam : bergunetakam : togoitos-kue : sarnikio (:) kue : sua : kombalkez : nelitomnekue [: to-ver-daunei : litom : nekue : daunei : litom : nekue : masnai : dizaunei : litom : soz : auguaresta[lo] : damai : uta : oskues : stena : verzoniti : silabur : sleitom : konsklitom : gabizetikantom [:] sanklistara : otanaum : togoitei : eni : uta : oskuez : boustom-ve : korvinom-vemakasiam-ve : ailam-ve : ambidiseti : kamanom : usabituz : ozas : sues : sailo : kusta : bizetuz : iomasekati : [a]mbidingounei : stena : es : vertai : entara : tiris : matus : dinbituz : neito : trikantameni : oisatuz : iomui : listas : titas : zizonti : somui : iom : arznas : bionti : iom : kustaikosarznas : kuati : ias : ozias : vertatosue : temeiue : robiseti : saum : dekametinas : datuz : someieni touzei : iste : ankios : iste : esankios : uze : areitena : sarnikiei : akainakubosnebintor : togoitei : ios : vramtiom-ve : auzeti : aratim-ve : dekametam : datuz : iom : togoitos-kuesarnikio-kue : aiuizas : kombalkores : aleites : iste : ires : ruzimuz : Ablu : ubokum

Gallaic was spoken in Gallicia north of what is now Portugal. Only the occasional word has been recorded, but there are traces in placenames in Galicia and Portugal. This is somewhat complicated by the Celtic Britons who settled there after the fall of Rome, although their influence was minor. Galician and Portuguese both have some Celtic vocabulary, which is presumably from Gallaic.

Gaulish is well-attested. There are more than five gross Gaulish inscriptions and French has more Celtic loanwords than any other non-Celtic language has. It had essentially the same vowels as Classical Latin and the main difference in the consonants were the presence of velar fricatives “kh” and “gh”, and the affricate “ts”. It had seven cases, including the instrumental which had been lost in Italic but does exist in Germanic. Unlike the living Celtic languages or Latin, word order was subject-verb-object, like English. Gaulish is also quite close to British itself. The general impression given by Gaulish is that it’s basically Latin with different words and endings, it not having any of the peculiarities one tends to associate with the surviving Celtic languages such as the confounding periphrastic approach, unusual syntax and consonant mutation. All of that appeared later. Latin and Greek are only distantly related to each other but their general approach to grammar is very similar. The same approach is found in Gaulish.

There are said to be more living speakers of Celtic languages today than there were in Roman or pre-Roman times, so in a way this is their heyday, if that’s true. I think it probably helps to know that there’s a host of lurking cognates to Romance words in Gàidhlig even though the spelling is, though to some extent justified, really annoying. I’ve decided, sight unseen, to reproduce the above list of cognates in Gàidhlig once again, in the form of a table with Italian and English equivalents:

Italian	Gàidhlig	English
tu	thu	thou
lui, esso	e	he
chi	cò	who
che	dè	what
non	chan eil	(is) not
altro	eile	other
stretto	caol	narrow
sottile	caol	thin
uomo	fear	man
madre	màthair	mother
padre	athair	father
pesce	iasg	fish
cane	cù	dog
mano	làmh	hand
orecchio	cluais	ear
cuore	cridhe	heart
cantare	seinn	sing

This isn’t very promising, I have to say. I don’t know how Welsh fares here, although I get the impression it’s less peculiar than Gàidhlig. At least the spelling is clearer.

To conclude then, I wonder if the Romance languages had been as marginalised as the Celtic whether they would have changed in equally peculiar ways. I now realise how little I know about Celtic and the Celts, which may in fact not really have much in common, and I also don’t know why it’s often denied that there is even such a thing as Celtic identity, and what political significance that idea has. And finally, after looking at all this evidence and making a cursory attempt to test it rather than seeking confirmation bias, I definitely accept that Italo-Celtic is a valid grouping of IE languages, more closely bound than either is to Germanic or Illyrian, and that in the late Bronze Age they were a single language with dialects, in close proximity to other languages which were related but not mutually intelligible with it. And I don’t know why anyone would claim the contrary, but then I’m not a linguist.

Startling Semitic-Celtic Parallels And Overinterpretation

Posted on 13/03/202113/03/2021 by zerothly

Some time ago in the 1980s I think, I made one of my many attempts to learn Gàidhlig and noticed something rather strange. I already had some knowledge of Hebrew and Arabic from when I was younger, and it suddenly struck me that the Celtic language shared some remarkable unusual features with the other two. From what I can recall, these included verb-subject-object word order, two genders – feminine and masculine – and something I can only vaguely remember about how prepositions and pronouns work. At the time, I didn’t know what to make of it. It seemed to be more than a coincidence because three always counts to my mind as more than chance allows, but it was difficult to think of a way of how it could’ve happened. I eventually settled on a rather vague conclusion that maybe Semitic language speakers had travelled north from the Maghreb into Iberia, which Q-Celtic languages are sometimes claimed to originate, and that they then influenced the ancestor of the Irish language in some way. However, this doesn’t work particularly well as it fails to explain how Welsh and Cornish also have these features. After a while, I just put it down to coincidence and my tendency to see patterns where none exist other than the ones my mind has imposed upon them.

At this point I’m going to veer off into probability to illustrate why three things in common is my threshold for statistical significance. It’s common to plump for one in twenty as the point at which something is considered significant, and scientific experiments often use this. In recent years I’ve seen rather too many dubious-looking scientific papers which seem to go for a much lower limit and I now wonder if there has been a new development in statistical theory which justifies this, or whether it’s more to do with “publish or perish”. Anyway, probabilities multiply, so if you flip a fair coin three times and it comes up heads every time the probability of that outcome is one in two times one in two times one in two. 2³ is eight, still below the point when one decides something is significant, but the probability of something happening is not always one in two. For fair dice, you’d only need to throw a six twice for it to become significant: one in thirty-six is six squared. Taking this the other way, the mean probability for three events to multiply up to one in twenty is of course the cube root of twenty, which is just over one in 2.7. However, this reasoning is faulty because we see patterns as opposed to the absence of patterns, so given the large number of other grammatical features one could pluck out of Celtic and Semitic languages, the ones that don’t fit might be ignored and the calculation then becomes extremely complicated because one then has to consider how to delineate specific grammatical features and how to count them, then work out what the chances are that two sets of languages share three grammatical features based on this and the number of possible options. For instance, with syntax the options, assuming a largely fixed word order which doesn’t always happen, are SVO, SOV, OVS, VSO, VOS and OSV, which is one in six. However, other features are quite arbitrary. There are languages out there with more than two dozen grammatical genders, for example. It’s possible to imagine a language whose every noun has a different gender.

Another pattern which definitely is meaningful which can be plucked out of Celtic languages as they are today is the fact that they and Romance languages, more specifically Italic languages, which are Romance languages plus Latin and its closest contemporary relatives, are closer to one another than they are to other branches of the Indo-European language family. Some of these features are the result of parallel evolution. For instance, all of the surviving six Celtic languages have two grammatical genders consisting of feminine and masculine, and this is also true of all Western Romance languages (though not of Romanian, which still has neuter). Besides this, other Indo-European languages tend to use an ending like “-est” to express the superlative of adjectives, but Italic and Celtic tend to use something like “-issimum” – “best” versus “bellissimo” for example. There are a number of other similarities which may be preserved ancient features lost from the other languages, features acquired because they were neighbours or features acquired in their common ancestral language. These are, though, easy to account for because Italic and Celtic just are obviously related, were spoken near each other and so on. The idea of a parallel between Celtic and Semitic is much harder to explain, which is why it might not exist at all.

Recently, I discovered that my personal will o’ the wisp is not in fact just mine. Professional linguists have noticed this too, and there are even theories about how it might have happened and a number of other features in common. VSO and inflected prepositions are just two of several parallels. I should explain that in Gàidhlig and its relatives, prepositions vary according to who they refer to, so for example “agam” means “at me” and “agat” “at thee”. The origin of these is easy to account for, that the words have simply been run together over the millennia, but few other languages do this. Arabic and Hebrew, on the other hand, do. The languages also do things with these prepositions which other languages don’t. They express possession and obligation with them. “The hair on her” – “am falt oirre” is “her hair” and “I need/want/must have a knife” is “tha bhuam sgian” – “there is from me (a) knife”. That “(a)” indicates something else they have in common: they all have a word for “the” but none for “a”. It’s unusual for a language to have a way of expressing definiteness without indefiniteness. Interestingly, Anglo-Saxon and Old Norse, both spoken in these isles, also had a way to say “the” but not one to say “a(n)”, and this may be a clue as to how these apparent coincidences happened. Breton, however, does have an indefinite article. Likewise, all the languages repeat the pronoun at the end of a relative clause – “the chair which I sat on it” and not “the chair (which) I sat on”. There’s also the way the word for “and” is used, or rather, a word for “and”: “agus” in Gàidhlig (there’s another word, “is”) and “wa” in Arabic (“ve” in today’s Hebrew). In English, “and” is a simple coördinating conjunction like “or” and “but”, but in the other languages it can also be used as a subordinating one. It can also mean “when” or “as”. This is also unusual. “Agus”/”ve” can also be used to mean “but” or “although”, and in fact as I understand it, the Arabic “wa” is the only option to express “but”. Besides this, there’s what’s known as the construct state genitive in English descriptions of Hebrew grammar. Arabic doesn’t say “the man’s house” but “man the house”, or “taigh an duine” in Gàidhlig – “the house man”. This is in spite of the fact that the language in question has a genitive form for the noun in question. This makes approximately eight features found in Celtic and Semitic languages but only rarely in others.

And there’s more. The surviving Celtic languages are unusual among Indo-European languages in having these features, and are in general quite aberrant compared to the others. That said, there are branches of the family which have unusual features for it, such as Armenian, which has grammar more like other languages than Indo-European in that it hangs successive suffixes off the ends of words per idea as opposed to having combined ideas in each suffix (in English we have, for example, a final S for genitive (possessive) and plural and don’t need anything extra). Even so, were it not for the known history and the fact that so much Celtic vocabulary is clearly similar to that of other European languages, nobody would guess Celtic languages were Indo-European. In fact, the very features which they share with Semitic languages are the ones which make them unique in the Indo-European family.

They are also emphatically not related to each other, or at least so distantly related that there are languages native to Kenya and Tanzania which are closer to Hebrew and Arabic and a dead language spoken in present day China which is closer to Welsh (and in fact English) than they are to each other. Semitic languages are part of a family now referred to as “Afro-Asiatic”, which also includes Tamazight, a Berber language, and Ancient Egyptian, spoken five thousand years ago and still nowhere near the speech of the Kurgans at the time which are ancestral to Celtic, Germanic and the like. There are, however, a few theories about how this has happened.

One apparently anomalous circumstance which can be seen from the New Testament is that Paul wrote a letter to the Galatians. These lived in Anatolia, the Asian portion of present-day Turkey, and they spoke a Celtic language. This language was clearly in close proximity to the Semitic lingua franca of that region at the time, Aramaic, as well as various others such as Assyrian. It’s therefore been suggested that the whole of the Celtic branch was influenced by this local connection, all the way across to Ireland in the end. To me, this seems a little far-fetched, but it is true that there’s a concentration of a particular set of genes which marks the Irish, and incidentally myself, as possible wanderers from the Indo-European ancestral land who went as far as possible at the time. This may make the so-called Celts the ultimate invaders in a way and contradicts the common mystical, matriarchal and peaceful image some people seem to have of them. This migration also forms part of another theory, that farming, having been invented in the Fertile Crescent where Semitic languages were spoken, then spread culturally across Europe to these islands and took linguistic features with it. Either of these ideas being true could be expected to imply that all Celtic languages, not just the modern survivors here and in Brittany, had these features in common.

Significantly, the speakers of Celtic languages were probably the first Indo-European speakers to arrive in Great Britain and Ireland. Prior to that, clearly there were other people living here who had their own spoken but unwritten languages. It’s possible that traces of these may survive in place names. It used to be thought that the Picts spoke a non-IE language, possibly related to Basque, but this has now been refuted. The features Irish, Welsh and the rest have in common with Hebrew and Arabic are also apparently shared with Tamazight and other languages of the Maghreb, although to me that’s hearsay – I haven’t checked them out. Consequently, one rather outré theory, is that before the Celts got here the folk of Albion and the Emerald Isle spoke a Semitic language, and Celtic was influenced by this when it got here. However, there doesn’t seem to be much reason to suppose this to be so other than the connection.

Leaving those theories aside, I would bring up the issue of linguistic universals, and particularly implicational universals. Some features are common to all spoken languages. For example, every known spoken language has a vowel like /a/ as in “father” in it, every language which distinguishes questions tonally involves changing the pitch of the voice towards the end of the sentence, and every language has at least some plural pronouns. There’s a particular set of implicational universals around SOV languages which they tend to have in common, such as being exclusively suffixing, to the extent that it used to be thought that there was a so-called “Altaic” language family including Turkish and Mongolian, and some would even include Japanese and Korean in that, but they’ve turned out not to be closely related but have sometimes grown more alike through contact, but they also have many of these implicational universals, suggesting to me some kind of possible “standard” human spoken language with those grammatical features. I would tentatively suggest, and I may well be wrong, that the features Celtic and Semitic languages share are in fact similarly implicational universals. Both of them have an unusual syntax and this may lead them both down the same path.

But there’s an extra layer to this which intrigues me. There used to be a famous Hebrew teacher who introduced the subject as “Gentlemen, this is the language God spoke” (yes, this is extremely sexist but it was a long time ago), and similarly Arabic is considered a particularly sacred language almost designed by God to write the Qur’an. Hence the features mentioned are used in two very important sacred texts, and if I’m going to go all religious and mystical on you, just maybe the Celtic and Semitic languages have a special place in spiritual practices, and this is about that. But leaving that aside, it still seems to me that the most likely explanation for the things they have in common is simply that they are a particular “type” of language, just as Japanese and Turkish are, without needing to have any genetic relationship.

They’re also both really annoying!

The issue of overinterpretation will have to be held over until tomorrow, sorry.