Single transliteration scheme for all CM languages?

Languages used in Carnatic Music & Literature
Post Reply
arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

Going by the earlier stated rule:
To summarize, for tamizh #n/#ng is ங், and "ng" is ஙக்
For kannada (and similar), #n/#ng is ಙ್, and "ng" is Mg (i.e. anuswara)
I get this:

English:
1. vA#ngmaya
2. vA#nmaya
3. vAngAdE
4. tangam
5. a#ng#nganam
6. a#n#nanam

Tamil:
1. வாங்மய
2. வாங்மய
3. வாங்காதே
4. தங்கம்
5. அங்ஙனம
6. அங்ஙனம்

Kannada:
1. ವಾಙ್ಮಯ
2. ವಾಙ್ಮಯ
3. ವಾಂಗಾದೇ
4. ತಂಗಂ
5. ಅಙ್ಙನಂ
6. ಅಙ್ಙನಂ

Is this ok. What will break this logic would be if a "ng" combination is there which does not translate to a ங or Mg (anuswara).

Arun

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

jayaram,

for the in nilavu, even malayalam script doesnt differentiate right? That would make it hard to represent it I think. But about combinatory words. Does the sound of ni change if you have some combined-word where the latter half is nilavu and so "ni would no longer be the first syllable? If it does change, then it may be possible for the logic (eventually) to find if it is the first syllable and generate qualifiers - but not sure.

Arun

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

So far ok.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drshrikaanth wrote:So far ok.
ok thanks. I will upload this version for testing. Thanks!

Arun

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

arunk wrote:
drshrikaanth wrote:So far ok.
ok thanks. I will upload this version for testing. Thanks!
Done. btw i added explicit support for tamizh ந by using ^n. i know you liked 2, but i thought ^nAmam is less intrusive than n2Amam - so i wanted people to get a feel for it. Something other than ^ can work too if people dont like it. Its easy to change the logic on this to another symbol. Of course, ^n and n are the same for kannada (ನ)
Some examples:

English:
^nam
nandi
na^ndi

Tamil:
நம்
நந்தி
நந்தி

Kannada:
ನಂ
ನಂದಿ
ನಂದಿ

Arun

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

i should add that actually ^ in ^nAmam is not required since ன cant start a word. But you need it in the middle of word as in tiru^nAmam
Last edited by arunk on 29 Nov 2006, 02:30, edited 1 time in total.

jayaram
Posts: 1317
Joined: 30 Jun 2006, 03:08

Post by jayaram »

Never understood the difference between ந and ன in Tamil. They both sound the same I think, so no difference in transliteration. Yes?

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

Arun
I really find ^ annoying. Do you mind introducing n2 as alternative spellinng if you want to retain ^n. Thanks. As I expressed earlier n and n2 clearly bring out that these are only 2 variations of the same sound (actually no difference now). This is not the case with ~n and #n

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

i think i should be able to also support n2. While your explanation makes sense, the fact that "2" is a suffix really ruins it for me :). But 2n is not that great and probably can cause other problems.

Of course, ^ probably has a similar effect on you :). I am not overly married to ^ in particular, but i prefer it better than n2.

Arun
Last edited by arunk on 29 Nov 2006, 05:12, edited 1 time in total.

ramakriya
Posts: 1877
Joined: 04 Feb 2010, 02:05

Post by ramakriya »

arunk,

Tried out the following text in the test bed for testing s (as in sangam), sh (shankara) and Sh (Shanmukha) along with the visarga feature you added;

karuNA kaTAkSha Ela upEkSha duShTa shikShaka shiShTa rakShaka - duHkhahara shiva kR.pALu susamskR.ta bhaktara poreyO ~nAna dIkSe ittu.

I have highlighed the errored spots below:

In Tamizh it shows up as:

கருணா கடாக்ஷ ஏல உபேக்ஷ துஷ்ட ஷிக்ஷக ஷிஷ்ட ரக்ஷக - துஃகஹர ஷிவ க்ருபாளு சுசம்ஸ்க்ருத பக்தர பொரெயோ ஞான தீக்செ இத்து.

In kannada it shows up as:

ಕರುಣಾ ಕಟಾಕ್ಶ ಏಲ ಉಪೇಕ್ಶ ದುಶ್ಟ ಶಿಕ್ಶಕ ಶಿಶ್ಟ ಕ್ಶಕ - ದುಃಖಹರ ಶಿವ ಕೃಪಾಳು ಸುಸಮ್ಸ್ಕೃತ ಭಕ್ತರ ಪೊರೆಯೋ ಙಾನ ದೀಕ್ಷೆ ಇತ್ತು.

I think there is inconsistency in handling s, sh and Sh ;

s - should bein kannada, or ஸ் depending on the context in tamizh ( No doubt about that)

sh - should be in kannada, and , as for as I know :) in tamizh (you might use S or Sh or whatever - as long as it gives consistent results in both scripts - AND - you tell the user beforehand how he should spell this sound :)

Sh - should bein kannaDa should be in tamizh (Again, you might use S or Sh .. but specify it first)

The samyuktAkshara 'ksha' is rendered correctly in tamizh script, but incorrectly in kannada

(kSa gives the correct representation in kannaDa script ( ಕ್ಷ ) but wrongly shows up as க்ச in tamizh. kSha or ksha give the correct representation in tamizh ( க்ஷ ) but shows up as ಕ್ಶ in kannaDa.

visarga looks good.

more about ~nAna and saMskR.ta in the next post.

-Ramakriya
Last edited by ramakriya on 29 Nov 2006, 06:01, edited 1 time in total.

ramakriya
Posts: 1877
Joined: 04 Feb 2010, 02:05

Post by ramakriya »

ramakriya wrote:more about ~nAna and saMskR.ta in the next post.

-Ramakriya
Before that another observation. The anunAsika (ma kAra) before a avargIya vyanjana should be represented by a anusvAra in kannaDa.

Here is what I get:

samyOjane samhAra samvahana samShOdhane samsAra

சம்யோஜனெ சம்ஹார சம்வஹன சம்ஷோதனெ சம்சார

ಸಮ್ಯೋಜನೆ ಸಮ್ಹಾರ ಸಮ್ವಹನ ಸಮ್ಶೋಧನೆ ಸಮ್ಸಾರ

-Ramakriya

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

ramakriya wrote:sh - should be in kannada, and , as for as I know :) in tamizh (you might use S or Sh or whatever - as long as it gives consistent results in both scripts - AND - you tell the user beforehand how he should spell this sound :)
Try "S" for this. Both "sh" and "Sh" stands for the next one (i.e. ಷ ஷ).

sa (samam), Sa (Syama) and Sha/sha (purusha/puruSha) are the 3 variants.

English: sh Sh Sa sa
Tamil: ஷ் ஷ் ச ச (although in some cases sa becomes the other one i.e. ஸ)
Kannada: ಶ್ ಶ್ ಷ ಸ

Is this ok - does it make sense? Am i missing one another variant? In tamizh both "sa" and "Sa" are currently the same (but eventually there would be additional qualifiers)
ramakriya wrote:The samyuktAkshara 'ksha' is rendered correctly in tamizh script, but incorrectly in kannada
Ok. I will check this.
(kSa gives the correct representation in kannaDa script ( ಕ್ಷ ) but wrongly shows up as க்ச in tamizh. kSha or ksha give the correct representation in tamizh ( க்ஷ ) but shows up as ಕ್ಶ in kannaDa.
Based on the representation for 3 sa's are "sa", "Sa" and "Sha/sha": i presume this is always ksha (or is there kSa too which results in a single letter)?

Arun

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

ramakriya wrote:
ramakriya wrote:more about ~nAna and saMskR.ta in the next post.

-Ramakriya
Before that another observation. The anunAsika (ma kAra) before a avargIya vyanjana should be represented by a anusvAra in kannaDa.

Here is what I get:

samyOjane samhAra samvahana samShOdhane samsAra

சம்யோஜனெ சம்ஹார சம்வஹன சம்ஷோதனெ சம்சார

ಸಮ್ಯೋಜನೆ ಸಮ್ಹಾರ ಸಮ್ವಹನ ಸಮ್ಶೋಧನೆ ಸಮ್ಸಾರ

-Ramakriya
Sorry. I dont know what those terms mean. Can you explain it in simpler terms please?

Thanks
Arun

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

ramakriya wrote:arunk,

Tried out the following text in the test bed for testing s (as in sangam), sh (shankara) and Sh (Shanmukha) along with the visarga feature you added;

karuNA kaTAkSha Ela upEkSha duShTa shikShaka shiShTa rakShaka - duHkhahara shiva kR.pALu susamskR.ta bhaktara poreyO ~nAna dIkSe ittu.

In kannada it shows up as:

ಕರುಣಾ ಕಟಾಕ್ಶ ಏಲ ಉಪೇಕ್ಶ ದುಶ್ಟ ಶಿಕ್ಶಕ ಶಿಶ್ಟ ಕ್ಶಕ - ದುಃಖಹರ ಶಿವ ಕೃಪಾಳು ಸುಸಮ್ಸ್ಕೃತ ಭಕ್ತರ ಪೊರೆಯೋ ಙಾನ ದೀಕ್ಷೆ ಇತ್ತು.
You have spelt j~nAna wrongly. The jakAra must occur first and the anunAsika vyanjana is only a ottakShara.

Arun- samskRta etc, I have already pointed out that in some places we need bindu and in other placs we need makAra itself. So you need to specify in the script that m and M are never used interchangeably.

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

ramakriya wrote:
ramakriya wrote:more about ~nAna and saMskR.ta in the next post.

-Ramakriya
Before that another observation. The anunAsika (ma kAra) before a avargIya vyanjana should be represented by a anusvAra in kannaDa.
This is not clear and not always the case either. You have words like saumya, and Amla and namrate, where the anuswAra/bindu is Not used. Here, the bindu occurs only when "sam" or some other such prefix is present which is not an integral part of the word e.g saMyAna, saMlApa, saMrAT etc.

With v, S, Sh and s and h, I think the bindu always occurs regardless of the form of the word. So the first 3 consonants of the avargIya vyanjanas(those that are not grouped as pentads) allow for both usages while the rest alwyas have the anuswAra when m is the part of a conjunct consonant.

ramakriya
Posts: 1877
Joined: 04 Feb 2010, 02:05

Post by ramakriya »

drshrikaanth wrote:
ramakriya wrote:
ramakriya wrote:more about ~nAna and saMskR.ta in the next post.

-Ramakriya
Before that another observation. The anunAsika (ma kAra) before a avargIya vyanjana should be represented by a anusvAra in kannaDa.
This is not clear and not always the case either. You have words like saumya, and Amla and namrate, where the anuswAra/bindu is Not used. Here, the bindu occurs only when "sam" or some other such prefix is present which is not an integral part of the word e.g saMyAna, saMlApa, saMrAT etc.

With v, S, Sh and s and h, I think the bindu always occurs regardless of the form of the word. So the first 3 consonants of the avargIya vyanjanas(those that are not grouped as pentads) allow for both usages while the rest alwyas have the anuswAra when m is the part of a conjunct consonant.
you are right about ya, ra and la - Did not think of these cases

-Ramakriya

ramakriya
Posts: 1877
Joined: 04 Feb 2010, 02:05

Post by ramakriya »

drshrikaanth wrote:[You have spelt j~nAna wrongly. The jakAra must occur first and the anunAsika vyanjana is only a ottakShara.
Had tried that as well :( that gives it wrong as well (first anunAsika in ottu instead of the second)

j~nAna

ஜ்ஞான

ಜ್ಙಾನ

-Ramakriya

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

I will look at this in detail a bit later. So 'M' must always mean bindu (i.e. explicit specification). Right now m and 'M' indeed are used iinterchangeably (i think). I will add support for explicit M support.
With v, S, Sh and s and h, I think the bindu always occurs regardless of the form of the word. So the first 3 consonants of the avargIya vyanjanas(those that are not grouped as pentads) allow for both usages while the rest alwyas have the anuswAra when m is the part of a conjunct consonant.
I can incorporate this if needed, so that you can get away with samh... or saMh... etc. If it is ambiguous only for "y", "r" and "l", i can make it so for any other consonant. That way an explicit bindu specification would be needed only if 'ma' consonant is followed by ya, ra or la variant. Would that be ok?

For ??? , it is ~nAna or ~nyAna (i thought i mentioned it before). j~n will confuse it presently as ramakriya observed. Anyway, ~nAna/~nyAna now comes out as ???. I am presuming now that this is wrong. How should it be?

Arun
Last edited by arunk on 29 Nov 2006, 18:37, edited 1 time in total.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

i cant for some reason get the tamizh and kannada representation for ~nAna/~nyAna to come out in that post. All of a sudden they are coming out as ???. May be it is the computer I am logged in from now.

Arun

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:I can incorporate this if needed, so that you can get away with samh... or saMh... etc. If it is ambiguous only for "y", "r" and "l", i can make it so for any other consonant. That way an explicit bindu specification would be needed only if 'ma' consonant is followed by ya, ra or la variant. Would that be ok?
Arun. Why give room for confusion. Simply male M and m no interchangeable. Tomorrow another exception could pop up tha would mandate this anyway.
For ??? , it is ~nAna or ~nyAna (i thought i mentioned it before). j~n will confuse it presently as ramakriya observed. Anyway, ~nAna/~nyAna now comes out as ???. I am presuming now that this is wrong. How should it be?
I dont think j~nAna would confuse anyone. That is the spelling I have seen used almost everywhere ever since my schooldays. This is alsothe way it is spelt in writing in kannaDa. In case you are not aware, the pronunciation of j~nAna in kannaDa and ~nAna in tamizh are different. This is how it looks ಜ್ಞಾನ. Can you see?

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drshrikaanth wrote:Arun. Why give room for confusion. Simply male M and m no interchangeable. Tomorrow another exception could pop up tha would mandate this anyway.
ok. if we are not sure that we have all cases - no need to make M/m interchangeable.

I can see you spelling for j~nAna (i cant quote you as I lose it :). Now i do remember the kannada spelling. I need to go back and check what the logic is doing - i dont think i incorporated it. I will fix this later today. I will make j~n/j~ny <=> ~n/~ny for tamizh. BTW, do telugu and sanskrit treat it like kannada (so that i can reuse the kannada logic as is)

Arun
Last edited by arunk on 29 Nov 2006, 19:01, edited 1 time in total.

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:BTW, do telugu and sanskrit treat it like kannada (so that i can reuse the kannada logic as is)
Yes

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drs,

i think we may have a problem with not making M/m interchangeable at all w.r.t tamzh krithis. I am not sure and so please consider this:

I should be able to say "pAl vaDiyum mugam", but now it would require "pAl vaDiyuM mugaM" to get bindu right for other languages. May seem like not as big a deal as it doesnt affect the tamizh krithi phonetically, but still more effort would be required to get tamizh krithis right. I would like to avoid such cases i.e. where transl. for a certain language gets more complex due to the scheme being applicable to other languages. These are the kind of things I was hoping the programming logic to do as much heavy lifting as possible.

Atleast, we should make "m/M" interchangeable at the end of a word. That will fix the above case and can even make many sanskrit krithis easier with many words ending in "m" for dvitIya vibhakti (although in such cases, i would think most people notating sanskrit krithis would know sanskrit and so when to use M and when not).

The same argument can apply for "m" in the middle for tamizh krithis also. I dont yet know for sure as what to do there but the range of consonant sounds that can follow mei 'm' (ma) in tamizh words (native ones - see below) is much more limited i think. I can think of"sa" (imsai), "ba" (ambu), "ma" ( amma)? Perhaps these can morph to 'M' if approp? But that would apply only if such combinations always require bindu in kannada/telugu/sanskrit etc. Based on what you said before, i think this may be the case (??).

Now cm tamizh krithis by papanansam sivan etc. have a lot of sanskrit words imported as-is. That can make the range of possibilities more in tamizh (?) - but perhaps those can be required be use 'M' if bindu is needed?

Arun

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

Again not sure but I could have a similar problem with needing j~nAna/j~nyAna for the word implying knowledge

In tamizh the phonetic sound of the first syllable in this word is fully represented by the consonant ~na/~nya i.e. ஞ. But in kannada (and i guess otheres), that ~na/~nya consonant is ಞ and cannot represent the sound here. It needs j + ~na/~nya as in ಜ್ಞಾ.

The question is the actual phonetic sound of ಜ್ಞಾನ is (even slighltly) different from the tamizh word ஞான (as in a tamizh krithi), i.e. as sort of how vA#ngamaya is different in saMgam (or vAngadE => vAMgAdE)?

If so then if i have want to transliterate ஞான in a tamizh krithi as ~nyAna, how should it appear in kannada? Note that going other way is fine j~nya/j~na => ஞ

Again I want to avoid requiring tiruj~nAnasambandar. The "j" is unnecessary here in the context, and is needed only for rendering other languages - i would again ideally want this taken care of behind the cases

Also i presume j + ~na/~nya in kannada is not a special case and that there may be other combination of "some consonant" + ~na/~nya? If not that would be good news :)

Arun
Last edited by arunk on 29 Nov 2006, 21:01, edited 1 time in total.

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:Atleast, we should make "m/M" interchangeable at the end of a word.
Yes you could do that.
That will fix the above case and can even make many sanskrit krithis easier with many words ending in "m" for dvitIya vibhakti (although in such cases, i would think most people notating sanskrit krithis would know sanskrit and so when to use M and when not).
Iam not sure what you mean here but if you are writing sanskrit in sanskrit, the "m" at the end also will be "m" only and not the circle/bindu.
The same argument can apply for "m" in the middle for tamizh krithis also. I dont yet know for sure as what to do there but the range of consonant sounds that can follow mei 'm' (ma) in tamizh words (native ones - see below) is much more limited i think. I can think of"sa" (imsai), "ba" (ambu), "ma" ( amma)? Perhaps these can morph to 'M' if approp? But that would apply only if such combinations always require bindu in kannada/telugu/sanskrit etc. Based on what you said before, i think this may be the case (??).
ANother thing I probably did not mention earlier. When m combines with itself as a conjunct as in amma, it is Never shown by bindu. It is alwas amma i.e as the latter itself.

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:The question is the actual phonetic sound of ಜ್ಞಾನ is (even slighltly) different from the tamizh word ஞான (as in a tamizh krithi), i.e. as sort of how vA#ngamaya is different in saMgam (or vAngadE => vAMgAdE)?
It is vA#nmaya. Not vA#ngamaya. And the pronunciation is perceptible and clearly ddifferent from vAMgAdE.
If so then if i have want to transliterate ஞான in a tamizh krithi as ~nyAna, how should it appear in kannada? Note that going other way is fine j~nya/j~na => ஞ
I think you are safer writing it as j~nAna in kannaDa as that is how kannaDigas will pronounce it even in a tamizh kRti. The problem with lwriting it as ~nAnam is that they will probably have problems recognising the sound as words never begin with ~n in kannaDa.
Also i presume j + ~na/~nya in kannada is not a special case and that there may be other combination of "some consonant" + ~na/~nya? If not that would be good news :)
Iam afraid you cannot have the good news. There are many words such as aj~na, sarvaj~na, aj~nAna, saMj~ne etc in kannaDa.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drshrikaanth wrote:Iam not sure what you mean here but if you are writing sanskrit in sanskrit, the "m" at the end also will be "m" only and not the circle/bindu.
? So bindu for "m" at end of word is for kannada and not for sanskrit or is it not always the case for sanskrit? Ramakriya for bhajarE had:
bhajarE rE mAnasa mahiShAsura mardineeM ||
nija bhakta jaya shubhadAyineeM ||
bhavasAgara tAraka roopAM ||
shree vidyAM mahA kAma suMdareeM ||

From what little I know the sanskrit "rendition" of it do have dots above (just like kannada as anuswara following). In any case, i was thinking it would be easier to get this right for languages that require "ma" at end to require bindu/anuswara. Am i confused again :)?

Arun

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

No Arun. AFAIK the letter m only occurs at the end in sanskrit, not the bindu. In hindi of course it is different. But people often might simply follow the pattern in their language while writing sanskrit as well *and hence the M or the dot at the end). But strictly, it should be the letter only.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drshrikaanth wrote:It is vA#nmaya. Not vA#ngamaya.
Both are/should-be equiv. in this scheme currently (only because #ng is phonetically closer than #n)
And the pronunciation is perceptible and clearly ddifferent from vAMgAdE.
Yes i did see that.
I think you are safer writing it as j~nAna in kannaDa
No i meant if word occurs in tamizh krithis, i would like to avoid saying it as j~nAna.
The problem with lwriting it as ~nAnam is that they will probably have problems recognising the sound as words never begin with ~n in kannaDa.
The interpretor (logic) can do it so it interprets it like jn~ (which is why i asked that the question below)
drshrikaanth wrote:
Also i presume j + ~na/~nya in kannada is not a special case and that there may be other combination of "some consonant" + ~na/~nya? If not that would be good news :)
Iam afraid you cannot have the good news. There are many words such as aj~na, sarvaj~na, aj~nAna, saMj~ne etc in kannaDa.
No I meant some thing other than "j" before ~n. If it cannot, then j~n <=> j~ny <=> ~n <=> ~ny always and we have no problems. You can say ~nyAna/~nAna in a tamizh krithi, j~nAna/j~nyAna in a kannada krithi and it will come out right in both languages (with the help of the additiional logic). But if ~n can occur without j before it, that would be a problem.

Arun

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drshrikaanth wrote:No Arun. AFAIK the letter m only occurs at the end in sanskrit, not the bindu. In hindi of course it is different. But people often might simply follow the pattern in their language while writing sanskrit as well *and hence the M or the dot at the end). But strictly, it should be the letter only.
Ok. This should be no problem. The sanskrit interpretor can treat m/M as "m".

Arun

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:
drshrikaanth wrote:It is vA#nmaya. Not vA#ngamaya.
Both are/should-be equiv. in this scheme currently (only because #ng is phonetically closer than #n)
I do have an issue with the latter spelling vA#ngmaya. For those who read the english version, it introduces an exztra "g" sound which should not be there.
arunk wrote:
drshrikaanth wrote:I think you are safer writing it as j~nAna in kannaDa
No i meant if word occurs in tamizh krithis, i would like to avoid saying it as j~nAna.
I did get what you meant. Iam saying there will be some peculiarities of pronunciation by non-native speakers. No doubt some who are familiar with tamzih pronunciation will get it right but most will pronounce it as j~na only.
arunk wrote:
drshrikaanth wrote:]The problem with lwriting it as ~nAnam is that they will probably have problems recognising the sound as words never begin with ~n in kannaDa.
The interpretor (logic) can do it so it interprets it like jn~ (which is why i asked that the question below)
I meant they might simply not able to discern how to pronounce if you just wrote ~jAnam. Is it not better to at least be able to sing albeit with imperfect diction. If you want you can leave it as ~na rather than j~na for tamizh krtis.
No I meant some thing other than "j" before ~n. If it cannot, then j~n <=> j~ny <=> ~n <=> ~ny always and we have no problems. You can say ~nyAna/~nAna in a tamizh krithi, j~nAna/j~nyAna in a kannada krithi and it will come out right in both languages (with the help of the additiional logic). But if ~n can occur without j before it, that would be a problem.
hmm. No, I cannot think of other combinations offhand.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drshrikaanth wrote:
arunk wrote:No i meant if word occurs in tamizh krithis, i would like to avoid saying it as j~nAna.
I did get what you meant. Iam saying there will be some peculiarities of pronunciation by non-native speakers. No doubt some who are familiar with tamzih pronunciation will get it right but most will pronounce it as j~na only.
Sorry. I know you got what i meant, but i wanted to i would avoid having to write it in the scheme as j~nAna for a tamizh krithi. You are right that pronounciation will invariably vary among different speakers. While we can "help" some with super-script suffixes etc., you can only go so far.
drshrikaanth wrote:
No I meant some thing other than "j" before ~n. If it cannot, then j~n <=> j~ny <=> ~n <=> ~ny always and we have no problems. You can say ~nyAna/~nAna in a tamizh krithi, j~nAna/j~nyAna in a kannada krithi and it will come out right in both languages (with the help of the additiional logic). But if ~n can occur without j before it, that would be a problem.
hmm. No, I cannot think of other combinations offhand.
Ok. For now let me go with this.

Arun

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:No I meant some thing other than "j" before ~n. If it cannot, then j~n <=> j~ny <=> ~n <=> ~ny always and we have no problems. You can say ~nyAna/~nAna in a tamizh krithi, j~nAna/j~nyAna in a kannada krithi and it will come out right in both languages (with the help of the additiional logic). But if ~n can occur without j before it, that would be a problem.
You have a few words in sanskrit where ~n occurs with c andjust 2 or 3 with l (l~n God knows how to pronounce it). I must say these are very unlikely to occur in kRtis. The l~n combination- never. c~n possible but very slim chance.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drshrikaanth wrote:
arunk wrote:
drshrikaanth wrote:It is vA#nmaya. Not vA#ngamaya.
Both are/should-be equiv. in this scheme currently (only because #ng is phonetically closer than #n)
I do have an issue with the latter spelling vA#ngmaya. For those who read the english version, it introduces an exztra "g" sound which should not be there.
Ok. If you think #ng will mislead people, i can leave it as #n. I think usage of #n (when compared to ng/Mg) is rarer in tamizh as well as kannada . Just to complete the circle, i would like a malayalam perspective where i think this consonant is used a lot more.

jayaram - the question is Does the malayalam consonant closest to "#n" (as in vA#nmaya, a#n#anam/அங்ஙனம்), better phonetically represented (for malayalam words only) by #ng or would it mislead people by overemphasizing the "ga" sound"?

If #ng would mislead, i will remove the "g" and only accept #n.

Arun
Last edited by arunk on 29 Nov 2006, 22:12, edited 1 time in total.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drshrikaanth wrote:
arunk wrote:No I meant some thing other than "j" before ~n. If it cannot, then j~n <=> j~ny <=> ~n <=> ~ny always and we have no problems. You can say ~nyAna/~nAna in a tamizh krithi, j~nAna/j~nyAna in a kannada krithi and it will come out right in both languages (with the help of the additiional logic). But if ~n can occur without j before it, that would be a problem.
You have a few words in sanskrit where ~n occurs with c andjust 2 or 3 with l (l~n God knows how to pronounce it). I must say these are very unlikely to occur in kRtis. The l~n combination- never. c~n possible but very slim chance.
Even here there is hope. We can always first look for "j~n", "l~n" and "c~n" as a single unit, and if not then "~n" becomes "j~n".

Arun

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:jayaram - the question is Does the malayalam consonant closest to "#n" (as in vA#nmaya, a#n#anam/அங்ஙனம்), better phonetically represented (for malayalam words only) by #ng or would it mislead people by overemphasizing the "ga" sound"?
Sorry to butt in Arun. But from the little I know and the lot I have heard, there is no place for g in pronunciation(and certainly in spelling as you know).

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drshrikaanth wrote:
arunk wrote:jayaram - the question is Does the malayalam consonant closest to "#n" (as in vA#nmaya, a#n#anam/அங்ஙனம்), better phonetically represented (for malayalam words only) by #ng or would it mislead people by overemphasizing the "ga" sound"?
Sorry to butt in Arun. But from the little I know and the lot I have heard, there is no place for g in pronunciation(and certainly in spelling as you know).
ok. I will remove it. We can add it if many people later feel like it makes sense in some contexts (and even then we can recommend against it for kannada, sanskrit etc.)

Arun

jayaram
Posts: 1317
Joined: 30 Jun 2006, 03:08

Post by jayaram »

Srikanth is right regarding the #ng sound for this case. Please note, however, that we do have a distinct 'ng' syllable where the 'g' is heard clearly. Assume this is the same for the other languages also.

(By the way, Srikanth, I notice that you live in Maidstone, UK. Do you come to London to attend kutcheris? Very much look forward to meeting the legend in person! ;) )

ramakriya
Posts: 1877
Joined: 04 Feb 2010, 02:05

Post by ramakriya »

arunk wrote:? So bindu for "m" at end of word is for kannada and not for sanskrit or is it not always the case for sanskrit? Ramakriya for bhajarE had:
bhajarE rE mAnasa mahiShAsura mardineeM ||
nija bhakta jaya shubhadAyineeM ||
bhavasAgara tAraka roopAM ||
shree vidyAM mahA kAma suMdareeM ||

From what little I know the sanskrit "rendition" of it do have dots above (just like kannada as anuswara following). In any case, i was thinking it would be easier to get this right for languages that require "ma" at end to require bindu/anuswara. Am i confused again :)?

Arun
DRS is absolutely correct on this - For bhajarE rE, the transliteration scheme used was for kannDa - I used the same text to generate samskrita unicode - AFAIK samskrita would use half consonent here.

-Ramakriya
Last edited by ramakriya on 29 Nov 2006, 22:31, edited 1 time in total.

ramakriya
Posts: 1877
Joined: 04 Feb 2010, 02:05

Post by ramakriya »

drshrikaanth wrote:You have a few words in sanskrit where ~n occurs with c andjust 2 or 3 with l (l~n God knows how to pronounce it). I must say these are very unlikely to occur in kRtis. The l~n combination- never. c~n possible but very slim chance.
That's interesting - do you know any of these words?

-Ramakriya

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

I am adding support for M etc. and i have a question. How do the tamizh words "anbu" (affection) and "ambu" (arrow), get written in kannada. Or more generally does "nb" become "Mb"? (and if so, does "mb" remain "mb')?

Arun
Last edited by arunk on 29 Nov 2006, 22:45, edited 1 time in total.

ramakriya
Posts: 1877
Joined: 04 Feb 2010, 02:05

Post by ramakriya »

arunk wrote:I am adding support for M etc. and i have a question. How do the tamizh words "anbu" (affection) and "ambu" (arrow), get written in kannada. Or more generally does "nb" become "Mb"? (and if so, does "mb" remain "mb')?

Arun
If I were to write a tamizh kriti with the word anbu - I would retain it as such. 'nb' here should not be replaced with 'mb' .

For ambu, m will look like a bindu - But I think your scheme already takes care of that.

-Ramakriya
Last edited by ramakriya on 29 Nov 2006, 22:52, edited 1 time in total.

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

ramakriya wrote:
drshrikaanth wrote:You have a few words in sanskrit where ~n occurs with c andjust 2 or 3 with l (l~n God knows how to pronounce it). I must say these are very unlikely to occur in kRtis. The l~n combination- never. c~n possible but very slim chance.
That's interesting - do you know any of these words?
yAc~nA and other combination words with yAc~na, pUgayal~na

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:I am adding support for M etc. and i have a question. How do the tamizh words "anbu" (affection) and "ambu" (arrow), get written in kannada. Or more generally does "nb" become "Mb"? (and if so, does "mb" remain "mb')?
Ramakriya has already answered this. To elaborate, mb uses bindu only because m and b both belong to the same pentad. Likewise. all combinations where the first nasal consonant and the 2nd consonant of a conjunct beong to the same pentad, the bindu will occur. Combinations of a nasal consonant with a consonant from another varga/pentad will retain the nasal as such. bindu will not/cannot occur here.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

ok. Anyway i think i was confused :)

(Corrected):
Right now in ng/ngh, nc/nch, nT/nTh, nt/nth, nd/ndha, 'n' becomes 'M'. So 'nb' would remain as 'nb'

Before 'm' and 'M' were sort for treated like 'n' (with some additional rules). That is changing now.

Per new rules:;
1. M is explicit bindu. It translates to "ma" for tamizh only.
2. Either m or M at end becomes bindu for kannada.
3. 'ms' and 'mb' also become 'Ms' and 'Mb' to account for tamizh (and is consistent with kannada).
4. The other tamizh occurences my (ramya), mm (amma) wont become M in kannada, and that is consistent.

Ok? Anything else. We know we can treat some more consonants that follow ma like #3 (even for non-tamizh), but not yet.

Arun
Last edited by arunk on 29 Nov 2006, 23:05, edited 1 time in total.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

drs - thanks. Our posts crossed. Although i need to cross check what you said (like what are in each pentads :).

Arun

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:Right now in ng, np, nc, nT, nt, the 'n' becomes 'M'. So 'nb' would remain as 'nb'
ARun read my previous post on the rules for M occurring in some places.
Per new rules:;
1. M is explicit bindu. It translates to "ma" for tamizh only.
2. Either m or M at end becomes bindu for kannada.
Fine
3. 'ms' and 'mb' also become 'Ms' and 'Mb' to account for tamizh (and is consistent with kannada).
These words in tamizh are anyway borrowings from sanskrit.
4. The other tamizh occurences my (ramya), mm (amma) wont become M in kannada, and that is consistent.
ramya(again a sanskrit word) will be ramya. But saMyama will have bindu. You did not forget what I wrote about both m and M occuring in combination with y, r and l did you?

And finally, when you introduce telugu, you can safely apply the same rules as for kannaDa as is.

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

Silly questions. Please be patient :). I see the pentats are:

k, kh, g, gh, nga,
ca, cha, ja, jha, nya,
Ta, Tha, Da, Dha, Na,
ta, tha, da, dha, na,
pa, pha, ba, bha, ma.

I immediately see then nt, nth, nd, ndh use bindu (not nn? like mm?).
Also, mp, pph, mb, mbh

I thought ramakriya mentioned that nca, nga etc. also become Mca, Mga etc. Which rule would that be?

Also do these rules apply for Na too (nga? nya? doesnt seem so but what do i know :))? Would these considered nasal consonants that follow these rules? (so kaNDEn in tamizh would use bindu?
drshrikaanth wrote:
4. The other tamizh occurences my (ramya), mm (amma) wont become M in kannada, and that is consistent.
ramya(again a sanskrit word) will be ramya. But saMyama will have bindu. You did not forget what I wrote about both m and M occuring in combination with y, r and l did you?
No i didnt. I am going to leave user to specify bindu here (i.e. "my" would remain "my", and if bindu applies you have to specify as "My"). For most tamizh words that you use "my" combo (yes - i do agree that they will be sanskrit imports), i am hoping/guessing they are not required. But it turns out to be so for some of them, user can specify it. They will be rare anyway in tamizh krithis, and asking for explicit qualification there is ok.
drshrikaanth wrote:And finally, when you introduce teugu, you can safely apply the same rules as for kannaDa as is.
Now that is good news ;)
Last edited by arunk on 29 Nov 2006, 23:20, edited 1 time in total.

drshrikaanth
Posts: 4066
Joined: 26 Mar 2005, 17:01

Post by drshrikaanth »

arunk wrote:Silly questions. Please be patient :). I see the pentats are:

k, kh, g, gh, nga,
ca, cha, ja, jha, nya,
Ta, Tha, Da, Dha, Na,
ta, tha, da, dha, na,
pa, pha, ba, bha, ma.
First you are confusing us all by using differen representations for some of the letters (nga, nya)
The pentad will be k---#n, c--~n, T--N, t---n, p---m

To repeat, each nasal, when combining with other from its home pentad, will Always be represented by a bindu. When combining with itself, never by a bindu. (#n and ~n do not combine with themselves) When combining with consonants from another pentad, never by bindu.
Also do these rules apply for Na too (nga? nya? doesnt seem so but what do i know :))?
Already answered above.
(so kaNDEn in tamizh would use bindu?
yes. So also in kannaDa (kaMDe)

arunk
Posts: 3424
Joined: 07 Feb 2010, 21:41

Post by arunk »

ok. By this, I obviously misunderstood what ramakriya said.

The combinations then are:
1. (Corrected) #ng, #ngh, #nd, #ndh (although in our scheme these can appear as ng, ngh, nd, ndh in the input e.g. when oming from a transliteration of a tamizh krithi. For example tangam, sandi etc.)
2. ~nc, ~nch, ~nj, ~njh (some examples please?)
3. NT, NT, ND, ND
4. nt, nth, nd, ndh
5. mp, mph, mb, mbh

Arun
Last edited by arunk on 30 Nov 2006, 00:15, edited 1 time in total.

Post Reply