bug-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#66470: 29.1; Provide Urdu input method


From: Rahguzar
Subject: bug#66470: 29.1; Provide Urdu input method
Date: Sat, 14 Oct 2023 11:50:42 +0200
User-agent: mu4e 1.10.7; emacs 29.1

Hi Eli,

Eli Zaretskii <eliz@gnu.org> writes:

> I responded to your question on help-gnu-emacs.

Thanks a lot! But changing ` to ' doesn't seem to make a difference.

>> 1) Better coverage of Urdu zabta takhti
>> http://www.unicode.org/L2/L2002/02004-uzt.pdf It is a code page
>> developed for Urdu and supposedly was the basis of current Urdu support
>> of Unicode. However I can't figure out exactly what is the mapping from
>> UZT to Unicode. If anyone has pointers please share.
>
> What are the specific issues you cannot figure out?

The paper I referenced describes the UZT code page but it is uses hand
drawn characters and I can't figure out what what Unicode code points
those characters correspond to. It is hard to search for them because
Unicode uses Arabic names for them which are different for Urdu ones and
the characters I am still looking for are highly unusual ones, so even
though Urdu is my first language I wouldn't be able to disambiguate
between similar looking characters to find one most suitable for Urdu.
But that is a more of a theoretical completeness issue. The letters that
are provided by the input method can be used to write all normal Urdu
text (i.e. one that doesn't have stretches of Farsi or Arabic in it).

> Anyway, a better place to ask about Unicode mapping is on the Unicode
> mailing list, unicode@corp.unicode.org.

Thanks, I will ask there.

>> 2) Another input method for the common Urdu keyboard support. Despite
>> Urdu being my mother tongue I never learned to type in it which is why I
>> prefer a phonetic keyboard. However I will like to make an input method
>> with the layout of a physical Urdu keyboard. If a keyboard layout was
>> standardized in Pakistan or India, I can't find the specification. If
>> such a document exists and anyone can point me to it, I will be happy to
>> use it as a basis for addition to the `urdu.el` in attached patch.
>> Otherwise I will use the layout of Urdu keyboard from some free software
>> project. But this will probably be closer to Emacs 30 than now.
>
> Does https://www.branah.com/urdu help?

I found that keyboard, but I also wanted to see if there was some
institutional standardization in Pakistan or India so as to find a
canonical resource. Web searches didn't reveal anything so if nothing
comes up, I will go with the one found on that page.

However I don't find the phonetic keyboard there that useful since it
has to put every character on one key. Using a combination of two
keys which Emacs can do is more systematic and easy to remember (at
least for me).

>>
>> 3) The key for 'w' is unused in the input method for now. I am tempted
>> to bind a version of `insert-char` but restricted to Arabic range of
>> Unicode to cover the rare cases not covered by the input method now. Any
>> opinions about that?
>
> That key is assigned characters in the above keyboard page.

This was for the input method in the attached file which is my attempt at
building a phonetic input method for Urdu taking some inspiration from
farsi-transliterate-banan in Emacs.

However if you and other maintainers prefer to replicate more widely used
keyboards (e.g. those on mobile devices) I will provide them instead.

>> ---
>>  lisp/leim/quail/urdu.el | 193 ++++++++++++++++++++++++++++++++++++++++
>>  1 file changed, 193 insertions(+)
>>  create mode 100644 lisp/leim/quail/urdu.el
>
> Thanks, but I'd prefer not to create a new file.  How about adding
> this to persian.el instead?

That is an option and another better one might be indian.el which is
where input methods for languages from Indo-Pakistan subcontinent are
now. Another consideration for where it should go is that other major
languages in Pakistan are also not represented among input methods.
These include Punjabi (which I can speak a little but not write) which
has two scripts: Emacs has a input methods for gurmukhi but not
shahmukhi https://en.wikipedia.org/wiki/Shahmukhi , although Wikipedia
indicates that an Urdu input method should be sufficient. There are also

Saraiki: https://en.wikipedia.org/wiki/Saraiki_language
Sindhi: https://en.wikipedia.org/wiki/Sindhi_language
Pashto: https://en.wikipedia.org/wiki/Pashto_alphabet
Balochi: https://en.wikipedia.org/wiki/Balochi_Standard_Alphabet

Which all use a script similar to Urdu/Farsi/Arabic one but with
variation that might benefit from a different input method.

Of these branah.com has keyboards for Sindhi and Pashto and I can try
making input methods based on those.

>> +(quail-define-package
>> + "urdu-phonetic" "Urdu" "اردو صوتی"
>> + `((?a . "a → ا , A → آ , ax → أ , u → ع")
>> +   (?y . "y → ی , e → ے , i → ئ")
>> +   (?r . "r → ر , R → ڑ")
>
> Besides the ` vs ' issue, I doubt that this kind of GUIDANCE is a good
> idea.  Emacs input methods have a standardized way of showing
> candidates and the keys to select them, so I would prefer to stick to
> the standardized guidance format instead.  In particular, showing the
> translations of both upper- and lower-case keys is not a good idea, I
> think.

I am a native Urdu speaker but I have never typed Urdu seriously until
recently which is why I tried to make an input method for it. I think
for a phonetic input method such disambiguation from guidance string
will be very helpful for me at least while I need these training wheels.
So if possible I would like to have it at least for my personal use. But
as I said earlier I can't make it work with either ` or '.

But once we figure out the other details mentioned above, I will send
another patch with standard guidance.

Thanks,
Rahguzar





reply via email to

[Prev in Thread] Current Thread [Next in Thread]