bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?

bug-gnu-emacs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?

From:	Stefan Kangas
Subject:	bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?
Date:	Thu, 17 Nov 2022 20:14:09 -0800

Eli Zaretskii <eliz@gnu.org> writes:

> No.  AFAIR, they are in utf-8-emacs because they include characters
> beyond the Unicode range, which UTF-8 cannot encode.  See, for
> example, the codepoints that start around line 645 in ind-util.el,
> which are used for converting between IS 13194 (ISCII) and Unicode.

I see, thanks.

Do we need these characters to be raw bytes in the source code though?
I was thinking of a change similar to the below, which would
incidentally make it a bit easier to read the code.

diff --git a/lisp/language/ind-util.el b/lisp/language/ind-util.el
index e2a21820f4..16161319ef 100644
--- a/lisp/language/ind-util.el
+++ b/lisp/language/ind-util.el
@@ -644,9 +644,9 @@ indian-dev-aiba-decode-region
     ;;Unicode vs IS13194  ;; only Devanagari is supported now.
     ((ucs-devanagari-to-is13194-alist
       '((?\x0900 . "[U+0900]")
-       (?\x0901 . " ")
-       (?\x0902 . " ")
-       (?\x0903 . " ")
+        (?\x0901 . "?\x180000")
+        (?\x0902 . "?\x180001")
+        (?\x0903 . "?\x180002")
        (?\x0904 . "[U+0904]")

[and so on]

This change would also avoid confusing external tools.  For example, the
code is completely unreadable in many external viewers, such as:

https://github.com/emacs-mirror/emacs/blob/master/lisp/language/ind-util.el#L647

[Prev in Thread]

Current Thread

[Next in Thread]

bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Stefan Kangas, 2022/11/17
- bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Eli Zaretskii, 2022/11/17
  - bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Stefan Kangas <=
    - bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Eli Zaretskii, 2022/11/18
    - bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Stefan Kangas, 2022/11/18
    - bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Eli Zaretskii, 2022/11/18
    - bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Stefan Kangas, 2022/11/18
    - bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Eli Zaretskii, 2022/11/19
    - bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Drew Adams, 2022/11/18
    - bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Stefan Kangas, 2022/11/18
    - bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?, Drew Adams, 2022/11/18

Prev by Date: bug#59087: 29.0.50; package-vc-update error
Next by Date: bug#59314: 29.0.50; EUDC and message-mode header completion
Previous by thread: bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?
Next by thread: bug#59341: 29.0.50; Lisp files with other encoding than UTF-8?
Index(es):
- Date
- Thread