bug#68246: 30.0.50; Add non-TS mode as extra parent of TS modes

bug-gnu-emacs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#68246: 30.0.50; Add non-TS mode as extra parent of TS modes

From:	Dmitry Gutov
Subject:	bug#68246: 30.0.50; Add non-TS mode as extra parent of TS modes
Date:	Wed, 17 Jan 2024 05:45:34 +0200
User-agent:	Mozilla Thunderbird

On 16/01/2024 04:32, Stefan Monnier via Bug reports for GNU Emacs, theSwiss army knife of text editors wrote:

Please don't call it "language".  That'd be confusing.  LSP is about
programming languages, so "language" is natural there.  But in Emacs,
a major mode is more general than that.  For example, it is not
unthinkable to consider mail-mode to be the extra-parent of
message-mode (or vice versa) -- but what is the "language" in that
case?

Isn't the language for such modes in this paradigm just the empty set?


I'm not too worried about those cases, indeed.
I'm more worried about the taxonomy of languages.
We currently have the taxonomy of major modes, with which we're pretty
familiar, and we've had many years to learn about its downsides,
complexity, as well as how to deal with them, but for languages we're
only familiar with the easy cases, which makes us judge the idea in
a way that may prove naive.


Some of us perhaps familiar with more cases than others.

IME, deciding what is the type of the content of a buffer is usually
trivial but with some notable caveats, such as XPM or Postscript files,
or "container formats" (like `.deb` or `.odt`, as well as things like
DocBook which can be considered either as their own format or as XML),

Sounds like DocBook could be viewed using different major modes. I'm notsure whether they should be classified as different languages in generalin such cases, but here is sounds like :doc_book vs :xml.

or "sublanguages" such as C being a subset of C++, or Javascript being
a subset of Typescript.  And I suspect the info we need will not always
be quite the same.

So far my understanding is that "languages" would not have a hierarchy -no relation of being a "subset" or etc, because different applicationswill likely need different relations between such languages. Or none atall, most of the time.

When a major mode is suitable for a number of languages, it can beexpressed externally, e.g. using several entries in major-mode-alist-alist.

So while there might be a good case to be made to add some API functions
to query the language/type(s) of a given buffer (I'm not sure we'd need
the language of a given major mode, OTOH), or to find the preferred
mode(s) for a given language/type, I think it's worthwhile to try and
tweak our major mode taxonomy because it is information we must have
and information we know we will always have, so we should strive to make
it as good as we can.

It shouldn't make it any harder to add language/type API functionality.
On the contrary it should make it easier.

[ As suggested elsewhere in this thread, we could even try and merge
   those taxonomies, e.g. using extra parents of the form `LANG-lang`.  ]

Inserting an extra parent called LANG-lang could work to contain the(mode->lang) mapping, but only if we decide that a mode can correspondto only one language, or if we are not going to run the language hook inthe mode function. But if we don't, the extra complexity seems not worthit: we'll need the lang->mode mapping somewhere else anyway. And lookingthere (in major-mode-remap-alist) we could fetch the reverse relationjust as well.

This also wouldn't bring any of the other features I enumerated togetherwith my patch.

[Prev in Thread]

Current Thread

[Next in Thread]

bug#68246: 30.0.50; Add non-TS mode as extra parent of TS modes, (continued)

Prev by Date: bug#68244: hash-table improvements
Next by Date: bug#68443: 30.0.50; Cannot commit from *vc-diff* if there is a deleted file.
Previous by thread: bug#68246: 30.0.50; Add non-TS mode as extra parent of TS modes
Next by thread: bug#68246: 30.0.50; Add non-TS mode as extra parent of TS modes
Index(es):
- Date
- Thread