bug#52749: G-expressions don't consistently preserve #nil

bug-guix

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#52749: G-expressions don't consistently preserve #nil

From:	Philip McGrath
Subject:	bug#52749: G-expressions don't consistently preserve #nil
Date:	Mon, 27 Dec 2021 13:38:42 -0500
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.3.1

Hi!

Just as a general disclaimer, I'm a Racketeer and only incidentally aSchemer, so I'm not very familiar with the universe of Guile libraries.


On 12/23/21 01:59, Maxime Devos wrote:
> Philip McGrath schreef op wo 22-12-2021 om 23:25 [-0500]:
>> G-expressions currently do not consistently preserve the distinction
>> between #nil and '(), which causes trouble for programs that rely on
>> that distinction. In particular, the issue affects programs that use
>> (guix build json), because that library uses #nil to represent the JSON
>> value `null', whereas it uses '() to represent an empty JSON array.
>
> The constant #nil is only for elisp compatibility and not something
> supposed to be used in Scheme code that isn't for Scheme/elisp
> compatibility, so this seems more a bug in (guix build json) to me.

That was not the impression I had gotten from `info "(guile)Nil"`. Forexample, I think someone who wanted to finish the implementationdescribed in `info "(guile)ECMAScript"` might want to use #nil for oneof the false-y ECMAScript values to take advantages of the documentedefficiencies in its bit-level representation. More concretely,guile-json@1 and guile-json@3 use #nil in the same way as (guix build json).


On 12/23/21 12:58, Maxime Devos wrote:
> Philip McGrath schreef op wo 22-12-2021 om 23:25 [-0500]:
>> G-expressions currently do not consistently preserve the distinction
>> between #nil and '(), which causes trouble for programs that rely on
>> that distinction. In particular, the issue affects programs that use
>> (guix build json), because that library uses #nil to represent the JSON
>> value `null', whereas it uses '() to represent an empty JSON array.
>>
>> The following program exposes the error:
>> [
>> ;...]
>>
>> ; This one fails!
>> (check-equal? (gexp->json-string #~'(@ ("k" . #nil)))
>>                 "{\"k\":null}"
>>                 "gexp: null in object")
>
> A simpler test:
>
> Compare this:
>    (cdr (gexp->approximate-sexp #~("stuff" . #nil)))
>    ; output: #nil --- seems like everything is ok?
>
> with:
>    (gexp->approximate-sexp #~("stuff" . #nil))
>    ; output: ("stuff") --- where did the #nil go?
>
> I think the idea is that, if you construct a list (a b c . #nil)
> in elisp, and pass it to Scheme, then Scheme should treat it as a
> Scheme list, so it should be printed as (a b c) when using Scheme's
> 'write' or 'display'.

Since `write` and `list?` are specified by various Scheme standards, Ithink it is the correct choice for `write` to use a Scheme-compatibleexternal representation for values recognized by `list?`, at least bydefault. (Perhaps a parameter could control this behavior?)

I think the behavior of `gexp->approximate-sexp` is at least defensible,since its documentation (`info guix "gexp->approximate-sexp"`) warnsthat "some information can be lost".

But I think the implementation of G-expressions faces more stringentobligations. I see it as analogous to the implementation of syntaxobjects, a macro expander, or a compiler, in that it should have asemantics-preserving representation of arbitrary Guile code, includingGuile's extensions to Scheme.

(I haven't yet understood at a theoretical level how "strata" and"staging" relate to the more familiar concept of "phases", but myintuition is that, while the R6RS model of phases wouldn't be enough, itseems like would probably to express staging/strata in terms of phaseswith Racket enhancements like the label phase level and arbitrarysubmodule-implemented phases.)


So, I agree that:

On 12/25/21 06:13, Maxime Devos wrote:

That said, it would be less surprising if the #nil/() distinction is
preserved by gexp->derivation and friends. This can be done by writing
our own 'write' procedure. Downside: it might be less efficient than
Guile's write which is implemented in C. Can be resolved by writing our
own 'write' procedure in C.

I haven't looked at the implementation at all, but extending `write`certainly would be a reasonable option, and, longer-term, it might bepossible to upstream a patch adding the needed behavior.

A more radical option could be to use a format other than plain-texts-expressions for compiled G-expressions. For example, Racket has aforward-compatible "fast-load serialization" binary format for the kindsof values that can be embedded in compiled code.[0] There are obviousdisadvantages to a binary format, but advantages include the ability topreserve source-location information and to avoid some the quirks thatcome with functions like `write` and `read`, for historical reasons orfor the convenience of humans writing code directly. The implementationis in Racket, so it should be fairly easy to port to Guile, if that werewanted.[1] Or maybe there's something related to Guile bytecode thatwould work, or maybe just making a `#nil`-preserving version of `write`would be easier.


-Philip

[0]: https://docs.racket-lang.org/reference/fasl.html

[1]:https://github.com/racket/racket/blob/master/racket/collects/racket/fasl.rkt

[Prev in Thread]

Current Thread

[Next in Thread]

bug#52749: G-expressions don't consistently preserve #nil, Philip McGrath, 2021/12/22
- bug#52749: G-expressions don't consistently preserve #nil, Maxime Devos, 2021/12/23
  - bug#52749: G-expressions don't consistently preserve #nil, Maxime Devos, 2021/12/25
    - bug#52749: G-expressions don't consistently preserve #nil, Philip McGrath <=
    - bug#52749: G-expressions don't consistently preserve #nil, Maxime Devos, 2021/12/27
- bug#52749: G-expressions don't consistently preserve #nil, Maxime Devos, 2021/12/23

Prev by Date: bug#52684: [BUG] Multiple Packages Failing to Build
Next by Date: bug#52829: GNOME Bluetooth integration is broken
Previous by thread: bug#52749: G-expressions don't consistently preserve #nil
Next by thread: bug#52749: G-expressions don't consistently preserve #nil
Index(es):
- Date
- Thread