UTF8 support

users-prolog

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

UTF8 support

From:	stamit
Subject:	UTF8 support
Date:	Fri, 03 Jun 2005 21:24:41 +0300

Hello to the list.

I did my own hack for support for UTF8 encoding in GNU Prolog. It peeks
ahead on the stream and makes each extended character/byte in a UTF8
sequence appear to be of the same type as the entire 'wide' character.
It uses the functions declared in <wctype.h>.

This logic has a problem with pushback *BUT* it is all internal to the
scanner (the scanner just treats non-ASCII characters specially).

Invalid UTF8 is read in the old-fashioned way.

Long '\XXXXX\' escape sequences are read/written.

I also did everything I could for atom_chars, atom_codes etc. but I
don't know gprolog's internals very well. They all seem to work OK
though.

There are #ifdef blocks and a new option in configure.in (untested).


src/BipsPl/c_supp.[ch]
        *Char* and *Code* functions updated
src/BipsPl/scan_supp.c
        UTF8_Hack_Peek_Next_Char
src/BipsPl/stream_supp.[ch]
        I had to add some fields to StmInf for the scanner to use
src/BipsPl/write_supp.c
        iswprint test
src/EnginePl/atom.[ch]
        UTF8_Hack_Classify_Char, Is_Valid_Code


I hope you will find all this useful.


Stamatis Mitrofanis

gprolog-1.2.16-utf8.patch
Description: Text Data

[Prev in Thread]

Current Thread

[Next in Thread]

UTF8 support, stamit <=

Prev by Date: No: Problems with Fedora 3
Previous by thread: No: Problems with Fedora 3
Index(es):
- Date
- Thread