pdf-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[pdf-devel] Re: [PATCH] Tokeniser API documentation


From: jemarch
Subject: [pdf-devel] Re: [PATCH] Tokeniser API documentation
Date: Tue, 19 May 2009 17:46:25 +0200
User-agent: Wanderlust/2.14.0 (Africa) SEMI/1.14.6 (Maruoka) FLIM/1.14.8 (Shijō) APEL/10.6 Emacs/23.0.92 (i686-pc-linux-gnu) MULE/6.0 (HANACHIRUSATO)

   >    address@hidden PDF_TOKEN_READABLE_STRINGS
   >    +Encode strings in a human-readable way
   >    +(i.e., in hexadecimal or with special characters escaped).
   > 
   > What are the semantics of this writing flag? Given that any string
   > contents can be expressed in hexadecimal, when will it use the
   > escaping of special characters? 

   The idea is to leave the details up to the library, but produce
   something that a person can read while debugging (otherwise, we'd only
   need to escape backslash, carriage return, and unbalanced parentheses).
   Characters 0-9, 11-31, and 127 (at least) would be escaped, and the
   library might write the string as hex if most characters would be
   escaped.

Ok.

   >    address@hidden pdf_status_t pdf_token_keyword_new (const pdf_char_t 
address@hidden, pdf_size_t @var{size}, pdf_token_t address@hidden)
   ...
   > Maybe would be good to add error checking on the contents of the
   > keyword. For example, it cannot start with /. To return PDF_EBADDATA
   > in that case may be appropriate. What do you think? The same would
   > apply to the other token_*_new functions.

   Sometimes the validity of a token will depend on the flags used when
   writing it (e.g., a name can include any non-null character, unless
   _NO_NAME_ESCAPES is specified when writing it).  Also, these checks will
   be redundant when a constructor is used by the token reader; but they're
   probably not a significant performance hit, so I'm not opposed to doing
   them.

I would go to include them then.

   I'm not currently planning to check implementation limits in the
   constructors or token writer, since I don't expect the writer to have
   such limits.  We could easily check them, but it would just be for the
   benefit of readers.

I agree. The PDF writer module (that will use the token writer) will
care about those limits.

   The other things you commented on will be fixed in the next patch.

Ok, many thanks.

-- 
Jose E. Marchesi
address@hidden

GNU Project
http://www.gnu.org




reply via email to

[Prev in Thread] Current Thread [Next in Thread]