|
From: | jemarch |
Subject: | Re: [pdf-devel] Re: Modifications on pdf_token_read to get token boundaries |
Date: | Wed, 17 Jun 2009 19:51:01 +0200 |
User-agent: | Wanderlust/2.14.0 (Africa) SEMI/1.14.6 (Maruoka) FLIM/1.14.8 (Shijō) APEL/10.6 Emacs/23.0.60 (i686-pc-linux-gnu) MULE/6.0 (HANACHIRUSATO) |
> Since the operation in the streams is inherently sequential, that > schema makes sense in my opinion. I agree. The new tell operation is now in the trunk. > The addition of a new function to the tokeniser returning the offset > of the beginning of the last read token would then use this > pdf_stm_tell function. > > What do you think? If you agree I will implement it as part of my > type4+tokeniser patch. It looks good to me. The easiest way to implement it in the token reader would probably be to replace the "reader->state = X" statements with calls to a private function (e.g. enter_state) that would also store the token's starting position. Then in flush_token, store this value when emitting a token. Many thanks for the hints :) -- Jose E. Marchesi <address@hidden> http://www.jemarch.net GNU Project http://www.gnu.org
[Prev in Thread] | Current Thread | [Next in Thread] |