swftools-common
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Swftools-common] Issues with embedded fonts when converting PDF to


From: Michael Haufler (scireum GmbH)
Subject: Re: [Swftools-common] Issues with embedded fonts when converting PDF to SWF (500 USD reward)
Date: Tue, 14 Jan 2014 18:05:16 +0100

Hello Hans,

the pdf text extractions also works for us. But we need the correct text inside the SWF an this is not correct. Just check the output of swfstrings page_00010.swf there are characters missing.

Greetings
Mike


freundliche Grüße
scireum GmbH

Michael Haufler
Geschäftsführer

-------------------------------------------------------------------------------------------------------
scireum GmbH, Alfred-Klingele-Straße 6, 73630 Remshalden

Unser Service - gerne für Sie da:
Vertrieb: (07151) 20637 10
Support: (07151) 20637 20 - E-Mail: address@hidden

Tel:  (07151) 20637-11 -  E-Mail: address@hidden
Fax: (07151) 20637-19 -  Internet: http://www.scireum.de
 
Geschäftsführer: Michael Haufler, Andreas Haufler
Amtsgericht Stuttgart, HRB 732171


2014/1/14 Hans J Nuecke <address@hidden>
Hallo Michael,
not sure I fully understood your issue, but probably your "-vvv" creates the problem.
To have triple verbosity I think the correct use is "-v -v -v" instead of "-vvv".

I converted the page and created a search index file (the pure text saved in a .txt file; we use that for search); and both looks OK; see attached.
The text was extracted with pdftotext (part of xpdf), with the setting "-enc UTF-8".

Hopefully this helps ;-)
Regards
Hans

Am 14.01.2014 16:40, schrieb Michael Haufler (scireum GmbH):
Hello All,

we have a strange issue with pdf2swf 0.9.2 
The PDF contains item numbers like this one: 09500214
From the source pdf I can copy + paste them perfectly.
However when I convert the pdf to swf the textinformation partially gets lost.

I already inspected pdf2swf with the -vvv option and in the log everything seems right.

But when I check the text with swfstrings i get some unicode crap instead of the original item number.

Here is the part of the Log for the second item on the Page with the Text:
"LSPG       1 09500214       105263    36"

VERBOSE Updating font to OBIGPM+Corbel-Italic-60-0
VERBOSE Updating font to XJVNLU+Corbel-62-0
TRACE   beginString(L) render=0
DEBUG   drawChar(102.047200,205.512000,c='L' (76), u=76 <1> 'L') CID=0 render=0 glyphid=20 font=0xc3d7d0 size=0.007471
TRACE   Placing shape ID 32
TRACE   Drawing char 20 in font 5 at 1,0 in color 000000ff
TRACE   endString() render=0 textstroke=(nil)
TRACE   beginString(SPG) render=0
DEBUG   drawChar(106.025200,205.512000,c='S' (83), u=83 <1> 'S') CID=0 render=0 glyphid=24 font=0xc3d7d0 size=0.007471
TRACE   Drawing char 24 in font 5 at 94,0 in color 000000ff
DEBUG   drawChar(110.248000,205.512000,c='P' (80), u=80 <1> 'P') CID=0 render=0 glyphid=22 font=0xc3d7d0 size=0.007471
TRACE   Drawing char 22 in font 5 at 194,0 in color 000000ff
DEBUG   drawChar(114.608500,205.512000,c='G' (71), u=71 <1> 'G') CID=0 render=0 glyphid=16 font=0xc3d7d0 size=0.007471
TRACE   Drawing char 16 in font 5 at 296,0 in color 000000ff
TRACE   endString() render=0 textstroke=(nil)
VERBOSE Updating font to ERHDFG+Corbel-58-0
TRACE   beginString(.w) render=0
DEBUG   drawChar(426.070600,205.512000,c='w' (887), u=49 <1> '1') CID=1 render=0 glyphid=34 font=0xc21470 size=0.007471
TRACE   Drawing char 5 in font 4 at 7625,0 in color 000000ff
TRACE   endString() render=0 textstroke=(nil)
TRACE   beginString(.v...{.v.v.x.w.z) render=0
DEBUG   drawChar(442.105000,205.512000,c='v' (886), u=48 <1> '0') CID=1 render=0 glyphid=33 font=0xc21470 size=0.007471
TRACE   Drawing char 40 in font 4 at 8002,0 in color 000000ff
DEBUG   drawChar(446.029450,205.512000,c=' ' (895), u=57 <1> '9') CID=1 render=0 glyphid=42 font=0xc21470 size=0.007471
TRACE   Drawing char 13 in font 4 at 8094,0 in color 000000ff
DEBUG   drawChar(449.953900,205.512000,c='{' (891), u=53 <1> '5') CID=1 render=0 glyphid=38 font=0xc21470 size=0.007471
TRACE   Drawing char 9 in font 4 at 8187,0 in color 000000ff
DEBUG   drawChar(453.878350,205.512000,c='v' (886), u=48 <1> '0') CID=1 render=0 glyphid=33 font=0xc21470 size=0.007471
TRACE   Drawing char 40 in font 4 at 8279,0 in color 000000ff
DEBUG   drawChar(457.802800,205.512000,c='v' (886), u=48 <1> '0') CID=1 render=0 glyphid=33 font=0xc21470 size=0.007471
TRACE   Drawing char 40 in font 4 at 8371,0 in color 000000ff
DEBUG   drawChar(461.727250,205.512000,c='x' (888), u=50 <1> '2') CID=1 render=0 glyphid=35 font=0xc21470 size=0.007471
TRACE   Drawing char 41 in font 4 at 8464,0 in color 000000ff
DEBUG   drawChar(465.651700,205.512000,c='w' (887), u=49 <1> '1') CID=1 render=0 glyphid=34 font=0xc21470 size=0.007471
TRACE   Drawing char 5 in font 4 at 8556,0 in color 000000ff
DEBUG   drawChar(469.576150,205.512000,c='z' (890), u=52 <1> '4') CID=1 render=0 glyphid=37 font=0xc21470 size=0.007471
TRACE   Drawing char 42 in font 4 at 8648,0 in color 000000ff
TRACE   endString() render=0 textstroke=(nil)
TRACE   beginString(.w.v.{.x.|.y) render=0
DEBUG   drawChar(518.306650,205.512000,c='w' (887), u=49 <1> '1') CID=1 render=0 glyphid=34 font=0xc21470 size=0.007471
TRACE   Drawing char 5 in font 4 at 9795,0 in color 000000ff
DEBUG   drawChar(522.231100,205.512000,c='v' (886), u=48 <1> '0') CID=1 render=0 glyphid=33 font=0xc21470 size=0.007471
TRACE   Drawing char 40 in font 4 at 9887,0 in color 000000ff
DEBUG   drawChar(526.155550,205.512000,c='{' (891), u=53 <1> '5') CID=1 render=0 glyphid=38 font=0xc21470 size=0.007471
TRACE   Drawing char 9 in font 4 at 9980,0 in color 000000ff
DEBUG   drawChar(530.080000,205.512000,c='x' (888), u=50 <1> '2') CID=1 render=0 glyphid=35 font=0xc21470 size=0.007471
TRACE   Drawing char 41 in font 4 at 10072,0 in color 000000ff
DEBUG   drawChar(534.004450,205.512000,c='|' (892), u=54 <1> '6') CID=1 render=0 glyphid=39 font=0xc21470 size=0.007471
TRACE   Drawing char 10 in font 4 at 10164,0 in color 000000ff
DEBUG   drawChar(537.928900,205.512000,c='y' (889), u=51 <1> '3') CID=1 render=0 glyphid=36 font=0xc21470 size=0.007471
TRACE   Drawing char 7 in font 4 at 10257,0 in color 000000ff
TRACE   endString() render=0 textstroke=(nil)
TRACE   beginString(.y.|) render=0
DEBUG   drawChar(560.167450,205.512000,c='y' (889), u=51 <1> '3') CID=1 render=0 glyphid=36 font=0xc21470 size=0.007471
TRACE   Drawing char 7 in font 4 at 10780,0 in color 000000ff
DEBUG   drawChar(564.091900,205.512000,c='|' (892), u=54 <1> '6') CID=1 render=0 glyphid=39 font=0xc21470 size=0.007471
TRACE   Drawing char 10 in font 4 at 10872,0 in color 000000ff
TRACE   endString() render=0 textstroke=(nil)
TRACE   endTextObject() render=0 textstroke=(nil) clipstroke=(nil)
TRACE   saveState 0xc69e70
DEBUG   updateLineDash, 0 dashes
VERBOSE Updating font to ERHDFG+Corbel-58-0


This is how I converted the file:

pdf2swf -T 9 -G -f -vvv catalog.pdf -o 1.swf

Here is the complete log file as zipped Textfile: http://m.scireum.de/cid-font-log.txt.zip

To speed things up a little we offer 500 USD payed per paypal to the fist person who can solve our issue.  

The issue is resolved if we can convert the pdf to swf with the correct text in the swf

For further information just contract me address@hidden


Greetings

Michael Haufler



---------------
SWFTools-common is a self-managed list. To subscribe/unsubscribe, or amend an existing subscription, please kindly point your favourite web browser at:<http://lists.nongnu.org/mailman/listinfo/swftools-common>



---------------
SWFTools-common is a self-managed list. To subscribe/unsubscribe, or amend an existing subscription, please kindly point your favourite web browser at:<http://lists.nongnu.org/mailman/listinfo/swftools-common>


reply via email to

[Prev in Thread] Current Thread [Next in Thread]