Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Computing > Computer Support > Pasting copied text turns to garbage code

Reply
Thread Tools

Pasting copied text turns to garbage code

 
 
ComeMon!
Guest
Posts: n/a
 
      11-11-2003
Hi,

Last week I met below case:

I opened a PDF file, from which I highlighted a line of English text
and copied it to clipboard with the help of right mouse key. As I
used right mouse key to paste the text in opened notepad window, the
text was displayed as garbage. This is the first time I met such case
with text copied and pasted from PDF file opened in acrobat reader.

Was there any special thing set in the pdf file that caused this ?


 
Reply With Quote
 
 
 
 
Chris D
Guest
Posts: n/a
 
      11-12-2003
Sounds like there wasn't enough font information stored in the PDF to
allow extraction/pasting. Were the fonts embedded in the document?
What created the PDF?

Chris.
___________________________________

Chris Dahl - CTO, Solutions
ARTS PDF

http://www.artspdf.com/


(ComeMon!) wrote in message news:<. com>...
> Hi,
>
> Last week I met below case:
>
> I opened a PDF file, from which I highlighted a line of English text
> and copied it to clipboard with the help of right mouse key. As I
> used right mouse key to paste the text in opened notepad window, the
> text was displayed as garbage. This is the first time I met such case
> with text copied and pasted from PDF file opened in acrobat reader.
>
> Was there any special thing set in the pdf file that caused this ?
>
>

 
Reply With Quote
 
 
 
 
ComeMon!
Guest
Posts: n/a
 
      11-13-2003
(Chris D) wrote in message news:<. com>...
> Sounds like there wasn't enough font information stored in the PDF to
> allow extraction/pasting. Were the fonts embedded in the document?
> What created the PDF?


Try this
http://faculty.ist.unomaha.edu/pdasg...sis/thesis.pdf

>
> Chris.
> ___________________________________
>
> Chris Dahl - CTO, Solutions
> ARTS PDF
>
> http://www.artspdf.com/
>
>
> (ComeMon!) wrote in message news:<. com>...
> > Hi,
> >
> > Last week I met below case:
> >
> > I opened a PDF file, from which I highlighted a line of English text
> > and copied it to clipboard with the help of right mouse key. As I
> > used right mouse key to paste the text in opened notepad window, the
> > text was displayed as garbage. This is the first time I met such case
> > with text copied and pasted from PDF file opened in acrobat reader.
> >
> > Was there any special thing set in the pdf file that caused this ?
> >
> >

 
Reply With Quote
 
Andreas Lobinger
Guest
Posts: n/a
 
      11-13-2003
Aloha,

"ComeMon!" schrieb:
> > Sounds like there wasn't enough font information stored in the PDF to
> > allow extraction/pasting. Were the fonts embedded in the document?
> > What created the PDF?

> Try this
> http://faculty.ist.unomaha.edu/pdasg...sis/thesis.pdf


Build by TeX via dvips, contains subsetted fonts and non-standard encoding
vector. There is nearly nothing you can do to change this behaviour.
With a low-level pdf-editor tool you could change both the font definition
and encoding vector. Or you extract the information about the encoding
vector and write a script (or use unix's tr) to translate the
copied string.

Wishing a happy day
LOBI
 
Reply With Quote
 
Will Henney
Guest
Posts: n/a
 
      11-13-2003
(ComeMon!) wrote in message news:<. com>...
> (Chris D) wrote in message news:<. com>...
> > Sounds like there wasn't enough font information stored in the PDF to
> > allow extraction/pasting. Were the fonts embedded in the document?
> > What created the PDF?

>
> Try this
> http://faculty.ist.unomaha.edu/pdasg...sis/thesis.pdf
>


Looks like a typical case of someone forgetting the -Ppdf option to
dvips in the second stage of the conversion chain latex->dvi->ps->pdf.
As a result, the pdf file has Type 3 bitmapped fonts with a non-standard
encoding (this also explains why the fonts look fuzzy in acroread).

Luckily, there is a solution:

http://www.tex.ac.uk/cgi-bin/texfaq2html?label=pkfix

You'll need to convert back to ps, fix the fonts with pkfix, then
convert back to pdf.
 
Reply With Quote
 
Will Henney
Guest
Posts: n/a
 
      11-13-2003
Sorry, the pkfix method I gave in my previous post doesn't seem
to work if you only have the PDF file. You need at least
the original PS file from which the PDF file was generated. At
least it didn't work for me when using any of pdf2ps (effectively
ghostscript), pdftops, or "Print to file" in acroread.
 
Reply With Quote
 
Will Henney
Guest
Posts: n/a
 
      11-13-2003
Andreas Lobinger <> wrote in message news:<>...
> "ComeMon!" schrieb:
> > > Sounds like there wasn't enough font information stored in the PDF to
> > > allow extraction/pasting. Were the fonts embedded in the document?
> > > What created the PDF?

> > Try this
> > http://faculty.ist.unomaha.edu/pdasg...sis/thesis.pdf

>
> Build by TeX via dvips, contains subsetted fonts and non-standard encoding
> vector. There is nearly nothing you can do to change this behaviour.
> With a low-level pdf-editor tool you could change both the font definition
> and encoding vector. Or you extract the information about the encoding
> vector and write a script (or use unix's tr) to translate the
> copied string.


I found a PS version of the thesis on the website, so the pkfix trick
can be used after all. In case you don't have easy access to the
relevant tools, I've put a fixed PDF version for you at

http://www.astrosmo.unam.mx/~w.henne...esis_pkfix.pdf

You should now be able to cut-and-paste to your heart's content. I do
hope I'm not aiding and abetting plagiarism here
 
Reply With Quote
 
ComeMon!
Guest
Posts: n/a
 
      11-13-2003
Andreas Lobinger <> wrote in message news:<>...
> Aloha,
>
> "ComeMon!" schrieb:
> > > Sounds like there wasn't enough font information stored in the PDF to
> > > allow extraction/pasting. Were the fonts embedded in the document?
> > > What created the PDF?

> > Try this
> > http://faculty.ist.unomaha.edu/pdasg...sis/thesis.pdf

>
> Build by TeX via dvips, contains subsetted fonts and non-standard encoding
> vector. There is nearly nothing you can do to change this behaviour.
> With a low-level pdf-editor tool you could change both the font definition
> and encoding vector. Or you extract the information about the encoding
> vector and write a script (or use unix's tr) to translate the
> copied string.


I want to know more about different encoding standards & format
definitions of pdf files. Any url can I go for further information
?

>
> Wishing a happy day
> LOBI

 
Reply With Quote
 
Aandi Inston
Guest
Posts: n/a
 
      11-13-2003
(ComeMon!) wrote:

>I want to know more about different encoding standards & format
>definitions of pdf files. Any url can I go for further information


The PDF Reference has it all.
----------------------------------------
Aandi Inston http://www.quite.com
Please support usenet! Post replies and follow-ups, don't e-mail them.

 
Reply With Quote
 
 
 
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Pasting text into Outlook Express dwm Computer Support 24 09-06-2007 05:38 PM
Pasting multiple lines into a single line text box Adam Plocher HTML 1 06-13-2007 05:40 PM
Pasting copied text from pdf turns to garbage code BTsinner Computer Support 0 06-08-2007 07:39 AM
Templates - Garbage In Garbage Not Out ramiro_b@yahoo.com C++ 1 07-25-2005 04:48 PM
Pasting code with tab indentation to irb benny Ruby 1 05-26-2005 05:46 PM



Advertisments
 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57