why does copy/pasting from pdfs sometimes have a problem with the ‘ti’ combination of letters?

403 viewsOtherTechnology

seems that when you copy and paste, sometimes there’s a question mark in a box replacing letters, particularly t or ti in some strange unicode error. but why in my experience does it only affect these characters?

In: Technology

4 Answers

Anonymous 0 Comments

PDF is not a text format its a format designed for printers.

A PDF can literaly just be a picture of some text. Unlike a word document thats actualy a text formating software.

There is thousands of different ways to still extract text from an PDF, but in some cases the information of the text is just not parr of the PDF file instead its just a bunch of pixels.

Some software might to try OCR (optical character recognition) but thats just not accurate in every case.

You are viewing 1 out of 4 answers, click here to view all answers.