Most likely, it’s because the PDF files contain scans of text. You need to perform OCR (Optical Character Recognition) to extract text information from scans.