import PyPDF2

pdfName = 'path\Yiibaipoint.pdf'
read_pdf = PyPDF2.PdfFileReader(pdfName)
page = read_pdf.getPage(0)
page_content = page.extractText()
print page_content

當(dāng)運(yùn)行上面的程序時(shí)，我們得到以下輸出 -

Yiibai Point originated from the idea that there exists a class of readers who respond better 
to online content and prefer to learn new skills at their own pace from the comforts of their 
drawing rooms.

The journey commenced with a single tutorial on HTML in 2006 and elated by the response 
it generated, we worked our way to adding fresh tutorials to our repository which now 
proudly flaunts a wealth of tutorials and allied articles on topics ranging from programming
languages to web designing to academics and much more.

讀取多個(gè)頁面

要閱讀包含多個(gè)頁面的pdf并使用頁碼打印每個(gè)頁面，使用帶有getPageNumber()函數(shù)的循環(huán)。在下面的例子中有兩個(gè)頁面的PDF文件。內(nèi)容在兩個(gè)單獨(dú)的頁面標(biāo)題下打印。

import PyPDF2

pdfName = 'Path\Yiibaispoint2.pdf'
read_pdf = PyPDF2.PdfFileReader(pdfName)

for i in xrange(read_pdf.getNumPages()):
    page = read_pdf.getPage(i)
    print 'Page No - ' + str(1+read_pdf.getPageNumber(page))
    page_content = page.extractText()
    print page_content

執(zhí)行上面示例代碼，得到以下結(jié)果 -

Page No - 1
Yiibai Point originated from the idea that there exists a class of readers who respond better to 
online content and prefer to learn new skills at their own pace from the comforts of their drawing 
rooms. 


Page No - 2

The journey commenced with a single tutorial on HTML in 2006 and elated by the response it 
generated, we worked our way to adding fresh tutorials to our repository which now proudly flaunts 
a wealth of tutorials and allied articles on topics ranging from p
rogramming languages to web 
designing to academics and much more.

在线观看不卡亚洲电影_亚洲妓女99综合网_91青青青亚洲娱乐在线观看_日韩无码高清综合久久

處理PDF