Utilities based on 'libpoppler' for extracting text, fonts, attachments and metadata from a PDF file. Also supports high quality rendering of PDF documents info PNG, JPEG, TIFF format, or into raw bitmap vectors for further processing in R.

Documentation

Manual: pdftools.pdf
Vignette: None available.

Maintainer: Jeroen Ooms <jeroen at berkeley.edu>

Author(s): Jeroen Ooms

Install package and any missing dependencies by running this line in your R console:

install.packages("pdftools")

Depends
Imports Rcpp(>=0.12.12)
Suggests jpeg, png, webp, testthat
Enhances
Linking to Rcpp
Reverse
depends
pdfsearch
Reverse
imports
crminer, rcoreoa, readtext, textreadr
Reverse
suggests
goldi, hunspell, magick, tesseract
Reverse
enhances
Reverse
linking to

Package pdftools
Materials
URL https://ropensci.org/blog/2016/03/01/pdftools-and-jeroen (blog) https://github.com/ropensci/pdftools#readme (devel) https://poppler.freedesktop.org (upstream)
Task Views
Version 1.4
Published 2017-09-01
License MIT + file LICENSE
BugReports https://github.com/ropensci/pdftools/issues
SystemRequirements Poppler C++ interface library and headers
NeedsCompilation yes
Citation
CRAN checks pdftools check results
Package source pdftools_1.4.tar.gz