-
Contributing Member
convert pdf into html
Is there any way to convert pdf file to html file in Java?
---------------------
suresh
-
Contributing Member
Re: convert pdf into html
There are very few things that can be done with a PDF file after it has been created; extracting structured layout information is not one of them. Libraries like JPedal and PDFBox (listed below) can extract the text contained in a PDF, but that's about as good as it gets.
PDF is a hard to read format. The best one can do is try to extract the text contained in a PDF file.
iText - library to create PDFs
FOP - libray to create PDFs (and other formats) from XML by using XSL-FO transformations
PDFBox - library to create PDFs; can also extract text
JPedal - library to extract text from PDFs
PDFTextStream - commercial, library to extract text from PDFs
Adobe AcrobatViewer for JavaBean - freeware, library to display and print PDFs;
introductory article ; this library hasn't been updated in a long time and has problems displaying files that were created with recent PDF versions.
Posting Permissions
- You may not post new threads
- You may not post replies
- You may not post attachments
- You may not edit your posts
-
Forum Rules