Is there any way to convert pdf file to html file in Java?
---------------------
suresh
Printable View
Is there any way to convert pdf file to html file in Java?
---------------------
suresh
There are very few things that can be done with a PDF file after it has been created; extracting structured layout information is not one of them. Libraries like JPedal and PDFBox (listed below) can extract the text contained in a PDF, but that's about as good as it gets.
PDF is a hard to read format. The best one can do is try to extract the text contained in a PDF file.
[B]iText [/B]- library to create PDFs
[B]FOP[/B] - libray to create PDFs (and other formats) from XML by using XSL-FO transformations
[B]PDFBox [/B]- library to create PDFs; can also extract text
[B]JPedal [/B]- library to extract text from PDFs
[B]PDFTextStream [/B]- commercial, library to extract text from PDFs
[B]Adobe AcrobatViewer for JavaBean [/B]- freeware, library to display and print PDFs;
[B]introductory article [/B]; this library hasn't been updated in a long time and has problems displaying files that were created with recent PDF versions.