Results 1 to 2 of 2

Thread: convert pdf into html

  1. #1
    Contributing Member
    Join Date
    Sep 2006
    Answers
    962

    convert pdf into html

    Is there any way to convert pdf file to html file in Java?

    ---------------------
    suresh


  2. #2
    Contributing Member
    Join Date
    Sep 2006
    Answers
    962

    Re: convert pdf into html

    There are very few things that can be done with a PDF file after it has been created; extracting structured layout information is not one of them. Libraries like JPedal and PDFBox (listed below) can extract the text contained in a PDF, but that's about as good as it gets.

    PDF is a hard to read format. The best one can do is try to extract the text contained in a PDF file.

    iText - library to create PDFs
    FOP - libray to create PDFs (and other formats) from XML by using XSL-FO transformations
    PDFBox - library to create PDFs; can also extract text
    JPedal - library to extract text from PDFs
    PDFTextStream - commercial, library to extract text from PDFs
    Adobe AcrobatViewer for JavaBean - freeware, library to display and print PDFs;
    introductory article ; this library hasn't been updated in a long time and has problems displaying files that were created with recent PDF versions.


Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
About us
Applying for a job can be a stressful and frustrating experience, especially for someone who has never done it before. Considering that you are competing for the position with a at least a dozen other applicants, it is imperative that you thoroughly prepare for the job interview, in order to stand a good chance of getting hired. That's where GeekInterview can help.
Interact