Consulting

Results 1 to 7 of 7

Thread: Extract data from PDF

  1. #1

    Smile Extract data from PDF

    Hi

    Can someone share code sample to extract data from text based pdf to Excel.

    Thanks in advance

  2. #2
    Moderator VBAX Guru Simon Lloyd's Avatar
    Joined
    Sep 2005
    Location
    UK
    Posts
    3,003
    Location
    You would need some software to convert the pdf back to a readable format, once you do that how will you be able to target which portion of the pdf has the data you need?
    Regards,
    Simon
    Please read this before cross posting!
    In the unlikely event you didn't get your answer here try Microsoft Office Discussion @ The Code Cage
    If I have seen further it is by standing on the shoulders of giants.
    Isaac Newton, Letter to Robert Hooke, February 5, 1675 English mathematician & physicist (1642 - 1727)

  3. #3
    Distinguished Lord of VBAX VBAX Grand Master Bob Phillips's Avatar
    Joined
    Apr 2005
    Posts
    25,453
    Location
    There is product called PDFExtract that creates Excel or Word documents rom PDF.
    ____________________________________________
    Nihil simul inventum est et perfectum

    Abusus non tollit usum

    Last night I dreamed of a small consolation enjoyed only by the blind: Nobody knows the trouble I've not seen!
    James Thurber

  4. #4
    Hi

    I know of that as well as other products like Able2Extract but I was wondering if we can do that using Acrobat APIs and VBA.

    Thanks

  5. #5
    Distinguished Lord of VBAX VBAX Grand Master Bob Phillips's Avatar
    Joined
    Apr 2005
    Posts
    25,453
    Location
    Why reinvent the wheel?
    ____________________________________________
    Nihil simul inventum est et perfectum

    Abusus non tollit usum

    Last night I dreamed of a small consolation enjoyed only by the blind: Nobody knows the trouble I've not seen!
    James Thurber

  6. #6
    Hi

    Extracting the data would be part of the solution and there would be further procesing based on requirements. If it was possible to extract using VBA, that would make it integrated rather than having separate solutions to extract the data and then process it.

    Thanks

  7. #7
    VBAX Guru Kenneth Hobs's Avatar
    Joined
    Nov 2005
    Location
    Tecumseh, OK
    Posts
    4,956
    Location
    In any solution, you will have to extract data and then process it.

    A .net project that includes the iTextSharp.dll could probably extract the text. I have worked with the iTextSharp.dll to some extent.

    Another method that might be easily used is pdftk. You could use Shell() to shell to it and pass command line parameters and values. A ShellWait() routine may be needed to allow it time to complete the process. http://www.pdflabs.com

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •