Hi
Can someone share code sample to extract data from text based pdf to Excel.
Thanks in advance
Hi
Can someone share code sample to extract data from text based pdf to Excel.
Thanks in advance
You would need some software to convert the pdf back to a readable format, once you do that how will you be able to target which portion of the pdf has the data you need?
Regards,
Simon
Please read this before cross posting!
In the unlikely event you didn't get your answer here try Microsoft Office Discussion @ The Code Cage
If I have seen further it is by standing on the shoulders of giants.
Isaac Newton, Letter to Robert Hooke, February 5, 1675 English mathematician & physicist (1642 - 1727)
There is product called PDFExtract that creates Excel or Word documents rom PDF.
____________________________________________
Nihil simul inventum est et perfectum
Abusus non tollit usum
Last night I dreamed of a small consolation enjoyed only by the blind: Nobody knows the trouble I've not seen!
James Thurber
Hi
I know of that as well as other products like Able2Extract but I was wondering if we can do that using Acrobat APIs and VBA.
Thanks
Why reinvent the wheel?
____________________________________________
Nihil simul inventum est et perfectum
Abusus non tollit usum
Last night I dreamed of a small consolation enjoyed only by the blind: Nobody knows the trouble I've not seen!
James Thurber
Hi
Extracting the data would be part of the solution and there would be further procesing based on requirements. If it was possible to extract using VBA, that would make it integrated rather than having separate solutions to extract the data and then process it.
Thanks
In any solution, you will have to extract data and then process it.
A .net project that includes the iTextSharp.dll could probably extract the text. I have worked with the iTextSharp.dll to some extent.
Another method that might be easily used is pdftk. You could use Shell() to shell to it and pass command line parameters and values. A ShellWait() routine may be needed to allow it time to complete the process. http://www.pdflabs.com