Quantcast
Channel: KNIME RSS
Viewing all articles
Browse latest Browse all 4157

Extracting Comments section from PDF and page numbers

$
0
0

I have a PDF that is annotated with comments.  I would like to extract the comments section information ONLY from the PDF along with the page number they come from using KNIME.

The Tika parser can capture all the contents of the PDF, but there is too much text to extract the relevant comments as they are not tagged.  

Would be nice to hear from anyone in the KNIME community which has solved this problem.

Thanks

 


Viewing all articles
Browse latest Browse all 4157

Trending Articles