RTTSoftware Support Forum
PDF-ShellTools => General => Topic started by: lkaiser on February 10, 2016, 09:02:48 PM
-
Hi RTT,
Are there any API method to get the text from a PDF file ?
I am searching how to extract adresses informations (and other localized informations if possible) from files containing invoices for sorting and grouping by destination (country, region , city ... )
Thank you in advance
Lionel
-
Take a look to the Text and TextEx properties of the IPDFPage object (http://www.rttsoftware.com/Manuals/STIndex.htm?pageURL=ST/English/MyScriptsAPI.htm#IPDFPage).
There is a "View text" My Script (http://www.rttsoftware.com/Manuals/STIndex.htm?pageURL=ST/English/MyScripts.htm), under the samples tab, that demos the TextEx property.
-
I managed to extract only the desired text thanks to the code of the demo .
Maybe in future developments there will be a method to extract text or items from a specified rectangular area in the page.
Thank You RTT