RTTSoftware Support Forum

Please login or register.

Login with username, password and session length
Advanced search  
Pages: [1] 2 3 ... 10
 1 
 on: September 07, 2010, 07:04:42 PM 
Started by BlueBell - Last post by BlueBell
Nice one!!
I only had to so as far a clicking the 'Default' in the Help menu.
Many thanks - I will get my credit card out now  :)
Cheers

 2 
 on: September 07, 2010, 06:50:50 PM 
Started by BlueBell - Last post by RTT
PDFE can't use the Foxit reader directly, so it must interface it through an embedded IE instance.
Because of this, Foxit must be configured to show PDFs embedded in IE, and this is what usually happens after installing Foxit, so something changed since you installed it, or you installed it with custom settings.
So, to try to fix it, start Foxit as stand alone, and if you are using Windows Vista or Windows 7 with UAC turned on, make sure you run it as administrator (just go to where you have the Foxit Reader .exe, right click it and select the "Run as Administrator" option).

- First make sure that in the Foxit Internet preferences, menu Tools>Preferences-Internet, the "Display PDF in Browser" option is set.
- Now check also if the menu "Help>Set to Default PDF Reader" option is checked.

Now start IE and try to open an online PDF (just google for "filetype:pdf" to get one). If Foxit is showing in IE,  it will show in PDFE too.
Make also sure that in the PDFE preferences, menu Edit>Preferences, the Internal PDF Reader is set to the "Internet Explorer Default", option. In your case, because of what you described, it is already set.

If all this don't work, and you feel comfortable editing the Windows registry, just change in this key

HKEY_CLASSES_ROOT\MIME\Database\Content Type\application/pdf

the CLSID value to the Foxit reader CLSID
CLSID={14E8BBD8-1D1C-4D56-A4DA-D20B75EB814E}

Let me know if problem persist, and this time tell me what Windows you are using, and if it is 32 or 64 bits.
The Foxit Reader forums are also a good place to get assistance on this, because the only think you need really need to fix is to make Foxit to show in IE, when you open an online PDF with it. After that, will show also in PDFE.

 3 
 on: September 07, 2010, 04:27:20 PM 
Started by BlueBell - Last post by BlueBell
I like the software a lot... but... I am running Foxit PDF reaer [got rid of Acrobat entirely] - I can set the double-clik to work for Foxit OK but the internal reder [PDF View> All>] just throws up a blank explorer window [I am running FireFox, dont touch MSExplorer if i can help it]. Before that I get two windows open, trying to download a file, one offering to look for a programme to open fiels with a .pdf extension.
Can you help?

 4 
 on: September 07, 2010, 02:50:47 PM 
Started by straxx - Last post by RTT
The physical limit of the database is imposed by an 2GB maximum file size of any of the database files, and that size is, for the database files that grows larger, imposed by the amount of the metadata of the indexed PDFs.
I currently have a test database with more than 250'000 indexed PDFs,  with some very larger custom metadata fields (to test some things) and the bigger database file is only 250MB (and this is much bigger than what happens in a normal scenario, because of these dummy filed custom fields), so 100'000 is peanuts.
But for that amount of files, be sure to use a good directory hierarchy.  Don't put all these PDFs is the same directory, or you are going to have problems with the Windows Shell itself.

 5 
 on: September 07, 2010, 02:31:41 PM 
Started by risa - Last post by RTT
Quote
When is patch 3 expected to be released?

I have no date yet. Patch releases are essentially to fix bugs, and right now it includes too little fixes to justify a new release.

 6 
 on: September 07, 2010, 07:50:08 AM 
Started by straxx - Last post by straxx
Have you any recomendations of the max number of PDF's you can index an maintain in a database?
I have ~100'000 Files per year each around 10-15 Pages. All scanned and fulltext indexed.
Is it possible to manage this amount of documents with PDF Explorer for searching over the webserver?

 7 
 on: September 06, 2010, 08:09:37 AM 
Started by risa - Last post by risa
Thank's. It works fine.
When is patch 3 expected to be released?

 8 
 on: August 26, 2010, 03:32:12 PM 
Started by risa - Last post by RTT
I have now added the [FP] constant to the tool. This change will be included in the build 59 patch 3.

Meanwhile, you can skip the "edit in Excel" step if you create a custom grid layout, with a dynamic calculated column set to show the file path+filename "FP+F".
You can then export this additional column, already filled with the data you want, and, while importing, you just need to redirect this column to the field you want to use.
Note: You shouldn't export the field you are going to use, e.g. keywords, as final destination of this data, or you will have to unassign it while importing.

 9 
 on: August 26, 2010, 08:01:56 AM 
Started by risa - Last post by risa
I want to put the filename with filepath to a grid column (eg. keyword). This is a fixed value and shall not be changed if file is moved etc.
Using [F] in Batch Edit Info field gives the filename. Thought [FP] should give the path, but doesn't work.

Now I export the grid to Excel and copy filename (with filepath) to the column i want, and then Import the CSV file.
Any smarter way to do it?

 10 
 on: August 25, 2010, 12:31:14 AM 
Started by Lothar - Last post by RTT
Quote
on Batch tools -> Search & Extract I can only choose 1 page (too little, isbn is usually between page 3 to page 5) or all pages (way too much)

Why not the first and last 10 pages?

That's a good point! I will try to address this in the next version.

Quote
the given regexp misses a lot of ISBNs and I´m not manage to use regexp that I found on other scripts:


RE_ISBN = re.compile("(?:ISBN[ -]*(?:|10|13)|International Standard Book Number)[:\s]*(?:|, PDF ed.|, print ed.|\(pbk\)|\(electronic\))[:\s]*([-0-9Xx]{10,25})",
                     re.MULTILINE)

// This is a combination of strict and relaxed versions of ISBN number format
var reISBN=/(ISBN[\:\=\s][\s]*(?=[-0-9xX ]{13})(?:[0-9]+[- ]){3}[0-9]*[xX0-9])|(ISBN[\:\=\s][ ]*\d{9,10}[\d|x])/g;

The regular expressions component I'm using only supports a subset of the Perl regular expressions, making many of these regular expressions out there incompatible, if not modified to match the supported syntax.
You can check the supported syntax in the attached file.

Try with this one, that also contemplate the relaxed version:

(\d{3}[-]\d{1,5}[-]\d{1,7}[-]\d{1,6}[-][\d,x,X]|\d{1,5}[-]\d{1,7}[-]\d{1,6}[-][\d,x,X])|(ISBN[\:\=\s][ ]*\d{9,10}[\d|x])

In this case only the full match is important, so don't forget to set the capturing group 0 to "Extract".

If misses continue, please mail me one of these PDFs, so I can take a look.
Some of the misses can also be related to the quality of the extracted text. You can use the PDFView (text only mode), or the text extractor tool, to better understand what text the tool is processing.

Pages: [1] 2 3 ... 10