Creating Text-Searchable PDFs (Tequesta Series)
- Go to original document source in Twizzler (libdlc on ‘artemis’) > Tequesta > FI06102501 > SN03633705 > Done (or just use the ‘Shortcut to Done’ on desktop)
- Highlight document folder in question, click ‘Copy to’ (on top icon bar) > libdlc on ‘artemis’ (S:) > Tequesta > OCR_input, and then click ‘Copy’
- After copying, go to Twizzler > Tequesta > OCR_input (or just use ‘Shortcut to OCR_input’ on desktop) and rename copied version by using the last two digits of the year given in parenthesis, then underscore and 1 (e.g. '67_1(1967)' should be changed to '67_1')
- Double click on folder and put all covers in a subfolder called 'covers'
- Erase all files except for tif files (e.g. Thumbs.db)
- Highlight the information in the address bar and go to Edit > Copy
- Go to ‘Tequesta Job’ (located in the C: drive under Prdev- Job or just use the ‘Shortcut to tequesta.job’ on desktop)
- Highlight text starting at (S:) and stopping before the last slash (\*.tif |APPEND)
- Click Edit > Paste AND SAVE! It should look something like this: S:\Tequesta\OCR_input\01_1\*.tif |APPEND (You should have 5 lines of text in ‘Job’)
- Close all windows
- Go to ‘PrimeOCR Job Server’ and click ‘Start’
- After completion, go back to ‘OCR_input’, then the volume in question, then open the newly made PDF file
- Click search, and test a word and a number for quality control
- Print a few pages from PDF for quality control
- If everything is well, go back to OCR_input and after the completed volume, type underscore then complete (e.g. 67_1_complete)