Page 1 of 1

Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Wed Jan 10, 2018 7:12 pm
by teriibi
Hi,

Can someone update me about those two solutions I plan...

1) Having to Extract lines of text from a PDF monthly document,
rebuild my own look alike format.
Complete all fields and make a printable completed similar PDF doc.
Or..
2) Fulfill all empty fields with data from a managing stack...and print it too.
(in that case no content import/export, just editing/adding text above/within the PDF file.

I believe the second solution is less flexible since the fields to complete will be at a sligthly different location and with dif sizes, each month. :shock:

Which ever is doable under LC. ?!!
TKS

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Fri Jan 12, 2018 4:53 pm
by MaxV
teriibi wrote:
Wed Jan 10, 2018 7:12 pm

2) Fulfill all empty fields with data from a managing stack...and print it too.
(in that case no content import/export, just editing/adding text above/within the PDF file.
This is easy with livecode and more flexible.
The PDF format is one way to untouchable, when something is in PDF can't come back anymore. PDF was created to print, not to modify it or else.

If you design correctly your LIVECODE template, the result will be always the same.

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Fri Jan 12, 2018 8:43 pm
by teriibi

Code: Select all

when something is in PDF can't come back anymore.
Uuuhh...did know ppl at Adobe were that mean !
aiaiiii... :?

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Sat Jan 13, 2018 5:53 am
by FourthWorld
Are these documents meant to be printed?

I wonder if a more modern, maleable format like ePub may be a good fit.

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Sat Jan 13, 2018 1:12 pm
by teriibi
Actualy, just 1 copy to paper
All other - look alike - copies would be only virtualy shared under some electronic format sqlite/pdf/excel

I thought of just making an empty Form from scratch with sqlite. then export it to PDF,
+ probably an APP that syncro over the net witht the same monthly data...so no need of a PDF for those users.

still the 1 paper copy is needed.
the burden is having to manualy copy/paste new data everymonth from delivered PDF to the LC stack.
Thats the way I will go if I cant automate it afterall.. :|

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Mon Jan 15, 2018 2:30 pm
by MaxV
Livecode can create PDF, so centralize all on livecode.

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Mon Jan 15, 2018 9:58 pm
by teriibi
For the same final doc. :

Which content is lighter to transfer/downlaod from a Server to several Devices ?
PDF data content ...or ...EPUB data content ?
or it is just the same.
Can one build an epub doc from a device ?
Same for PDF ?
Or one has to download ready made EPUBS end PDFs files ?

finaly
Can one print an Epub doc as easely as one print PDF from any devices Oses, IOs, Android/Linux/windows ?

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Fri Jan 19, 2018 1:31 am
by MaxV
epub is lighter than PDF
epub is just a text file, very easy to write.
PDF is a nightmare.

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Fri Jan 19, 2018 11:04 am
by teriibi
,,,any other format that could offer a more "fancy look" like a PDF print ?
I think thats why PDF is so popular today..

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Wed Jan 24, 2018 2:07 pm
by MaxV
You can use HTML, any web page is in HTML and is fancy.

I think that HTML probably is the best for you, because:
  • just a single HTML file for multiple pages
  • it is pure text, so is very easy to edit, write, modify
  • it auto-formats to device or paper format
  • whan you print it, it autosplits on multiple pages
  • just use a browser to see it

Re: Extracting text from a PDF file or filling a montlhy PDF Courses Form

Posted: Tue Jun 16, 2020 3:24 pm
by richmond62
There are PDF documents and PDF documents . . .

Older PDF documents are, to all intents and purposes just multi-page images, and as such are useless re importing text
and can generally only be converted to text using OCR, which I can assure you is a headache of gargantuan proportions,
especially when the text is in non-Latin script.

Newer PDF documents contain an embedded text layer which can be selected, copied and pasted elsewhere . . .

Mind you, I have no idea how you would extract that embedded text layer into, say, a LiveCode textField.