I have 40+ forms and I want to make sure that common fields have the same names. This would greatly help me in storing and sorting the entries.
Apart from that i want to make sure the formatting is consistent through all the forms. Is there a way i can extract this data from the pdf forms preferably to excel file, so that i can check / make corrections.
the properties i am looking for are:
- Field ID
- Field Name
- Field Type
- Font
- Font Size
- Font Color
- Alignment
- Multiline
- Date Format
The Form DATA FILE (FDF or eXtended XFDF) is easily exported from the PDF and can be manipulated from blank to be imported to a PDF to fill the fields automatically. In some ways similar to the PDF it can contain binary media but is predominantly text based thus easy to parse.
It could look something like this, so very easy to import to other applications
HOWEVER it is only the field data thus you cant change style or colour those are part of the PDF page data and thus for changing a pages content you need a full blown editor /API with analysis and modification abilities, rather than import export function. There are many GUI PDF editors with API or SDK abilities such as windows Foxit Phantom but you need to use your favourite choice for platform or programming language, thuse for JS you can use iText Aspose or Spire etc...