Skip to main content
Zapier Step

Input

NameDescriptionRequired
PDF URLProvide the URL of the source PDF document or a filetoken:// link from PDF.co Built-In Files Storage. For cloud services like Google Drive or Dropbox, ensure the link is publicly accessible without a password.Yes
Extract Fillable FieldsEnable this option (true) to extract information about fillable fields in the PDF form.No
Custom ProfilesA JSON string which adds options for the conversion process. See Custom Profiles for more.No

Source PDF URL & Google

When using Google Drive, it’s typically recommended to choose the File option. For more advanced file integration techniques, see Integrating File Sources with pdf.co.
Google File

Output

NameDescription
infoAn object containing the PDF information such as Title, PageCount, Author, Subject, CreationDate, etc. It also contains the FormField object array in case we have the Extract Fillable Fields property enabled.
SimplifiedFieldsData.field_1An Object containing form field information such as FieldName, Value, Type, AltFieldName and PageIndex. Please note: this output object will only produced if input is configured for Extract Fillable Fields.
SimplifiedFieldsData.field_2An object holding another form field information, following the same pattern as SimplifiedFieldsData.field_1 for each output file.
errorDetails of any errors (if any).
statusThe response status code. If all good this will be 200.
jobIdThe unique identifier for the job.
creditsThe credits spent on the process.
remainingCreditsThe credits left on your account.
durationThe time it took for the process.

Custom profiles

Use Custom Profiles to enhance your workflow with additional processing options. Enter JSON configuration to customize OCR settings, output format, text extraction methods, and more.
Custom Profiles

Sample JSON

{ "ImageOptimizationFormat": "JPEG", "JPEGQuality": 25, "ResampleImages": true, "ResamplingResolution": 120, "GrayscaleImages": true }
You can use any regular API parameter from the API Reference within Zapier using the std_params feature in profiles. The std_params enables the definition of regular API parameters in a JSON format, See Standard Parameters for detailed documentation and examples.
ParameterTypeDefaultDescription
OCRModestringAutoSpecifies how OCR (Optical Character Recognition) should process input content, offering various modes to tailor text extraction based on content type such as images, fonts, and vector graphics. For more information, see OCR Extraction Modes.
OCRResolutioninteger300Use this parameter to change the OCR resolution from the default 300 dpi. The range is from 72 to 1200 dpi.
RotationAngleinteger-Use manual rotation to handle PDFs with vertically drawn text. Normally, OCR automatically detects page rotation in PDFs and extracts text accurately. However, in some cases, the PDF might not have an actual rotated page --- Rather, the text itself is drawn vertically. In such scenarios, auto-detection may fail. You can use this parameter to manually set the page rotation. The available angles are: 0, 1, 2, 3.
LineGroupingModestringNoneControls line grouping in PDF text extraction. Modes: None (no grouping), GroupByRows (merge rows if all cells align), GroupByColumns (merge cells by column), JoinOrphanedRows (merge single-cell rows to above if no separator).
ConsiderFontColorsbooleanfalseControls whether font colors should be considered when detecting table structure and merging text objects during PDF extraction. Set to true to consider font colors.
DetectNewColumnBySpacesRatiostring1.2Controls how spaces between words are interpreted for column detection in PDF text extraction. It defines the ratio of space width that determines when text should be treated as being in separate columns.
AutoAlignColumnsToHeaderbooleantrueControls how columns are detected and aligned during table extraction from PDF documents. It affects both table structure detection and text extraction with formatting preservation. Set to true to automatically align columns to the header row. When set to true (default), the row with the most columns is used as the header, and all other rows are aligned to this structure --- ideal for well-structured tables. When set to false, columns are analyzed independently across all rows to build the structure, which works better for inconsistent or irregular tables.
OCRImagePreprocessingFiltersobject-Image preprocessing filters for OCR. Refer to OCRImagePreprocessingFilters for usage examples.
    .AddGrayscalebooleanfalseConverts to grayscale before OCR.
    .AddGammaCorrectionarray[string (float format)][“1.4”]Adds a gamma correction filter.
DataEncryptionAlgorithmstring-Controls the encryption algorithm used for data encryption. See User-Controlled Encryption for more information. The available algorithms are: AES128, AES192, AES256.
DataEncryptionKeystring-Controls the encryption key used for data encryption. See User-Controlled Encryption for more information.
DataEncryptionIVstring-Controls the encryption IV used for data encryption. See User-Controlled Encryption for more information.
DataDecryptionAlgorithmstring-Controls the decryption algorithm used for data decryption. See User-Controlled Encryption for more information. The available algorithms are: AES128, AES192, AES256.
DataDecryptionKeystring-Controls the decryption key used for data decryption. See User-Controlled Encryption for more information.
DataDecryptionIVstring-Controls the decryption IV used for data decryption. See User-Controlled Encryption for more information.

OCRImagePreprocessingFilters

To set image preprocessing filters, please use:
{
 "profiles": "{
    "ExtractShadowLikeText": false,
    "OCRMode": "Auto",
    "OCRImagePreprocessingFilters.AddGrayscale()": [],
    "OCRImagePreprocessingFilters.AddGammaCorrection()": [
        1.4
    ]
}"
}