Skip to main content
Make Step

Input

NameDescriptionRequired
Import OptionsChoose the input source, either Upload a File or Import a File From URL.Yes

Upload a File
NameDescriptionRequired
DataUpload a file using raw binary data from another module. Note: This requires additional credits as it first uploads to PDF.co Temporary Files Storage.Yes
Output File NameSpecify a custom file name for the output file.No
Import a File From URL
NameDescriptionRequired
URLProvide the URL to the source PDF document, or a filetoken:// link from PDF.co Built-In Files Storage. If you use another cloud service such as Google Drive or Dropbox ensure the link is publicly accessible.Yes
Output File NameSpecify a custom file name for the output file.No
NameDescriptionRequired
Convert TypeSelect the type of conversion. Default is PDF to Text.No
PagesEnter a comma-separated list of page indices (or ranges) for processing. Leave empty to include all pages. The first page is numbered 0 (zero). For example: 0,1-2,5-.No
PasswordIf the PDF is password-protected, enter the password here.No
InlineChoose Yes to include a copy of the output data in JSON output mode. If set to No, only a link to the output will be included in this mode.No
Execution ModeSelect Sync for small tasks up to 10 seconds. Choose Async for standard jobs, or Async For Large Docs for tasks over 30 seconds. Use Job Check module for retrieving results in large tasks.No
ProfilesAdd custom options for the process in a JSON string format. See API Profiles for more details.No
Output Links ExpirationSet the expiration time in minutes for output links. Default is 60 minutes. Increase this limit with a Business Plan or higher, see plans here for details.No
Export TypeChoose between Download a File or JSON Output. Default is Download a File.No

Integrating External File Sources

Streamline your Make workflows with external file sources like Google Drive and Dropbox using their unique actions. Discover efficient integration strategies in our guide: File Source Integrations in Make.

Output

NameDescription
urlThis is the temporary URL provided by the PDF.co file server.
BodyRepresents the raw output data. This is generated only when the Export Type option is set to JSON Output.
DataRepresents the output binary data. This data is generated only when the Export Type option is set to Download a File.
StatusIndicates the response status code. A success status is returned if the operation is successful.
outputLinkValidTillSpecifies the timestamp until which the url remains accessible.
errorProvides details about any errors encountered during the process, if applicable.
File NameThe designated name of the output file.
Job IdA unique identifier assigned to the job.
creditsThe amount of credits utilized for the process.
Remaining CreditsDisplays the balance of credits available in your account.
durationThe duration of time the process took to complete.

Profiles

To display the Profiles fields, you must enable Advanced Settings by clicking the toggle:
Advanced Settings
You can set additional options for the operation used in the PDF.co module by using Profiles. A profile is a string in JSON-like format containing predefined parameters.

Here’s an example of a Custom Profiles input:

{ "outputDataFormat": "base64" }
With this input, the PDF.co module will return the output in base64 format. You can find the list of available parameters for customizing profiles in the PDF.co operation documentation below:
You can use any regular API parameter from the API Reference within Make using the std_params feature in profiles. The std_params enables the definition of regular API parameters in a JSON format, See Standard Parameters for detailed documentation and examples.
ParameterTypeDefaultDescriptionAvailable for
outputDataFormatstring-If you require your output as base64 format, set this to base64.PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
ColumnDetectionModestringContent Groups And BordersControls column detection/alignment in PDF table extraction. See Column Detection Mode for more information.PDF to CSV, PDF to XLS
OCRModestringAutoSpecifies how OCR (Optical Character Recognition) should process input content, offering various modes to tailor text extraction based on content type such as images, fonts, and vector graphics. For more information, see OCR Extraction Modes.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
OCRResolutioninteger300Use this parameter to change the OCR resolution from the default 300 dpi. The range is from 72 to 1200 dpi.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
RotationAngleinteger-Use manual rotation to handle PDFs with vertically drawn text. Normally, OCR automatically detects page rotation in PDFs and extracts text accurately. However, in some cases, the PDF might not have an actual rotated page --- rather, the text itself is drawn vertically. In such scenarios, auto-detection may fail. You can use this parameter to manually set the page rotation. The available angles are: 0, 1, 2, 3.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
LineGroupingModestringNoneControls line grouping in PDF text extraction. Modes: None (no grouping), GroupByRows (merge rows if all cells align), GroupByColumns (merge cells by column), JoinOrphanedRows (merge single-cell rows to above if no separator).PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
ConsiderFontColorsbooleanfalseControls whether font colors should be considered when detecting table structure and merging text objects during PDF extraction. Set to true to consider font colors.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
DetectNewColumnBySpacesRatiostring1.2Controls how spaces between words are interpreted for column detection in PDF text extraction. It defines the ratio of space width that determines when text should be treated as being in separate columns.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
AutoAlignColumnsToHeaderbooleantrueControls how columns are detected and aligned during table extraction from PDF documents. It affects both table structure detection and text extraction with formatting preservation. Set to true to automatically align columns to the header row. When set to true (default), the row with the most columns is used as the header, and all other rows are aligned to this structure --- ideal for well-structured tables. When set to false, columns are analyzed independently across all rows to build the structure, which works better for inconsistent or irregular tables.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
OCRImagePreprocessingFiltersobject-Image preprocessing filters for OCR. Refer to OCRImagePreprocessingFilters for usage examples.
    .AddGrayscalebooleanfalseConverts to grayscale before OCR.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
    .AddGammaCorrectionarray[string (float format)][“1.4”]Adds a gamma correction filter.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
RenderTextObjectsbooleantrueControls whether to render text objects in the PDF document. When set to true, it will render all text objects in the PDF document. Set to false to skip over text objects during rendering. See Disable Text Layer for more information.PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
RenderImageObjectsbooleantrueRender image objects or not.PDF to JPG, PDF to PNG, PDF to WEBP
RenderVectorObjectsbooleantrueRender vector objects or not.PDF to JPG, PDF to PNG, PDF to WEBP
JPEGQualityinteger85See profiles.JPEGQuality.PDF to JPG
WEBPQualityinteger75See profiles.WEBPQuality.PDF to WEBP
TIFFCompressionstringLZWSee profiles.TIFFCompression.PDF to TIFF
RenderingResolutioninteger120See Set Image Resolution for more information.PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
OptimizeImagesbooleantrueSome PDF may have high quality images used in the document and you may need to keep the quality of these images in the output HTML. By default PDF to HTML is optimizing images and you can easily turn it off. See Control Image Quality for more information.PDF to HTML
OutputPageWidthinteger1024Control page width (in pixels) for output HTML. Height is calculated and used according to the original pdf pages ratio. See Control Output Page Width for more information.PDF to HTML
AdditionalCssStylesstring""To inject CSS for layout options in your HTML. Example: #canvas { zoom: 50%; }. Scale the div that contains all generated HTML pages by 50%. See Inject CSS for more information.PDF to HTML
SaveVectorsbooleanfalseControls whether to save vector graphics during PDF to HTML conversion. Set to true to save vector graphics.PDF to CSV, PDF to JSON, PDF to XLS, PDF to XML
SaveImagesstringNoneControls how images are saved during PDF to HTML conversion. Modes: None (no images), OuterFile (save to sub-folder), Embed (embed as Base64 data:URI).PDF to CSV, PDF to JSON, PDF to XLS, PDF to XML, PDF to HTML
ConsiderFontSizesbooleanfalseSet to true to make the converter consider font size differences in document text when detecting and parsing table structures. This can be helpful in cases where tables are formatted using different font sizes to distinguish between headers, data cells, or other structural elements.PDF to CSV, PDF to JSON, PDF to XLS, PDF to XML
ExtractionAreaarray[number]-Extract text in a specific area by defining the extraction area - set with points in the format [x, y, width, height].PDF to CSV, PDF to JSON, PDF to XLS, PDF to XML
ExtractShadowLikeTextbooleantrueControls whether to extract invisible text from a PDF document. Set to false to skip over invisible text during extraction. This is particularly useful when dealing with PDFs that contain hidden text layers or when you only want to extract visible content. When this value is set to false, OCRMode must be set to Auto to properly apply the shadow text filtering effect.PDF to CSV, PDF to JSON, PDF to XLS, PDF to XML
DataEncryptionAlgorithmstring-Controls the encryption algorithm used for data encryption. See User-Controlled Encryption for more information. The available algorithms are: AES128, AES192, AES256.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
DataEncryptionKeystring-Controls the encryption key used for data encryption. See User-Controlled Encryption for more information.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
DataEncryptionIVstring-Controls the encryption IV used for data encryption. See User-Controlled Encryption for more information.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
DataDecryptionAlgorithmstring-Controls the decryption algorithm used for data decryption. See User-Controlled Encryption for more information. The available algorithms are: AES128, AES192, AES256.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
DataDecryptionKeystring-Controls the decryption key used for data decryption. See User-Controlled Encryption for more information.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF
DataDecryptionIVstring-Controls the decryption IV used for data decryption. See User-Controlled Encryption for more information.PDF to CSV, PDF to JSON, PDF to Text, PDF to XLS, PDF to XML, PDF to HTML, PDF to JPG, PDF to PNG, PDF to WEBP, PDF to TIFF