Search In Pdf Pn

Input

NameDescriptionRequired
PDF URLProvide the URL to the source PDF document, or a filetoken:// link from PDF.co Built-In Files Storage. If you use another cloud service such as Google Drive or Dropbox ensure the link is publicly accessible.Yes
Search QuerySpecify the text you wish to search for within the PDF document (e.g. company name).Yes
Use Regular ExpressionsSet to true to enable regular expression search for the searchString(s) parameter.No
PagesSpecify the page indices to search, using comma-separated values or ranges (e.g., “0,1,2-” or “1,2,3-7”). Page indexing starts at 0. Use “!” before a number to count from the end (e.g., “!0” for the last page). Leave empty to search all pages. The input must be a string.No
File NameFile name for the generated output, the input must be in string format.No
Webhook URLThe callback URL or Webhook used to receive the output data.No
Output Links Expiration (In Minutes)Set the expiration time for the output link in minutes. After this specified duration, any generated output file(s) will be automatically deleted from PDF.co Temporary Files Storage. The maximum duration for link expiration varies based on your current subscription plan. To store permanent input files (e.g. re-usable images, pdf templates, documents) consider using PDF.co Built-In Files Storage.No
InlineSet to true to return results inside the response. Otherwise, the endpoint will return a URL to the output file generated.No
Word Matching ModeWordMatchingMode defines how search terms match PDF text. Modes: None (exact string match only), SmartMatch (default; flexible word boundary match, includes letters/digits/punctuation), ExactMatch (strict word boundaries, whole-word match only).No
PasswordThe password of the password-protected PDF fileNo
HTTP UsernameHTTP auth user name if required to access source URL.No
HTTP PasswordHTTP auth password if required to access source URL.No
Custom ProfilesUse JSON to customize PDF processing with options like output resolution, OCR settings, text extraction methods, encryption, and image handling. Check our Custom Profiles section to see all available parameters for your current endpoint.No

Custom Profiles

You can set additional options for the operation used in the PDF.co node by using Custom Profiles. A custom profile is a string in JSON-like format containing predefined parameters. Here’s an example of a Custom Profiles input:
{ 'OCRDetectPageRotation': true }
With this input, the PDF.co operation will rotate the scanned PDF automatically. You can find the list of available parameters for customizing profiles in the PDF.co operation documentation below:
ParameterTypeDefaultDescription
ColumnDetectionModestringContentGroupsAndBordersControls column detection/alignment in PDF table extraction. Modes: ContentGroupsAndBorders (default; text + lines), ContentGroups (text grouping only), Borders (lines only), BorderedTables (OCR-based for bordered tables), ContentGroupsAI (AI for dense/complex layouts).
DetectionMinNumberOfRowsinteger1Minimum number of rows to detect in a table
DetectionMinNumberOfColumnsinteger1Minimum number of columns to detect in a table
DetectionMaxNumberOfInvalidSubsequentRowsAllowedinteger0Maximum number of invalid subsequent rows allowed in a table
DetectionMinNumberOfLineBreaksBetweenTablesinteger0Minimum number of line breaks between tables
EnhanceTableBordersbooleantrueEnhance table borders or not
OCRDetectPageRotationbooleanfalseControls whether to detect page rotation in the PDF document when OCR applied. Set to true to detect page rotation. See Support page rotation for more information.
DataEncryptionAlgorithmstring-Controls the encryption algorithm used for data encryption. See User-Controlled Encryption for more information. The available algorithms are: AES128AES192AES256.
DataEncryptionKeystring-Controls the encryption key used for data encryption. See User-Controlled Encryption for more information.
DataEncryptionIVstring-Controls the encryption IV used for data encryption. See User-Controlled Encryption for more information.
DataDecryptionAlgorithmstring-Controls the decryption algorithm used for data decryption. See User-Controlled Encryption for more information. The available algorithms are: AES128AES192AES256.
DataDecryptionKeystring-Controls the decryption key used for data decryption. See User-Controlled Encryption for more information.
DataDecryptionIVstring-Controls the decryption IV used for data decryption. See User-Controlled Encryption for more information.
You can also use Custom Profiles to:

Output

NameDescription
jobIdUnique identifier for the background job.
pageCountNumber of pages in the PDF document.
errorIndicates whether an error occurred (false means success)
statusStatus code of the request (200, 404, 500, etc.). For more information, see Response Codes.
creditsNumber of credits consumed by the request
remainingCreditsNumber of credits remaining in the account
durationTime taken for the operation in milliseconds
urlDirect URL to the final PDF file stored in S3.
nameName of the output file
outputLinkValidTillTimestamp indicating when the output link will expire