> ## Documentation Index
> Fetch the complete documentation index at: https://docs.pdf.co/llms.txt
> Use this file to discover all available pages before exploring further.

# Split PDF Based on Text Search

> Enhance your Zapier workflow by integrating this step to segment PDF documents based on text search, including OCR capabilities. This feature is particularly useful for creating new PDF files from sections of the original document, identified through specific text or patterns using regular expressions.

<Frame>
  <img src="https://mintcdn.com/pdfco/N4Le3Ib-q2JX4RLs/images/integrations/zapier/zapier-step27.png?fit=max&auto=format&n=N4Le3Ib-q2JX4RLs&q=85&s=6269b2c89921ecb1de5d0f09208123cd" alt="Zapier Step" width="2226" height="2370" data-path="images/integrations/zapier/zapier-step27.png" />
</Frame>

## Input

| Name                                   | Description                                                                                                                                                                                                                                           | Required |
| -------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | -------- |
| **Source PDF URL**                     | Provide the URL to the source PDF document, or a `filetoken://` link from [PDF.co Built-In Files Storage](https://app.pdf.co/files). If you use another cloud service such as **Google Drive** or **Dropbox** ensure the link is publicly accessible. | Yes      |
| **Text Search String**                 | Specify the text string for searching within the PDF pages.                                                                                                                                                                                           | Yes      |
| **Enable Case-Sensitive Search**       | Activate this to `True` for case-sensitive search. Default is `False`.                                                                                                                                                                                | No       |
| **Enable Regular Expression Search**   | Set this to `True` to incorporate regular expressions in your search. The default is `False`.                                                                                                                                                         | No       |
| **Exclude Pages with Identified Text** | Opt this to `True` to exclude pages where the text is found. Default is `False`.                                                                                                                                                                      | No       |
| **OCR Language**                       | Select the [OCR language](/api-reference/pdf-make-text-searchable-or-unsearchable) for text recognition in scanned PDFs. Default is English.                                                                                                          | No       |
| **Base Filename for New PDFs**         | Define the base filename for the newly created segmented PDF files.                                                                                                                                                                                   | No       |
| **Custom Profiles**                    | A `JSON` string which adds options for the conversion process. See [Custom Profiles](#custom-profiles) for more.                                                                                                                                      | No       |

### Source PDF URL & Google

<Note>
  When using **Google Drive**, it’s typically recommended to choose the **File** option. For more advanced file integration techniques, see [Integrating File Sources with pdf.co](/integrations/zapier/input-file-sources).

  <Frame>
    <img src="https://mintcdn.com/pdfco/tXGo3rbTS_pEF5es/images/integrations/zapier/zapier-google-input-source.png?fit=max&auto=format&n=tXGo3rbTS_pEF5es&q=85&s=8e304dac8851d0b17c9500f25c2d41c8" alt="Google File" width="819" height="102" data-path="images/integrations/zapier/zapier-google-input-source.png" />
  </Frame>
</Note>

## Output

| Name                  | Description                                                                                                                        |
| --------------------- | ---------------------------------------------------------------------------------------------------------------------------------- |
| `url1`                | This represents the temporary URL of the output file hosted on the PDF.co file server.                                             |
| `url2`                | Similarly, this is the temporary URL for another output file on the PDF.co file server. This pattern is used for all output files. |
| `urls`                | This is an array of temporary URLs, each pointing to an output file.                                                               |
| `outputLinkValidTill` | A timestamp which indicates how long the `url` will be available for.                                                              |
| `error`               | Details of any errors (if any).                                                                                                    |
| `status`              | The [response status](/api-reference/introduction) code. If all good this will be `200`.                                           |
| `jobId`               | The unique identifier for the job.                                                                                                 |
| `credits`             | The credits spent on the process.                                                                                                  |
| `remainingCredits`    | The credits left on your account.                                                                                                  |
| `duration`            | The time it took for the process.                                                                                                  |

## Custom profiles

Use Custom [Profiles](/api-reference/profiles) to enhance your workflow with additional processing options. Enter `JSON` configuration to customize OCR settings, output format, text extraction methods, and more.

<Frame>
  <img src="https://mintcdn.com/pdfco/tXGo3rbTS_pEF5es/images/integrations/zapier/custom-profiles.png?fit=max&auto=format&n=tXGo3rbTS_pEF5es&q=85&s=3a96b0395b56c9977724ee05327aa571" alt="Custom Profiles" width="843" height="111" data-path="images/integrations/zapier/custom-profiles.png" />
</Frame>

### Sample JSON

```json theme={null}
{ "ImageOptimizationFormat": "JPEG", "JPEGQuality": 25, "ResampleImages": true, "ResamplingResolution": 120, "GrayscaleImages": true }
```

<Tip>
  You can use any regular API parameter from the [API Reference](/api-reference) within Zapier using the `std_params` feature in profiles. The `std_params` enables the definition of regular API parameters in a JSON format, See [Standard Parameters](/api-reference/profiles#standard-parameters) for detailed documentation and examples.
</Tip>

| Parameter                 | Type   | Default | Description                                                                                                                                                            |
| ------------------------- | ------ | ------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `outputDataFormat`        | string | -       | If you require your output as base64 format, set this to base64                                                                                                        |
| `DataEncryptionAlgorithm` | string | -       | Controls the encryption algorithm used for data encryption. See User-Controlled Encryption for more information. The available algorithms are: AES128, AES192, AES256. |
| `DataEncryptionKey`       | string | -       | Controls the encryption key used for data encryption. See User-Controlled Encryption for more information.                                                             |
| `DataEncryptionIV`        | string | -       | Controls the encryption IV used for data encryption. See User-Controlled Encryption for more information.                                                              |
| `DataDecryptionAlgorithm` | string | -       | Controls the decryption algorithm used for data decryption. See User-Controlled Encryption for more information. The available algorithms are: AES128, AES192, AES256. |
| `DataDecryptionKey`       | string | -       | Controls the decryption key used for data decryption. See User-Controlled Encryption for more information.                                                             |
| `DataDecryptionIV`        | string | -       | Controls the decryption IV used for data decryption. See User-Controlled Encryption for more information.                                                              |
