Installing and enabling the Document Processing Service component
You can enhance the text analysis of file attachments by Pega Email Bot or Pega
Intelligent Virtual Assistant (IVA), by using the Document Processing Service (DPS). The
Document Processing Service provides optical character recognition (OCR), and analysis
of forms and tables in image attachments for email bots and chatbots. You can also use
DPS in your custom flows as it provides extension points and automations for custom
solutions.
By installing and enabling the DPS component in Pega Platform, you remove the need for third-party OCR software in your
application. Instead, your email bot performs text analysis of image-based file attachments
in a service on the cloud.
Before you begin: If the DPS component is not available in your application as discovered in step
3, download and prepare the DPS component for installation on your system:
- Obtain a .zip file for the component from Pega Marketplace.
- Extract the .zip file contents to your local hard drive. The extracted files include a /component folder.
A Pega Platform application and an email bot together use machine learning models to detect entities in email attachments, for example, Microsoft Word (DOC/DOCX), Open Document Format (ODT), Portable Document Format with text layer (PDF), Rich Text Format (RTF), and Simple Text (TXT). With the DPS component, the list of supported file types also includes images, for example, JPG, PNG, and PDF files without a text layer.