Language support for NLP
Pega Platform provides text analytics based on natural language processing (NLP) that you can use to detect, process, and structure text data from email, chatbots, and social media platforms. Depending on the language of the analyzed content, various text analytics features help you obtain accurate analysis results.
Pega Platform can detect 35 languages, out of which the Pega-NLP ruleset supports 16 languages for analysis:
Before you configure your text models, define the languages that you want to use for NLP. For more information, see Enabling languages for NLP.
Languages that Pega Platform can detect
Pega Platform can detect the following languages. You can use the languages detected, including the ones that are not supported by the Pega-NLP ruleset, to connect to external NLP providers that process texts in these languages:
- Arabic
- Basque
- Belarusian
- Bulgarian
- Catalan
- Croatian
- Czech
- Danish
- Dutch
- English
- Esperanto
- Finnish
- French
- German
- Hindi
- Hungarian
- Icelandic
- Indonesian
- Italian
- Japanese
- Korean
- Lithuanian
- Mandarin
- Norwegian
- Persian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Spanish
- Swedish
- Turkish
- Ukrainian
- Vietnamese
Languages that Pega Platform can analyze
You can create text models in Pega Platform and use them to analyze texts. Some NLP features are only available for certain languages, as shown in the following table.
Language | Continuous learning | Text extraction | Topic detection | Small talk detection | Intent analysis | Sentiment analysis |
Croatian | ✓ | ✓ | ✓ | Not available | Not available | Not available |
Czech | ✓ | ✓ | ✓ | Not available | Not available | Not available |
Danish | ✓ | ✓ | ✓ | Not available | Not available | Not available |
Dutch | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
English | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
French | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
German | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
Italian | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
Japanese | ✓ | ✓ | ✓ | Not available | Not available | Not available |
Norwegian | ✓ | ✓ | ✓ | Not available | Not available | Not available |
Polish | ✓ | ✓ | ✓ | Not available | Not available | Not available |
Portuguese | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
Russian | ✓ | ✓ | ✓ | Not available | Not available | Not available |
Spanish | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
Swedish | ✓ | ✓ | ✓ | Not available | Not available | Not available |
Turkish | ✓ | ✓ | ✓ | Not available | Not available | Not available |
Previous topic Analyzing natural language Next topic Enabling languages for NLP