Configuring the Data Flow service
This content applies only to On-premises and Client-managed cloud environments
In the Data Flow service, you can run data flows in batch mode or real time (stream) mode. Specify the number of Pega Platform threads that you want to use for running data flows in each mode.
For more information, see Assigning node types to nodes for on-premises environments.
- In the header of Dev Studio, click .
- In the Service list, select the node types for which you want to
configure the number of threads.Batch nodes process batch data flow runs. Real-time nodes process streaming data flows.
- In the Data flow nodes section, click Edit settings.
- In the Thread count field, enter the number of threads that you
want to use for running data flows in the selected mode.To scale the Data Flow service vertically, increase the current number of threads.
For example: If you divide the source of a data flow into five partitions, Pega Platform divides the data flow run into five assignments, and then processes the assignments simultaneously on separate threads, if five threads are available. Pega Platform calculates the number of available threads by multiplying the thread count by the number of nodes. For example, with two nodes and the thread count set to 5, the data flow run uses five threads and five threads remain idle.
- Click Submit.
Previous topic Configuring the Adaptive Decision Manager service for Pega Cloud environments Next topic Configuring the Real Time Data Grid service