Skip to main content


         This documentation site is for previous versions. Visit our new documentation site for current releases.      
 

Data Flow service

Updated on July 5, 2022

This content applies only to On-premises and Client-managed cloud environments

The Data Flow service enables running data flows in batch and real time (stream) modes. Data flows are data pipelines that read, transform, and write data. With data flows, you can, for example, run decisions, perform text analysis, and execute real-time aggregations.

With the Data Flow service, you can run data flows in either batch mode or real time (stream) mode, depending on the type of data flow. Batch and real time modes process data independently and do not affect each other. The higher the number of nodes for a mode, the higher the use of the mode. For example, using more nodes is useful when performing batch runs that require data-intensive computing.

Depending on the partitioning configuration of data flow instances, a data flow can process data on a different number of nodes than the number configured for the Data Flow service.

Have a question? Get answers now.

Visit the Support Center to ask questions, engage in discussions, share ideas, and help others.

Did you find this content helpful?

Want to help us improve this content?

We'd prefer it if you saw us at our best.

Pega.com is not optimized for Internet Explorer. For the optimal experience, please use:

Close Deprecation Notice
Contact us