Configuring Hadoop settings for an HBase connection
Use the HBase settings in the Hadoop data instance to configure connection details for the HBase data sets.
By using the Hadoop infrastructure, you can process large amounts of data directly on the Hadoop cluster and reduce the data transfer between the Hadoop cluster and the Pega Platform. Hadoop configuration instances are records in the SysAdmin category and belong to the Data-Admin-Hadoop class.
- In the header of Dev Studio, click .
- On the Create Hadoop form, enter a description and a name for the Hadoop data instance.
- Click Create and open.
- On the Connection tab of a Hadoop data instance, select the Use HBase configuration.
- In the Client list, select one of the HBase client
implementations.The selection of this setting depends on the server configuration.
Choices Actions REST - In the Port field, provide the port on which the REST
gateway is set up.The default port is 20550.
- Optional: To use custom settings, select the Advanced configuration check box.
- Optional: In the REST host field, specify a custom REST host that is different from the one defined in the common configuration.
- Optional: In the Response timeout field, enter the number of
milliseconds to wait for the server response. Enter zero to remove the
timeout.The default timeout is 5000.
Java - In the Port field, provide the port for the Zookeeper
service.The default port is 2181.
- Optional: To use custom settings, select the Advanced configuration check box.
- Optional: In the Zookeeper host field, specify a custom HBase Zookeeper host that is different from the one defined in the common configuration.
- Optional: In the Response timeout field, enter the number of
milliseconds to wait for the server response. Enter zero to remove the
timeout.The default timeout is 5000.
- Optional: To enable secure connections, select the Use authentication check box, and then go to step 6.
- In the Port field, provide the port on which the REST
gateway is set up.
- Optional: To configure secure connections for Java, perform the following actions:
- In the Master kerberos principal field, enter the Kerberos principal name of the HBase master node as defined and authenticated in the Kerberos Key Distribution Center, typically in the following format: hbase/<hostname>@<REALM>
- In the Client kerberos principal field, enter the Kerberos principal name of a user as defined in Kerberos, typically in the following format: <username><hostname>@<REALM>
- In the Keystore field, enter the name of a keystore that
contains a keytab file with the keys for the user that you defined in the
Client kerberos principal field.
- Test the connection to the HBase master node by clicking Test connectivity.
Previous topic Creating an HBase data set record Next topic Creating a Decision Data Store data set