(SM-1911) GCP (Google Cloud Platform) BigQuery

This page provides general guidelines on how to set up GCP Storage and Big Query to work with Datavard Storage Management.

Contents

1. GCP prerequisites

The entry point to GCP is https://console.cloud.google.com/. Log in with the company account.

1.1 Project

Select a project or create a new one

Overview of the project resources is always accessible via a click on

 

1.2 Service account

The key component for GCP access from the SAP system will be the Service account with the generated key in JSON format. Download the key file, it will be used for authentication to GCP later in the configuration.

1.3 Storage bucket

The next step is the creation of a Storage bucket with appropriate permissions (read/write access for Service account). Markdown the bucket name for later configuration.

1.4 BigQuery dataset

Similarly, BigQuery dataset needs to be created with appropriate permissions (read/write) for the Service account. The dataset name will be used later.

 

2. SAP configuration

2.1 JSON key

Step one on the SAP side is to upload the GCP Service account key file (from step 1.2) to the application server. Standard path for sensitive files used by Datavard Java connector is /sapmnt/<SID>/global/security/dvd_conn. All information necessary for authentication is contained within, so access to the file should be limited to authorized personnel (through both operating system and SAP transactions).

2.2 Datavard Java connector

All communication between the SAP system and GCP is facilitated by the Datavard Java connector. The transaction used for configuration and management is /DVD/JCO_MNG. For configuration steps, please refer to section JCO Management in (SM-1911) Hadoop Storage Setup.

2.2.1 Datavard Java connector GCP libraries

In addition to standard Datavard Java connector configuration, GCP libraries have to be transported into the system <TO ADD DOWNLOAD LINK>.

2.4 Storage Management setup

The final step in SAP & GCP connectivity is the creation of two storages in transaction /DVD/SM_SETUP. One for the transfer of flat files, the other for table management and SQL queries.

  1. Google Cloud Storage (GCS) - as seen in the screenshot, required fields are
    Storage ID - logical name of the storage connection
    Storage type - GCS (stands for Google Cloud Storage)
    Bucket name - Name of the storage bucket from step 1.3
    Configuration path - physical path to JSON key file from step 2.1
    Java connector RFC - TCP/IP RFC destination used for communication with Datavard Java connector (configured in step 2.2)

     

  2. Google BigQuery storage (BIGQUERY) - required fields
    Storage ID - logical name of the storage connection
    Storage Type - BIGQUERY
    Dataset - name of the BigQuery dataset from step 1.4
    Reference Storage - Google Cloud storage used for the flat-file transfer
    Java connector RFC - TCP/IP RFC destination used for communication with Datavard Java connector

     

When all configuration steps are completed and there is no problem with permissions, automatic storage check shows a green tick.