(Glue-2002) Prerequisites
SAP system
- SAP NetWeaver 7.01 SPS15 or higher (ABAP stack)
- Datavard’s Reuse Library 2.05 on the NetWeaver ABAP stack
- Datavard Glue installed
- Active HTTP/HTTPS service on the SAP system (which acts as a client to the Hadoop services)
- Go to the ICM Monitor (transaction SMICM) and use the path within the menu: Goto à
- Check for Active HTTP/HTTPS service on the SAP system (which acts as a client to the Hadoop services)
- WebHDFS/HttpFS connector set up successfully.
Hadoop cluster
- Apache Hadoop version 2.0.0 or higher is installed within a corporate infrastructure, either on-premise or within the cloud (e.g. Microsoft Azure, Amazon’s AWS or Cloudera CDH5).
- The recommended Hadoop version is 2.4.0 or higher where major supportability improvements and bug fixes were applied to WebHDFS and HttpFS.
- Enable/Install the HttpFS or WebHDFS service on the Hadoop cluster.
- HttpFS is Datavard’s recommended option. It is more efficient because it does not require redirection to be handled by the client. It is also easier to install and configure within a corporate infrastructure because it acts as gateway (single point of access); therefore, it can be placed behind a firewall.
Enable/Install the Oozie service on the Hadoop cluster.
- The recommended Oozie version is 4.0.0 or higher where major supportability improvements and bug fixes were applied. For extended job log information Oozie version 4.2.0 or higher is required.
- Apply the desired security mechanism according to the chapter Security of this document.