Xtract Universal is a standalone SAP connector that enables you to extract data from SAP (e.g. ERP and BW) systems to different destination environments, and thereby significantly reduce development time, thanks to a high degree of automation.

Xtract Universal can be easily integrated with different ETL tools, especially when customers are using the ETL tools for data transformation and the integration of non-SAP source systems.

In this blog, we will discuss the integration with Pentaho Data Integration (PDI).


Pentaho Data Integration, codenamed Kettle, is a data integration product with many built-in capabilities. It has a community edition as well as a professional edition. PDI aka Kettle consists of a core data integration (ETL) engine and GUI applications that allow the user to define data integration jobs and transformations.

We will describe two possible scenarios for SAP data extraction using Xtract Universal and Pentaho Data Integration (PDI).

Scenario 1 with a staging database

In the first scenario, Pentaho would trigger Xtract Universal, that extracts the SAP data and loads it to a destination. We can select any destination that is supported by Xtract Universal and Pentaho. In this example, we would work with SQL Server. The destination can also be another database/data warehouse system or cloud storage.

Pentaho executes the SAP extraction by calling the Xtract Universal Command Line (xu.exe). The picture below shows the architecture.


In Xtract Universal, we have defined an extraction with an SQL Server as the destination. Xtract Universal will then extract the SAP data and load it into the SQL server database. After that, Pentaho can read the SAP data from the SQL server and process it further.

Scenario 2 with direct access via web service

In the second scenario, Pentaho would call a web service provided by Xtract Universal for the SAP extraction. The picture below shows the architecture.


In Pentaho, we trigger the extraction by using an HTTP call. Xtract Universal extracts the data from SAP and delivers it directly to Pentaho in CSV format. In Pentaho, we can then process the delivered data and then load than to the desired destination.

For more details please check the detailed walkthrough in our help


Written by Khoder Elzein

Khoder is responsible for presales and always has an attentive ear for our customers, prospectives and partners. When travelling in Germany or abroad, he provides customer support at PoCs, workshops and on training courses. He also looks after the further development of our software solutions. Khoder has been working in IT since the turn of the millennium; he has been a member of team Theobald since 2009. When it comes to private interests, family, nature and reading feature at the top of his list – along with innovative fusion cuisine, as you may infer from his favourite dish of Swabian cheese noodles with tabouleh.