Import and export pipelines

Creating a data pipeline from scratch can be time-consuming, especially when we need to perform the same merge or transformation step on a similar log. To streamline this process, we can now reuse the merge and transformation operations from previously created pipelines.

In Apromore 10.1, we can export a data pipeline (which stores the merge and transformation operations) and import the pipeline on a similar log to reuse the merge and transformation operations.

Imagine that we are creating a data pipeline. In the Merge step, we append two logs.

Import002

In the Transform step, we replace “Credit” in the Card Type column with “Postpaid” and drop rows where the dispute amount is less than 4.

Import003

But in another data pipeline, we need to perform the same merge and transformation we have just created here. Perhaps a second user wishes to create the same data pipeline using the same event logs. It could also be that extracted files get updated, and the user wishes to recreate the data pipeline on the updated file. First, we ensure we save the data pipeline.

Import004

In the Transform step, click Export to download the data pipeline for future use.

Note

We can also export the data pipeline by clicking Export in the Merge Pipeline window.

Import005

The pipeline is downloaded as a JSON file.

Now, we can create another data pipeline that utilizes the merging and transformation operation in the earlier pipeline. In the new pipeline, extract the logs to create the data pipeline.

Note

The extracted logs in the new data pipeline must have the same file names and column names as the logs from which the data pipeline was downloaded.

Import006

In the Merge step, click Import to upload the downloaded data pipeline JSON.

Import007

The same merge operation will be performed automatically.

Import008

The transformation operation will also be performed automatically.

Import009

We can perform any further actions to merge or transform the data before loading it into Apromore.