Your submission was sent successfully! Close

You have successfully unsubscribed! Close

Thank you for signing up for our newsletter!
In these regular emails you will find the latest updates about Ubuntu and upcoming events where you can meet our team.Close

Pentaho Data Integration quickly with Juju


on 3 March 2016

This article was last updated 8 years ago.

Tom Barber at Meteorite explains how beneficial Juju Charms has been when migrating customer services from old, tricky servers.

I promised a demo of Juju GUI in my earlier blog and we’ve worked hard on a Pentaho Data Integration charm, so I figured we’d combine the two.

For those of you who don’t know, Juju is an Application Modelling platform developed by Canonical. Pentaho Data Integration is the best Open Source ETL toolkit on the planet. So why not combine the best of both worlds?

At Meteorite we have been migrating customer services from old, tricky to maintain servers to Juju managed clusters and its been an eye opening experience, in a good way! The latest product in our armoury to get the treatment is PDI. Our brief for this is threefold:

Firstly we need an easy way to deploy PDI on a bunch of different servers, in different clouds and maintain it, along with that it needs to provide on demand and scheduled ETL execution.

We also need a way to spin up and configure remote Carte servers for remote execution of ETL jobs and transformations, we would also like self configuring clusters as an added bonus.

Big Data deployment to work with the Big Data charms Juju already provides.

The charm is still under development, don’t expect everything to work yet, some of the functionality hasn’t even got any code written yet, but the charm is available here

Of our brief 1 and 2 are 90% complete, 3 will be done in the not too distant future.

But of course it wouldn’t be cricket if we didn’t give you a quick video demo of how easy it is so spin up a PDI cluster so here we are…

Apologies for the lack of contrast on the terminal, I need to remember to resolve that before the next video, but luckily we’re mostly interested in the GUI and webpages so hopefully it doesn’t impact things too much!

Original post

About the author

Tom Barber is the founder and technical director of, a consulting company specialising in the Saiku Analytics platform. His weekly duties include BI consulting, Scala & Java programming and tinkering with System Administration frameworks. In his spare time Tom is a regular blogger and open source committer. You can read more about Tom on the blog.

Ubuntu cloud

Ubuntu offers all the training, software infrastructure, tools, services and support you need for your public and private clouds.

Newsletter signup

Get the latest Ubuntu news and updates in your inbox.

By submitting this form, I confirm that I have read and agree to Canonical's Privacy Policy.

Related posts

Kubernetes backups just got easier with the CloudCasa charm from Catalogic

For a native integration for Canonical’s Kubernetes platform, Juju was the perfect fit, and the charm makes consuming CloudCasa seamless for users.

What is a Kubernetes operator?

Kubernetes is the open source, industry-standard platform for deploying, managing and scaling containerized applications – and applications on Kubernetes are...

Operate popular open source on Kubernetes – Attend Operator Day at KubeCon EU 2024

Operate popular open source on Kubernetes – Attend Operator Day at KubeCon EU 2024