How to Install IBM Watsonx Data 2.0 Developer Edition on Ubuntu EC2 (On-Premises)

2 minute read

Setting up IBM Watsonx Data 2.0 Developer Edition on an Ubuntu EC2 instance enables you to leverage IBM’s data lakehouse capabilities on the cloud. This guide provides detailed steps, from configuring entitlement to starting the Watsonx Data containers.

For guidance on creating an EC2 instance, check out my previous blog: How to Create an AWS EC2 Instance.Make sure the instance type bigger eg.t3.xlarge and allow All traffic

Prerequisites

Ensure that you have:

An IBM Entitlement Key for Watsonx Data.
An Ubuntu EC2 instance in AWS.

Step 1: Set Up Entitlement Key

Log in to your IBM container library.
Go to Add New key and create a new API key for entitlement.
Store the API key securely, as you’ll need it for the Watsonx Data installation.

Step 2: Install Docker

Watsonx Data requires Docker to manage its containers. Install Docker as follows:

# Update package information
sudo apt update

# Install Docker
sudo apt install -y docker.io

# Start Docker service
sudo systemctl start docker
sudo systemctl enable docker

Verify Docker installation:

docker --version

Step 3: Set Up Installation Directory and Environment Variables

Switch to root user, create an installation directory, and set the necessary environment variables.

sudo su -
mkdir watsonxdata
cd watsonxdata

# Set environment variables
export LH_ROOT_DIR=<NEW DIRECTORY-watsonxdata>
export LH_RELEASE_TAG=latest
export IBM_LH_TOOLBOX=cp.icr.io/cpopen/watsonx-data/ibm-lakehouse-toolbox:$LH_RELEASE_TAG
export LH_REGISTRY=cp.icr.io/cp/watsonx-data
export PROD_USER=cp
export IBM_ENTITLEMENT_KEY=<YOUR_IBM_ENTITLEMENT_KEY>
export IBM_ICR_IO=cp.icr.io

Replace <YOUR_IBM_ENTITLEMENT_KEY> with the key obtained in Step 1.

For Docker, set DOCKER_EXE as follows:

export DOCKER_EXE=docker

Step 4: Download and Extract Watsonx Data Developer Package

Pull the Watsonx Data developer package and copy it to the host system:

$DOCKER_EXE pull $IBM_LH_TOOLBOX
id=$($DOCKER_EXE create $IBM_LH_TOOLBOX)
$DOCKER_EXE cp $id:/opt - > /tmp/pkg.tar
$DOCKER_EXE rm $id

Extract the package and verify the checksum:

tar -xf /tmp/pkg.tar -C /tmp
cat /tmp/opt/bom.txt
cksum /tmp/opt/*/*
tar -xf /tmp/opt/dev/ibm-lh-dev-*.tgz -C $LH_ROOT_DIR

Step 5: Authenticate with IBM Registry

$DOCKER_EXE login ${IBM_ICR_IO} --username=${PROD_USER} --password=${IBM_ENTITLEMENT_KEY}

Step 6: Run Setup Script

Run the setup script to initialize the Watsonx Data Developer environment. You can set a custom password with the --password option; otherwise, the default password is password.

$LH_ROOT_DIR/ibm-lh-dev/bin/setup --license_acceptance=y --runtime=$DOCKER_EXE

Step 7: Start Watsonx Data Containers

Start the Watsonx Data containers using the following command:

$LH_ROOT_DIR/ibm-lh-dev/bin/start

Step 8: Access Watsonx Data Console

Open the Watsonx Data console by visiting https://<YOUR_EC2_PUBLIC_IP>:9443 (or the port specified during setup).
Log in with the username ibmlhadmin and the password you set during setup (default is password).

Managing Watsonx Data

Check Container Status

To view the status of all containers:

$LH_ROOT_DIR/ibm-lh-dev/bin/status --all

Stop All Containers

To stop all containers:

$LH_ROOT_DIR/ibm-lh-dev/bin/stop

Start/Stop a Specific Container

To manage individual containers, use stop_service and start_service commands. Replace <container_name> with the name from the docker ps output:

$LH_ROOT_DIR/ibm-lh-dev/bin/stop_service <container_name>
$LH_ROOT_DIR/ibm-lh-dev/bin/start_service <container_name>

Step 9: Log In to Watsonx Data

Once Watsonx Data is up and running, access the login page via your browser at https://<YOUR_EC2_PUBLIC_IP>:9443.

Login Page: Enter your username and password to access the Watsonx Data console.

After that you will see the Dashboard

Dashboard

Step 10: Infrastructure Manager

After logging in, navigate to the Infrastructure Manager to monitor and manage system resources and services.

Infrastructure Manager

Infrastructure Manager: View and control Watsonx Data’s underlying infrastructure and resource allocations.

Step 11: Explore the Query Workspace

Use the Query Workspace to write and run SQL queries directly within Watsonx Data.

Query Workspace

Query Workspace: Execute SQL queries and analyze data with Watsonx Data’s SQL editor.

Step 12: Access Query History

The Query History section lets you review past queries, making it easy to track, repeat, or debug previous SQL commands.

Query History

Query History: Review and manage past queries for efficient workflow management.

Congratulations! You have successfully installed and configured IBM Watsonx Data 2.0 Developer Edition on your Ubuntu EC2 instance.

Resources:

For more insights, check out my Blog Section.

Share on

Twitter Facebook LinkedIn

Abdul Rahman

How to Install IBM Watsonx Data 2.0 Developer Edition on Ubuntu EC2 (On-Premises)

Prerequisites

Step 1: Set Up Entitlement Key

Step 2: Install Docker

Step 3: Set Up Installation Directory and Environment Variables

Step 4: Download and Extract Watsonx Data Developer Package

Step 5: Authenticate with IBM Registry

Step 6: Run Setup Script

Step 7: Start Watsonx Data Containers

Step 8: Access Watsonx Data Console

Managing Watsonx Data

Check Container Status

Stop All Containers

Start/Stop a Specific Container

Step 9: Log In to Watsonx Data

Step 10: Infrastructure Manager

Step 11: Explore the Query Workspace

Step 12: Access Query History

Share on

Leave a comment

You may also enjoy

PySpark for Data Engineers: Build Scalable Data Pipelines

06 Feb 2025

Mastering Apache Kafka: A Comprehensive Guide

28 Jan 2025

How to Install Apache Airflow Using Docker and Write Your First DAG

01 Jan 2025

How to tune a foundational model in watsonx.ai

13 Dec 2024