The PIM project enables the spinning up of an AI environment with very little user intervention, adjacent to other workloads running on IBM Power. These workloads might be running on any of the supported operating systems on IBM Power: IBMi, AIX, or Linux, as long as they are managed by a Hardware Management Console (HMC). The PIM solution leverages Bootable Containers (bootc), a modern tool for deploying and configuring immutable Linux systems. PIM provides an end-to-end solution for AI stack installation by creating a Logical Partition (LPAR) with a specified AI stack image. This involves network and storage attachment, and the LPAR is then booted with the configured image.
- Seamless Update: System updates are automatic if a newer version of the image is publicly available. Otherwise, when the user upgrades via PIM upgrade command with the latest credentials, the system updates are pulled and applied from the configured private registry over a reboot of the system.
- Rollback: bootc preserves the state of the system. In case of a disruption in Updates, the system can be rolled back to a previous version.
- Makes admin's management simple by easing day 2 operations like monitoring, upgrading and managing.
- Provides end-to-end software lifecycle management operations like launch, destroy, update-config, update-compute, rollback and status.
- Provides AI inferencing capability on CPU currently. The intent is to provide inferencing-based accelerators available on the platform as and when they become available.
- PIM currently supports IBMi and Linux operating system environments. Support for deploying from AIX will be added at a later date.
PIM has 2 personas, namely the builder and the deployer.
- Builder: Someone who builds a bootable AI container image to bring up the AI stack with the deployer flow. Refer to builder-guide.md for more details.
- Deployer: Someone who deploys a PIM solution to bring up the AI stack in IBM core environments. Refer to deployer-guide.md for more details.
To get started, you can follow steps below to build and deploy a simple entity extraction application which uses vLLM.
- Step 1: Build the application
- Step 2: Containerize the application
- Step 3: Build the Base image
- Follow base image building steps given here.
- Step 4: Build the AI image
- Step 1: Set up PIM
- You can follow the steps to set up PIM on your IBMi/Linux machine given here.
- Step 2: Configure your AI partition
- Read through this guide and fill appropriate values.
- Use final image built on builder step 4 in
ai.image
field.
- Step 3: Run the Launch Command
python cli/pim.py launch
- Step 4: Access the application
- After the
launch
command successfully creates the partition, you can access the UI for the entity extraction application on port8501
, using the IP address provided during the launch configuration in config.ini.
- After the
To successfully deploy PIM, various components of the IBM Power software stack would at the minimum have to be at the levels listed below:
Component | P10 | P11 |
---|---|---|
Host Management Console(HMC) | 1061 | 1110 |
Partition Firmware(PFW) | 1050 | 1110 |
Virtual IO Server(VIOS) | 4.1.1.0 | 4.1.1.00 |
IBMi | 7.5 | 7.6 |