Power Inference Microservices (PIM)

The PIM project enables the spinning up of an AI environment with very little user intervention, adjacent to other workloads running on IBM Power. These workloads might be running on any of the supported operating systems on IBM Power: IBMi, AIX, or Linux, as long as they are managed by a Hardware Management Console (HMC). The PIM solution leverages Bootable Containers (bootc), a modern tool for deploying and configuring immutable Linux systems. PIM provides an end-to-end solution for AI stack installation by creating a Logical Partition (LPAR) with a specified AI stack image. This involves network and storage attachment, and the LPAR is then booted with the configured image.

Key highlights of the PIM solution

Seamless Update: System updates are automatic if a newer version of the image is publicly available. Otherwise, when the user upgrades via PIM upgrade command with the latest credentials, the system updates are pulled and applied from the configured private registry over a reboot of the system.
Rollback: bootc preserves the state of the system. In case of a disruption in Updates, the system can be rolled back to a previous version.
Makes admin's management simple by easing day 2 operations like monitoring, upgrading and managing.
Provides end-to-end software lifecycle management operations like launch, destroy, update-config, update-compute, rollback and status.
Provides AI inferencing capability on CPU currently. The intent is to provide inferencing-based accelerators available on the platform as and when they become available.
PIM currently supports IBMi and Linux operating system environments. Support for deploying from AIX will be added at a later date.

PIM Personas

PIM has 2 personas, namely the builder and the deployer.

Builder: Someone who builds a bootable AI container image to bring up the AI stack with the deployer flow. Refer to builder-guide.md for more details.
Deployer: Someone who deploys a PIM solution to bring up the AI stack in IBM core environments. Refer to deployer-guide.md for more details.

Getting started

To get started, you can follow steps below to build and deploy a simple entity extraction application which uses vLLM.

Builder steps

Step 1: Build the application
- vLLM:
  - Open source vLLM application can be used.
- Entity Extraction App:
  - A sample entity extraction application is provided here that uses the OpenAI /chat/completion API to do entity extraction.
Step 2: Containerize the application
- vLLM:
  - Follow the instructions in the README to build the vLLM application's container image. It has a script that pulls open-source vLLM code base and builds a container image.
- Entity Extraction App:
  - README for steps to build the container image for an entity extraction application.
Step 3: Build the Base image
- Follow base image building steps given here.
Step 4: Build the AI image
- vLLM:
  - Follow steps given here to build the vLLM AI image, ensure that you use the image created in step-3 as base image(FROM).
- Entity Extraction App:
  - Scripts and README to build the AI image are given here, ensure that you use the vLLM AI image created in the previous step as base image(FROM) here.

Deployer steps

Step 1: Set up PIM
- You can follow the steps to set up PIM on your IBMi/Linux machine given here.
Step 2: Configure your AI partition
- Read through this guide and fill appropriate values in config.ini.
- Use final image built on builder step 4 in ai.image field.
Step 3: Run the Launch Command

python cli/pim.py launch

Step 4: Access the application
- After the launch command successfully creates the partition, you can access the UI for the entity extraction application on port 8501, using the IP address provided during the launch configuration in config.ini.

Supported Versions

To successfully deploy PIM, various components of the IBM Power software stack would at the minimum have to be at the levels listed below:

Component	P10	P11
Hardware Management Console(HMC)	1061	1110
Partition Firmware(PFW)	1050	1110
Virtual IO Server(VIOS)	4.1.1.0	4.1.1.00
IBMi	7.5	7.6

Name		Name	Last commit message	Last commit date
Latest commit History 297 Commits
base-image		base-image
cli		cli
docs		docs
examples		examples
.gitignore		.gitignore
.python-version		.python-version
.whitesource		.whitesource
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
DCO1.1.txt		DCO1.1.txt
LICENSE		LICENSE
README.md		README.md
config.ini		config.ini
install_ibmi.sh		install_ibmi.sh
install_linux.sh		install_linux.sh
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Power Inference Microservices (PIM)

Key highlights of the PIM solution

PIM Personas

Getting started

Builder steps

Deployer steps

Supported Versions

About

Uh oh!

Releases 1

Uh oh!

Contributors 10

Uh oh!

Languages

License

IBM/project-pim

Folders and files

Latest commit

History

Repository files navigation

Power Inference Microservices (PIM)

Key highlights of the PIM solution

PIM Personas

Getting started

Builder steps

Deployer steps

Supported Versions

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Uh oh!

Contributors 10

Uh oh!

Languages