model : add LightOnOCR-1B model #16764

ngxson · 2025-10-24T23:15:35Z

Seems like the "OCR model race" has started. This seems to be one of the few "low hanging fruits" that we can easily support in llama.cpp

The model features:

Qwen3 as language model
Mistral3 as vision encoder (the difference is that LightOnOCR does not use [IMG_BREAK] token)

Original model: https://huggingface.co/lightonai/LightOnOCR-1B-1025

GGUF model: https://huggingface.co/ggml-org/LightOnOCR-1B-1025-GGUF

To try it:

llama-cli -hf ggml-org/LightOnOCR-1B-1025-GGUF -c 8192

# open https://localhost:8080 and try uploading an image

Important note: this model requires specific input structure, see the chat template

The structure seems to be:

Starts with an empty system message
Then, an user message. All images must be contained in this message; No instructions are needed

Example:

{
  "messages": [{
    "role": "system",
    "content": ""
  }, {
    "role": "user",
    "content": [{
      "type": "image_url",
      "image_url": {"url": "data:image/png;base64,......"}
    }]
  }],
}

ggerganov

Very cool!

The command in OP should be llama-server instead of llama-cli.

model : add LightOnOCR-1B model

a51c6b1

ngxson requested a review from CISC as a code owner October 24, 2025 23:15

github-actions bot added examples python python script changes labels Oct 24, 2025

CISC approved these changes Oct 25, 2025

View reviewed changes

ggerganov approved these changes Oct 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

model : add LightOnOCR-1B model #16764

model : add LightOnOCR-1B model #16764

Uh oh!

ngxson commented Oct 24, 2025 •

edited

Loading

Uh oh!

ggerganov left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

model : add LightOnOCR-1B model #16764

Are you sure you want to change the base?

model : add LightOnOCR-1B model #16764

Uh oh!

Conversation

ngxson commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ngxson commented Oct 24, 2025 •

edited

Loading