-
Notifications
You must be signed in to change notification settings - Fork 29.8k
Description
Model description
As discussed in #31836, I would like to add the miniCPM-o model.
I believe the miniCPM family has had a significant impact on the LMM, LLM field.
Currently, the miniCPM-o code is uploaded to the Hugging Face Hub, which makes maintenance very difficult.
Therefore, I want to add models like miniCPM-o to Transformers so they can receive ongoing support and maintenance.
While there are many vision LMM models available on Hugging Face, a considerable number of any-to-any models,
such as Qwen2.5-Omni-7B, MiniCPM-o-2_6, and Janus-Pro-7B, are often implemented by creating their own repositories.
I want to implement these any-to-any models in Transformers so that they can leverage the various features of Hugging Face.
Additionally, adding an any-to-any pipeline could serve as a good template for future any-to-any models to be added.
If you have any good suggestions, let me know!
Open source status
- The model implementation is available
- The model weights are available
Provide useful links for the implementation
https://github.com/OpenBMB/MiniCPM-o
https://huggingface.co/openbmb/MiniCPM-o-2_6