with llama3-v inside
https://github.com/OpenBMB/MiniCPM-V
https://github.com/mbzuai-oryx/Video-LLaVA
https://github.com/OpenGVLab/InternVL
https://github.com/THUDM/CogVLM2
PaliGemma (multimodal) from Google:
https://hf-mirror.com/google/paligemma-3b-pt-224
https://github.com/google-research/big_vision/blob/main/big_vision/configs/proj/paligemma/README.md
https://github.com/huggingface/blog/blob/main/paligemma.md
lots of open source models can be found at: