Project structure of: THUDM/CogVLM
CogVLM
Interactive AI demos, datasets, and utilities for NLP
basic_demo
CLI demos, web demo with CogAgent, chat app for text generation
cli_demo_hf.py
CLI demo with CogAgent, Vicuna tokenizer, argparse, GPU support
cli_demo_sat.py
CLI for text generation, distributed chat app with language support.
web_demo.py
Create web demo using Gradio, CogVLM, CogAgent.
composite_demo
Interactive chat and image-based composite language model demo.
client.py
Python client for composite language model interaction
conversation.py
Conversation class with role, content, image, translation support.
demo_agent_cogagent.py
CogVLM conversation agent demo
demo_chat_cogagent.py
Chatbot with image processing and Chinese model integration
demo_chat_cogvlm.py
Chat demo with image processing and Conversation object initialization.
main.py
Chat application using CogAgent and CogVLM models with image uploads and prompts.
utils.py
Dynamic indexing AI image analysis tool
dataset.md
Bilingual visual instruction dataset for CogVLM v1.0 training
finetune_demo
Fine-tunes models, evaluates performance in demonstrations
openai_demo
OpenAI Chatbot Demo: FastAPI, CogVLM, CogAgent, GPU Memory
README.md
Improved AI models with HuggingFace support and templates.
requirements.txt
Python project requirements: NLP, deep learning, data visualization, web dev.
utils
Utility scripts and functions for various processes.
merge_model.py
Trains a pre-existing model with fine-tuning parameters.
models
Efficient CLIP models and mixin for parallelism
split_dataset.py
Splits, shuffles, and ensures reproducibility for specified files.
utils
Utility scripts and functions for various processes.
__init__.py
Imports functions from CogVLM libraries for various processes.
chat.py
Process images, apply optional processors, chat mode.
dataset.py
Loads .jpg images, converts to RGB, extracts text from names, returns data and ids.
grounding_parser.py
Overlay images with text and boxes using grounding parser.
language.py
PyTorch language model utilities with tokenization and preprocessing.
vision.py
Vision.py: Image preprocessor using Torchvision transforms