Project structure of: BlinkDL/RWKV-LM
RWKV-LM
RWKV-based Text Generation and Chatbots
README.md
Trains enhanced RWKV language model with features and optimizations
RWKV-v1
Neural network training, Transformer models, text generation utilities.
src
Neural network training, Transformer models, RWKV-v1 source code, text generation utilities.
model.py
Neural network weights, time curves, Transformer models with rotary pos embeddings.
trainer.py
Train models, handle datasets, adjust learning rate.
utils.py
Tight top-k, p sampling with temperature control for text gen
train.py
Trains RWKV language model efficiently.
RWKV-v2-RNN
Trains and saves RWKV-v2 RNN models.
cuda
CUDA implementation of RNN forward/backward passes
run.py
Trains RWKV text generation model, evaluates on dataset.
src
RWKV-v2 RNN Language Generation
train.py
Trains RWKV-v2 RNN, adjusts params, saves models at intervals.
RWKV-v3
Train RWKV-v3 models for verification
cuda
CUDA RNN, dot product, shared memory, timex op, PyTorch kernel.
run.py
RWKV model training script
src
Training datasets and tokenizer for RWKV-v3
train.py
Trains RWKV-LM and GPT, optimizes GPU, saves partial models.
verify.py
RWKV-v3/verify.py: Verifies RWKV model using GPT, creates RWKV-GPT and RWKV-RNN
RWKV-v4
RWKV-V4: Transformer RNN Language Model Framework
cuda
Efficient CUDA RNN ops, overflow avoidance for WKV model passes.
run.py
RWKV language model text generation script
src
Transformer model and trainer with utilities.
train.py
Trains RWKV-v4 model with DeepSpeed and FP16.
verify.py
RWKV model initialization and comparison script
RWKV-v4neo
Trains customizable RWKV chatbot models.
chat.py
Multilingual chatbot with RWKV language model.
cuda
CUDA-accelerated RWKV ops for BFloat16
img_demoAE.py
Image processing code with ToBinary, R_ENCODER, and neural network model.
math_demo
run.py
RWKV NLP model, tokenization classes, RWKV_RNN functions.
run.py
RWKV Language Model: Text Generation & Refinement
src
Trains RWKV, AWS S3 storage, prime number utilities.
train.py
Trains RWKV language model with customizable parameters, PyTorch Lightning.
verify.py
Trains RWKV language model, prints outputs.
RWKV-v5
Large-scale RWKV text model utilities.
cuda
CUDA-optimized neural network operations and code for shared memory, matrix ops, and parallelism.
demo.jsonl
Restaurant Attack, AI Advancements, Legal Drama, and Growth
make_data.py
Binary file manipulation, RWKV indexing, prime number verification.
src
RWKV Transformer Training with DeepSpeed and Utilities
tokenizer
train.py
Trains RWKV Language Model with Pytorch Lightning and Deepspeed