Page 8 | Blog of James Brown

Autonomous Machines & Society.

2024-02-28

Multimodal Autoregressive Unsupervised Learning

It is so good that I can login to Kaggle on smartphone.

Instruction following image editing via prompt engineering like Mini DALLE3 or multimodal model like CogCoM

Recently search engine & browser augmented generation has become popular. A great tool for creating video scripts.

Google has released a new architecure called Genie, which can generate latent action space only using video unsupervised training.

Do not ever use NTFS in Linux. If you unfortunatedly face disk inaccessible problem when accessing NTFS disks, in the first case run chkdsk /f onthen reboot into Windows twice. The usage of the /f parameter iimportant!

After fixing, copy all files to another place, format the disk into ext4 or xfs, then recover the files.

GPT2 models from huggingface accept inputs_embeds as parameter of instance call and method “generate”.

Typically to adapt ViT into LLM you need LayerNorm and a linear projection layer.

You cannot put custom embedding into text generaton pipeline.

To encode token manually in GPT2:

1 2	inputs_embeds = model.transformer.wte(input_ids)

In LLaMA:

def embed_tokens(self，token_ids):
if hasattr(self.llama_model.base_model, "model"): # with lora
embeds = self.llama_model.base_model.model.model.embed_tokens(token_ids)
else:
self.llama_model.base_model.embed_tokens(token_ids)
return embeds

Install pytorch gpu with conda:

1 2	conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

Set HF_HUB_OFFLINE=1 while loading local models, to prevent accessing network.

Set Environment="OLLAMA_MODELS=<model_storage_path>" in ollama systemd service file. Remember to change username and usergroup too, and set appropriate permission to model storage path.

In Windows, set it in system environment variables, or create a directory symlink by:

1 2	mklink /D C:\Users\<User>\.ollama\models E:\AI\Ollama\Models

2024-02-26

Windows High Contrast Theme Switching

Download ThemeSwitcher after Windows 11 for switching themes.

Write the following content into hctweak1.theme:

; Copyright � Microsoft Corp.
[Theme]
; High Contrast Black - IDS_THEME_DISPLAYNAME_HCBLACK
DisplayName=@%SystemRoot%\System32\themeui.dll,-2103
SetLogonBackground=0
; Computer - SHIDI_SERVER
[CLSID\{20D04FE0-3AEA-1069-A2D8-08002B30309D}\DefaultIcon]
DefaultValue=%SystemRoot%\System32\imageres.dll,-109
; UsersFiles - SHIDI_USERFILES
[CLSID\{59031A47-3F72-44A7-89C5-5595FE6B30EE}\DefaultIcon]
DefaultValue=%SystemRoot%\System32\imageres.dll,-123
; Network - SHIDI_MYNETWORK
[CLSID\{F02C1A0D-BE21-4350-88B0-7367FC96EF3C}\DefaultIcon]
DefaultValue=%SystemRoot%\System32\imageres.dll,-25
; Recycle Bin - SHIDI_RECYCLERFULL SHIDI_RECYCLER
[CLSID\{645FF040-5081-101B-9F08-00AA002F954E}\DefaultIcon]
Full=%SystemRoot%\System32\imageres.dll,-54
Empty=%SystemRoot%\System32\imageres.dll,-55
[Control Panel\Colors]
ActiveTitle=0 0 0
Background=0 0 0
ButtonFace=0 0 0
ButtonText=255 255 255
GrayText=166 166 166
Hilight=142 227 240
HilightText=38 59 80
HotTrackingColor=117 233 252
InactiveTitle=0 0 0
InactiveTitleText=128 128 128
TitleText=255 255 255
Window=0 0 0
WindowText=255 255 255
Scrollbar=0 0 0
Menu=0 0 0
WindowFrame=255 255 255
MenuText=255 255 255
ActiveBorder=255 255 0
InactiveBorder=0 128 0
AppWorkspace=0 0 0
ButtonShadow=0 0 0
ButtonHilight=192 192 192
ButtonDkShadow=255 255 255
ButtonLight=255 255 255
InfoText=255 255 255
InfoWindow=0 0 0
ButtonAlternateFace=192 192 192
GradientActiveTitle=128 0 128
GradientInactiveTitle=0 128 0
MenuHilight=128 0 128
MenuBar=0 0 0
[Control Panel\Cursors]
Arrow=%SystemRoot%\cursors\aero_arrow.cur
Help=%SystemRoot%\cursors\aero_helpsel.cur
AppStarting=%SystemRoot%\cursors\aero_working.ani
Wait=%SystemRoot%\cursors\aero_busy.ani
NWPen=%SystemRoot%\cursors\aero_pen.cur
No=%SystemRoot%\cursors\aero_unavail.cur
SizeNS=%SystemRoot%\cursors\aero_ns.cur
SizeWE=%SystemRoot%\cursors\aero_ew.cur
Crosshair=
IBeam=
SizeNWSE=%SystemRoot%\cursors\aero_nwse.cur
SizeNESW=%SystemRoot%\cursors\aero_nesw.cur
SizeAll=%SystemRoot%\cursors\aero_move.cur
UpArrow=%SystemRoot%\cursors\aero_up.cur
DefaultValue=Windows Default
DefaultValue.MUI=@main.cpl,-1020
Hand=%SystemRoot%\cursors\aero_link.cur
Link=
[Control Panel\Desktop]
Wallpaper=
TileWallpaper=0
WallpaperStyle=10
Pattern=
[VisualStyles]
Path=%SystemRoot%\resources\themes\Aero\AeroLite.msstyles
ColorStyle=NormalColor
Size=NormalSize
HighContrast=3
[boot]
SCRNSAVE.EXE=
[MasterThemeSelector]
MTSM=RJSPBS
[Sounds]
; IDS_SCHEME_DEFAULT
SchemeName=@%SystemRoot%\System32\mmres.dll,-800

Use like ThemeSwitcher.exe <absolute_path_to_theme_file>

Activate Windows using Microsoft-Activation-Scripts

1
2
3

pip3 uninstall tb-nightly tensorboardX tensorboard
python3 -m tensorboard.main --help

2024-02-23

Provide Feedback To Xbox

Kinect is gone from Xbox. I do not think it will come back anytime soon.

Visit here or here to submit feedback.

2024-02-17

Magica: Open Source File Type Recognition Model From Google

https://github.com/google/magika

2024-02-15

Manual And Automatic Video Creation

抖音信息采集 (Playwright)

Cinelerra recommends a lots of Linux distros for media workshop

Learn video editing on Capcut Learning.

Capcut has released its online video editor (basically shit) so that one can use it on multiple platforms.

Find target audience advertisement recommendation via online platforms.

The live streaming and the audio/music may need some transcription to be more efficient to manage and create.

Make inheritance of existing projects, in order to keep track of ideas and sort by average and total view counts per group.

Link your reusable part and ideas to statistics in order to find out the most efficient workflow.

For quickly testing your unique ideas, you can use manual video editing, save your workspace to specific folder, produce video alongside the project file, and upload it with specialized tool in order to keep track of the video.

If automatic upload is not “convenient” enough for you, consider manually input the video link afterwards.

Autotrack the video statistics after upload.

Link your live streaming workspace with the live streaming platform and streaming time, create a more accurate perspective about your actural impact to the audience.

2024-02-15

Emulator World And Keybindings, Video Resources Collection

Configuration file might be found after first launch, and xemu is at /root/.local/share/xemu/xemu/xemu.toml

You may collect video resource from random computer actions and semantically indexed, but better be partially guided by LLM, since that makes more sense even inside a small video excerpt.

Downloading games from Romsfun is fast.

Steam deck has done a lot of things to port PC games to Linux. Most works are around Proton, a Wine fork which supports DirectX features.

EmuDeck has a list of supported emulators.

Many games can be downloaded for emulators, and played on various platforms.

Arch Wiki contains a section around gamepad.

Keyboard can handle all gamepad functions, no mouse required.

It turned out that I hate emulators. I do not want to download separate games and set them up just for random inputs. If that is the case, we should make it much more convenient and error resistant.

I do not want to buy a physical console like Xbox Series S along with a strange HID to gamepad adaptor. It will not accelerate the development of cybergod.

2024-02-15

Mood Control

Drinking coffee is prohibited. It is bad for mood.

Sometimes I just feel very angry when the bed or body is too warm, losing control and have a very unstable mind. Maybe it will be better if wearing girl style clothes, because that makes me cooler, literally.

Always stay away from heated areas, no matter it is about temperature, noises or arguments.

Sometimes my mind is crazy just as if using a hammer smashing at my balls. It is like a testosterone or adrenaline fest. Maybe some female hormone will help.

To handle this kind of imminent stress, some additional help might be necessary.

A gulp of fresh air, a glass of cold water, air conditioner, strawberry gum, mint candies, oil and inhaler, yogurt, essential oil, dark chocolate, prescription medicine etc.

Some additional medicine or drug might help. Others including alcohol, cigarette, dope.

2024-02-13

Suspicious Activity Found On Instant Messaging Network

People tend to seek for inspiration and conflicts on chatting software. They can be used to create video and article posts.

Sometimes your credential will be exposed by someone. Maybe you need to buy nameless QQ accounts to chat.

Suspicious account: 153-748-0445

Beginners

What is Retrieval-Augmented Generation (RAG)? What is any of this?
We have libraries in Python and TypeScript; the Python lib has more features
Installation and getting started:
Python
TypeScript
I want to customize…
LlamaHub is a collection of data connectors, agent tools and pre-written code
We have notebooks that show how to use the tools on LlamaHub
create-llama is a command-line tool that sets up a starter full-stack app for you
Installation instructions
We strongly recommend starting with this if you’re not sure where to start!
The RAG CLI is a command-line tool that will index and query files for you
Good for quickly checking if indexing documents works
Building a low-code chatbot with Streamlit (this tool is called RAGs)
What can I use?
All the vector stores we support
All the LLMs we support
All the embedding models we support
What’s the best way load and parse data?
The ingestion pipeline
How do I get full control over how my query engine works?
The query pipeline

Intermediate

We have several pre-built repos you may want to fork and modify:
SEC Insights, an advanced full-stack RAG app using multi-document agents
A full-stack demo repo using MongoDB Atlas
A (very similar) demo repo using Azure Cosmos DB (a MongoDB clone)
Create_llama_projects is a list of starter apps that can be installed with create-llama
RAG over embedded tables
Multi-document agent
Multi-modal (images)
Llamabot, a Slack bot
Gives step-by-step instructions to build a Slack bot with memory
LlamaParse, our advanced PDF parsing service
There’s a special $500 + swag prize for best use of LlamaParse!
Customizing prompts
Hybrid search
Vector search + keyword (BM25) search

Advanced

These are mostly notebooks showing how we did it
Customizing module types
Every component — retrievers, parsers, query engines, agents — can be customized.
Combining multiple data sources
e.g. Slack + docs + API
Routing across multiple sources
Shows tool use, lets the LLM decide what tool to pick
Multi-document queries
Answering a single question that needs multiple sources to answer
Examples of agents
Autonomous decision-making using LlamaIndex
Extracting structured outputs from unstructured data
e.g. Turn web pages into JSON
Multi-modal examples
Interpreting images, retrieving images
Auto-retrieval aka metadata filtering
Get the LLM to pre-filter data by tag in your database
Recursive retrieval
Treat sub-nodes differently when retrieving
Text to SQL
Query a database using an LLM
Pandas query engine
Query ingested tables using Pandas

Blog of James Brown

2024-02-28

Multimodal Autoregressive Unsupervised Learning

2024-02-26

Windows High Contrast Theme Switching

2024-02-24

Using Tensorboard

2024-02-23

Provide Feedback To Xbox

2024-02-17

Magica: Open Source File Type Recognition Model From Google

2024-02-15

Manual And Automatic Video Creation

2024-02-15

Emulator World And Keybindings, Video Resources Collection

2024-02-15

Mood Control

2024-02-13

Suspicious Activity Found On Instant Messaging Network

2024-02-07

Llamaindex Hackathon Resource Guide

Beginners

Intermediate

Advanced

Links

Blog of James Brown

2024-02-28 Multimodal Autoregressive Unsupervised Learning

2024-02-26 Windows High Contrast Theme Switching

2024-02-24 Using Tensorboard

2024-02-23 Provide Feedback To Xbox

2024-02-17 Magica: Open Source File Type Recognition Model From Google

2024-02-15 Manual And Automatic Video Creation

2024-02-15 Emulator World And Keybindings, Video Resources Collection

2024-02-15 Mood Control

2024-02-13 Suspicious Activity Found On Instant Messaging Network

2024-02-07 Llamaindex Hackathon Resource Guide

Beginners

Intermediate

Advanced

Links

2024-02-28

Multimodal Autoregressive Unsupervised Learning

2024-02-26

Windows High Contrast Theme Switching

2024-02-24

Using Tensorboard

2024-02-23

Provide Feedback To Xbox

2024-02-17

Magica: Open Source File Type Recognition Model From Google

2024-02-15

Manual And Automatic Video Creation

2024-02-15

Emulator World And Keybindings, Video Resources Collection

2024-02-15

Mood Control

2024-02-13

Suspicious Activity Found On Instant Messaging Network

2024-02-07

Llamaindex Hackathon Resource Guide