video and audio needs to be analysised separately.
audio can be processed by chunks, splited tracks, while video can be itered frame by frame.
video and audio needs to be analysised separately.
audio can be processed by chunks, splited tracks, while video can be itered frame by frame.
when xorg fails, one must use commandline to debug problems.
put ‘3’ after the longest line of boot commands.
use ssh to collect logs even if the main interface is stuck somehow (like libinput faliure)
reference:
https://www.linuxandubuntu.com/home/how-to-boot-into-linux-command-line/amp
working on archlinux arm:
libwacom 2.1.0-1
not working on kali linux:
libwacom-bin 2.2.0-1
full reference:
https://github.com/DIGImend/digimend-kernel-drivers/issues/514
sudo nano /etc/X11/xorg.conf
Section “InputClass”
Identifier “Tablet”
Driver “wacom”
MatchDevicePath “/dev/input/event*”
MatchUSBID “6161:4d15”
EndSection
to debug input problems:
check /etc/logs/Xorg.0.logs
video download:
https://github.com/Evil0ctal/Douyin_TikTok_Download_API
https://github.com/Johnserf-Seed/TikTokDownload
https://github.com/rouze-d/tiktok-download
https://github.com/CuriousYoda/tiktok-downloader
video api and deduplication:
https://github.com/VideoData/DY-Data
many scrapers:
https://github.com/Jack-Cherish/python-spider
video multi download tool:
https://github.com/smalls0098/video-parse-tools
tiktok scrapers:
https://github.com/drawrowfly/tiktok-scraper
tiktok api:
淘宝视频 哇偶视频似乎取消了视频上方的搜索接口
首页的视频推荐似乎更好看一些 推荐算法更先进
逛逛被单独分了一个专栏 在那里可以搜索视频
“您的分享过于频繁,请稍后重试”
出现这种情况需要更换qq号
how about let’s use appium for unlocking phone, airtest for actual testing?
appium can only unlock phone by removing password.
password with ampersand needs to be quoted/escaped.
that might need another supervisor
set up accessibility servicr for autox either by the switch inside settings (with root) or run this command:
1 | adb shell settings put secure enabled_accessibility_services packagname/servicename |
1 | am start -n org.autojs.autoxjs.v6/org.autojs.autojs.external.shortcut.ShortcutActivity -a android.intent.action.MAIN -e path "/storage/emulated/0/脚本/show_toast.js" |
现在autojs是付费的 但这两个都不能替代appium或者airtest
autox repo vscode plugin
hook current application context
android virtual cam xposed安卓虚拟摄像头 android virtual camera on xposed hook
highly suspected source of ‘token’, the miniapp json generator the MiniArkShareModelBuilder
transformArkShareJson
ShareQQArkHelper
MiniProgramShareUtils
MiniProgramShareUtils.newShareInfoRequest
ShareManager
MiniProgramShareUtils.shareToChatDirectly
adb shell am start -d 启动应用之uri被*吃了
inspeckage Android Package Inspector - dynamic analysis with api hooks, start unexported activities and more. (Xposed Module)
找出APP的SchemeURL(抓取APP意图/intent)的常用方法
隐式启动 这是一款开发者辅助工具,帮助开发者发现手机上的应用的快捷启动,原理是利用 Android 提供的隐式启动 Activity 来快速启动某个应用的某个界面,如快速发微博、发朋友圈、扫一扫,快速切换 vpn 等
adb/安卓/按键精灵/autojs/uniapp/ec打开SchemeURL的方法及常用SchemeURL整理
com.zwk.xintent (intent traffic monitoring tool) release orginal repo
use am broadcast
to send indirect intent
sending a boot-complete broadcast
exploiting broadcast receivers
usage of am and common android shell commands
post content on we.taobao.com
淘宝直播
https://zhuanlan.zhihu.com/p/91192587
packet capture + batch m3u8 download
5 666:/鱼里家庭布偶猫舍的场间简直太火爆了,快来看!
可爱的宝宝找家咯 ,快来场间 https://m.tb.cn/h.fJErCBC?sm=b4d048
———————口———————
!们那之学有为上家得人多啊
淘宝网页无法播放直播
最新版只有微淘入口
淘宝逛逛 有用户名 ID
67信生对在然有为上子是去你嘻 https://m.tb.cn/h.fJE9C6B?sm=ee59f9 怕鱼的小猫咪~~
this video is flipable
淘宝网页版
use kaggle for testing, if we can connect to it we are good for 12 hours.
the campus network allows dns query, might allow dns port based proxies.
use dig for DNS avaliability check.
dig baidu.com
dns proxies:
https://0day.work/tunneling-all-traffic-over-dns-with-a-socks-proxy/
https://serverfault.com/questions/962961/socks-proxy-over-dns
generate audio and music: audiolm
audio ai timeline best audio generation models
generate music using diffusion
ecantorix generate singing voice using lmms and espeak
read/write deepvoice binary file
decrypt acestudio binary project file
谷歌AI歌手震撼来袭!AudioLM简单听几秒,便能谱曲写歌 https://www.kuxai.com/article/398
论文地址:https://arxiv.org/abs/2110.08813
我的fork仓库:https://github.com/innnky/VISinger
scoredraft in python:
https://m.bilibili.com/video/av19085545
https://github.com/fynv/ScoreDraft
ACE studio 公测:
ACE Studio是时域科技旗下的AI歌声合成引擎,通过毫无妥协的高表现力人声,解除演唱能力的羁绊,释放人们的音乐想象力。
2022年7月12日,ACE Studio公测开启。
为了在正式收费之前提供更好的稳定性体验,本轮公测期间,所有AI歌手和编辑器功能均可免费使用。下载软件后,使用手机号注册登录,即可开始使用。Mac/Win双端均可使用。
下载地址:https://ace-studio.timedomain.tech/
oss singing software:
Microsoft muzic
https://alternativeto.net/software/ecantorix/about/
https://alternativeto.net/software/openutau/about/
https://alternativeto.net/software/cadencii/about/
voice style transfer:
https://github.com/andabi/deep-voice-conversion
https://rebryk.github.io/convoice-demo/
https://github.com/mazzzystar/randomCNN-voice-transfer
https://ebadawy.github.io/post/speech_style_transfer/
you can alter the order of generated lyrics, fitting into the sequence of the original lyrics.
lyrics generation:
https://zhuanlan.zhihu.com/p/137214305
https://baijiahao.baidu.com/s?id=1666495322826772953
https://github.com/dengxiuqi/Lyricist-torch
https://github.com/zipper112/LyricsGeneration
https://github.com/coder-yuzhiwei/wangfeng-lyrics-generator
https://github.com/jianyq/Tong-Music
video download:
https://github.com/Evil0ctal/Douyin_TikTok_Download_API
https://github.com/Johnserf-Seed/TikTokDownload
https://github.com/rouze-d/tiktok-download
https://github.com/CuriousYoda/tiktok-downloader
video api and deduplication:
https://github.com/VideoData/DY-Data
many scrapers:
https://github.com/Jack-Cherish/python-spider
video multi download tool:
https://github.com/smalls0098/video-parse-tools
tiktok scrapers:
https://github.com/drawrowfly/tiktok-scraper
tiktok api:
A fastai/PyTorch package for unpaired image-to-image translation.
https://github.com/tmabraham/UPIT?auto_subscribed=false&email_source=explore
视听分割 视频注意力机制
only segment video objects that make sounds, video/audio combined segmentation:
https://github.com/OpenNLPLab/AVSBench
video object tracking and segmentation unified framework:
https://github.com/MasterBin-IIAU/Unicorn
video object segmentation handle long video with ease:
https://github.com/hkchengrex/XMem
when removing video watermarks, remember to ease in/out. that is said, do not stop blurring immediately after the end mark. instead, extend the blur time and decrease blur level incrementally. also, the blur ease-in is needed for the start mark, blur ahead of the start mark and ease in incrementally.
descriptive information generation from video/image:
https://github.com/BAAI-WuDao/CogView
https://github.com/BAAI-WuDao/BriVL
https://github.com/PaddlePaddle/PaddleVideo/blob/develop/docs/zh-CN/install.md
video understanding/captioning:
https://github.com/rohit-gupta/Video2Language
https://github.com/byeongjokim/Automatic-Baseball-Commentary-Generation-Using-DeepLearning
https://github.com/shhdSU/Image_Captioning_DeepLearning
https://github.com/jayleicn/recurrent-transformer
https://github.com/terry-r123/Awesome-Captioning
https://github.com/vijayvee/video-captioning
https://github.com/scopeInfinity/Video2Description
https://github.com/xiadingZ/video-caption.pytorch
https://github.com/YehLi/xmodaler
https://github.com/sujiongming/awesome-video-understanding
action recognition:
https://github.com/mit-han-lab/temporal-shift-module
https://github.com/yjxiong/temporal-segment-networks
https://github.com/yjxiong/tsn-pytorch
https://github.com/open-mmlab/mmaction
https://github.com/jinwchoi/awesome-action-recognition
The data remaining only have texts, danmaku, likes, titles, intros, comments, tags, image/video analysis results(short description). You can only generate video from generated metadata or given rules. Find similar words, similar danmaku, similar features, comments or the inverse, according to the selected topic and main idea.
Analyze video when downloaded, mark its highlights, analyze texts and danmaku. Get video segments and audio segments.
Collect pictures/videos with given rules, namely finding the head of somebody, with how many likes, keywords.
Split audio and grab the main speaker. clone the voice and perhaps changes the gender.
Split video and do human/image segmentation if human/target is found. put it onto another human/target’s background masking the original human, with similar areas and movements.
Analyze video with off-topic(offline) and of-topic(online) sources.
Remove watermark according to username.
Generate danmaku and generate video accordingly. Generate texts and generate video accordingly. Doing faceswap, talking head and human/image segmentation accordingly.
免费gpt文本生成:彩云小梦 以及小梦海外版
小梦的中文文本有涉及政治的检测器 不能把敏感内容塞进小梦
对话生成
https://huggingface.co/thu-coai/CDial-GPT_LCCC-large/tree/main
twitter generator inspired by influencers:
https://github.com/gdemos01/TwitterInfluencerAI
chinese LM
清华130b大模型
测试地址:https://huggingface.co/spaces/THUDM/GLM-130B
模型仓库:https://github.com/THUDM/GLM-130B
https://github.com/Morizeyao/GPT2-Chinese
https://zhuanlan.zhihu.com/p/352028922
https://github.com/TsinghuaAI/CPM-1-Finetune
https://github.com/TsinghuaAI/CPM-1-Generate
https://github.com/TsinghuaAI/CPM-2-Pretrain
gpt2/cpm tutorial:
https://www.cnblogs.com/wwj99/p/12503545.html