Pyjom Schedules

pyjom

schedule

This article features discussions on various video and audio processing projects. These projects cover a wide range of tasks such as Natural Language Understanding (NLU), trending topic discovery, loudness improvement, watermark removal, enhanced highlight algorithms, voice changing technology, locating older content, and live streaming for quality enhancement.

Published

August 18, 2022

pyjom dev schedules

整活

应急诈骗食品 (派蒙加Rick Ashley 如何混合？）

recommendation

use txtai to do NLU and recommend things to people

dog/cat video generation

make render engine runnable

issues:

video length too long (10 mins)

it was the speed calculation error.

bgm somehow not in sync (too broad bpm/clip ranges?)
to analyze the peaks (abrupt changes) in bgm and grab louder peaks using pyloudnorm (getting audio volume)

pip3 install pyloudnorm

import soundfile as sf
import pyloudnorm as pyln
data, rate = sf.read("0055014.wav") # load audio (with shape (samples, channels))
print(data.shape)
meter = pyln.Meter(rate) # create BS.1770 meter
loudness = meter.integrated_loudness(data) # measure loudness
print(loudness)

place video on loudest points, abrupt changes detected by talib or just take direvative and gaussian average
video too repetitive (small corpus?)
do not remove subtitle and crop active region (reviewer’s resource not used? but i rather advise you to do it directly since it requires less computational power)
do not have minimum motion threshold (reviewer’s fault? also recommend you to do this in producer)

text to video, template based video generator (this is perhaps the most complex video generator ever. do it with caution, it might also includes the flipcard, narrator and slideshow based generators)

generator models subarchitecture (subcategories of template based generators)

flipcard

slideshow (video and audio, might also include the dog&cat video!)

narrator

summarized video

policy evasion, NSFW filters

remove all hints from image, video, audio and script that may lead to copyright issues

analyze the media content and metadata, relationships

analyze danmaku
paraphrase the script
cut the crap and understand each clip’s meaning

process the video clips, like changing the human figure, changing face, stylish the video, adding 2d to 3d effects

process the audio clips, like changing voice, adding sound effects, separating audio/music tracks, ducking

index, retrieve and align video and audio content according to our collected database

qq managing

mitm chats in friends
mitm chats in groups
source and send pictures to qzone
source and send pictures to chat
reduce posting frequency by group size and feedback
post relative video link relative to group topic

personal info collecting and email/sms bulk sending

avoid mail being trashed or turned into junk
collect and make mail templates for mail posting

voice changer

vst based voice changer
train or find a decent voice generator 御姐音语料库小受音语料库

请在b站或者qq群里面寻找或者什么其他的有关的地方寻找谢谢

直播 live streaming

source the video

如果是同一个站的尽量放一个月以前的视频半个月以前的音频

prepare some space for storing live streaming data
source the audio
automatic interactions
handle the vtuber model’s actions

Pyjom Schedules

pyjom dev schedules

整活

recommendation

topic discovery/acquiring

popular topics

personal/customized topics

dog/cat video generation

make render engine runnable

text to video, template based video generator (this is perhaps the most complex video generator ever. do it with caution, it might also includes the flipcard, narrator and slideshow based generators)

generator models subarchitecture (subcategories of template based generators)

flipcard

slideshow (video and audio, might also include the dog&cat video!)

narrator

summarized video

policy evasion, NSFW filters

analyze the media content and metadata, relationships

process the video clips, like changing the human figure, changing face, stylish the video, adding 2d to 3d effects

process the audio clips, like changing voice, adding sound effects, separating audio/music tracks, ducking

index, retrieve and align video and audio content according to our collected database

qq managing

personal info collecting and email/sms bulk sending

voice changer

直播 live streaming

pyjom dev schedules

整活

recommendation

topic discovery/acquiring

trending topics

popular topics

personal/customized topics

dog/cat video generation

make render engine runnable

text to video, template based video generator (this is perhaps the most complex video generator ever. do it with caution, it might also includes the flipcard, narrator and slideshow based generators)

generator models subarchitecture (subcategories of template based generators)

flipcard

slideshow (video and audio, might also include the dog&cat video!)

narrator

summarized video

policy evasion, NSFW filters

analyze the media content and metadata, relationships

process the video clips, like changing the human figure, changing face, stylish the video, adding 2d to 3d effects

process the audio clips, like changing voice, adding sound effects, separating audio/music tracks, ducking

index, retrieve and align video and audio content according to our collected database

retrieve and align video and audio according to our smart search agent (keyword extractor, related words) and do live compilation

qq managing

personal info collecting and email/sms bulk sending

voice changer

直播 live streaming