Autonomous Machines & Society.

2022-09-17
语音转文字 Stt Speech To Text

语音转文字 asr stt speech to text

online

字说APP的api

逆向搜狗输入法 绕过签名验证

搜狗输入法apk的api

微软stt

https://github.com/cuberwr/bilibiliSTT

多家免费stt

https://github.com/1c7/Translate-Subtitle-File

offline

https://github.com/metavoiceio/metavoice-src

pyannote segment audio according to different speakers, detect voice activity

speechbrain very advanced speech related ai library, with almost everything related to speech

vosk

paddlespeech


paper of Google USM (universal speech model) supporting 1000 languages


whisper.cpp perform fast voice to text operation using cpu rather than gpu

whisperx improve time accuracy with forced alignment

whisper gui buzz

whisper by openai, with multilingual and translation avaliable, can detect under background music and noise, with slience,

Read More

2022-09-17
Story Continuation, Story-Dalle Able To Generate Story And Image At The Same Time

Read More

2022-09-17
audio-visual active speaker detection

Read More

2022-09-17
Agi (Artificial General Intelligence) Related Projects

said by HTM, AGI knows what it did to the world (self-awareness), also signals from sensors.

google research

gwern wrote a fiction. he thinks agi starts from automl-zero which is similar to lazero and metalazero by name and perspective.

by design lazero can be deeply aligned, inspecting and studying user’s actions. it also has its own exploration space. however, these expectations can never be fully satisfied at the same time. if you want more power, you have to let go.

lucidrains repositories

this one got lots of state-of-the-art implementations for close-sourced papers and also repos for AGI. stunning.

JEPA-pytorch (WIP) yann lecun’s version how agi will be built

PaLM scaling language model with pathways

side projects

make a video text to video generation

nuwa text to video generation

opencog

moses (supervised) for evolutionary program synthesis

repos on github

he4o

aixijs general reinforcement learning in browser repo

opennars

brain simulator 2 on windows platform

DQfD: Learning from Demonstrations for Real World Reinforcement Learning (paper)

mit class on AGI

jiaxiaogang’s god-knows-what theory and training logs

awesome deep reinforcement learning (deep-rl)

awesome agicocosci exhausitive list of papers and repos for cognitive science and AGI

introduction and links on AGI

Read More

2022-09-17
Video Quality Assessment, Audio Quality Assessment

video

paperswithcode benchmark

fast-vqa

ssim based vqs

audio

mosquito

Read More

2022-09-17
speech recognition

Read More

2022-09-17
Mindsdb, In-Database Machine Learning, Hidden Markov Model For Time Series Processing, Output A Label As Such For Each Element In The Time Series

MindsDB

documentation

cloud mindsdb editor

warning: this thing could break your dependencies. better use docker instead.

1
2
3
docker pull mindsdb/mindsdb
# pip3 install mindsdb

HMMLearn (unsupervised)

most useful feature:

training and inferring the hidden states

supervised hmm learning

seqlearn

pomegranate (both supervised and unsupervised)

documentation

All models that support labeled data support semi-supervised learning, including naive Bayes classifiers, general Bayes classifiers, and hidden Markov models.

While probability Distributions are frequently used as components of more complex models such as mixtures and hidden Markov models, they can also be used by themselves. Many data science tasks require fitting a distribution to data or generating samples under a distribution. pomegranate has a large library of both univariate and multivariate distributions which can be used with an intuitive interface.

General Mixture Models (GMMs) are an unsupervised probabilistic model composed of multiple distributions (commonly referred to as components) and corresponding weights. This allows you to model more complex distributions corresponding to a singular underlying phenomena. For a full tutorial on what a mixture model is and how to use them, see the above tutorial.

Hidden Markov Models

Bayes Classifiers and Naive Bayes

Markov Chains

Bayesian Networks

Markov Networks

Read More

2022-09-17
Javascript Python Bridge

jspybridge

javascript in python:

1
2
pip3 install javascript

1
2
3
4
5
from javascript import require, globalThis
chalk, fs = require("chalk"), require("fs")
print("Hello", chalk.red("world!"), "it's", globalThis.Date().toLocaleString())
fs.writeFileSync("HelloWorld.txt", "hi!")

access python from javascript:

1
2
npm i pythonia

1
2
3
4
5
6
7
8
9
10
11
import { python } from 'pythonia'
// Import tkinter
const tk = await python('tkinter')
// All Python API access must be prefixed with await
const root = await tk.Tk()
// A function call with a $ suffix will treat the last argument as a kwarg dict
const a = await tk.Label$(root, { text: 'Hello World' })
await a.pack()
await root.mainloop()
python.exit() // Make sure to exit Python in the end to allow node to exit. You can also use process.exit.

Read More

2022-09-17
Opencv Corner Detection

fast algorithm for corner detection

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
import numpy as np
import cv2 as cv
from matplotlib import pyplot as plt
img = cv.imread('blox.jpg',0) # `<opencv_root>/samples/data/blox.jpg`
# Initiate FAST object with default values
fast = cv.FastFeatureDetector_create()
# find and draw the keypoints
kp = fast.detect(img,None)
img2 = cv.drawKeypoints(img, kp, None, color=(255,0,0))
# Print all default params
print( "Threshold: {}".format(fast.getThreshold()) )
print( "nonmaxSuppression:{}".format(fast.getNonmaxSuppression()) )
print( "neighborhood: {}".format(fast.getType()) )
print( "Total Keypoints with nonmaxSuppression: {}".format(len(kp)) )
cv.imwrite('fast_true.png', img2)
# Disable nonmaxSuppression
fast.setNonmaxSuppression(0)
kp = fast.detect(img, None)
print( "Total Keypoints without nonmaxSuppression: {}".format(len(kp)) )
img3 = cv.drawKeypoints(img, kp, None, color=(255,0,0))
cv.imwrite('fast_false.png', img3)

Read More

2022-09-17
Color Transfer Between Images, Histogram Based Style Transfer

图像调色风格转换 可以创建蹦迪特效 让视频或者图片五彩斑斓

color transfer between images

1
2
pip install color_transfer

official scikit-learn histogram matching

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
import matplotlib.pyplot as plt
from skimage import data
from skimage import exposure
from skimage.exposure import match_histograms
reference = data.coffee()
image = data.chelsea()
matched = match_histograms(image, reference, channel_axis=-1)
fig, (ax1, ax2, ax3) = plt.subplots(nrows=1, ncols=3, figsize=(8, 3),
sharex=True, sharey=True)
for aa in (ax1, ax2, ax3):
aa.set_axis_off()
ax1.imshow(image)
ax1.set_title('Source')
ax2.imshow(reference)
ax2.set_title('Reference')
ax3.imshow(matched)
ax3.set_title('Matched')
plt.tight_layout()
plt.show()

histogram matching

Read More