踩点 音乐识别

audio analysis
beat detection
music analysis
This article discusses various audio and music recognition tools like audioFlux, inaSpeechSegmenter, Mousai, Music Emotion Recognition, Picard, AcoustID, MixingBear, madmom, pyaudioanalysis, MUSIC21, urbanSound8k dataset, MeowDetector, and more. It also mentions using QQ Music’s recognition engine for domestic audio identification and Premier plugins for categorizing humorous videos based on music structure.
Published

May 11, 2022


踩点 音乐识别 搞笑视频收集

now we have audioFlux, alternative to librosa, but faster


audioowl for tempo, beat and notes identification:

https://github.com/dodiku/AudioOwl

cnn based audio segmentation toolkit allow to detect speech, music and speaker gender:

https://github.com/ina-foss/inaSpeechSegmenter

speech music detection using keras:

https://github.com/qlemaire22/speech-music-detection

awesome deep learning music:

https://github.com/ybayle/awesome-deep-learning-music

music genre classification/ Music Classification/ Music Recommendation/ Music search

https://github.com/mlachmish/MusicGenreClassification

https://github.com/kristijanbartol/Deep-Music-Tagger

https://github.com/tae-jun/resemul

https://github.com/Insiyaa/Music-Genre-Classification

music recognization service:

audioid soundhound

maybe you should consider some chinese tools? none there.

music radar recognize music:

https://github.com/keshavbhatt/music-radar

mousai using free audd api to recognize music:

https://github.com/SeaDve/Mousai

music emotion recognization:

https://github.com/SeungHeonDoh/Music_Emotion_Recognition

music tagging and recognization, using acoustic ids and community based music database:

https://github.com/metabrainz/picard

https://musicbrainz.org/doc/AcoustID

mixingbear(alike neuralmix):

https://github.com/dodiku/MixingBear

madmom

https://github.com/CPJKU/madmom

http://madmom.readthedocs.org

音乐分类 综合音频分析包

pyaudioanalysis

mathematica audio slience removal segmentation:

https://zhuanlan.zhihu.com/p/43165678

music21 for music recognition:

https://zhuanlan.zhihu.com/p/35140033

music21 for midi analysis:

https://pypi.org/project/music21/

https://music21.readthedocs.io/en/latest

https://zhuanlan.zhihu.com/p/73564852

sound recognition and localization:

https://reality.ai/automotive-sound-recognition-localization/

urbansound8k dataset ( 6gb ):

https://www.kaggle.com/datasets/chrisfilo/urbansound8k

fourier transform cat meow detection:

https://github.com/EricDavidWells/MeowDetector

building sound event classifier:

https://ignitarium.com/building-an-ai-based-sound-event-classifier/

real time continuous sound event classification(usually via silence detection):

https://medium.com/@chathuranga.15/real-time-sound-event-classification-83e892cf187e

https://medium.com/@chathuranga.15/real-time-sound-event-classification-83e892cf187e

https://medium.com/@chathuranga.15/sound-event-classification-using-machine-learning-8768092beafc

cry detection:

https://www.amberou.com/cry-detection

https://github.com/umangkk5/Infant-Cry-Detection-System/blob/master/site-packages/soundfile.py

urbansound classifier:

https://github.com/awln/urban8k-audio-classifier

laugh detection:

https://github.com/ideo/LaughDetection

gun shot detection:

https://github.com/hasnainnaeem/Gunshot-Detection-in-Audio

dog bark detector:

https://github.com/t04glovern/dog-bark-detection

https://devopstar.com/2020/04/13/dog-bark-detector-machine-learning-model

https://dsp.stackexchange.com/questions/23466/detect-dog-barks

获得音乐识别api 最好是qq音乐识别 国内识别引擎

不能识别就分析简介 有没有BGM

踩点 bpm以前的autoup项目里有 看看其他的分析软件有没有 premiere一键踩点插件可能有开源库支持

已有的踩点视频 可以切出无文字的片段 根据音乐结构区分高潮 开始 中间等部分 根据音乐类型标签归类视频

搞笑视频的话 有纯笑声比较好 动作幅度大的 不要有对话 反向截图 收集类似视频