tab session manager needs google account to operate, while it can still do offline syncing without google cloud.
seems that it can only hook up with newly opened tabs instead of existing ones.
tab session manager needs google account to operate, while it can still do offline syncing without google cloud.
seems that it can only hook up with newly opened tabs instead of existing ones.
better use nomachine instead, which is based on nx
password: 472831
commands:
1 | # necessary env for gui target, though may not suitable for xvfb |
A curated list of awesome data labeling tools
labelImg - LabelImg is a graphical image annotation tool and label object bounding boxes in images
CVAT - Powerful and efficient Computer Vision Annotion Tool
labelme - Image Polygonal Annotation with Python
VoTT - An open source annotation and labeling tool for image and video assets
imglab - A web based tool to label images for objects that can be used to train dlib or other object detectors
Yolo_mark - GUI for marking bounded boxes of objects in images for training neural network Yolo v3 and v2
PixelAnnotationTool - Software that allows you to manually and quickly annotate images in directories
OpenLabeling - Label images and video for Computer Vision applications
imagetagger - An open source online platform for collaborative image labeling
Alturos.ImageAnnotation - A collaborative tool for labeling image data
deeplabel - A cross-platform image annotation tool for machine learning
MedTagger - A collaborative framework for annotating medical datasets using crowdsourcing.
Labelbox - Labelbox is the fastest way to annotate data to build and ship computer vision applications
turktool - A modern React app for scalable bounding box annotation of images
Pixie - Pixie is a GUI annotation tool which provides the bounding box, polygon, free drawing and semantic segmentation object labelling
OpenLabeler - OpenLabeler is an open source desktop application for annotating objects for AI appplications
Anno-Mage - A Semi Automatic Image Annotation Tool which helps you in annotating images by suggesting you annotations for 80 object classes using a pre-trained model
CATMAID - Collaborative Annotation Toolkit for Massive Amounts of Image Data
make-sense - makesense.ai is a free to use online tool for labelling photos
LOST - Design your own smart Image Annotation process in a web-based environment
Annotorious - A JavaScript library for image annotation.
Sloth - Tool for labeling image and video data for computer vision research.
YEDDA - A Lightweight Collaborative Text Span Annotation Tool (Chunking, NER, etc.). ACL best demo nomination.
ML-Annotate - Label text data for machine learning purposes. ML-Annotate supports binary, multi-label and multi-class labeling.
TagEditor - Annotation tool for spaCy
SMART - Smarter Manual Annotation for Resource-constrained collection of Training data
PIAF - A Question-Answering annotation tool
EchoML - Play, visualize, and annotate your audio files
audio-annotator - A JavaScript interface for annotating and labeling audio files.
audio-labeler - An in-browser app for labeling audio clips at random, using Docker and Flask.
wavesurfer.js - Simple annotations tool, check the example.
peak.js - Browser-based audio waveform visualisation and UI component for interacting with audio waveforms, developed by BBC UK.
Praat - Doing Phonetics By Computer
Aubio - Tool designed for the extraction of annotations from audio signals.
UltimateLabeling - A multi-purpose Video Labeling GUI in Python with integrated SOTA detector and tracker
VATIC - VATIC is an online video annotation tool for computer vision research that crowdsources work to Amazon’s Mechanical Turk.
Curve - Curve is an open-source tool to help label anomalies on time-series data
TagAnomaly - Anomaly detection analysis and labeling tool, specifically for multiple time series (one time series per category)
time-series-annotator - The CrowdCurio Time Series Annotation Library implements classification tasks for time series.
WDK - The Wearables Development Toolkit (WDK) is a set of tools to facilitate the development of activity recognition applications with wearable devices.
webKnossos - webKnossos is an open-source web-based tool for visualizing, annotating, and sharing large 3D image datasets. It features fast 3D data browsing, skeleton (line-segment) annotations, segmentation and proof-reading tools, mesh visualization, and collaboration features. The public instance webknossos.org hosts a collection of published datasets and can be used without a local setup.
KNOSSOS - KNOSSOS is a software tool for the visualization and annotation of 3D image data and was developed for the rapid reconstruction of neural morphology and connectivity.
Label Studio - Label Studio is a configurable data annotation tool that works with different data types
Dataturks - Dataturks support E2E tagging of data items like video, images (classification, segmentation and labelling) and text (full length document annotations for PDF, Doc, Text etc) for ML projects.
issues were found when launching apps on fixed ports.
maybe you should create this entry inside your lazero
package? no need for uploading to pypi, just keep it under pyjom
and leave a local install script there.
make sure all related services are going to launch after the redis_service.service
target. on macos or windows this may vary.
allocate multiple unused ports at once or they may overlap.
abandon ports found on redis.
python to get unused port:
1 | def getUnusedLocalhostPort(): |
install redis-py:
1 | pip install redis |
python send port to redis:
1 | import redis |
1 | journalctl -u <serviceName>.service |
1 | cd /etc/systemd/system |
maybe we should add some autorestart configs at it?
frpc_service.service
1 | [Unit] |
pyjom_webdav_rclone_service.service
1 | [Unit] |
tempthrottle.service
1 | [Unit] |
clash_fastgithub.service
1 | [Unit] |
tujia_scraper_qq_bot.service
1 | [Unit] |
sync_git_repos_syncdog.service
1 | [Unit] |
tune-a-video first recognize video content, then tweak it to fit the need
ComfyUI: A powerful and modular stable diffusion GUI.
civitai is a place for sharing stable diffusion models like anything v5 and surreality and ai arts.
now you can use controlnet to enhance the generation, give the figure skeleton. huggingface introduction
karlo: dalle2 replicate, karlo huggingface space, text to image (can be used for semantic search)
DiT diffusion with transformer
custom diffusion rlhf?
scribble-diffusion turn sketch into drawings
字体普遍画的很拉 需要用专门的ocr强化训练字体
fontdiffusion?
stable diffusion font generating
diffusionbee stable diffusion for macos m1
QQ搜索 异次元的我 免费画画 AI合成 (seems this can only be opened within qq, currently)
https://huggingface.co/hakurei/waifu-diffusion,这个ai是可以本地部署的,电脑配置可以的朋友们试试
novelai 有泄露的模型
imagen
dreambooth
dalle-mini, with space hosted on huggingface
中文版DALL-E is not open sourced (yet). it provides api for evaluation
1 | import numpy as np |
https://github.com/jina-ai/discoart
dalle-2
stable diffusion as dalle2 alternative
nvidia provided ai paint tool
text to image:
这个人的空间链接目前可以访问@2022 september 4
可以在被拉黑了之后快速点击右上角的分享链接 分享到其他人 其他群里面 或者点击生成链接 即可在浏览器里面查看这个人的动态 但是不知道这个链接有没有时效性 现在看起来就是一堆乱码 app里面的分享也不知道有没有时效性
不知道能不能搜索或者遍历 如果不能的话只能黑进去了 不过那样的话出来的数据肯定更多
要知道被拉黑,本地肯定有用户的ID, 有了ID就可以拿过去到其他新注册的Soul账号上面使用 通过底层api访问
可以考虑用Frida或者网上的一些脚本来分析破解SoulAPP 单独使用Frida估计不能利用Python遍历 还是需要破解协议证书才可以自由访问
speechbrain has features of Speech Recognition, Speaker Recognition, Speech Enhancement, Speech Processing, Multi Microphone Processing, Text-to-Speech, and also supports Spoken Language Understanding, Language Modeling, Diarization, Speech Translation, Language Identification, Voice Activity Detection, Sound classification, Grapheme-to-Phoneme, and many others.
视频里面的语言分为图片上面打出来的字幕以及人说的话
涉及到的问题分别为: 图片文字的语言分类 以及音频语言分类
online speech recognition
pip install SpeechRecognition
offline, need to provide language id:
https://pypi.org/project/automatic-speech-recognition/
use paddlespeech if possible, for chinese and english
use google cloud to detect language type in image:
https://github.com/deduced/ml-ocr-lang-detection
Detects and Recognizes text and font language in an image
https://github.com/JAIJANYANI/Language-Detection-in-Image
图片语言文字分类 可以用easyocr实现 加载多个模型 比如 中文加英文加日语 b站其他语言的可能也不怎么受欢迎 最多再加韩语
可以从视频简介 标题 链接里面提取出句子 每个句子进行语言分类 确定要使用的OCR模型 也有可能出现描述语言和视频图片文字语言不一致的情况
wolfram language提供了一个图片分类器 分类出来的结果可能很有意思 可以结合苹果的图片关注区域生成器来结合使用
ImageIdentify[pictureObj]
这个方法还支持subcategory分类 支持多输出 具体看文档
https://www.imageidentify.com/about/how-it-works
wolfram支持cloud deploy 到wolfram cloud不过那样可能不行
lingua performs good in short text, can be used in java or kotlin
supporting detecting different languages:
cld2 containing useful vectors containing text spans python binding
1 | import pycld2 as cld2 |
original cld3 is designed for chromium and it relies on chromium code to run
additional Python language related library from geeksforgeeks:
textblob is a natural language processing toolkit
1 | from textblob import TextBlob |
langid performs good in short text
google language detection library in python: langdetect
javascript:
https://github.com/wooorm/franc
python version of franc:
pyfranc
wlatlang.org provides whatlang-rs as rust package, also whatlang-py as python bindings
focus on person only, crop video and leave only human region untouched:
https://github.com/ConceptCodes/portal-zoomer
focus/zoom on given object using pytweening, a easing/tweening function collection.
to tell you, pytweening is initially developed for pyautogui (by the same author at least), probably for evading AI detection, passing captcha or somehow, but it could also be used in animation rendering.
or just use ffmpeg. you need to handcraft those formulas anyway.
does vidpy/mltframework and some other libs supports that? requires investigation.
macos mount ntfs read-only by default.
code from mounty.app
mounty is somehow not working so manual remount is needed.
one needs to click the remount button to mount it again under /Users/jamesbrown/.mounty/Toshiba3000
1 | sudo umount /Volumes/Toshiba3000 |