Autonomous Machines & Society.

2022-06-10

Bilibili Up主入站了解运作机制

bilibili up主入站了解运作机制开篇动画

视频提交之后转码（可能会二次压缩）

ai加人工审核血腥暴力色情（ai识别）

涉政时政版权违反社区规范

推荐系统

根据场景特点（分区话题标签相关度）用户行为（点击率）稿件互动质量（回复的内容是否是积极的）

精准多样（不能重复完全一样）视频内容

热门热搜筛选更严格没进去不是流量限流

限流是流量更高的时候第二道审核出现问题

锤人引战粗俗的问题

Game Player’s Logic

玩家玩游戏的逻辑

“Looks like you are caring about me.”

Gaming starts from dissatisfactory of the reality. Be it loneliness, anger, tirement or stress.

玩游戏都是有人引导玩的都是因为别人玩所以跟着玩的固有的社交属性

玩游戏和职业可能没多大关系

跟着玩会形成流派会在不同的区域分化比如正版盗版单机联机

玩家自闭的因素是因为游戏本身的复杂性以及封闭性重复性玩家变得不想考虑外界的事情只想关心游戏本身的事情

可以把游戏相关的视频从外网搬到内网把游戏视频搬运过来也可以引用游戏元素头像吸引游戏玩家的流量可以切分游戏剪辑视频转化游戏攻略之类的视频和文案

sound, visual effects, scripts

Classic scenes picked from danmaku peaks.

malware, hacking

do it!

referral spam

Referral spam is used to get into a targeted businesses analytics by visiting their

site multiple times with different IP’s from the domain you’re trying to market.

keyword stuffing

encourage you to add more “hot” words, by training language model in a supervised way, or just use plain replace hacks.

change open graph data for same page once in a while

open graph tutorial

free open graph extractor

scrape and rinse content

scrape from wayback machine (older but clean), check plagiarism in copyscape

private blog networks

create a bunch of websites, all refer to your own contents.

Instagram: Instagress

G+: Circlescope

Linkedin: Elink

Twitter: Managedflitter and Twitter toolkit

chrome extension

Youtube: Tubebuddy

Facebook: Facebook automation toolkit

chrome extension

Pinterest: NinjaPinner

Tumblr (blogging platform): Tumbleninja

Email: Pitchbox

jumpcut courses

jumpcut.com provides digital marketing courses:

viral academy(free as youtube influencer 101 course), automated income machine(forbidden), video ads bootscamp(nothing, avaliable on freecoursedl.com), contageous content(currently premium)

it features different camera angles of narriator’s portraits, relative animations and pdf sheets for fill-in-blank tasks. it emphasizes on email ads and audience funnel/filter.

many free course providers now offer jumpcut academy 2.0. they also offer technical analysis/ quantative analysis on marketing and other courses. though sometimes it is better to get to the code.

中文的自媒体教程和它也是讲一个类型的内容，不过更倾向于实操和自动化。当然也有国外的自媒体教程。

我觉得教程看太多了可能不利于代码实现。可以先把记录放在这里等待之后利用。

搬运教程/课程也是一个好的方向，好的教程/课程需要被翻译或者洗稿，避免被维权。

知乎上也有被我收藏下来的一个自媒体提高视频播放量的视频。他讲的是如何用连词，问句强行引起观众注意（拍摄技巧也可以强行引起观众注意，或者其他的方法，如连贯的语音）。可能知乎上面还有更多关于提高播放量以及和jumpcut有关的内容。

Source code semantic search tool audit tool

can be used to analysis bilibili source code or large code base

sourcegraph/sourcegraph:

in go

github/semantic:

in haskell

sonarqube:

code audit tool

2022-06-09

Remote Jobs

remote jobs in china:

https://github.com/LinuxSuRen/remote-jobs-in-china

remote jobs from stack overflow:

quantative financial stock market analysis

gs-quant

pyportfolioopt

amplfinance

造模拟炒股软件走势跟着大盘收集用户交易数据

随机假设买卖点随机假设成交构成无数虚拟账户交易或者用某种假设原理拆分不同来源的买卖单

time-series-transformers huggingface blog

深入浅出Python量化交易配套代码

数据来源

pandas_datareader by yahoo 是国外股票的数据

tushare 国内股票数据

模型建立

open source code for economics modeling in python/julia

pandas pipe for streamline processing of real time data

time series forcast

有类似的软件我下载到windows上面过 (agent based simulation/agent based modeling) called altreva adaptive modeler

fms

还原持仓的对象每一个账户都要详细分析分析每个账户什么时候买入卖出还有不买入不卖出观望的那些人所有人都要还原

实盘接口

tools for high frequency trading low latency trading tool

高频交易工具 低延迟交易工具

要抢涨停板网络必须要好下单速度要快

joinquant

easytrader

实盘易支持多个客户端服务端要钱的 sdk都是服务端的client

tools

mytt 通达信公式转换器

funcat 通达信公式转换器

models and frameworks

general reinforcement learning:

https://github.com/DLR-RM/stable-baselines3

crypto trading bot, support all crypto trading markets:

https://github.com/freqtrade/freqtrade

qlib by microsoft, quantatitive financial analysis:

https://github.com/microsoft/qlib

reinforcement financial deep learning package:

https://github.com/AI4Finance-Foundation/FinRL

openbb_terminal:

https://github.com/OpenBB-finance/OpenBBTerminal

zipline

https://github.com/quantopian/zipline

pyalgotrade

https://github.com/gbeced/pyalgotrade

quantaxis:

https://github.com/yutiansut/QUANTAXIS

vn.py:

https://github.com/vnpy/vnpy

https://www.vnpy.com/docs/

talib:

https://github.com/mrjbq7/ta-lib

https://www.programcreek.com/python/example/92322/talib.EMA?msclkid=425d0f6cb5dd11ec9da2a03aa72194cd

superalgos:

https://github.com/Superalgos/Superalgos

https://superalgos.org

2022-06-09

bilibili 生活区up 培训

NLP NLG Packages

questgen.ai:

generate question from essay, imitate interaction

增加观众互动性生成问题

question answering question generator

甲骨 jiagu nlp包 provided by ownthink:

https://github.com/ownthink/Jiagu

中文分词
词性标注
命名实体识别
知识图谱关系抽取
关键词提取
文本摘要
新词发现
情感分析
文本聚类

haystack:

nlp framework

neural search neural text search

semantic search

summarization

question answering

snownlp:

chinese segmentation, pinyin, sentiment analysis (情感分析), word tags, keywords, summary, tf-idf similarity, classification, 繁体转简体

Search Engines DIY

my custom search engine built upon thesaurus/synonyms/antenyms, fzf and grep

RETRO retrieval based attention net, though using faiss, unclear if it is search related. on page 8 of the paper there are different retrieval based models for selections. LDA (topic modeling) can assist search by discovering similar topics.

download nltk data here. when downloading manually, beware of the url path and id, so you would put things in order.

you would patch nltk in order to download via proxy. these data files are hosted on github assets.

check keyword urlopen and filedownloder.py under /data/data/com.termux/files/usr/lib/python3.10/site-packages/nltk

maybe you can explore further with online search engines? select your keyword then search again.

thesaurus will slow down things. make it into a preprocessor.

related shits can be found here

search engine optimization

advertools

zinc search

markuplm markup language model used for feature rich information extraction, webqa, arxiv paper: reading wikipedia to answer open domain questions

zinc search, go implementation of elastic search alternative

I bet there are many many alternatives. even for a relational database or graph database it can be a search engine by its nature.

how the heck can i search my own notes? slice it into little segments? standard excerpt included.

search for search engine in github.

search engines are related to spiders/crawlers.

how to utilize these search engines is a problem/challenge. use url filters, generic extractors, readbility.js, summarizers like sumy.

many specialized search engines that can search image, video and audio. one example is Jina

semantic search tool, multimedia search tool, neural search tool

https://github.com/searxng/searxng

parse popular search engine results like baidu, bing:

https://github.com/bisohns/search-engine-parser

search and scrape news

https://github.com/01joy/news-search-engine

image search engine

https://github.com/matsui528/sis

search engines used by hackers, social engineering, onion sites:

https://github.com/edoardottt/awesome-hacker-search-engines

search engine with customized recommendation:

https://github.com/mtianyan/FunpySpiderSearchEngine

seo tools 百度下拉词获取推荐词相关词

https://github.com/marcobiedermann/search-engine-optimization

a self-hosted search engine that can be deployed on heroku, google alike:

https://github.com/benbusby/whoogle-search

txtai:

semantic search tool

pip3 install txtai

using sentence-transformer models from huggingface sentence embedding

https://github.com/neuml/txtai

yacy:

distributed search engine circumvent censorship

provide rss feeds

searx:

meta search engine self-hosted

has third-party hosted searx websites avaliable:

https://searx.space/ total 83 online(currently)

mwmbl:

distributed crawler central search engine, can be self-hosted

written in python

video search engine:

generate summary from frames

https://github.com/AkshatSh/VideoSearchEngine

yuno:

context based search engine for anime, anime search engine with transformer and deep learning. text based search. more like a semantic search tool, or neural search tool.

Yuno is a context based search engine that indexes over 0.5 million anime reviews and other anime informations. To help you find anime with specific properties. This search engine will help people of r/AnimeSuggest who are looking for specific type of anime to watch.

This search engine was created to solve the problem of finding an object with specific properties and the object in this case is anime. But this search engine can be easily extended to any domain like books,movies,etc. Without the need of any kind of handcrafted dataset.

TypeSense:

dedicated client for every popular programminhg language

consume much fewer ram than meilisearch

need to write custom web interface via nodejs

upload data via client api

MeiliSearch:

good for small dataset

consume whoopy 900mb for my 9mb json dataset.

has intuitive web interface.

upload document via web post.

2022-06-07

Hardware For Fun Moment Capturing

A head-mounted coaxis camera, coaxis long-range microphone, buffered record mode, two separate buttons for recording and saving, implementation in micropython. large battery.

Blog of James Brown

2022-06-10

Bilibili Up主入站了解运作机制

bilibili up主入站了解运作机制开篇动画

2022-06-09

Understanding The Psychology Of Gaming And Its Impact On Community Formation

Game Player’s Logic

2022-06-09

Jumpcut Analysis, Social Media Marketing, Blackhat Seo

malware, hacking

referral spam

keyword stuffing

scrape and rinse content

private blog networks

jumpcut courses

2022-06-09

Source Code Semantic Search Tool

Source code semantic search tool audit tool

2022-06-09

Remote Jobs

2022-06-09

Quantatitive Financial Stock Market Analysis

quantative financial stock market analysis

数据来源

模型建立

实盘接口

tools

models and frameworks

2022-06-09

bilibili 生活区up 培训

2022-06-09

Nlp Packages

NLP NLG Packages

2022-06-08

Search Engines

Search Engines DIY

my custom search engine built upon thesaurus/synonyms/antenyms, fzf and grep

search engine optimization

2022-06-07

Hardware For Fun Moment Capturing

Links

Blog of James Brown

2022-06-10 Bilibili Up主 入站了解 运作机制

bilibili up主 入站了解 运作机制 开篇动画

2022-06-09 Understanding The Psychology Of Gaming And Its Impact On Community Formation

Game Player’s Logic

2022-06-09 Jumpcut Analysis, Social Media Marketing, Blackhat Seo

malware, hacking

referral spam

keyword stuffing

open graph metadata tag, easy for sharing/advertising webpages to social media

scrape and rinse content

private blog networks

social media automation tools

jumpcut courses

2022-06-09 Source Code Semantic Search Tool

Source code semantic search tool audit tool

2022-06-09 Remote Jobs

2022-06-09 Quantatitive Financial Stock Market Analysis

quantative financial stock market analysis

数据来源

模型建立

实盘接口

tools

models and frameworks

2022-06-09 bilibili 生活区up 培训

2022-06-09 Nlp Packages

NLP NLG Packages

2022-06-08 Search Engines

Search Engines DIY

my custom search engine built upon thesaurus/synonyms/antenyms, fzf and grep

search engine optimization

2022-06-07 Hardware For Fun Moment Capturing

Links

2022-06-10

Bilibili Up主入站了解运作机制

bilibili up主入站了解运作机制开篇动画

2022-06-09

Understanding The Psychology Of Gaming And Its Impact On Community Formation

2022-06-09

Jumpcut Analysis, Social Media Marketing, Blackhat Seo

2022-06-09

Source Code Semantic Search Tool

2022-06-09

Remote Jobs

2022-06-09

Quantatitive Financial Stock Market Analysis

2022-06-09

bilibili 生活区up 培训

2022-06-09

Nlp Packages

2022-06-08

Search Engines

2022-06-07

Hardware For Fun Moment Capturing