from textrank4zh import TextRank4Keyword, TextRank4Sentence
content = "" # 这里是python采集下来的content html内容
text = re.sub('<.*?>','',content)
text = re.sub(r'\s','',text)
zy = ''
tr4s = TextRank4Sentence()
tr4s.analyze(text=text, lower=True, source = 'all_filters')
# 可修改num值，设置摘要长度。
for item in tr4s.get_key_sentences(num=10):
zy = zy + item.sentence

2，利用google翻译双向翻译洗稿

之前有接触一个所谓人工智能洗稿的网站小发猫，说的是利用NLP算法进行洗稿，本来我以为洗稿只有同义词替换这个办法。

后来研究了一下小发猫，我首先觉得这个绝对不是利用什么所谓的NLP算法来洗稿，研究了一下发现可能是利用google翻译进行双向翻译，就是先中文翻译英文，然后再拿翻译出来的英文再翻译成中文。

自己也开发了一个这样的伪原创工具，发现其实并不好用。如果不仔细读，这样双向翻译出来的文章还能读，但是仔细读的话。其实语法习惯还有用词根本不准确，甚至有些情况还改变了这句话原有的语义。

does appium have linux accessibility implementation?

windows a11y:

https://github.com/blackrosezy/gui-inspect-tool

pywinauto

bookmark_history_search sucks. it does not include webpage summaries, only title, which makes searching the history very hard. the solution is to use readbility.js to visit and summarize these pages, and update these documents in meilisearch.

a11y is the general term for accessibility, for web browsers like firefox. however, there’s implementation for gnome internally.

linux a11y:

https://github.com/shubhamvasaikar/Auto-GUI-Testing

gnome accessibility toolkit(atk)

https://gitlab.gnome.org/GNOME/pyatspi2

https://gitlab.com/dogtail/dogtail

https://www.freedesktop.org/wiki/Accessibility/PyAtSpi2Example/

accessibility implementation in different toolkits:

https://github.com/GNOME/at-spi2-core/blob/e83d5558d2fbded5b345b0af254f26865e148e49/devel-docs/toolkits.md

Toolkits that use the DBus APIs directly

GTK4

Sources: gtk4/gtk/a11y

Qt5

Sources: qtbase/src/gui/accessible/linux

WebKit

Sources: WebKit/Source/WebCore/accessibility/atspi

Toolkits that use ATK

GTK3

Sources: gtk3/gtk/a11y

gnome-shell / St / via clutter’s cally

Sources: mutter/clutter/clutter/cally

Mozilla Firefox

Sources: gecko-dev/accessible/atk

Chromium

Uses both ATK and libatspi?

Sources:

chromium/ui/accessibility/platform/auralinux (atk)

chromium/ui/accessibility/platform/inspect/auralinux (atspi)

chromium/content/browser/accessibility/auralinux (atspi and atk)

LibreOffice

Sources: LibreOffice/core/vcl/unx/gtk3/a11y

Java Swing - via java-atk-wrapper

Sources: java-atk-wrapper

vercel hosts frontend only apps, could be useful if you want.

可以提取关键词然后到百度必应上面搜索获取相关内容注意语种一致性

search huggingface with julia or python:

huggingface_hub(python)

可以用huggingface的api来翻译对接英文的chatbot (blenderbot, dialo-gpt)

add timeout to these api requests

可以把训练好的中文chatbot放到huggingface上面去用kaggle放

https://github.com/yangjianxin1/GPT2-chitchat

could use this method to generate title for videos. i mean generally.

could host the model on huggingface, or baidu aistudio, heroku or your own machine

configure accelerated inference on huggingface (free for cpu, paid gpu):

https://huggingface.co/docs/api-inference/quicktour

huggingface inference apis:

https://huggingface.co/inference-api

huggingface conversational (chatbot) models:

https://huggingface.co/models?pipeline_tag=conversational&sort=downloads

heroku, use fastapi as interface:

https://fastapi.tiangolo.com

https://www.kaggle.com/getting-started/208405

https://signup.heroku.com

heroku alternatives:

back4app, google app engine

aistudio api, maybe you need to train or find a paddpepaddle based chatbot:

https://ai.baidu.com/ai-doc/AISTUDIO/bk3e382cq#创建在线api服务

一个项目可以创建至多五个沙盒服务, 并选择其中一个沙盒服务部署为线上服务.

沙盒服务如果连续超过24小时无调用将自动调整为暂停状态.

线上服务如果连续超过14天无调用将自动调整为暂停状态.

paddlenlp

https://aistudio.baidu.com/aistudio/projectdetail/3723144?channelType=0&channel=0

paddlepaddle chat model:

plato2

https://github.com/PaddlePaddle/Knover

https://github.com/PaddlePaddle/Knover/tree/develop/projects/PLATO-2

https://aistudio.baidu.com/aistudio/projectdetail/1886227?channelType=0&channel=0

中文chatbot:

https://github.com/zhaoyingjun/chatbot

https://github.com/Dimsmary/Ossas_ChatBot

教程

https://github.com/lcdevelop/ChatBotCourse

https://github.com/fendouai/Awesome-Chatbot

语料库

https://github.com/codemayq/chinese_chatbot_corpus

mysql path:

jdbc:mysql://10.33.163.33:3306/HTS_DB?characterEncoding=UTF-8

root

pipeline@123

Logs Path:

/data/data/Local/DeviceTest/20220406163617_hts_project/resources/HTS/android-hts/logs

under logs:

%Y.%m.%d_%H.%M.%S_

select the latest folder

under selected folder:

device_logcat_test__.txt.gz

decompress using: (before that os.chdir to the selected folder)

gzip -d

does the decompression remove the .gz file?

it will.

log format per line:

DfxTestLog: A1__DfxTestTime = datai=4

from 1 to 13:

A1test1..4

A2test5

A3test1..2

B1test1

B2test4..6

D1test1

M1test1

Tables:

Performance_Baseline_Info

testValue date(%Y-%m-%d) hmsVersion(HMSCore660319) baselineId_id deviceId_id(1,2,4,3,5) deviceType(phone<-1|wearable<-2|car<-4|tv<-3|ecodevice<-5)

Performance_Daily_Data

id features indicators baseValue

Performance_Device_Info

id(to the deviceId_id) model type sn cpu

SEO 蓝海词飙升词竞争度搜索人气转化率成交价（视频长度）

we need suggestion, related topics, also search results.

can be used in title generation.

title/message as query (-> keyword -> suggested query) -> search results -> extract response/title

suggestion, trending topics/keywords

black hat seo, https://www.blackhatworld.com/forums/black-hat-seo.28/

paste your link ‘elsewhere’, submit your link to search engine somehow, visit your link from search engine somehow

seo without website

write a blog on github?

create short links and submit them to search engine

get query count, perform n-gram analysis

https://www.aeripret.com/ngrams-analysis-seo/

https://www.pemavor.com/seo-keyword-clustering-with-python/

i have bookmarked links for further use on macbook chrome.

advertools is a professional SEO library, productivity & analysis tools to scale your online marketing

可以用分析股价的方法分析搜索关键词其中股价对应搜索频率（实时）播放量对应成交量（实时）也可能不对反正这个模型肯定要先收集数据然后再建模画k线当然也不必完全拘泥于全盘还原收集到的数据能反映实际情况得到最优解也就是发个视频预估播放量最大就行用深度学习模型

寻找潜在爆款话题标签

快排参数上首页

https://github.com/sopify-bot/seo

分为主动点击换IP点击

以及优化自身关键词被动优化两种方式

蓝海词可以从零开始做可以由现有词语延伸可以寻找已有的蓝海词

蓝海词是产品关键词的一种，又被称为“零少词”、“长尾词”。具体是指前台具备一定买家搜索热度，但供应商发布产品较少，通常该词下对应的精确匹配产品数量不超过3页，因而同行竞争度较低的关键词。一旦供应商能准确使用这些词语，并能结合信息质量发布一条合格的产品信息，将获得曝光和点击的快速提升

红海泛指竞争相当激烈的市场。在红海中，产业边界是明晰和确定的，游戏的竞争规则是已知的。身处红海的企业试图表现得超过竞争对手，以攫取已知需求下的更大市场份额

淘宝标题撰写技巧：标题流量的3架马车，飙升词+蓝海词+销量卡位词

什么是飙升词？就是在短时间内热度迅速攀升，并且持续上升的词！

蓝海词就是那些搜索热度非常高，但这个词下面的在线产品却很少的词。

这种词可以让我们避免和红海大词竞争，获取很多隐藏流量！

淘宝界面除了能够综合排序之外，我们还能通过销量来排序。

关键词卡位就是寻找点击量和你差不多的视频商品所拥有的关键词语标签这样按照播放量排序的时候就会排到这些视频中间

https://github.com/timesler/facenet-pytorch

https://github.com/JDAI-CV/FaceX-Zoo

https://github.com/justadudewhohacks/face-api.js

https://github.com/cmusatyalab/openface

https://github.com/davidsandberg/facenet

https://github.com/ageitgey/face_recognition

https://github.com/jerry1900/faceRecognition

Topic Generation 话题发现趋势发现热点发现文本分类

bert documentation

https://github.com/MaartenGr/BERTopic

新词发现（可用于挖掘热点热词蓝海词）

https://github.com/zhanzecheng/Chinese_segment_augment

https://github.com/bojone/word-discovery

https://github.com/blmoistawinde/HarvestText

文本分类文本匹配文本检索

https://github.com/lining0806/Naive-Bayes-Classifier

https://github.com/649453932/Bert-Chinese-Text-Classification-Pytorch

https://github.com/gaussic/text-classification-cnn-rnn

https://github.com/yongzhuo/Keras-TextClassification

https://github.com/youthpasses/bayes_classifier

https://github.com/Roshanson/TextInfoExp

https://github.com/aceimnorstuvwxz/toutiao-multilevel-text-classfication-dataset

https://github.com/CementMaker/cnn_lstm_for_text_classify

https://github.com/hellonlp/classifier_multi_label_textcnn

https://github.com/cjymz886/text-cnn

https://github.com/terrifyzhao/bert-utils

https://github.com/649453932/Chinese-Text-Classification-Pytorch

https://github.com/HappyShadowWalker/ChineseTextClassify

https://github.com/XqFeng-Josie/TextCNN

https://github.com/tensorlayer/text-antispam

https://github.com/MachineLP/TextMatch

看看别人的数据来源是什么

知乎神回答知乎同类回答排行榜 github排行榜同类内容

比较视频可以用段落总结关键词来做

free open source animation software for linux, by sourceforge.net

three.js javascript 3d library

typed.js imitate typing animation

anime.js javascript animation engine

synfig 2d vector based animation library

countup.js animate counting up to a number

vivus.js drawing animation imitator

Libreoffice Impress或者其他的动画工具制作视频比如synfig blender three.js

https://ask.libreoffice.org/t/convert-impress-presentation-to-video/33952

https://ask.libreoffice.org/t/how-to-turn-libreoffice-impress-into-video-mp4-format/20589

同样的可以制作冷知识问答的动画视频通过收集百度 bing搜索相关词语如果是问句问题就拿来搜索如果出现了放大版本的句子就收集下来就是回答

Blog of James Brown

2022-07-16

Github Gitee 大文件大型Repo如何上传

2022-07-15

模板创作模式自媒体洗稿

网页转文章

2022-07-15

Pyatspi Dogtail Gnome Accessibility Gui Inspect Tool For Linux A11Y

2022-07-14

复读机 Chatbot

时序数据库

智能问答

近义词

话题建模句向量

2022-07-14

Chatbot, Self-Hosted Model, Cloud Deploy, Cloud Services, Free Website Hosting Service

2022-07-14

Harmonyos Device Log To Mysql

2022-07-14

Seo 蓝海词竞争度

SEO 蓝海词飙升词竞争度搜索人气转化率成交价（视频长度）

seo without website

2022-07-13

人脸识别 Face Recognition

2022-07-13

Topic Generation 话题发现趋势发现热点发现

Topic Generation 话题发现趋势发现热点发现文本分类

2022-07-13

Powerpoint 比较视频制作方法 Animation Software Oss Scriptable Flipcard

Links

Blog of James Brown

2022-07-16 Github Gitee 大文件大型Repo如何上传

2022-07-15 模板创作模式 自媒体 洗稿

网页转文章

2022-07-15 Pyatspi Dogtail Gnome Accessibility Gui Inspect Tool For Linux A11Y

2022-07-14 复读机 Chatbot

时序数据库

智能问答

近义词

话题建模 句向量

2022-07-14 Chatbot, Self-Hosted Model, Cloud Deploy, Cloud Services, Free Website Hosting Service

2022-07-14 Harmonyos Device Log To Mysql

2022-07-14 Seo 蓝海词 竞争度

SEO 蓝海词 飙升词 竞争度 搜索人气 转化率 成交价（视频长度）

seo without website

2022-07-13 人脸识别 Face Recognition

2022-07-13 Topic Generation 话题发现 趋势发现 热点发现

Topic Generation 话题发现 趋势发现 热点发现 文本分类

2022-07-13 Powerpoint 比较视频制作方法 Animation Software Oss Scriptable Flipcard

Links

2022-07-16

Github Gitee 大文件大型Repo如何上传

2022-07-15

模板创作模式自媒体洗稿

2022-07-15

Pyatspi Dogtail Gnome Accessibility Gui Inspect Tool For Linux A11Y

2022-07-14

复读机 Chatbot

话题建模句向量

2022-07-14

Chatbot, Self-Hosted Model, Cloud Deploy, Cloud Services, Free Website Hosting Service

2022-07-14

Harmonyos Device Log To Mysql

2022-07-14

Seo 蓝海词竞争度

SEO 蓝海词飙升词竞争度搜索人气转化率成交价（视频长度）

2022-07-13

人脸识别 Face Recognition

2022-07-13

Topic Generation 话题发现趋势发现热点发现

Topic Generation 话题发现趋势发现热点发现文本分类

2022-07-13

Powerpoint 比较视频制作方法 Animation Software Oss Scriptable Flipcard