update: goose3
parent
1ee8f1e9ef
commit
ebb896bd41
|
@ -635,7 +635,7 @@ Python 实现的数据库。
|
|||
* [newspaper](http://hao.importnew.com/python-newspaper/):使用 Python 进行新闻提取,文章提取以及内容策展。[官网](https://github.com/codelucas/newspaper)
|
||||
* opengraph:一个用来解析开放内容协议(Open Graph Protocol)的 Python 模块。[官网](https://github.com/erikriver/opengraph)
|
||||
* [python-goose](http://hao.importnew.com/python-goose/):HTML 内容/文章提取器(python2)。[官网](https://github.com/grangier/python-goose)
|
||||
* [goose3](https://github.com/goose3/goose3): HTML 内容/文章提取器(python3)。[官网](http://goose3.readthedocs.io/en/latest/index.html)
|
||||
* [goose3](http://goose3.readthedocs.io/en/latest/index.html): HTML 内容/文章提取器(python3)。[官网](https://github.com/goose3/goose3)
|
||||
* python-readability:arc90 公司 readability 工具的 Python 高速端口。[官网](https://github.com/buriy/python-readability)
|
||||
* sanitize:为杂乱的数据世界带来调理性。[官网](https://github.com/Alir3z4/python-sanitize)
|
||||
* sumy:一个为文本文件和 HTML 页面进行自动摘要的模块。[官网](https://github.com/miso-belica/sumy)
|
||||
|
|
Loading…
Reference in New Issue