Files
bettafish-company/MindSpider/requirements.txt
T
老葛 64b94d79f9 feat(xhs): 集成xhshow库优化签名生成与请求参数 (#330)
* feat(xhs): 集成xhshow库优化签名生成与请求参数

- 引入xhshow库用于小红书API签名生成
- 替换原有的seccore_signv2_playwright签名校验方式
- 支持GET和POST请求的差异化签名处理
- 增加对b1值从localStorage获取的容错处理
- 更新x-t时间戳为毫秒级精度
- 在获取博主笔记接口中增加xsec_token和xsec_source参数- 支持通过配置传递验证token和渠道来源
- 更新依赖文件引入xhshow库- 调整配置示例适配新的token参数要求

* Delete MindSpider/DeepSentimentCrawling/MediaCrawler/config/xhs_config.py

移除配置文件

* Add xhs_config.py for Xiaohongshu platform settings

恢复错误删除的文件

---------

Co-authored-by: gehongbin <gehongbin@autohome.com.cn>
Co-authored-by: Doiiars <doiiars@qq.com>
2025-11-29 14:26:26 +08:00

60 lines
1.0 KiB
Plaintext

# MindSpider AI爬虫项目依赖包
# 整合BroadTopicExtraction和DeepSentimentCrawling模块的核心依赖
# ===============================
# 数据库相关
# ===============================
pymysql==1.1.0
aiomysql==0.2.0
aiosqlite==0.21.0
asyncpg
sqlalchemy
psycopg[binary]
# ===============================
# HTTP请求和网络
# ===============================
httpx==0.28.1
requests==2.32.3
aiofiles~=23.2.1
# ===============================
# 话题提取模块依赖
# ===============================
openai
pydantic-settings
numpy
pandas==2.2.3
regex
tqdm
python-dateutil
pytz
# ===============================
# MediaCrawler爬虫依赖
# ===============================
Pillow>=10.0.0
playwright==1.45.0
tenacity==8.2.2
opencv-python
redis~=4.6.0
pydantic==2.5.2
fastapi==0.110.2
uvicorn==0.29.0
python-dotenv==1.0.1
jieba==0.42.1
wordcloud==1.9.3
matplotlib==3.9.0
parsel==1.9.1
pyexecjs==1.5.1
typer>=0.12.3
pyhumps==3.8.0
xhshow>=0.1.3
# ===============================
# 工具包
# ===============================
beautifulsoup4
lxml
loguru