惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News | PayPal Newsroom
Security Archives - TechRepublic
Security Archives - TechRepublic
Hacker News: Ask HN
Hacker News: Ask HN
H
Hacker News: Front Page
Apple Machine Learning Research
Apple Machine Learning Research
TaoSecurity Blog
TaoSecurity Blog
Help Net Security
Help Net Security
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
V
V2EX
Hugging Face - Blog
Hugging Face - Blog
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
人人都是产品经理
人人都是产品经理
博客园 - 三生石上(FineUI控件)
Security Latest
Security Latest
Cloudbric
Cloudbric
WordPress大学
WordPress大学
S
SegmentFault 最新的问题
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
www.infosecurity-magazine.com
www.infosecurity-magazine.com
Know Your Adversary
Know Your Adversary
A
Arctic Wolf
L
LangChain Blog
Application and Cybersecurity Blog
Application and Cybersecurity Blog
The GitHub Blog
The GitHub Blog
P
Proofpoint News Feed
W
WeLiveSecurity
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
M
MIT News - Artificial intelligence
Google DeepMind News
Google DeepMind News
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
The Cloudflare Blog
小众软件
小众软件
NISL@THU
NISL@THU
云风的 BLOG
云风的 BLOG
P
Privacy & Cybersecurity Law Blog
S
Security @ Cisco Blogs
博客园 - 【当耐特】
I
InfoQ
Vercel News
Vercel News
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
P
Proofpoint News Feed
O
OpenAI News
Google DeepMind News
Google DeepMind News
N
News and Events Feed by Topic
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
K
Kaspersky official blog
T
Threat Research - Cisco Blogs
量子位
宝玉的分享
宝玉的分享

Bear docs

Private blogs Favicons and logos Multi-language blog Comments Roadmap Not seeing your post? Code of Conduct Upgrading from subscription to lifetime Neat Bear features
RSS Subscriber analytics
hidden (docs · 2025-02-19 · via Bear docs

Bear docs

With the dramatic rise of LLM scraper bots and other data collection services, the RSS and Atom feeds of blogs on all platforms, including Bear, are being mercilessly scraped.

These bots do not identify themselves as scraper bots, but as browsers and RSS readers. Unfortunately there is no way (that I'm currently aware of1) to determine whether this is a legitimate RSS reader or a bot since the IP addresses change with each request. This suggests that the bots are part of a broader bot network (or many bot networks) and completely ignore robots.txt.

Since RSS readers cannot execute Javascript or CSS, I can't present a challenge or captcha on the feeds since this will block both legitimate and bot traffic.

This has lead to the RSS subscriber count in analytics being completely incorrect, since each bot IP address is earmarked as a unique RSS subscriber during a 24 hour period.

This is compounded by the fact that large RSS reader platforms like Feedly do one request for all of their users subscribed to a specific feed, essentially logging many subscribers as 1.

Due to these reasons I've opted to remove the RSS subscriber count from the analytics dashboard.

Herman

  1. If you have any ideas on how to identify scraper bots vs legitimate RSS readers, please send me an email.↩