惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

D
Darknet – Hacking Tools, Hacker News & Cyber Security
V
Vulnerabilities – Threatpost
Cloudbric
Cloudbric
G
GRAHAM CLULEY
S
Securelist
Schneier on Security
Schneier on Security
Help Net Security
Help Net Security
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
Project Zero
Project Zero
Spread Privacy
Spread Privacy
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
Cisco Talos Blog
Cisco Talos Blog
T
Tailwind CSS Blog
博客园_首页
有赞技术团队
有赞技术团队
Simon Willison's Weblog
Simon Willison's Weblog
Stack Overflow Blog
Stack Overflow Blog
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
Latest news
Latest news
T
Tor Project blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
Attack and Defense Labs
Attack and Defense Labs
www.infosecurity-magazine.com
www.infosecurity-magazine.com
O
OpenAI News
J
Java Code Geeks
T
Tenable Blog
K
Kaspersky official blog
AWS News Blog
AWS News Blog
S
Security @ Cisco Blogs
The GitHub Blog
The GitHub Blog
T
Threatpost
月光博客
月光博客
H
Heimdal Security Blog
Security Latest
Security Latest
The Hacker News
The Hacker News
Y
Y Combinator Blog
A
Arctic Wolf
Apple Machine Learning Research
Apple Machine Learning Research
C
Cisco Blogs
美团技术团队
Microsoft Security Blog
Microsoft Security Blog
Hugging Face - Blog
Hugging Face - Blog
T
The Blog of Author Tim Ferriss
C
CERT Recently Published Vulnerability Notes
D
Docker
Google Online Security Blog
Google Online Security Blog
D
DataBreaches.Net
V
Visual Studio Blog
H
Help Net Security

Bear docs

Private blogs Favicons and logos Multi-language blog Comments Roadmap Not seeing your post? Code of Conduct Upgrading from subscription to lifetime Neat Bear features
RSS Subscriber analytics
hidden (docs · 2025-02-19 · via Bear docs

Bear docs

With the dramatic rise of LLM scraper bots and other data collection services, the RSS and Atom feeds of blogs on all platforms, including Bear, are being mercilessly scraped.

These bots do not identify themselves as scraper bots, but as browsers and RSS readers. Unfortunately there is no way (that I'm currently aware of1) to determine whether this is a legitimate RSS reader or a bot since the IP addresses change with each request. This suggests that the bots are part of a broader bot network (or many bot networks) and completely ignore robots.txt.

Since RSS readers cannot execute Javascript or CSS, I can't present a challenge or captcha on the feeds since this will block both legitimate and bot traffic.

This has lead to the RSS subscriber count in analytics being completely incorrect, since each bot IP address is earmarked as a unique RSS subscriber during a 24 hour period.

This is compounded by the fact that large RSS reader platforms like Feedly do one request for all of their users subscribed to a specific feed, essentially logging many subscribers as 1.

Due to these reasons I've opted to remove the RSS subscriber count from the analytics dashboard.

Herman

  1. If you have any ideas on how to identify scraper bots vs legitimate RSS readers, please send me an email.↩