惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Vercel News
Vercel News
O
OpenAI News
Project Zero
Project Zero
博客园 - 聂微东
博客园 - 叶小钗
V
V2EX
S
Schneier on Security
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
C
Check Point Blog
Hugging Face - Blog
Hugging Face - Blog
L
LangChain Blog
S
Secure Thoughts
Microsoft Security Blog
Microsoft Security Blog
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
G
GRAHAM CLULEY
Scott Helme
Scott Helme
T
The Exploit Database - CXSecurity.com
博客园 - 司徒正美
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
S
SegmentFault 最新的问题
Simon Willison's Weblog
Simon Willison's Weblog
Hacker News: Ask HN
Hacker News: Ask HN
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
月光博客
月光博客
T
Tailwind CSS Blog
The GitHub Blog
The GitHub Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
N
Netflix TechBlog - Medium
C
Cisco Blogs
Hacker News - Newest:
Hacker News - Newest: "LLM"
腾讯CDC
Blog — PlanetScale
Blog — PlanetScale
D
Docker
D
DataBreaches.Net
WordPress大学
WordPress大学
G
Google Developers Blog
美团技术团队
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
T
Tor Project blog
H
Help Net Security
罗磊的独立博客
H
Hacker News: Front Page
T
The Blog of Author Tim Ferriss
Jina AI
Jina AI
NISL@THU
NISL@THU
W
WeLiveSecurity
P
Proofpoint News Feed
AWS News Blog
AWS News Blog
MyScale Blog
MyScale Blog

cs.AI updates on arXiv.org

暂无文章

SecureBERT: A Domain-Specific Language Model for Cybersecurity
Ehsan Aghaei, Xi Niu, Waseem Shadid, Ehab Al-Shaer · 2022-04-06 · via cs.AI updates on arXiv.org

Natural Language Processing (NLP) has recently gained wide attention in cybersecurity, particularly in Cyber Threat Intelligence (CTI) and cyber automation. Increased connection and automation have revolutionized the world's economic and cultural infrastructures, while they have introduced risks in terms of cyber attacks. CTI is information that helps cybersecurity analysts make intelligent security decisions, that is often delivered in the form of natural language text, which must be transformed to machine readable format through an automated procedure before it can be used for automated security measures. This paper proposes SecureBERT, a cybersecurity language model capable of capturing text connotations in cybersecurity text (e.g., CTI) and therefore successful in automation for many critical cybersecurity tasks that would otherwise rely on human expertise and time-consuming manual efforts. SecureBERT has been trained using a large corpus of cybersecurity text.To make SecureBERT effective not just in retaining general English understanding, but also when applied to text with cybersecurity implications, we developed a customized tokenizer as well as a method to alter pre-trained weights. The SecureBERT is evaluated using the standard Masked Language Model (MLM) test as well as two additional standard NLP tasks. Our evaluation studies show that SecureBERT\footnote{\url{https://github.com/ehsanaghaei/SecureBERT}} outperforms existing similar models, confirming its capability for solving crucial NLP tasks in cybersecurity.