惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

阮一峰的网络日志
阮一峰的网络日志
D
Darknet – Hacking Tools, Hacker News & Cyber Security
S
Schneier on Security
The Last Watchdog
The Last Watchdog
Cyberwarzone
Cyberwarzone
S
Securelist
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cyber Attacks, Cyber Crime and Cyber Security
L
Lohrmann on Cybersecurity
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
博客园 - 司徒正美
The Cloudflare Blog
V
V2EX
博客园_首页
博客园 - 聂微东
Vercel News
Vercel News
人人都是产品经理
人人都是产品经理
G
GRAHAM CLULEY
T
Tenable Blog
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
L
LINUX DO - 最新话题
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
SecWiki News
SecWiki News
博客园 - 三生石上(FineUI控件)
S
Secure Thoughts
N
News | PayPal Newsroom
T
The Blog of Author Tim Ferriss
The GitHub Blog
The GitHub Blog
T
Troy Hunt's Blog
博客园 - 【当耐特】
Forbes - Security
Forbes - Security
H
Hacker News: Front Page
A
About on SuperTechFans
B
Blog RSS Feed
Engineering at Meta
Engineering at Meta
MongoDB | Blog
MongoDB | Blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
罗磊的独立博客
D
DataBreaches.Net
P
Privacy & Cybersecurity Law Blog
Schneier on Security
Schneier on Security
Application and Cybersecurity Blog
Application and Cybersecurity Blog
Google DeepMind News
Google DeepMind News
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Jina AI
Jina AI
D
Docker
P
Proofpoint News Feed

CIFAR

CIFAR researchers Pablo Jarillo-Herrero and Allan MacDonald awarded 2026 Kavli Prize in Nanoscience CIFAR researchers win prestigious sustainability award for study on climate-resilient farming Statement on Canada’s National AI Strategy CIFAR report charts bold research directions for the future of the Arctic Government of Canada and CIFAR announce $24M investment in top AI Talent De-risking and de-mystifying large-scale AI CIFAR earns national and global recognition for impact storytelling - CIFAR The next generation of research leaders starts here: Meet CIFAR’s newest Global Scholars “the House that Pigs built:” CIFAR researchers unveil how pig-derived materials shape everyday life The new frontier of AI and neurology CIFAR awards over $1M to support sociotechnical challenges in AI safety Advancing lifelong health: How CIFAR’s partnership with Manulife drives breakthrough research CIFAR’s Gilles Brassard and Charles H. Bennett receive 2025 ACM A.M. Turing Award for pioneering quantum information science Privacy by Design CIFAR announces research program decisions for 2025-2026 CIFAR and Mitacs partner to attract top next-gen talent to Canada
CIFAR commits $1M to global AI safety initiative
Justine Brooks · 2026-02-20 · via CIFAR

Four new Canadian research projects join the UK-led international AI Alignment Project where they will receive up to $165K per year

Four new research projects have been funded under the Canadian AI Safety Institute (CAISI) Research Program at CIFAR. As part of the AI Alignment Project, an international funding coalition led by the UK AI Security Institute, this initiative supports groundbreaking work in the field of AI alignment, ensuring advanced AI systems remain safe, secure and beneficial to society. 

The selected projects, leveraging fields like game theory, statistics and physics, will receive $165,000 for one year (with the possibility of an extension for a second year), alongside specialized compute resources and expert support to bridge the gap between AI’s rapid development and the safety frameworks needed to govern it. 

The addition of these new initiatives brings the total number of research projects at the CAISI Research Program at CIFAR up to sixteen, joining existing Catalyst Projects and Solution Networks in their efforts to drive high-impact AI safety research and implement practical solutions to the benefit of all. The impact of these projects and the overall program was recently highlighted in our 2025 Year in Review: Building Safe AI for Canadians report. 

“The AI Alignment Project is an important step in furthering Canada’s long history of leadership in driving AI research that is safe and trustworthy. As AI becomes increasingly present in our lives, it is more important than ever to ensure it is aligned with our values and serves the public good. By investing in the work of these Canadian researchers, we are building long-term economic resilience while cementing Canada’s position as a global leader in responsible AI development and deployment.”

— The Honourable Evan Solomon, Minister of Artificial Intelligence and Digital Innovation and Minister Responsible for the Federal Economic Development Agency for Southern Ontario

“The AI Alignment Project represents an opportunity to build on Canada’s strong ties with our international partners and work towards a common goal of furthering AI safety research. The selected projects cover a variety of key alignment issues that deserve our immediate attention and will contribute to ensuring AI systems are safe, trustworthy and interpretable, benefiting Canadians and beyond.” 

Catherine Régis and Nicolas Papernot, co-directors of the CAISI Research Program at CIFAR.

Game-theoretic safety guarantees for advanced AI systems

Zhijing Jin (Canada CIFAR AI Chair, Vector Institute, University of Toronto)

As information systems become increasingly AI-centric and autonomous, traditional security frameworks no longer adequately address questions of safety, control and privacy, especially in situations where multiple AI agents collaborate autonomously. Canada CIFAR AI Chair Zhijing Jin proposes using game theory, a robust theoretical framework, to provide provable guarantees to mitigate misaligned behaviours and offer concrete tools for policymakers and AI developers to maintain control in multi-agent scenarios.

Sample-efficient online fine-tuning against resistant behaviors: statistical foundations for post-training alignment

Linglong Kong (Canada CIFAR AI Chair, Amii, University of Alberta)

Modern AI systems deployed in the real world often develop emergent misalignment (e.g., reward hacking, deceptive alignment) after deployment, an internal behavioural failure that causes them to deviate from their intended goals. Canada CIFAR AI Chair Linglong Kong proposes a statistical framework for sample-efficient online fine-tuning to establish whether corrective training can serve as a trustworthy safety mechanism or whether more fundamental safeguards are required.

Scaling laws, data distributions, and learning dynamics: simulated high-energy physics data as a benchmark for data in the wild

Yonatan Kahn (University of Toronto)

Current theoretical AI research often relies on overly simple data modeling, making it difficult to answer fundamental questions about scaling laws or whether models truly learn underlying latent parameters. University of Toronto Professor Yonatan Kahn proposes a method of physics-based data generation that will provide ‘ground-truth information’ to researchers, allowing them to predict and test some of the fundamental questions around how AI learns, generalizes and scales.

A unified statistical framework for quantifying rare event risks for language models

Bei Jiang (Canada CIFAR AI Chair, Amii, University of Alberta)

One of the central challenges of AI alignment is quantifying extremely small failure rates – probabilities so low that ordinary testing will never observe them. ‘Long-tail failures,’ like jailbreaks, policy evasion or subtle safety violations, are where oversight is weakest, making it impossible to compare models, set safety standards or measure model regression. Canada CIFAR AI Chair Bei Jiang will address this issue using standard statistical tools designed for these rare-event problems, which are currently underused in large language model evaluation.