惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Google DeepMind News
Google DeepMind News
Stack Overflow Blog
Stack Overflow Blog
Hugging Face - Blog
Hugging Face - Blog
博客园_首页
T
The Blog of Author Tim Ferriss
博客园 - 叶小钗
N
Netflix TechBlog - Medium
腾讯CDC
C
Check Point Blog
P
Proofpoint News Feed
Engineering at Meta
Engineering at Meta
GbyAI
GbyAI
S
SegmentFault 最新的问题
F
Fortinet All Blogs
美团技术团队
U
Unit 42
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
博客园 - 司徒正美
F
Full Disclosure
Recorded Future
Recorded Future
D
DataBreaches.Net
博客园 - 【当耐特】
Martin Fowler
Martin Fowler
J
Java Code Geeks
I
InfoQ
Y
Y Combinator Blog
A
About on SuperTechFans
AI
AI
爱范儿
爱范儿
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
Forbes - Security
Forbes - Security
W
WeLiveSecurity
M
MIT News - Artificial intelligence
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Simon Willison's Weblog
Simon Willison's Weblog
Schneier on Security
Schneier on Security
The GitHub Blog
The GitHub Blog
Security Archives - TechRepublic
Security Archives - TechRepublic
aimingoo的专栏
aimingoo的专栏
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
G
GRAHAM CLULEY
Know Your Adversary
Know Your Adversary
Latest news
Latest news
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
D
Docker
Recent Commits to openclaw:main
Recent Commits to openclaw:main
量子位
V2EX - 技术
V2EX - 技术
Project Zero
Project Zero

Deep Dive: AI

How to secure AI systems Why Debian won’t distribute AI models any time soon Building creative restrictions to curb AI abuse Solving for AI’s black box problem Copyright, selfie monkeys, the hand of God Welcome to Deep Dive: AI
When hackers take on AI: Sci-fi – or the future?
Deep Dive: A · 2022-08-30 · via Deep Dive: AI

osi deep dive episode 3

Deep Dive: AI

When hackers take on AI: Sci-fi – or the future?

Loading

Because we lack a fundamental understanding of the internal mechanisms of current AI models, today’s guest has a few theories about what these models might do when they encounter situations outside of their training data, with potentially catastrophic results. Tuning in, you’ll hear from Connor Leahy, who is one of the founders of Eleuther AI, a grassroots collective of researchers working to open source AI research. He’s also Founder and CEO of Conjecture, a startup that is doing some fascinating research into the interpretability and safety of AI. We talk more about this in today’s episode, with Leahy elaborating on some of the technical problems that he and other researchers are running into and the creativity that will be required to solve them. We also take a look at some of the nefarious ways that he sees AI evolving in the future and how he believes computer security hackers could contribute to mitigating these risks without curbing technological progress. We close on an optimistic note, with Leahy encouraging young career researchers to focus on the ‘massive orchard’ of low-hanging fruit in interpretability and AI safety and sharing his vision for this extremely valuable field of research.

To learn more, make sure not to miss this fascinating conversation with EleutherAI Founder, Connor Leahy! Full transcript. 

Key Points From This Episode:

  • The true story of how EleutherAI started as a hobby project during the pandemic.
  • Why Leahy believes that it’s critical that we understand AI technology.
  • The importance of making AI more accessible to those who can do valuable research.
  • What goes into building a large model like this: data, engineering, and computing.
  • Leahy offers some insight into the truly monumental volume of data required to train these models and where it is sourced from.
  • A look at Leahy ‘s (very specific) perspective on making EleutherAI’s models public.
  • Potential consequences of releasing these models; will they be used for good or evil?
  • Some of the nefarious ways in which Leahy sees AI technology evolving in the future.
  • Mitigating the risks that AI poses; how we can prevent these systems from spinning out of control without curbing progress.
  • Focusing on solvable technical problems to build systems with embedded safeguards.
  • Why Leahy wishes more computer security hackers would work on AI problems.
  • Low-hanging fruit in interpretability and AI safety for young career researchers.
  • Why Leahy is optimistic about understanding these problems better going forward.
  • The creativity required to come up with new ways of thinking about these problems.
  • In closing, Leahy encourages listeners to take a shot at linear algebra, interpretability, and understanding neural networks.

Links Mentioned in Today’s Episode:

Credits

Special thanks to volunteer producer, Nicole Martinelli. Music by Jason Shaw, Audionautix.

This podcast is sponsored by GitHub, DataStax and Google.

No sponsor had any right or opportunity to approve or disapprove the content of this podcast.

This work is licensed under a Creative Commons Attribution 4.0 International License.

The views expressed in this podcast are the personal views of the speakers and are not the views of their employers, the organizations they are affiliated with, their clients or their customers. The information provided is not legal advice. No sponsor had any right or opportunity to approve or disapprove the content of this podcast.