惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

阮一峰的网络日志
阮一峰的网络日志
D
Darknet – Hacking Tools, Hacker News & Cyber Security
S
Schneier on Security
The Last Watchdog
The Last Watchdog
Cyberwarzone
Cyberwarzone
S
Securelist
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cyber Attacks, Cyber Crime and Cyber Security
L
Lohrmann on Cybersecurity
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
博客园 - 司徒正美
The Cloudflare Blog
V
V2EX
博客园_首页
博客园 - 聂微东
Vercel News
Vercel News
人人都是产品经理
人人都是产品经理
G
GRAHAM CLULEY
T
Tenable Blog
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
L
LINUX DO - 最新话题
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
SecWiki News
SecWiki News
博客园 - 三生石上(FineUI控件)
S
Secure Thoughts
N
News | PayPal Newsroom
T
The Blog of Author Tim Ferriss
The GitHub Blog
The GitHub Blog
T
Troy Hunt's Blog
博客园 - 【当耐特】
Forbes - Security
Forbes - Security
H
Hacker News: Front Page
A
About on SuperTechFans
B
Blog RSS Feed
Engineering at Meta
Engineering at Meta
MongoDB | Blog
MongoDB | Blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
罗磊的独立博客
D
DataBreaches.Net
P
Privacy & Cybersecurity Law Blog
Schneier on Security
Schneier on Security
Application and Cybersecurity Blog
Application and Cybersecurity Blog
Google DeepMind News
Google DeepMind News
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Jina AI
Jina AI
D
Docker
P
Proofpoint News Feed

Zara Zhang

To learn anything, first unlearn school Build for one: AI and the age of personal leverage Six lessons I learned from volunteering at the Computer History Museum Beyond the AI tutor: What AI can do for language learning Why I’m not breastfeeding (and feeling awesome about it) 7 habits of highly effective cross-border collaborators Remote work sucks: Why I love going to the office Why I’m not learning to code (and why tech needs more humanists) Why I don’t make long-term plans
Why we built TLDW
Zara Zhang · 2025-10-28 · via Zara Zhang

I’m excited to launch a side project I’ve been building with two friends: TLDW (Too long; didn’t watch), a tool that helps people learn from long YouTube videos.

TLDW was inspired by my own pain points when learning from YouTube.

Over the past year, I’ve noticed a shift in my content consumption habits, where I’ve been spending more and more hours watching long YouTube videos to learn: video podcasts, founder interviews, lectures, tutorials… Many of them are more than 1-hour long and packed with high-value information.

I also started to listen to audio podcasts less and started watching more of the video versions on YouTube. Because:

1. Many of them are very dense in information, and I get easily distracted when listening to audio only. I can focus better when watching the video version.

2. Many of these shows contain visual information like product demos, screen shares, and slides. Also, even watching the speaker’s dynamics and body language can help me digest the content better.

Apparently, many other people feel the same, because YouTube is now the biggest podcasting service in the US. Yep, YouTube, not Apple Podcasts or Spotify.

When I recommended my followers do the same, I noticed that many people didn’t have the attention span to sit through these hour-long videos. Instead, they tend to toss the link to an AI tool (say Gemini or NotebookLM) and get it to generate a text summary.

I don’t like text summaries.

1. Converting video (a high-bandwidth format) into text (a low-bandwidth format) is very problematic. You lose all the rich visual information (demos, slides, speakers’ emotional cues). You also lose lots of details that made this video worth consuming in the first place: the anecdote/example that hits you in the gut, the turn of phrase that makes a line memorable… Many times, it’s not the gist of the video that we remember, but the little details.

2. Text summaries are generic (unless you put in an extremely detailed prompt specifying what you’re looking for, which 99% of people don’t do). Your summary and my summary will look the same. But given the same hour-long video, we will obviously care about different things. An AI researcher watching a talk by Sam Altman might pay attention to parts about model training, whereas a product manager might be more interested in how models are being productized. Summaries should be personalized.

So here’s the dilemma: How do we allow people to consume long-form videos efficiently, while also preserving the fidelity of the original content AND making it a personalized experience?

We felt like the answer lies in this new format: highlight reels.

In the past, AI has mostly been used for “compression”: turning a 1-hour video into a 1-paragraph summary. But what if we use it for “filtering” or “curation” instead? Pick out the 5 minutes that I should pay attention to, but just give me the original clip without watering it down.

This is what gave rise to TLDW’s key feature: personalized highlight reels. We run the video transcript by an LLM which will identify the most high-signal parts of a video. These “highlight reels” are then displayed using different colors on the progress bar, and clicking on each will take you directly to the corresponding timestamp. And this can be personalized, because different viewers care about different things: Say you’re someone working in education, you could type in “future of education”, and the model will highlight all the parts that discussed education in the video.

We believe “personalized highlight reels” are a better solution than “generic text summary”. Since launch, many users said they really like this user interface, and that this is the tool they’ve been waiting for.

In addition, we also have a few features that addressed my own pain points when trying to watch technical videos to learn AI:

1. When you come across a term/jargon you don’t understand, you can select it in the transcript and click “Explain” – the AI will explain it for you in CONTEXT. The key here is “in context”. Before, I used to manually type out these terms in ChatGPT, but ChatGPT didn’t have the context of what I was watching. The meaning of words/phrases changes depending on the context. Because our AI has the whole video transcript in its context window, it’s able to provide a context-specific explanation in one click.

2. When you come across a memorable line, you can select it in the transcript and click “Take notes”, and this will automatically be added to your personal notebook.

3. You can chat with the video transcript. Ask questions such as “what are the juiciest quotes” or “how does the speaker feel about [insert topic]”.

4. All your past videos and notes are saved in one central place, so this could become your personal learning hub.

Things we’re planning to add in the future:

1. Being able to share quotes/clips easily with other people/on social media, so that you can learn in public

2. Multilingual support and adding translation features

3. Enhancing the personalization: the AI could learn about your taste and interests and recommend reels based on those

I worked on TLDW on nights and weekends with 2 amazing partners: Samual Zhang (developer) and Yiqi Yan (designer). This was not our full-time job; we’ve been working on it out of genuine passion for the problem space and a desire to get our hands dirty building in AI. We also open-sourced the code here.

The product is still very early and has a lot of room for improvement, so would appreciate your feedback through emailing me (zara.r.zhang@gmail.com) or tagging me on X (zarazhangrui).

Try TLDW here: tldw.us or see a pre-generated example here.

(This post first appeared on my Substack)

Unknown's avatar

Published by Zara Zhang

Zara Zhang works at ByteDance in its Beijing office. Previously, she was an investment analyst at GGV Capital (first in the Menlo Park office, then in the Beijing office), a venture capital firm that invests in companies in the US, China, and other emerging markets. She has interned as a reporter covering China’s tech industry for The Information. Her writings have been published on The Harvard Crimson, Harvard Magazine, Foreign Policy, Huffington Post, and China Personified. She has also worked as a marketing intern at ZhenFund. Zara graduated from Harvard University Phi Beta Kappa with a degree in psychology. At Harvard, she wrote and edited for The Harvard Crimson, led the organization of Harvard China Forum (a 1,000-people conference featuring leaders from China and the US), and ran a weekly newsletter about food around the university. Zara grew up in Changchun, a city in northeast China, and received her secondary education in Singapore. A language enthusiast, she is trained in Chinese-English interpretation and translation, speaks French and Japanese, and can sing in Cantonese. Zara co-hosted “996”, a podcast where she and GGV managing partner Hans Tung interviewed leaders in US-China cross-border tech and entrepreneurship. Listen on iTunes, Overcast, Spotify, SoundCloud, or search “996” in any podcast app.