Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation - 惯性聚合

推荐订阅源

Schneier on Security

Netflix TechBlog - Medium

Stack Overflow Blog

博客园 - 三生石上(FineUI控件)

Y Combinator Blog

The GitHub Blog

钛媒体：引领未来商业与生活新知

Recorded Future

Microsoft Security Blog

aimingoo的专栏

博客园 - 司徒正美

Palo Alto Networks Blog

The Cloudflare Blog

Google Developers Blog

大猫的无限游戏

LINUX DO - 最新话题

Cyber Security Advisories - MS-ISAC

Hugging Face - Blog

Recent Announcements

The Hacker News

Cyber Attacks, Cyber Crime and Cyber Security

人人都是产品经理

Hackread – Cybersecurity News, Data Breaches, AI and More

博客园 - 聂微东

Threat Intelligence Blog | Flashpoint

Know Your Adversary

Privacy International News Feed

Security Latest

Fortinet All Blogs

Kaspersky official blog

罗磊的独立博客

博客园 - 郭新晨

人工智能的数学基础 MATLAB 2025b 安装教程 A curated list of awesome voice conversion, projects and communities Automatically generate, translate, and overlay subtitles for any video Automatically generate and overlay subtitles for any video SoftVC VITS Singing Voice Conversion 有手就行！Sovits AI人声模型训练 A curated roadmap based on my 6 years of experience form zero to become a skilled AI Speech Engineer. This roadmap covers everything from fundamentals to cutting-edge AI新宠DocExt：纯本地文档抽取，开源免费还无依赖！你还在为OCR头疼吗？ LangExtract万字实战指南：基于LLM文本结构化工具 Get your documents ready for gen AI HMM隐马尔可夫模型的例子、原理、计算和应用 GUI for a Vocal Remover that uses Deep Neural Networks We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence - 郭新晨开源语音分离工具大比拼：人声 VS 背景音乐 ⚔️ - 获取干净训练语音 (数据截至 2025年4月17日)！！！ VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Mel Spectrogram VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram A fork to record speaker output with python. PyAudio with PortAudio for Windows | Extended | Loopback | WASAPI | Latest precompiled Version 头条号爬虫案例今日头条评论爬虫 - 使用Selenium自动化采集头条文章评论的Python工具 LibriheavyMix - 郭新晨 Open-source datasets and deep learning models for separating sounds 自动抓取 Credential - wxdown 程序版微信视频号 API SDK 视频号、小程序、抖音、快手、小红书、直播流、m3u8、酷狗、QQ音乐等常见网络资源下载! 微信公众号文章的爬虫微信公众号文章爬取 CLI 工具一个用于采集微信公众号文章和数据的轻量级爬虫工具 - 郭新晨 About This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners vsr using gan A curated list of resources for video super-resolution using diffusion models About A Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution About Image Super-Resolution for Anime-Style Art Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

郭新晨 · 2026-04-29 · via 博客园 - 郭新晨

此内容由惯性聚合(RSS阅读器)自动聚合整理，仅供阅读参考。原文来自 — 版权归原作者所有。