惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

S
Schneier on Security
腾讯CDC
N
Netflix TechBlog - Medium
GbyAI
GbyAI
Stack Overflow Blog
Stack Overflow Blog
博客园 - 三生石上(FineUI控件)
Y
Y Combinator Blog
Jina AI
Jina AI
The GitHub Blog
The GitHub Blog
云风的 BLOG
云风的 BLOG
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
U
Unit 42
Vercel News
Vercel News
Recorded Future
Recorded Future
Microsoft Security Blog
Microsoft Security Blog
aimingoo的专栏
aimingoo的专栏
博客园 - 司徒正美
IT之家
IT之家
S
Securelist
T
Tenable Blog
P
Palo Alto Networks Blog
MyScale Blog
MyScale Blog
The Cloudflare Blog
G
Google Developers Blog
Scott Helme
Scott Helme
大猫的无限游戏
大猫的无限游戏
T
Threatpost
L
LINUX DO - 最新话题
雷峰网
雷峰网
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Hugging Face - Blog
Hugging Face - Blog
Recent Announcements
Recent Announcements
The Hacker News
The Hacker News
C
Cyber Attacks, Cyber Crime and Cyber Security
人人都是产品经理
人人都是产品经理
H
Hackread – Cybersecurity News, Data Breaches, AI and More
博客园 - 聂微东
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Know Your Adversary
Know Your Adversary
P
Privacy International News Feed
Security Latest
Security Latest
Cyberwarzone
Cyberwarzone
F
Fortinet All Blogs
L
LangChain Blog
G
GRAHAM CLULEY
K
Kaspersky official blog
爱范儿
爱范儿
I
Intezer
罗磊的独立博客
B
Blog RSS Feed

博客园 - 郭新晨

人工智能的数学基础 MATLAB 2025b 安装教程 A curated list of awesome voice conversion, projects and communities Automatically generate, translate, and overlay subtitles for any video Automatically generate and overlay subtitles for any video SoftVC VITS Singing Voice Conversion 有手就行!Sovits AI人声模型训练 A curated roadmap based on my 6 years of experience form zero to become a skilled AI Speech Engineer. This roadmap covers everything from fundamentals to cutting-edge AI新宠DocExt:纯本地文档抽取,开源免费还无依赖!你还在为OCR头疼吗? LangExtract万字实战指南:基于LLM文本结构化工具 Get your documents ready for gen AI HMM隐马尔可夫模型的例子、原理、计算和应用 GUI for a Vocal Remover that uses Deep Neural Networks We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence - 郭新晨 开源语音分离工具大比拼:人声 VS 背景音乐 ⚔️ - 获取干净训练语音 (数据截至 2025年4月17日)!!! VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Mel Spectrogram VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram A fork to record speaker output with python. PyAudio with PortAudio for Windows | Extended | Loopback | WASAPI | Latest precompiled Version 头条号爬虫案例 今日头条评论爬虫 - 使用Selenium自动化采集头条文章评论的Python工具 LibriheavyMix - 郭新晨 Open-source datasets and deep learning models for separating sounds 自动抓取 Credential - wxdown 程序版 微信视频号 API SDK 视频号、小程序、抖音、快手、小红书、直播流、m3u8、酷狗、QQ音乐等常见网络资源下载! 微信公众号文章的爬虫 微信公众号文章爬取 CLI 工具 一个用于采集微信公众号文章和数据的轻量级爬虫工具 - 郭新晨 About This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners vsr using gan A curated list of resources for video super-resolution using diffusion models About A Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution About Image Super-Resolution for Anime-Style Art Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
郭新晨 · 2026-04-29 · via 博客园 - 郭新晨
https://github.com/MoonshotAI/Kimi-Audio