惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

人人都是产品经理
人人都是产品经理
S
Secure Thoughts
Recent Announcements
Recent Announcements
B
Blog
博客园_首页
Blog — PlanetScale
Blog — PlanetScale
G
Google Developers Blog
宝玉的分享
宝玉的分享
N
Netflix TechBlog - Medium
WordPress大学
WordPress大学
Help Net Security
Help Net Security
Forbes - Security
Forbes - Security
The Register - Security
The Register - Security
aimingoo的专栏
aimingoo的专栏
L
LINUX DO - 最新话题
T
Tailwind CSS Blog
Google DeepMind News
Google DeepMind News
Recorded Future
Recorded Future
Stack Overflow Blog
Stack Overflow Blog
Webroot Blog
Webroot Blog
P
Privacy International News Feed
H
Help Net Security
PCI Perspectives
PCI Perspectives
Jina AI
Jina AI
K
Kaspersky official blog
B
Blog RSS Feed
T
The Exploit Database - CXSecurity.com
Apple Machine Learning Research
Apple Machine Learning Research
S
Security Affairs
C
Cisco Blogs
云风的 BLOG
云风的 BLOG
C
CERT Recently Published Vulnerability Notes
罗磊的独立博客
P
Palo Alto Networks Blog
F
Fortinet All Blogs
Google DeepMind News
Google DeepMind News
L
Lohrmann on Cybersecurity
Latest news
Latest news
Engineering at Meta
Engineering at Meta
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Microsoft Security Blog
Microsoft Security Blog
博客园 - 司徒正美
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
I
InfoQ
小众软件
小众软件
P
Proofpoint News Feed
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
www.infosecurity-magazine.com
www.infosecurity-magazine.com
MongoDB | Blog
MongoDB | Blog
美团技术团队

cs.SD updates on arXiv.org

Probing Token Spaces under Generator Shift in AI-Generated Music Detection Deep Neural Network for Musical Instrument Recognition using MFCCs Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning Game Emotion Recognition from Speech based on Relevant Feature and Majority Voting AudioMNIST: Exploring Explainable Artificial Intelligence for Audio Analysis on a Simple Benchmark Singing Style Transfer Using Cycle-Consistent Boundary Equilibrium Generative Adversarial Networks A Predictive Model for Music Based on Learned Interval Representations DNN-HMM based Speaker Adaptive Emotion Recognition using Proposed Epoch and MFCC Features ASR-based Features for Emotion Recognition: A Transfer Learning Approach A Universal Music Translation Network Deep Speech Denoising with Vector Space Projections Convolutional Generative Adversarial Networks with Binary Neurons for Polyphonic Music Generation Speaker-Invariant Training via Adversarial Learning The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines Comprehending Real Numbers: Development of Bengali Real Number Speech Corpus Pop Music Highlighter: Marking the Emotion Keypoints Improved TDNNs using Deep Kernels and Frequency Dependent Grid-RNNs Deep Learning Based Speech Beamforming Deep Predictive Models in Interactive Music Expectation Learning for Adaptive Crossmodal Stimuli Association Neural Style Transfer for Audio Spectograms Learning audio and image representations with bio-inspired trainable feature extractors An analysis of incorporating an external language model into a sequence-to-sequence model Learning to Fuse Music Genres with Generative Adversarial Dual Learning Audio Cover Song Identification using Convolutional Neural Network Deep Neural Networks for Multiple Speaker Detection and Localization HoME: a Household Multimodal Environment Now Playing: Continuous low-power music recognition Unsupervised Adaptation with Domain Separation Networks for Robust Speech Recognition JamBot: Music Theory Aware Chord Based Generation of Polyphonic Music with LSTMs Deep Within-Class Covariance Analysis for Robust Audio Representation Learning Framework for evaluation of sound event detection in web videos Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention Listening to the World Improves Speech Command Recognition Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning Generating Nontrivial Melodies for Music as a Service Research on several key technologies in practical speech emotion recognition MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment A Categorical Approach for Recognizing Emotional Effects of Music Capturing Long-term Temporal Dependencies with Convolutional Networks for Continuous Emotion Recognition Learning Musical Relations using Gated Autoencoders An Improved Residual LSTM Architecture for Acoustic Modeling Generative Statistical Models with Self-Emergent Grammar of Chord Sequences Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification Speaker Identification in each of the Neutral and Shouted Talking Environments based on Gender-Dependent Approach Using SPHMMs A Hybrid Approach with Multi-channel I-Vectors and Convolutional Neural Networks for Acoustic Scene Classification Learning and Evaluating Musical Features with Deep Autoencoders Monaural Audio Speaker Separation with Source Contrastive Estimation Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders MidiNet: A Convolutional Generative Adversarial Network for Symbolic-domain Music Generation Transfer learning for music classification and regression tasks Note Value Recognition for Piano Transcription Using Markov Random Fields Sound-Word2Vec: Learning Word Representations Grounded in Sounds Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices Lyrics-to-Audio Alignment by Unsupervised Discovery of Repetitive Patterns in Vowel Acoustics Residual LSTM: Design of a Deep Recurrent Architecture for Distant Speech Recognition SampleRNN: An Unconditional End-to-End Neural Audio Generation Model Imposing higher-level Structure in Polyphonic Music Generation using Convolutional Restricted Boltzmann Machines and Constraints A Unit Selection Methodology for Music Generation Using Deep Neural Networks Algorithmic Songwriting with ALYSIA DeepBach: a Steerable Model for Bach Chorales Generation Learning Filter Banks Using Deep Learning For Acoustic Signals Composing Music with Grammar Argumented Neural Networks and Note-Level Encoding Song From PI: A Musically Plausible Network for Pop Music Generation Maximum entropy models for generation of expressive music Weakly Supervised PLDA Training Decision Making Based on Cohort Scores for Speaker Verification Discovering Sound Concepts and Acoustic Relations In Text Style Imitation and Chord Invention in Polyphonic Music with Exponential Families Inpainting of long audio segments with similarity graphs Explaining Deep Convolutional Neural Networks on Music Classification CaR-FOREST: Joint Classification-Regression Decision Forests for Overlapping Audio Event Detection Fractal Dimension Pattern Based Multiresolution Analysis for Rough Estimator of Person-Dependent Audio Emotion Recognition Label Tree Embeddings for Acoustic Scene Classification Polymetric Rhythmic Feel for a Cognitive Drum Computer The "Horse'' Inside: Seeking Causes Behind the Behaviours of Music Content Analysis Systems Symbolic Music Data Version 1.0 Towards Playlist Generation Algorithms Using RNNs Trained on Within-Track Transitions Audio Event Detection using Weakly Labeled Data An Argument-based Creative Assistant for Harmonic Blending Wavelet Scattering on the Pitch Spiral Sports highlights generation based on acoustic events detection: A rugby case study Emotion Analysis of Songs Based on Lyrical and Audio Features Modeling State-Conditional Observation Distribution using Weighted Stereo Samples for Factorial Speech Processing Models Plagiarism Detection in Polyphonic Music using Monaural Signal Separation Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation Computoser - rule-based, probability-driven algorithmic music composition Automatic Fado Music Classification Music and Vocal Separation Using Multi-Band Modulation Based Features A Stochastic Temporal Model of Polyphonic MIDI Performance with Ornaments Outer-Product Hidden Markov Model and Polyphonic MIDI Score Following Phoneme discrimination using KS algebra I Beyond Markov Chains, Towards Adaptive Memristor Network-based Music Generation A Mixed Graphical Model for Rhythmic Parsing An Approach for Classification of Dysfluent and Fluent Speech Using K-NN And SVM Evolving Musical Counterpoint: The Chronopoint Musical Evolution System An end-to-end machine learning system for harmonic analysis of music On Macroscopic Complexity and Perceptual Coding Particle Filtering on the Audio Localization Manifold Inter Genre Similarity Modelling For Automatic Music Genre Classification
AID: Open-source Anechoic Interferer Dataset
Philipp Götz, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets · 2022-08-05 · via cs.SD updates on arXiv.org

A dataset of anechoic recordings of various sound sources encountered in domestic environments is presented. The dataset is intended to be a resource of non-stationary, environmental noise signals that, when convolved with acoustic impulse responses, can be used to simulate complex acoustic scenes. Additionally, a Python library is provided to generate random mixtures of the recordings in the dataset, which can be used as non-stationary interference signals.