We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for recent submissions

[ total of 17 entries: 1-17 ]
[ showing up to 25 entries per page: fewer | more ]

Fri, 23 Aug 2019

[1]  arXiv:1908.08160 [pdf]
Title: Sound Localization and Separation in Three-dimensional Space Using a Single Microphone with a Metamaterial Enclosure
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Applied Physics (physics.app-ph)
[2]  arXiv:1908.08044 [pdf, other]
Title: Coarse-to-fine Optimization for Speech Enhancement
Journal-ref: Interspeech 2019
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)

Thu, 22 Aug 2019

[3]  arXiv:1908.07517 [pdf]
Title: AI for Earth: Rainforest Conservation by Acoustic Surveillance
Comments: Accepted to KDD2019 Workshop on Data Mining and AI for Conservation
Subjects: Sound (cs.SD); Databases (cs.DB); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[4]  arXiv:1908.07750 (cross-list from cs.CV) [pdf, other]
Title: A Realistic Face-to-Face Conversation System based on Deep Neural Networks
Comments: Accepted to ICCV 2019 workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[5]  arXiv:1908.07656 (cross-list from eess.AS) [pdf]
Title: Survey on Deep Neural Networks in Speech and Vision Systems
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Sound (cs.SD); Signal Processing (eess.SP); Machine Learning (stat.ML)
[6]  arXiv:1908.07590 (cross-list from cs.IR) [pdf, other]
Title: From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories
Comments: In the Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2019)
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Wed, 21 Aug 2019

[7]  arXiv:1908.07324 [pdf]
Title: A Microphone Array and Voice Algorithm based Smart Hearing Aid
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[8]  arXiv:1908.06969 [pdf, other]
Title: Music Transcription Based on Bayesian Piece-Specific Score Models Capturing Repetitions
Comments: 17 pages, 9 figures, version submitted to IEEE/ACM TASLP
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[9]  arXiv:1908.07409 (cross-list from stat.AP) [pdf, other]
Title: Onset detection: A new approach to QBH system
Comments: 30 pages, 26 figures
Subjects: Applications (stat.AP); Information Retrieval (cs.IR); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[10]  arXiv:1908.07107 (cross-list from cs.HC) [pdf, other]
Title: Fuzzy C-Means Clustering and Sonification of HRV Features
Comments: 5 pages, 5 figures
Journal-ref: 2019 the IEEE/ACM 4th International Conference on Connected Health: Applications, Systems and Engineering Technologies: EdgeDL WorkshopAt: Washington, D.C, sep- 25-27
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[11]  arXiv:1908.07045 (cross-list from eess.AS) [pdf, other]
Title: Salient Speech Representations Based on Cloned Networks
Comments: Interspeech 2019
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)

Tue, 20 Aug 2019

[12]  arXiv:1908.06752 [pdf, other]
Title: Towards Generating Ambisonics Using Audio-Visual Cue for Virtual Reality
Comments: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[13]  arXiv:1908.06593 [pdf, other]
Title: Audio query-based music source separation
Comments: 8 pages, 7 figures, Appearing in the proceedings of the 20th International Society for Music Information Retrieval Conference (ISMIR 2019) (camera-ready version)
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[14]  arXiv:1908.06468 [pdf, other]
Title: Efficient Context Aggregation for End-to-End Speech Enhancement Using a Densely Connected Convolutional and Recurrent Network
Comments: 5 pages
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[15]  arXiv:1908.06248 [pdf, other]
Title: JVS corpus: free Japanese multi-speaker voice corpus
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)

Mon, 19 Aug 2019

[16]  arXiv:1908.05863 [pdf, other]
Title: Sub-Spectrogram Segmentation for Environmental Sound Classification via Convolutional Recurrent Neural Network and Score Level Fusion
Comments: accepted in the 2019 IEEE International Workshop on Signal Processing Systems (SiPS2019)
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[17]  arXiv:1908.05743 (cross-list from eess.AS) [pdf, other]
Title: State-of-the-art Speech Recognition using EEG and Towards Decoding of Speech Spectrum From EEG
Comments: Extended version of paper which is under review. arXiv admin note: text overlap with arXiv:1906.08871
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[ total of 17 entries: 1-17 ]
[ showing up to 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 1908, contact, help  (Access key information)