Showing 1 - 20 of 53,615

1

Switchable Deep Beamformer for High-quality and Real-time Passive Acoustic Mapping
Yi Zeng ; Jinwei Li ; Hui Zhu ; et al.
Ultrasound in Medicine & Biology. 51:1901-1914

FOS: Computer and inform... Computer Science - Machi... Sound (cs.SD) Artificial Intelligence... Computer Science - Artif... Audio and Speech Process...
Academic journal
Save to List
2

Sparse wavefield reconstruction and denoising with boostlets
Zea, Elias ; Laudato, Marco ; Andén, Joakim
2025 International Conference on Sampling Theory and Applications (SampTA). :1-5

FOS: Computer and inform... Sound (cs.SD) Beräkningsmatematik wavefields Signalbehandling Fluid Mechanics
Academic journal
Save to List
3

Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes
Yasuda, Masahiro ; Nguyen, Binh Thien ; Harada, Noboru ; et al.

Computer Science - Sound Electrical Engineering a...
Report
Save to List
4

Joint ASR and Speaker Role Tagging with Serialized Output Training
Xu, Anfeng ; Feng, Tiantian ; Narayanan, Shrikanth

Electrical Engineering a... Computer Science - Sound
Report
Save to List
5

AC/DC: LLM-based Audio Comprehension via Dialogue Continuation
Fujita, Yusuke ; Mizumoto, Tomoya ; Kojima, Atsushi ; et al.

Electrical Engineering a... Computer Science - Compu... Computer Science - Sound
Report
Save to List
6

Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs
Futami, Hayato ; Tsunoo, Emiru ; Kashiwagi, Yosuke ; et al.

Computer Science - Compu... Computer Science - Sound Electrical Engineering a...
Report
Save to List
7

Fine-Grained control over Music Generation with Activation Steering
Panda, Dipanshu ; Joe, Jayden Koshy ; R, Harshith M ; et al.

Computer Science - Sound Computer Science - Artif... Electrical Engineering a...
Report
Save to List
8

The 2025 PNPL Competition: Speech Detection and Phoneme Classification in the LibriBrain Dataset
Landau, Gilad ; Özdogan, Miran ; Elvers, Gereon ; et al.
NeurIPS 2025 Competition Track

Computer Science - Machi... Computer Science - Sound Electrical Engineering a...
Report
Save to List
9

InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions
Wang, Zhenzhi ; Yang, Jiaqi ; Jiang, Jianwen ; et al.

Computer Science - Compu... Computer Science - Artif... Computer Science - Sound
Report
Save to List
10

Training-Free Voice Conversion with Factorized Optimal Transport
Lobashev, Alexander ; Yermekova, Assel ; Larchenko, Maria

Computer Science - Sound Computer Science - Compu... Computer Science - Machi... Electrical Engineering a...
Report
Save to List
11

A Study on Speech Assessment with Visual Cues
Ahmed, Shafique ; Zezario, Ryandhimas E. ; Saleem, Nasir ; et al.

Electrical Engineering a... Computer Science - Sound Electrical Engineering a...
Report
Save to List
12

OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary
Sudo, Yui ; Fujita, Yusuke ; Kojima, Atsushi ; et al.

Computer Science - Sound Computer Science - Compu... Electrical Engineering a...
Report
Save to List
13

Ming-Omni: A Unified Multimodal Model for Perception and Generation
AI, Inclusion ; Gong, Biao ; Zou, Cheng ; et al.

Computer Science - Artif... Computer Science - Compu... Computer Science - Compu... Computer Science - Machi... Computer Science - Sound Electrical Engineering a...
Report
Save to List
14

A Technique for Isolating Lexically-Independent Phonetic Dependencies in Generative CNNs
Šegedin, Bruno Ferenc

Computer Science - Compu... Computer Science - Sound Electrical Engineering a...
Report
Save to List
15

SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research
Attia, Ahmed Adel ; Liu, Jing ; Espy-Wilson, Carl

Computer Science - Sound Computer Science - Artif... Computer Science - Compu... Electrical Engineering a...
Report
Save to List
16

Fractional Fourier Sound Synthesis
Gutiérrez, Esteban ; Cádiz, Rodrigo ; Long, Carlos Sing ; et al.

Computer Science - Sound Electrical Engineering a...
Report
Save to List
17

PHRASED: Phrase Dictionary Biasing for Speech Translation
Wang, Peidong ; Xue, Jian ; Zhao, Rui ; et al.

Computer Science - Compu... Computer Science - Artif... Computer Science - Sound Electrical Engineering a...
Report
Save to List
18

Neighbors and relatives: How do speech embeddings reflect linguistic connections across the world?
Törö, Tuukka ; Suni, Antti ; Šimko, Juraj

Computer Science - Compu... Computer Science - Sound Electrical Engineering a...
Report
Save to List
19

Higher-Order Network Representation of J. S. Bach's Solo Violin Sonatas and Partitas: Topological and Geometrical Explorations
Mrad, Dima ; Najem, Sara

Computer Science - Sound Electrical Engineering a... Physics - Physics and So...
Report
Save to List
20

Teaching Physical Awareness to LLMs through Sounds
Wang, Weiguo ; Nie, Andy ; Zhou, Wenrui ; et al.

Computer Science - Sound Computer Science - Artif... Computer Science - Multi... Computer Science - Robot... Electrical Engineering a...
Report
Save to List

Filter