Treffer: A Technique for Isolating Lexically-Independent Phonetic Dependencies in Generative CNNs

Title:

A Technique for Isolating Lexically-Independent Phonetic Dependencies in Generative CNNs

Authors:

Šegedin, Bruno Ferenc

Publication Year:

2025

Collection:

Computer Science

Subject Terms:

Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Document Type:

Report Working Paper

Access URL:

http://arxiv.org/abs/2506.09218

Accession Number:

edsarx.2506.09218

Database:

arXiv

Weitere Informationen

The ability of deep neural networks (DNNs) to represent phonotactic generalizations derived from lexical learning remains an open question. This study (1) investigates the lexically-invariant generalization capacity of generative convolutional neural networks (CNNs) trained on raw audio waveforms of lexical items and (2) explores the consequences of shrinking the fully-connected layer (FC) bottleneck from 1024 channels to 8 before training. Ultimately, a novel technique for probing a model's lexically-independent generalizations is proposed that works only under the narrow FC bottleneck: generating audio outputs by bypassing the FC and inputting randomized feature maps into the convolutional block. These outputs are equally biased by a phonotactic restriction in training as are outputs generated with the FC. This result shows that the convolutional layers can dynamically generalize phonetic dependencies beyond lexically-constrained configurations learned by the FC.

Treffer: A Technique for Isolating Lexically-Independent Phonetic Dependencies in Generative CNNs

Weitere Informationen

Links

Zusatz-Funktionen