본문으로 건너뛰기

#Speech Recognition

10개의 포스트

[논문리뷰] NVSpeech: An Integrated and Scalable Pipeline for Human-Like Speech Modeling with Paralinguistic Vocalizations

댓글 수 로딩 중

[논문리뷰] DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition

댓글 수 로딩 중

[논문리뷰] MiDashengLM: Efficient Audio Understanding with General Audio Captions

댓글 수 로딩 중

[논문리뷰] POWSM: A Phonetic Open Whisper-Style Speech Foundation Model

댓글 수 로딩 중

[논문리뷰] HiKE: Hierarchical Evaluation Framework for Korean-English Code-Switching Speech Recognition

댓글 수 로딩 중

[논문리뷰] RIR-Mega: a large-scale simulated room impulse response dataset for machine learning and room acoustics modeling

댓글 수 로딩 중

[논문리뷰] Voice Evaluation of Reasoning Ability: Diagnosing the Modality-Induced Performance Gap

댓글 수 로딩 중