TorchAudio: Building Blocks for Audio and Speech Processing TorchAudio Examples

Liquid Audio - Speech-to-Speech models

We present LFM2-Audio-1.5B, Liquid AI's first end-to-end audio foundation model. Built with low-latency in mind, the lightweight LFM2 backbone enables real time speech-to-speech conversations without ...

GitHub

Kokoro Web - Free AI Text to Speech

Kokoro Web is powered by hexgrad/Kokoro-82M, an open-weight 82 million parameter Text-to-Speech model available on Hugging Face. Despite its lightweight architecture, it delivers comparable quality to ...

IEEE

What Are They Doing? Joint Audio-Speech Co-Reasoning

Abstract: In audio and speech processing, tasks usually focus on either the audio or speech modality, even when both sounds and human speech are present in the same audio clip. Recent Auditory Large ...

deseret

‘We built it for them’: New Logan Institute of Religion building opens for tours

Once housing the second institute of religion established by The Church of Jesus Christ of Latter-day Saints, the old Logan Institute of Religion building long-pioneered a space where young adult ...

IEEE

AudioSetCaps: An Enriched Audio-Caption Dataset Using Automated Generation Pipeline With Large Audio and Language Models

Abstract: With the emergence of audio-language models, constructing large-scale paired audio-language datasets has become essential yet challenging for model development, primarily due to the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results