Abstract: Personalizing a speech synthesis system is a highly desired application, where the system can generate speech with the user’s voice with rare enrolled recordings. There are two main ...
Abstract: Expressive text-to-speech (TTS) aims to synthesize speech with varying speaking styles to better reflect human speech patterns. In this study, we attempt to use natural language as a style ...
In this work, we introduce DINOv, a Visual In-Context Prompting framework for referring and generic segmentation tasks. For visualization and demos, we also recommend trying T-Rex demo link, which is ...
remove-circle Internet Archive's in-browser video "theater" requires JavaScript to be enabled. It appears your browser does not have it turned on. Please see your ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results