ZHISHENG ZHENG's picture

9 7

ZHISHENG ZHENG

zhisheng01

https://zhishengzheng.com/

zhisheng147

AI & ML interests

LLM, Speech and Audio Processing

Organizations

None yet

zhisheng01's activity

upvoted a paper 16 days ago

The VoxCeleb Speaker Recognition Challenge: A Retrospective

Paper • 2408.14886 • Published 23 days ago • 8

upvoted a paper 23 days ago

Language Model Can Listen While Speaking

Paper • 2408.02622 • Published Aug 5 • 37

upvoted 3 papers 2 months ago

Autoregressive Speech Synthesis without Vector Quantization

Paper • 2407.08551 • Published Jul 11 • 13

Video-to-Audio Generation with Hidden Alignment

Paper • 2407.07464 • Published Jul 10 • 16

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

Paper • 2407.04051 • Published Jul 4 • 35

upvoted a paper 3 months ago

PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation

Paper • 2407.02869 • Published Jul 3 • 18

upvoted a paper 4 months ago

Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

Paper • 2406.02430 • Published Jun 4 • 28

upvoted 2 papers 7 months ago

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27 • 185

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25 • 55