Zhengxiao Du
Zhengxiao Du
Home
Experience
Blog
Publications
CV
Contact
Light
Dark
Automatic
3
Scaling Speech-Text Pre-training with Synthetic Interleaved Data
peech language models (SpeechLMs) accept speech input and produce speech output, allowing for more natural human-computer interaction …
Aohan Zeng
,
Zhengxiao Du
,
Mingdao Liu
,
Lei Zhang
,
Shengmin Jiang
,
Yuxiao Dong
,
Jie Tang
Preprint
Understanding Emergent Abilities of Language Models from the Loss Perspective
Recent studies have put into question the belief that emergent abilities in language models are exclusive to large models. This …
Zhengxiao Du
,
Aohan Zeng
,
Yuxiao Dong
,
Jie Tang
Preprint
GLM-130B: An Open Bilingual Pre-Trained Model
We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model with 130 billion parameters. It is an attempt to …
Aohan Zeng
,
Xiao Liu
,
Zhengxiao Du
,
Zihan Wang
,
Hanyu Lai
,
Ming Ding
,
Zhuoyi Yang
,
Yifan Xu
,
Wendi Zheng
,
Xiao Xia
,
Weng Lam Tam
,
Zixuan Ma
,
Yufei Xue
,
Jidong Zhai
,
Wenguang Chen
,
Peng Zhang
,
Yuxiao Dong
,
Jie Tang
Code
Preprint
GPT Understands, Too
While GPTs with traditional fine-tuning fail to achieve strong results on natural language understanding (NLU), we show that GPTs can be …
Xiao Liu
,
Yanan Zheng
,
Zhengxiao Du
,
Ming Ding
,
Jiezhong Qiu
,
Zhilin Yang
,
Jie Tang
Preprint
Code
Cite
×