Zhengxiao Du
Zhengxiao Du
Home
Experience
Blog
Publications
CV
Contact
Light
Dark
Automatic
3
Understanding Emergent Abilities of Language Models from the Loss Perspective
Recent studies have put into question the belief that emergent abilities in language models are exclusive to large models. This …
Zhengxiao Du
,
Aohan Zeng
,
Yuxiao Dong
,
Jie Tang
Preprint
GLM-130B: An Open Bilingual Pre-Trained Model
We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model with 130 billion parameters. It is an attempt to …
Aohan Zeng
,
Xiao Liu
,
Zhengxiao Du
,
Zihan Wang
,
Hanyu Lai
,
Ming Ding
,
Zhuoyi Yang
,
Yifan Xu
,
Wendi Zheng
,
Xiao Xia
,
Weng Lam Tam
,
Zixuan Ma
,
Yufei Xue
,
Jidong Zhai
,
Wenguang Chen
,
Peng Zhang
,
Yuxiao Dong
,
Jie Tang
Code
Preprint
GPT Understands, Too
While GPTs with traditional fine-tuning fail to achieve strong results on natural language understanding (NLU), we show that GPTs can be …
Xiao Liu
,
Yanan Zheng
,
Zhengxiao Du
,
Ming Ding
,
Jiezhong Qiu
,
Zhilin Yang
,
Jie Tang
Preprint
Code
Cite
×