Yu Zhu "祝宇"
Assistant Professor, Information Systems
University of Delaware, USA

Ph.D., Information Systems
University of Utah, USA
Ph.D., Finance
Zhejiang University, China

My res|

Quicktake: BPE, WordPiece, and SentencePiece

Yu Zhu published on 2023-06-19 included in series

One table to compare popular tokenization methods: BPE, WordPiece, and SentencePiece.

Implementation details: From the original Transformer to GPT

Yu Zhu published on 2023-06-10 included in series

This article comapres the implementation details between the original Transformer and GPT. These tricks are critical to performance but not always explained in the paper.

Transformers (deep learning models) are better at predicting the tail distribution

Yu Zhu published on 2023-05-30 included in series

In a setting of PEAD (post-earnings-announcement-drift) prediction using earnings call transcripts, I found Transformers (deep learning models) have a larger performance lead on extreme data points (data at the tails of the distribution).