Home
Publications
News
Light
Dark
Automatic
Zhiyuan Zhang
Latest
Understanding and Improving Layer Normalization. NeurIPS 2019.
Muse: Parallel Multi-scale Attention for Sequence to Sequence Learning. Arxiv 2019.
Cite
×