参考文献
1、Brown, T., et al、(2020)、Language Models are FewShot Learners、*Advances in Neural Information Processing Systems*.
2、Devlin, J., et al、(2019)、BERT: Pretraining of Deep Bidirectional Transformers for Language Understanding、*arXiv preprint arXiv:1810.04805*.
3、Vaswani, A., et al、(2017)、Attention is All You Need、*Advances in Neural Information Processing Systems*.