1、自然语言生成 (Natural Language Generation, NLG)
研究如何让机器自动生成高质量的文本。
推荐论文:
Large Language Models Are FewShot Learners by Brown et al., 2020
GPT3: Language Models Are FewShot Learners by Brown et al., 2020
2、文本摘要 (Text Summarization)
研究如何将长文档压缩为简洁的摘要。
推荐论文:
A DiscourseAware Neural Abstractive Summarizer by Nallapati et al., 2016
PEGASUS: Pretraining with Extracted Gapsentences for Abstractive Summarization by Zhenzhong et al., 2020
3、情感分析 (Sentiment Analysis)
研究如何识别文本中的情感倾向。
推荐论文:
BERT: Pretraining of Deep Bidirectional Transformers for Language Understanding by Devlin et al., 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach by Liu et al., 2019
4、对话系统 (Dialogue Systems)
研究如何构建能够进行自然对话的AI系统。
推荐论文:
The Wizard of Wikipedia: KnowledgePowered Conversational AI by Dinan et al., 2018
ULMFiT: Universal Language Model FineTuning for Text Classification by Howard & Ruder, 2018
5、知识图谱 (Knowledge Graphs)
研究如何从文本中提取结构化知识。
推荐论文:
Knowledge Graph Embedding: A Survey of Approaches and Applications by Wang et al., 2017
Question Answering over Knowledge Graphs: Question Understanding via Fragment Selection by Dong et al., 2015