Dongyan Zhao中文


Wangxuan Institute of Computer Technology
Peking University
Office: No.128 North Zhongguancun Street, Haidian District, Beijing 100080, P. R. of China
Phone: (86) 10-82529252
Email: zhaodongyan AT pku DOT edu DOT cn 

I am a Professor in Wangxuan Institute of Computer Technology (WICT), Peking University (PKU), China. I received B.S., M.S. and Ph.D. in Computer Science from Department of Computer Science and Technology of PKU in 1991, 1994 and 2000 respectively. 

Research Biography

My major research interests include Natural Language Processing, Semantic Data Management and Knowledge-based Intelligent System. Recently, I am interested in several research topics including Information Extraction, Knowledge Graph, Question Answering & Reading Comprehension, Dialogue System and Knowledge-based Intelligence applications. 

As a distinguished member of China Computer Federation (CCF), I'm the secretary-general of CCF TCCI (Technical Committee on Chinese Information Technology, renamed as Technical Committe on Natural Language Processing in 2020) from 2010 to 2019, member of CCF Task Force on Big Data and member of CCF Network and Data Communications, as well as a senior member of CIPS Social Media Processing Committee.

In recent years, while undertaking 15 national and 8 provincial/ministerial scientific research projects (include National Natural Science Foundation of China, National Hi-Tech Project, etc.), I am/was the PI in 11 projects, including 7 national research projects.

Having published 100 referred papers (more than 60 of them are top ranked by CCF, such as ACL, AAAI, IJCAI, KDD, WWW, SIGMOD, VLDB; AI, TODS, VLDB Journal, TKDE etc.), obtained 20 patents, I won 7 official awards in national and provincial/ministerial level, including National Awards of Scientific and Technological Process, Second Prize (Ranking as the first).

I also won China Youth Science and Technology Award (2007) and Special Award of Technological Innovation titled "Honor of Science and Technology" by Beijing Municipal Government (2007).

Research Group: Web Information Processing Lab


Selected Publications (From 2017 only)


Refereed Journal Papers:

  • Peng Peng, Lei Zou, Lei Chen, Dongyan Zhao: Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload. IEEE Trans. Knowl. Data Eng. 31(4): 670-685 (2019) (CCF Rank A)
  • Liwei Chen, Yansong Feng, Songfang Huang, Bingfeng Luo, Dongyan Zhao: Encoding implicit relation requirements for relation extraction: A joint inference approach,Artificial Intelligence 265: 45-66 (2018) (CCF Rank A)
  • Peng Peng, Lei Zou, Zhenqin Du, Dongyan Zhao: Using partial evaluation in holistic subgraph search. Frontiers Comput. Sci. 12(5): 966-983 (2018)
  • Yanyan Jia, Yansong Feng, Yuan Ye, Chao Lv, Dongyan Zhao, Improve Discourse Parsing with Two-Step Neural Transition-Based Model, ACM Transactions on Asian and Low-Resource Language Information Processing, TALLIP 17(2): 11:1-11:21 (2018)
  • Sen Hu, Lei Zou, Jeffrey Xu Yu, Haixun Wang, Dongyan Zhao: Answering Natural Language Questions by Subgraph Matching over Knowledge Graphs. IEEE Trans. Knowl. Data Eng. TKDE 30(5): 824-837 (2018) (CCF Rank A)
  • Youhuan Li, Lei Zou, Huaming Zhang, Dongyan Zhao: Longest Increasing Subsequence Computation over Streaming Sequences. IEEE Trans. Knowl. Data Eng. TKDE 30(6): 1036-1049 (2018) (CCF Rank A)
  • Tao Chen, Mansheng Li, Qiang He, Lei Zou, Youhuan Li, Cheng Chang, Dongyan Zhao, Yunping Zhu: LiverWiki: a wiki-based database for human liver. BMC Bioinformatics 18(1): 452:1-452:11 (2017)
  • Weiguo Zheng, Lei Zou, Lei Chen, Dongyan Zhao: Efficient SimRank-based Similarity Join, ACM Transactions on Database Systems, TODS 42(3): 16:1-16:37 (2017) (CCF Rank A)

Conference Papers:


  • Zhenxin Fu, Shaobo Cui, Mingyue Shang, Feng Ji, Dongyan Zhao, Haiqing Chen, Rui Yan: Context-to-Session Matching: Utilizing Whole Session for Response Selection in Information-Seeking Dialogue Systems. KDD 2020: 1605-1613(CCF Rank A)
  • Lisong Qiu, Yingwai Shiu, Pingping Lin, Ruihua Song, Yue Liu, Dongyan Zhao, Rui Yan: What If Bots Feel Moods? SIGIR 2020: 1161-1170(CCF Rank A)
  • Chongyang Tao, Wei Wu, Yansong Feng, Dongyan Zhao, Rui Yan: Improving Matching Models with Hierarchical Contextualized Representations for Multi-turn Response Selection. SIGIR 2020: 1865-1868(CCF Rank A)
  • Zhenxin Fu, Yu Wu, Hailei Zhang, Yichuan Hu, Dongyan Zhao, Rui Yan: Be Aware of the Hot Zone: A Warning System of Hazard Area Prediction to Intervene Novel Coronavirus COVID-19 Outbreak. SIGIR 2020: 2241-2250(CCF Rank A)
  • Zechang Li, Yuxuan Lai, Yansong Feng, Dongyan Zhao: Domain Adaptation for Semantic Parsing. IJCAI 2020: 3723-3729(CCF Rank A)
  • Shen Gao, Xiuying Chen, Zhaochun Ren, Dongyan Zhao, Rui Yan: From Standard Summarization to New Tasks and Beyond: Summarization with Manifold Information. IJCAI 2020: 4854-4860(CCF Rank A)
  • Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Dongyan Zhao: Neighborhood Matching Network for Entity Alignment. ACL 2020: 6477-6487(CCF Rank A)
  • Xueliang Zhao, Wei Wu, Chongyang Tao, Can Xu, Dongyan Zhao, Rui Yan: Low-Resource Knowledge-Grounded Dialogue Generation. ICLR 2020
  • Shen Gao, Xiuying Chen, Chang Liu, Li Liu, Dongyan Zhao, Rui Yan: Learning to Respond with Stickers: A Framework of Unifying Multi-Modality in Multi-Turn Dialog. WWW 2020: 1138-1148(CCF Rank A)
  • Danyang Liu, Juntao Li, Meng-Hsuan Yu, Ziming Huang, Gongshen Liu, Dongyan Zhao, Rui Yan: A Character-Centric Neural Model for Automated Story Generation. AAAI 2020: 1725-1732(CCF Rank A)
  • Meng-Hsuan Yu, Juntao Li, Danyang Liu, Bo Tang, Haisong Zhang, Dongyan Zhao, Rui Yan: Draft and Edit: Automatic Storytelling Through Multi-Pass Hierarchical Conditional Variational Autoencoder. AAAI 2020: 1741-1748(CCF Rank A)
  • Juntao Li, Chang Liu, Jian Wang, Lidong Bing, Hongsong Li, Xiaozhong Liu, Dongyan Zhao, Rui Yan: Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce. AAAI 2020: 8212-8219(CCF Rank A)
  • Yuan Ye, Yansong Feng, Bingfeng Luo, Yuxuan Lai, Dongyan Zhao: Integrating Relation Constraints with Neural Relation Extractors. AAAI 2020: 9442-9449(CCF Rank A)
  • Xiuying Chen, Daorui Xiao, Shen Gao, Guojun Liu, Wei Lin, Bo Zheng, Dongyan Zhao, Rui Yan: RPM-Oriented Query Rewriting Framework for E-commerce Keyword-Based Sponsored Search (Student Abstract). AAAI 2020: 13769-13770(CCF Rank A)
  • Jie Wang, Zhenxin Fu, Moxin Li, Haisong Zhang, Dongyan Zhao, Rui Yan: Learning Sense Representation from Word Representation for Unsupervised Word Sense Disambiguation (Student Abstract). AAAI 2020: 13947-13948(CCF Rank A)
  • Youhuan Li, Lei Zou, M. Tamer Özsu, Dongyan Zhao: Time Constrained Continuous Subgraph Search Over Streaming Graphs. ICDE 2019: 1082-1093(CCF Rank A)
  • Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Dongyan Zhao: Jointly Learning Entity and Relation Representations for Entity Alignment. EMNLP/IJCNLP (1) 2019: 240-249
  • Jia Li, Chongyang Tao, Wei Wu, Yansong Feng, Dongyan Zhao, Rui Yan: Sampling Matters! An Empirical Study of Negative Sampling Strategies for Learning of Matching Models in Retrieval-based Dialogue Systems. EMNLP/IJCNLP (1) 2019: 1291-1296
  • Ran Le, Wenpeng Hu, Mingyue Shang, Zhenjun You, Lidong Bing, Dongyan Zhao, Rui Yan: Who Is Speaking to Whom? Learning to Identify Utterance Addressee in Multi-Party Conversations. EMNLP/IJCNLP (1) 2019: 1909-1919
  • Zhangming Chan, Juntao Li, Xiaopeng Yang, Xiuying Chen, Wenpeng Hu, Dongyan Zhao and Rui Yan: Modeling Personalization in Continuous Space for Response Generation via Augmented Wasserstein Autoencoders. EMNLP /IJCNLP (1) 2019: 1931-1940
  • Jizhi Tang, Yansong Feng, Dongyan Zhao: Learning to Update Knowledge Graph by Reading News. EMNLP /IJCNLP (1) 2019: 2632-2641
  • Shen Gao, Xiuying Chen, Piji Li, Zhangming Chan, Dongyan Zhao, Rui Yan: How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing. EMNLP /IJCNLP (1) 2019: 3739-3749
  • Mingyue Shang, Piji Li, Zhenxin Fu, Lidong Bing, Dongyan Zhao, Shuming Shi, Rui Yan: Semi-supervised Text Style Transfer: Cross Projection in Latent Space. EMNLP /IJCNLP (1) 2019: 4936-4945
  • Zhangming Chan, Xiuying Chen, Yongliang Wang, Juntao Li, Zhiqiang Zhang, Kun Gai, Dongyan Zhao, Rui Yan: Stick to the Facts: Learning towards a Fidelity-oriented E-Commerce Product Description Generation. EMNLP /IJCNLP (1) 2019: 4958-4967
  • Ran Le, Wenpeng Hu, Yang Song, Tao Zhang, Dongyan Zhao, Rui Yan: Towards Effective and Interpretable Person-Job Fitting. CIKM 2019: 1883-1892
  • Zhenxin Fu, Feng Ji, Wenpeng Hu, Wei Zhou, Dongyan Zhao, Haiqing Chen, Rui Yan: Query-bag Matching with Mutual Coverage for Information-seeking Conversations in E-commerce. CIKM 2019: 2337-2340
  • Jia Li, Chongyang Tao, Nanyun Peng, Wei Wu, Dongyan Zhao, Rui Yan: Evaluating and Enhancing the Robustness of Retrieval-based Dialogue System with Adversarial Examples. NLPCC 2019(1): 142-154
  • Yang Lin, Pengyu Huang, Yuxuan Lai, Yansong Feng, Dongyan Zhao: Evidence Distilling for Fact Extraction and Verification. NLPCC 2019(1): 211-222
  • Feiyu Xu, Hans Uszkoreit, Yangzhou Du, Wei Fan, Dongyan Zhao, Jun Zhu: Explainable AI: A Brief Survey on History, Research Areas, Approaches and Challenges. NLPCC 2019(2): 563-574
  • Zechang Li, Yuxuan Lai, Yuxi Xie, Yansong Feng, Dongyan Zhao: A Sketch-Based System for Semantic Parsing. NLPCC 2019(2): 748-759
  • Zhengwei Tao, Waiman Si, Juntao Li, Dongyan Zhao, Rui Yan: Boosting Variational Generative Model via Condition Enhancing and Lexical-Editing. PRICAI2019(1): 377-391
  • Rui Yan, Ran Le, Yang Song, Tao Zhang, Xiangliang Zhang, Dongyan Zhao: Interview Choice Reveals Your Preference on the Market: To Improve Job-Resume Matching through Profiling Memories. KDD 2019: 914-922 (CCF Rank A)
  • Xiuying Chen, Zhangming Chan, Shen Gao, Meng-Hsuan Yu, Dongyan Zhao, Rui Yan: Learning towards Abstractive Timeline Summarization. IJCAI 2019: 4939-4945 (CCF Rank A)
  • Wenpeng Hu, Zhangming Chan, Bing Liu, Dongyan Zhao, Jinwen Ma, Rui Yan: A Graph-structured Neural Network for Dialogue Systems. IJCAI 2019: 5010-5016 (CCF Rank A)
  • Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Rui Yan, Dongyan Zhao: Relation-Aware Entity Alignment for Heterogeneous Knowledge Graphs. IJCAI 2019: 5278-5284 (CCF Rank A)
  • Xueliang Zhao, Chongyang Tao, Wei Wu, Can Xu, Dongyan Zhao, Rui Yan: A Document-grounded Matching Network for Response Selection in Retrieval-based Chatbots. IJCAI 2019: 5443-5449 (CCF Rank A)
  • Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, Rui Yan: One Time of Interaction May Not Be Enough: Go Deep with an Interaction-over-Interaction Network for Response Selection in Dialogues. ACL 2019(1): 1-11 (CCF Rank A)
  • Jiazhan Feng, Chongyang Tao, Wei Wu, Yansong Feng, Dongyan Zhao, Rui Yan: Learning a Matching Model with Co-teaching for Multi-turn Response Selection in Retrieval-based Dialogue Systems. ACL 2019(1): 3805-3815 (CCF Rank A)
  • Lisong Qiu, Juntao Li, Wei Bi, Dongyan Zhao, Rui Yan: Are Training Samples Correlated? Learning to Generate Dialogue Responses with Multiple References. ACL 2019(1): 3826-3835 (CCF Rank A)
  • Wenpeng Hu, Zhou Lin, Bing Liu, Chongyang Tao, Zhengwei Tao, Jinwen Ma, Dongyan Zhao, Rui Yan: Overcoming Catastrophic Forgetting via Model Adaptation. ICLR 2019 (Poster)
  • Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, Rui Yan: Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. WSDM 2019: 267-275
  • Shen Gao, Zhaochun Ren, Yihong Zhao, Dongyan Zhao, Dawei Yin, Rui Yan: Product-Aware Answer Generation in E-Commerce Question-Answering. WSDM 2019: 429-437
  • Juntao Li, Lidong Bing, Lisong Qiu, Dongmin Chen, Dongyan Zhao, Rui Yan: Learning to Write Stories with Thematic Consistency and Wording Novelty. AAAI 2019: 1715-1722 (CCF Rank A)
  • Shen Gao, Zhaochun Ren, Yihong Eric Zhao, Dongyan Zhao, Dawei Yin, Rui Yan: Product-Aware Answer Generation in E-Commerce Question-Answering. AAAI 2019: 6399-6406 (CCF Rank A)
  • Yuxuan Lai, Yansong Feng, Xiaohan Yu, Zheng Wang, Kun Xu, Dongyan Zhao: Lattice CNNs for Matching Based Chinese Question Answering. AAAI 2019: 6634-6641 (CCF Rank A)
  • Juntao Li, Lisong Qiu, Bo Tang, Dongmin Chen, Dongyan Zhao, Rui Yan: Insufficient Data Can Also Rock! Learning to Converse Using Smaller Data with Augmentation. AAAI 2019: 6698-6705 (CCF Rank A)
  • Lili Yao, Nanyun Peng, Ralph Weischedel, Kevin Knight, Dongyan Zhao, Rui Yan: Plan-and-Write: Towards Better Automatic Storytelling. AAAI 2019: 7378-7385 (CCF Rank A)
  • Mingyue Shang, Zhenxin Fu, Hongzhi Yin, Bo Tang, Dongyan Zhao, Rui Yan: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test? AAAI 2019: 10031-10032 (Student Abstracts)
  • Feifan Fan, Yansong Feng, Dongyan Zhao: Multi-grained Attention Network for Aspect-Level Sentiment Classification. EMNLP 2018: 3433-3442
  • Juntao Li, Yan Song, Haisong Zhang, Dongmin Chen, Shuming Shi, Dongyan Zhao, Rui Yan:
    Generating Classical Chinese Poems via Conditional Variational Autoencoder and Adversarial Training. EMNLP 2018: 3890-3900
  • Xiuying Chen, Shen Gao, Chongyang Tao, Yan Song, Dongyan Zhao, Rui Yan: Iterative Document Representation Learning Towards Summarization with Polishing. EMNLP 2018: 4088-4097
  • Haisong Zhang, Zhangming Chan, Yan Song, Dongyan Zhao, Rui Yan: When Less Is More: Using Less Context Information to Generate Better Utterances in Group Conversations. NLPCC (1) 2018: 76-84
  • Rui Yan, Dongyan Zhao: Coupled Context Modeling for Deep Chit-Chat: Towards Conversations between Human and Computer. KDD 2018: 2574-2583 (CCF Rank A)
  • Sen Hu, Lei Zou, Jeffrey Xu Yu, Haixun Wang, Dongyan Zhao: Answering Natural Language Questions by Subgraph Matching over Knowledge Graphs (Extended Abstract). ICDE 2018: 1815-1816. (CCF Rank A)
  • Yansong Feng, Songfang Huang, Dongyan Zhao, Rui Yan, Bingfeng Luo, Zheng Wang:
    Marrying Up Regular Expressions with Neural Networks: A Case Study for Spoken Language Understanding. ACL (1) 2018: 2083-2093. (CCF Rank A)
  • Yanyan Jia, Yuan Ye, Yansong Feng, Yuxuan Lai, Rui Yan, Dongyan Zhao: Modeling discourse cohesion for discourse parsing via memory network. ACL (2) 2018: 438-443. (CCF Rank A)
  • Mingyue Shang, Zhenxin Fu, Nanyun Peng, Yansong Feng, Dongyan Zhao, Rui Yan: Learning to Converse with Noisy Data: Generation with Calibration. IJCAI 2018: 4338-4344. (CCF Rank A)
  • Yiping Song, Cheng-Te Li, Jian-Yun Nie, Ming Zhang, Dongyan Zhao, Rui Yan: An Ensemble of Retrieval-Based and Generation-Based Human-Computer Conversation Systems. IJCAI 2018: 4382-4388. (CCF Rank A)
  • Chongyang Tao, Shen Gao, Mingyue Shang, Wei Wu, Dongyan Zhao, Rui Yan: Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism. IJCAI 2018: 4418-4424 (CCF Rank A)
  • Xiaowei Tong, Zhenxin Fu, Mingyue Shang, Dongyan Zhao, Rui Yan: One "Ruler" for All Languages: Multi-Lingual Dialogue Evaluation with Adversarial Multi-Task Learning. IJCAI 2018: 4432-4438. (CCF Rank A)
  • Rui Yan, Dongyan Zhao: Smarter Response with Proactive Suggestion: A New Generative Neural Conversation Paradigm. IJCAI 2018: 4525-4531. (CCF Rank A)
  • Peng Peng, Lei Zou, M. Tamer Özsu, Dongyan Zhao: Multi-query Optimization in Federated RDF Systems. DASFAA (1) 2018: 745-765. (Best Paper)
  • Rui Yan, Dongyan Zhao: A NeuRetrieval Model for Human-Computer Conversations. WWW 2018: 305-312(Companion Volume) (CCF Rank A)
  • Ying Zeng, Yansong Feng, Rong Ma, Zheng Wang, Rui Yan, Chongde Shi, Dongyan Zhao, Scale Up Event Extraction Learning via Automatic Training Data Generation, AAAI 2018: pp6045-6052. (CCF Rank A)
  • Chongyang Tao, Lili Mou, Dongyan Zhao, Rui Yan. RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog Systems, AAAI 2018: pp722-729. (CCF Rank A)
  • Zhenxin Fu, Xiaoye Tan, Nanyun Peng, Dongyan Zhao, Rui Yan. Style Transfer in Text: Exploration and Evaluation, AAAI 2018: pp663-670. (CCF Rank A)
  • Yiping Song, Rui Yan, Yansong Feng, Yaoyuan Zhang, Dongyan Zhao, Ming Zhang. Towards a Neural Conversation Model with Diversity Net Using Determinantal Point Processes. AAAI 2018: pp 5932-5939. (CCF Rank A)
  • Yiping Song, Dongyan Zhao, Ming Zhang, Rui Yan. Diversifying Neural Conversation Model with Maximal Marginal Relevance. IJCNLP(2) 2017: 169-174
  • Jizhi Tang, Chao Lv, Lili Yao, Dongyan Zhao: PKUICST at TREC 2017 Real-Time Summarization Track: Push Notifications and Email Digest. TREC 2017
  • Ying Zeng, Yansong Feng, Dongyan Zhao: WIP Event Detection System at TAC KBP 2017 Event Nugget Track. TAC 2017
  • Xinyi Lin, Rui Yan, Dongyan Zhao. A Hybrid Optimization Framework Fusing Word- and Sentence-Level Information for Extractive Summarization. NLPCC 2017: 124-135
  • Shuo Han, Lei Zou, Jeffrey Xu Yu, Dongyan Zhao: Keyword Search on RDF Graphs - A Query Graph Assembly Approach. CIKM 2017: 227-236
  • Lili Yao, Yaoyuan Zhang, Yansong Feng, Dongyan Zhao and Rui Yan. Towards Implicit Content-Introducing for Generative Short-Text Conversation Systems. EMNLP 2017: 2190-2199
  • Bingfeng Luo, Yansong Feng, Jianbo Xu, Xiang Zhang and Dongyan Zhao, Learning to Predict Charges for Criminal Cases with Legal Basis, EMNLP 2017: 2727-2736
  • Rui Yan, Dongyan Zhao, Weinan E, Joint Learning of Response Ranking and Next Utterance Suggestion in Human-Computer Conversation System, SIGIR 2017: 685-694 (CCF Rank A)
  • Bingfeng Luo, Yansong Feng, Zheng Wang, Zhanxing Zhu, Songfang Huang, Rui Yan, Dongyan Zhao, Learning with Noise: Enhance Distantly Supervised Relation Extraction with Dynamic Transition Matrix, ACL (1) 2017: 430-439. (CCF Rank A)
  • Zhiliang Tian, Rui Yan, Lili Mou, Yiping Song, Yansong Feng, Dongyan Zhao, How to Make Contexts More Useful? An Empirical Study to Context-Aware Neural Conversation Models, ACL (2) 2017: 231-236. (CCF Rank A)​

More publications data can be accessed by the following websites: