Visualizing and Understanding Deep Learning Models in NLP

Speaker: Jiwei Li

Time:2017-11-09 09:30-11:30

Location: 106 Lecture Hall, Institue of computer science & technology of Peking University

Title:  Visualizing and Understanding Deep Learning Models in NLP

Abstract: A long-standing criticism of neural network models is their lack of interpretability: results for neural models are hard to interpret.In this talk, I will discuss a few attempts trying to rationalize outputs from neural networks. Methods include simply calculating first-order gradients, computing the difference in log likelihood on gold-standard labels when some words are erased, and a more sophisticated model that uses a reinforcement learning model to find the minimal set of words that must be erased to change the model’s decision. 

The proposed attempts offer interpretable explanations for various aspects of neural models such as how words' meaning composes to form higher-level language units such as phrases or sentences; how neural models select and filter important words; and why some models perform better than others. More importantly, they provide efficient tools to conduct error analysis that can be used on different neural architectures across various NLP applications, which have potential to improve the effectiveness of a wide variety of NLP systems.

References:Understanding Neural Networks through Representation Erasure: https://arxiv.org/pdf/1612.08220.pdf Visualizing and Understanding Neural Models in NLP: https://arxiv.org/pdf/1506.01066.pdf

Bio: Jiwei Li got his B.S. in Biology from Peking University (2008-2012) and Ph.D in Computer Science from Stanford University (2014-2017). He was a winner of Facebook Fellowship 2015 and Baidu Fellowship 2016. He works on Natural Language Processing, advised by Prof. Dan Jurafsky. 

Contact us
Tel: 86-10-6275 4420    
Fax: 86-10-6275 4532
Dean MailBox:icst748 at pku.edu.cn
Address:No. 128 Zhongguancun North Street, Haidian District, Beijing, 100871, P. R. China
Links:
WangXuan
FOUNDER
PEKING University
© Copyright 2017 All Rights Reserved
Wangxuan Institute of Computer Technology, Peking University