Shubham Toshniwal

Final year PhD Student
Toyota Technological Institute at Chicago (TTIC)
Advisor: Kevin Gimpel, Karen Livescu

Contact Details

6045 S. Kenwood Ave., Chicago, IL
shtoshni at ttic dot edu
Google Scholar

Happy Me !!
Grand Canyon 2021

I am a final year graduate student working with Kevin Gimpel and Karen Livescu at Toyota Technological Institute at Chicago (TTIC). I also work closely with Sam Wiseman, and Allyson Ettinger.

Before TTIC, I spent two years as a software engineer in IBM Research, New Delhi. And even before that I did my undergrad in Computer Science from Indian Institute of Technology Kanpur.

My research is focused on natural language understanding, specifically on tracking the world state described in text, with particular focus on entities. This problem is essential to language comprehension, especially for long documents where understanding these abstractions are key to situating far apart evidences. We have used memory models for this task in recent work [1, 2]. We also worked on a fun project on language modeling for chess where the chess board is the world, and the chess pieces are the entities in this world [3]. We're currently working on integrating ideas from these two threads to endow language models with "built-in" probing capabilities with regards to entity tracking.

I have also worked at the intersection of speech and text understanding [4, 5, 6, 7], and end-to-end speech recognition [8, 9, 10].


  • I will be joining FAIR as a Research Scientist this coming January


Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks
Shubham Toshniwal, Sam Wiseman, Allyson Ettinger, Karen Livescu, Kevin Gimpel
EMNLP 2020 (short paper)
[Code] [Colab] [LitBank Data in HTML] [Slides] [Talk] [bib]

PeTra: A Sparsely Supervised Memory Model for People Tracking
Shubham Toshniwal, Allyson Ettinger, Kevin Gimpel, Karen Livescu
ACL 2020
[Code] [Colab] [Slides] [Talk] [bib]

A Cross-Task Analysis of Text Span Representations
Shubham Toshniwal, Haoyue Shi, Bowen Shi, Lingyu Gao, Karen Livescu, Kevin Gimpel
RepL4NLP 2020
[Code] [Slides] [Talk] [bib]

Pre-trained Text Embeddings for Enhanced Text-to-Speech Synthesis
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda, Shubham Toshniwal, Karen Livescu
Interspeech 2019

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
Shubham Toshniwal, Anjuli Kannan, Chung-Cheng Chiu, Yonghui Wu, Tara N Sainath, Karen Livescu
IEEE Workshop on Spoken Language Technology (SLT), 2018

Hierarchical Multitask Learning for CTC-based Speech Recognition
Kalpesh Krishna, Shubham Toshniwal, Karen Livescu

Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information
Trang Tran*, Shubham Toshniwal*, Mohit Bansal, Kevin Gimpel, Karen Livescu, Mari Ostendorf
NAACL HLT 2018 (Oral)
[Code] [bib]

Multilingual Speech Recognition With A Single End-To-End Model
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro Moreno, Eugene Weinstein, Kanishka Rao
ICASSP 2018 (Oral)
[Slides] [Blog] [bib]

Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition
Shubham Toshniwal, Hao Tang, Liang Lu, Karen Livescu
Interspeech 2017 (Oral)
[Slides] [Summary] [bib]

Jointly learning to align and convert graphemes to phonemes with neural attention models
Shubham Toshniwal, Karen Livescu
IEEE Workshop on Spoken Language Technology (SLT), 2016
[Code] [Poster] [bib]

USHER: An Intelligent Tour Companion
Shubham Toshniwal, Parikshit Sharma, Saurabh Srivastava, Richa Sehgal
International Conference on Intelligent User Interfaces (IUI) 2015
[MIT Technology Review]



Visitor counts

Started this counter in August 2020 out of curiosity :)

Flag Counter