co-located with IJCAI-19
The 4th International Workshop on Search-Oriented Conversational AI (SCAI)
at IJCAI-19, Macao 🇲🇴, China, August 12
Click here to go to the SCAI main page.
More and more information is found and consumed in a conversational form rather than using traditional search engines. Chatbots, personal assistants in our phones and eyes-free devices are being used increasingly more for different purposes, including information retrieval and exploration. On the other side, information retrieval empowers dialogue systems to answer questions and to get context for assisting the user in her tasks. With the recent success of deep learning in different areas of natural language processing, this appears to be the right foundation to power search conversationalization. Yet, we believe more can be done for theory and practice of conversation-based search and search-based dialogues.
The aim of this edition of the SCAI workshop is to bring together researchers from the Natural Language Processing (NLP), Artificial Intelligence (AI), and Information Retrieval (IR) communities to investigate future directions of research in the area of search-oriented conversational systems. The focus of this installment seeks to broaden participation between research and industry. The previous instances identified a number of research areas related which warrant additional deeper exploration. To provide a broad forum we solicit a variety of research and position paper submissions.
The 1st edition of the workshop was co-located with International Conference on the Theory of Information Retrieval (ICTIR 2017).
The 2nd edition of the workshop was co-located with the Conference on Emperical Methods in Natural Language Processing (EMNLP 2018).
The 3rd edition (special half-day edition) of the workshop was co-located with The Web Conference 2019 (TheWebConf 2019).
Session 1 (08:30–10:00)
Session 2 (10:30–12:30)
- Rajhans Samdani, Ankit Goyal, Pierre Rappolt and Pratyus Patnaik: Practical User Feedback-driven Internal Search Using Online Learning to Rank [paper]
- Vishwajeet Kumar, Ganesh Ramakrishnan and Yuan-Fang Li: A Framework for Automatic Question Generation from Text using Deep Reinforcement Learning [paper] [slides]
Session 3 (14:00–15:30)
- Jiahuan Pei, Arent Stienstra, Julia Kiseleva, and Maarten de Rijke: SEntNet: Source-aware Recurrent Entity Networks for Dialogue Response Selection [paper] [slides] [poster]
- Yangjun Zhang, Pengjie Ren and Maarten de Rijke: Improving Background Based Conversation with Context-aware Knowledge Pre-selection [paper] [slides] [poster]
Session 4 (16:00–18:15)
- Title: Towards scalable multi-domain conversational agents
- Abstract: Task-oriented dialogue systems have become very popular with the advent of virtual assistants like Google Assistant, Amazon Alexa, Apple Siri, etc. Such assistants need to support an ever-increasing number of services and APIs and building a dialogue system that can seamlessly operate across all these services is the holy grail of conversational AI. This talk will highlight some of the challenges and practical considerations associated with building such large scale virtual assistants and our efforts towards addressing them through our newly released Schema-guided Dialogue dataset. Our dataset is the largest publicly available dataset of multi-domain task-oriented dialogues containing around 16k conversations spanning across 16 domains. The methodology of creation of this dataset and our ongoing Dialogue System Technology Challenge based on this dataset will also be introduced.
- Title: Towards Building More Intelligent Dialogue Systems: Semantics, Consistency, and Interactiveness
- Abstract: Building open-domain, open-topic dialogue systems (known as chatbot) is one of the most challenging AI tasks due to the difficulties of natural language understanding and the requirements of world knowledge and even semantic reasoning. The speaker will address three fundamental issues existing in current dialogue systems: semantics, consistency, and interactiveness. As preliminary research attempts to address these problems, the speaker will present some recent studies towards building more intelligent dialogue systems: 1) how a dialogue system can behave more interactively via emotion detection and expression, proactive question generation, and sentence function control; 2) how a dialogue system can behave more consistently via explicit personalization given a specific profile; and 3) how a dialogue system can behave more intelligently (semantics) via using commonsense knowledge for language understanding and generation. These attempts will move forward to more intelligent, human-like chatting machines.
- Title: SERP-Based Conversations
- Abstract: How might we convey the information that is traditionally returned by a search engine in the form of a complex search engine result page (SERP) in a meaningful and natural conversation? In the talk I will sketch some of the challenges, some of the solutions to related problems (such as background based conversations), and some of the ongoing work on this task. Based on joint work with Pengjie Ren, Nikos Voskarides, Svitlana Vakulenko
- Title: Integrating ‘hard’ knowledge with neural language models
- Abstract: Neural approaches show incredible performance in many dialog related tasks. But, when it comes to real-world dialog agents, we still need to add a lot of ‘hard’’ knowledge, expressed not only in a form of databases, but also rules, heuristics, patterns etc. We consider several ideas how to integrate these two worlds.
- Title: Multi-modal Reasoning: Bridging Vision and Language
- Abstract: The relentless advancement of deep learning in the recent years has brought dramatic boost of performance in many applications, and has since became a nearly synonym of Artificial Intelligence (AI). Nevertheless, the classical scenario embodying the AI proposed by pioneering minds and science fiction authors, human talking to an accompanying android regarding the surrounding world, is yet to become a reality. While the performances demonstrated by specialized deep learning models have approached or even surpassed human performance in various well-define individual tasks of computer vision and natural language processing, bridging the perception of visual information and the communication through natural language remains a very challenging task. The speaker will talk about some recent studies in conversational AI in Media Communications Lab at University of Southern California. The critical role of the multi-modalities of information will be discussed. Our approaches toward better alignment of multi-modalities in vision and language systems, as well as the methodology in inference and extraction for improved performance, will be introduced.
- Title: Clova – Leveraging Search Results to make Conversational Agents Smarter
- Abstract: Clova is an AI platform developed by NAVER and LINE. NAVER has been the most popular search engine for over a decade in South Korea, and LINE is the Japan-based global messenger platform. A smart speaker called Clova Friends that includes Line’s free voice-call function and infrared home-appliance control has been launched in Korea and Japan. Clova aims to enable a user to access the relevant information and control devices intuitively and conveniently. NAVER adapts search result ranking based on implicit user feedback to better meet user’s needs. One or more search results for the query are key features in guessing the user’s intentions. In addition to search technologies such as integrated search and related query suggestion, deep learning-based methods were utilized to understand user’s intentions more precisely and improve the accuracy quickly. In this talk, I will cover some effort and challenge in matching user’s intentions to the relevant information, and connecting users based on shared interests to provide the best way to find the information and services. I will introduce the technically challenging problems that we are currently tackling and future AI developments.
- Title: Conversational Agents with Emotion and Personality
- Abstract: For the efficient interaction between human and digital companion, i.e., machine agents, the digital companions need to have both human-like emotion and personality. We will start from reporting our continuing efforts and recent results to develop human-like emotional conversational agents as a part of the Korean National Flagship AI Program. The developed emotional conversational agents make emotional dialogue, understand human emotion, and express its own emotion. The emotions of human users are estimated from text, audio, and visual face expression during verbal conversation, and the emotions of intelligent agents are expressed in the speech and facial images. We will first show how our ensemble networks won the Emotion Recognition in the Wild (EmotiW2015) challenge with 61.6% accuracy to recognize seven emotions from facial expression. Then, a multimodal classifier combines text, voice, and facial video for better accuracy. Also, a learning-based Text-to-Speech (TTS) system will be introduced to express emotions in the dialogue. These emotions of human users and agents interact each other during the conversation. Our conversational agents have chitchat and Question-and-Answer (Q&A) modes, and the agents respond differently for different emotional states in chitchat mode. Then, we will discuss our current efforts to incorporate personalities of both human users and conversational agents. Finally, we will further extend the emotions and personality towards minds, i.e., brain internal states, which include trustworthiness, implicit intention, memory, and preference. Since not-much-studies are available, we start from hypotheses on mind space axes and conduct cognitive neuroscience experiments to prove the hypotheses. We believe that this mind model is very important for the efficient interactions and better bindings between human and conversational agents.