community question answering cqa communities of millions
play

* Community Question Answering (CQA) Communities of millions of - PowerPoint PPT Presentation

* Design of CQA Systems for Flexible and Scalable Deployment and Evaluation Ivan Srba , Maria Bielikova Faculty of Informatics and Information Technologies Slovak University of Technology in Bratislava 6 th June 2016 ICWE 2016, Lugano 2 *


  1. * Design of CQA Systems for Flexible and Scalable Deployment and Evaluation Ivan Srba , Maria Bielikova Faculty of Informatics and Information Technologies Slovak University of Technology in Bratislava 6 th June 2016 ICWE 2016, Lugano

  2. 2 * Community Question Answering (CQA) • Communities of millions of users share their knowledge • by providing answers • on questions asked by the rest of the community • CQA systems have been adapted into additional contexts and environments • Educational domain • Crowd-based customer services, integrated development environments (IDEs) • Initial research how specifics of these new environments affect • Essential features (e.g. core question answering functions) • Collaboration support (e.g. question recommendation)

  3. 3 * Open Problems 1. Low adaptability of essential features to various settings • Possibility to be deployed in several different instances at the same time (e.g. in several educational or enterprise organizations) • How to make design of essential features flexible to handle various settings? 2. Ineffective integration and evaluation of collaboration support methods • Possibility to perform live experiments (very rare in standard CQA systems) • How to achieve loosely coupled integration of collaboration support methods with CQA systems? • How to make combination of offline and online experiments as effective as possible?

  4. 4 * Case Study on Our CQA System Askalot • A novel concept of an organization-wide educational CQA system 1150 users 430 questions 560 answers 410 comments Ruby on Rails Open source

  5. 5 * Case Study on Our CQA System Askalot • The version of Askalot was proposed specifically for our university • … and thus it lacked sufficient flexibility and scalability • We started a cooperation with: 1. Harvard University in order to transform Askalot into a plugin to MOOC system edX 2. University of Lugano in order to deploy at additional universities as a part of cooperation project in the SCOPES program

  6. * Designing Essential Features for Various Settings • Modular system architecture • Adaptable self-managed content organization • Flexible user management integration • Ubiquitous activity awareness and notifications

  7. 7 * Modular System Architecture • Necessity to develop two main configurations of our system • Askalot @university • Askalot @mooc • Solution based on one application and three components (RoR engines) • Shared • core features that are common for both configurations • University and MOOC • inherit all features from the core component and add specialized features

  8. 8 * Adaptable Self-managed Content Organization • Two-level topic structure which supports easy and flexible deployment • Category level • Reflects the formal structure of a university or a MOOC course • Tag level • Selected by asker to describe particular question topics • Categories should • be hierarchical • reflect repeating sessions (i.e. academic years or course sessions) • Solution based on nested set pattern • Hierarchical tree where each node has • domain-specific ID - to identify the same categories across all academic years or course sessions • shareable flag – whether questions from the previous sessions should be displayed also in the current session • askable flag – whether students can ask questions in this category

  9. 9 * • A: Global view loaded from shared component A • B: Unit view loaded from MOOC component B

  10. * Designing Universal Experimental Infrastructure • Modular approach where all methods are loosely coupled from other methods or system itself • Possibility to combine training/evaluation of methods on offline datasets with live experiments

  11. 11 * Three main parts of experimental infrastructure Stack Exchange Dataset Stack Exchange Dataset (XML format) (database) Dataset Conversion Stack Exchange XML-DB Convertor Resources Resources Askalot Event Simulation Job (live system) Event Events Dispatching Askalot Dataset Events Event Dispatcher (database) User/Content Profilers Method Feeders & Evaluators ... Listeners and Profiles User/Question/Answer Profiles

  12. 12 * Part 1: Dataset Conversion • Utilities to convert • any datasets from CQA systems Stack Exchange Dataset Stack Exchange Dataset • to a dedicated experimental (database) (XML format) Dataset database with the same database Conversion Stack Exchange XML-DB Convertor schema as Askalot system Resources • Existing convertors Resources Askalot Event Simulation Job (live system) Event • Stack Exchange datasets Events Dispatching Askalot Dataset Events Event Dispatcher (database) • edX datasets (in progress) User/Content Profilers Method Feeders & Evaluators ... Listeners and Profiles User/Question/Answer Profiles

  13. 13 * Part 2: Event Dispatching • Event is represented by • a resource Stack Exchange Dataset Stack Exchange Dataset • question, answer, comment, view, (database) (XML format) Dataset vote Conversion Stack Exchange • an action type XML-DB Convertor • create, update, delete Resources • an initiator Resources Askalot Event Simulation Job (live system) Event • who performed this action Events Dispatching Askalot Dataset Events Event Dispatcher (database) • Event sources • live system in online experiments User/Content Profilers Method Feeders & Evaluators ... Listeners • datasets in offline experiments and Profiles • either from Askalot itself User/Question/Answer Profiles • or from other CQA systems

  14. 14 * Part 2: Event Dispatching • Event simulation job • selects from the database all Stack Exchange Dataset Stack Exchange Dataset resources (database) (XML format) Dataset • converts them to a list of events Conversion Stack Exchange XML-DB Convertor • sorts events by time when they Resources originally happened Resources Askalot • sets the current time in the Event Simulation Job (live system) Event Events Dispatching experimental environment to this Askalot Dataset Events Event Dispatcher (database) event time • dispatches the event User/Content Profilers Method Feeders & Evaluators ... Listeners • Exact reproduction of events and Profiles • as they would be created by the User/Question/Answer Profiles live system

  15. 15 * Part 3: Listeners and Profiles • Listeners • Profilers - model users/content Stack Exchange Dataset Stack Exchange Dataset • user expertise, question difficulty (database) (XML format) Dataset • Method feeders - trigger and Conversion Stack Exchange XML-DB Convertor evaluate various research Resources methods • recommendation of new Resources Askalot Event Simulation Job (live system) Event questions to potential answerers Events Dispatching Askalot Dataset Events Event Dispatcher (database) • User/question/answer profiles • Universal data structures to store User/Content Profilers Method Feeders & Evaluators ... results of profilers Listeners and • attribute Profiles • value User/Question/Answer Profiles • probability • source

  16. 16 * Conclusion • Drawing upon redesign of CQA system Askalot, we proposed several design recommendations • how concepts of CQA systems can be adapted to an educational context and organizational environment • with achieving high flexibility and scalability • As the result, Askalot can be • deployed in three different environments (and many more if necessary) • characterized also as an open platform based on the universal experimental infrastructure askalot.fiit.stuba.sk/demo ( Try it! ) github.com/AskalotCQA/askalot askalot@fiit.stuba.sk

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend