Subjective Databases: Enabling Search by Experience Wang-Chiew Tan - PowerPoint PPT Presentation

Subjective Databases: Enabling Search by Experience Wang-Chiew Tan Megagon Labs EDBT 2019

Megagon Labs Recruit Holdings : A human resources and lifestyle company, 200+ online services. : EDBT 2019

An example hotel query “ Hotels with clean rooms near IST congress center in Lisbon, Portugal.” EDBT 2019

Today’s hotel websites EDBT 2019

EDBT 2019

Voyageur: An Experiential Travel Search Engine . WWW 2019 demonstration screenshot. ● Powered by our Subjective Database engine . EDBT 2019

Today’s hotel search systems ● Exposes as many attributes as they think important. ● Schema is fixed a priori. ● Results are objective: ○ A hotel either satisfies the objective criteria or not. EDBT 2019

Example subjective queries in different domains Hotels : “ Hotels with clean rooms near IST congress center in Lisbon, Portugal.” Restaurant : “ Restaurants which are romantic and decently priced .” Jobs : “ Companies working on cutting edge AI tech. and offers good benefits. ” EDBT 2019

Criteria for search are subjective ● Subjective : based on or influenced by personal feelings, tastes, or opinions. ● J. McAuley and A. Yang. Addressing Complex Subjective Product Related Queries with Customer Reviews . WWW 2016. “ around 20% of [product] queries were labeled as being ‘subjective’ by workers. ” EDBT 2019

Criteria for search are subjective Y.Li, A.Feng, J.Li, S.Mumick, A.Halevy, V.Li, T. Subjective Databases , ArXiv 2019. A.Halevy. The Ubiquity of Subjectivity. IEEE DEB 2019. EDBT 2019

Subjective/objective data and queries EDBT 2019

Subjective queries against subjective data Why is this a hard problem? ● Experiences are subjective and personal. ● Specified in a variety of ways. ○ Often in text, not in a database. ○ Their meanings are often imprecise. ○ Hard to model in a database. EDBT 2019

Subjective Data: Examples EDBT 2019

EDBT 2019

Subjective queries against subjective data Why is this a hard problem? Subjective data … Room is comfortably clean. The continental … Apartment was clean, ... breakfast is OK. ... staff friendly. Pool was adequate. ... ? … showerhead with “Hotels with really many settings, thick … Apartment was clean, clean rooms and is a luxurious towels, … staff friendly. Pool was friendly staff. romantic getaway.” adequate. ... … Apartment was clean, Subjective query staff friendly. Pool was adequate. ... EDBT 2019

The remainder of this talk Y.Li, A.Feng, J.Li, S.Mumick, A.Halevy, V.Li, T. OpineDB Subjective Databases , ArXiv 2019. ● Subjective database model ● Processing subjective database queries ● Building subjective databases ● Concluding remarks ● Demonstration screenshots EDBT 2019

Subjective database schema ● Relation schemas R ( K , A 1 , … , A n ). ● Objective attributes and subjective attributes ○ values are based on facts, indisputable ○ values are influenced by personal beliefs or feelings EDBT 2019

Subjective attributes Hotel (hotelname, capacity, address, price_pn, * room_cleanliness , * bathroom , * service , * comfort ) “ very clean ”, “ pretty clean ”, “ modern ”, “ old style ”, “ dated “ spotless ”, “ average ”, “ stained shower ”, “ recently ● Type of a subjective attribute: a marker summary over a carpet ”, “dirty”, “ quite dirty ”, remodeled”, “modernistic linguistic domain . “ very filthy ”, “ dusty”, “very style”, ... dirty”, “unclean”, ... Linguistic variations Linguistic domains EDBT 2019

Linguistic domain and marker summaries ● Linguistic domain (LD) of an attribute ○ a set of short linguistic variations that describe the attribute. ● Marker ○ a word in the LD ● Marker summary: ○ a set of markers in the LD representative of the LD ● Room_cleanliness[“ very clean ”, “ average ”, “ dirty ”, “ very dirty ”] EDBT 2019

Marker Summaries “rooms are pretty clean” ● Linearly-ordered 0.5 0.5 ○ Markers form a linear-scale. Room_cleanliness[“ very clean ”, “ average ”, “ dirty ”, “ very dirty ”] ○ “ extravagant old-fashioned bathrooms ” ● Categorical 1 1 ○ No two markers of the marker summary form a linear scale. Bathroom[“ old-fashioned ”, “ standard ”, “ modern ”, “ luxurious ”] ○ EDBT 2019

Subjective queries against subjective data Subjective data … Room is comfortably clean. The continental … Apartment was clean, ... breakfast is OK. ... staff friendly. Pool was adequate. ... Subjective database … showerhead with “Hotels with really many settings, thick … Apartment was clean, clean rooms and is a luxurious towels, … staff friendly. Pool was friendly staff. romantic getaway.” adequate. ... … Apartment was clean, Subjective query staff friendly. Pool was adequate. ... EDBT 2019

Subjective queries against subjective data Hotel (hotelname, capacity, address, price_pn, * room_cleanliness , * bathroom , Subjective data * service , * comfort ) … Room is comfortably Marker summaries clean. The continental … Apartment was clean, ... Room_cleanliness breakfast is OK. ... staff friendly. Pool was [ very_clean, average, dirty, very_dirty ] adequate. ... … showerhead with “Hotels with really Bathroom many settings, thick … Apartment was clean, clean rooms and is a [ old, standard, modern, luxurious ] luxurious towels, … staff friendly. Pool was Service friendly staff. romantic getaway.” adequate. ... [ exceptional, good, average, bad, very_bad ] … Apartment was clean, Subjective query Bed staff friendly. Pool was [ very_soft, soft, firm, very_firm, ok, worn_out ] adequate. ... Linguistic domains ... EDBT 2019

Subjective database queries “ Find hotels with cost less than $150 per night, has really clean rooms and is a romantic getaway. ” select * from Hotels where price_pn < 150 and “ has really clean rooms ” and “ is a romantic getaway ” EDBT 2019

Lots of related work (NLP and DB) ● Natural language interfaces to databases ○ Parse natural language into semantic structure (SQL). ○ Parsing objective queries. V. Zhong, C.Xiong, R.Socher. Seq2SQL: Generating structured queries from natural language using reinforcement learning . arXiv 2017. F.Li, H.V.Jagadish. Understanding Natural Language Queries over Relational Databases . SIGMOD Record 2016. A.Simitsis, G.Koutrika, Y. Ioannidis. Précis: from unstructured keywords as queries to structured databases as answers . VLDBJ 2008. Yael Amsterdamer, Anna Kukliansky, Tova Milo: A Natural Language Interface for Querying General and Individual Knowledge . PVLDB 2015. S. Iyer, I. Konstas, A. Cheung, J. Krishnamurthy, L. Zettlemoyer. Learning a neural semantic parser from user feedback . ACL 2017. A.Popescu, O.Etzioni, H.Kautz. Towards a theory of natural language interfaces to databases . IUI 2003. And more! EDBT 2019

Subjective database queries “ Find hotels with cost less than $150 per night, has really clean rooms and is a romantic getaway. ” select * from Hotels where price_pn < 150 and “ has really clean rooms ” and “ is a romantic getaway ” EDBT 2019

Processing subjective database queries select * from Hotels 0.7 0.7 Predicate “ has really clean rooms ” → where price_pn < 150 and “has really clean rooms” and Interpretation room_cleanliness[“very clean”] “is a romantic getaway” “ is a romantic getaway ” → “ has really clean rooms ”, Service[“exceptional”] ⨁ Compute degrees of 0.63 “ is a romantic getaway ” truth for each hotel Bathroom[“luxurious”] 0.82 Query result: Fuzzy aggregation 1. Holiday Hotel 2. Inn Hotel ... EDBT 2019

Predicate interpretation Interpret each predicate into a fuzzy logic expression over attribute markers. select * from Hotels h s elect * from Hotels h where price_pn < 150 where price_pn < 150 ⨂ and h.room_cleanliness ⩬ “really clean” “has really clean rooms” ⨂ and (h.service ⩬ “exceptional” ⨁ “is a romantic getaway” h.bathroom ⩬ “luxurious”) EDBT 2019

Predicate interpretation: The easy case ● Problem : Given a query predicate p , find the marker(s) that best represent p . “has really clean rooms” ? Query predicates match directly to markers. “is a romantic getaway” ? Marker summaries Room_cleanliness [ very_clean, average, dirty, very_dirty ] “ has firm beds ” Bathroom [ old, standard, modern, luxurious ] “ luxurious bathrooms ” Service [ exceptional, good, average, bad, very_bad ] Bed [ very_soft, soft, firm, very_firm, ok, worn_out ] EDBT 2019

Predicate interpretation: The harder case Query predicates have arbitrary phrases. ● Word embedding method: ○ Find variations similar to p based on its word embedding. ● Co-occurrence method: ○ Find a marker whose linguistic variations frequently co-occur with p in the reviews. ● When all else fails … text-retrieval method. EDBT 2019

Predicate interpretation: word embedding method ● Find best semantically matching variations to p . ○ p = query predicate, w2v( w ) = word vector of w , ○ idf( w ) = inverse document frequency of w in the review corpus. ○ Interpretation: corresponding marker of q with highest similarity score to p above a certain threshold. EDBT 2019

Subjective Databases: Enabling Search by Experience Wang-Chiew Tan - PowerPoint PPT Presentation

Subjective Databases: Enabling Search by Experience Wang-Chiew Tan Megagon Labs EDBT 2019 Megagon Labs Recruit Holdings : A human resources and lifestyle company, 200+ online services. : EDBT 2019 An example hotel query Hotels with

Three measures of subjective social status Subjective measures of social Centers (1949):

Creating Databases and Tables Introduction to Databases in Python Creating Databases

Inductive Inductive Inductive Inductive Databases Databases Databases Databases and

Lecture 11: Persistent Memory Databases 1 / 71 Persistent Memory Databases Recap

Module 3: Creating and Managing Databases Overview Creating Databases Creating

Generic Generic and Subjective and Subjective Assisting Assisting Conversational Conversational

Models for Inexact Reasoning Reasoning with Subjective Pseudo Reasoning with Subjective Pseudo

Subjective Expected Utility Tommaso Denti March 8, 2015 We will go over Savages subjective

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

T T T The CDO Blueprint: Enabling the he CDO Blueprint: Enabling the he CDO Blueprint:

GEMS/Food Databases and GEMS/Food Databases and GEMS/Food Databases and in the Food Supply

Image Databases Image Databases Image Databases Prof. Paolo Ciaccia Prof. Paolo Ciaccia

Lecture 10: Larger-than-Memory Databases 1 / 53 Larger-than-Memory Databases Recap

Databases and PHP Accessing databases from PHP PHP & Databases l PHP can connect to

3. Text and document databases Normal databases: formatted records; document databases:

Indexing Multimedia Multimedia Databases Databases Indexing Indexing Multimedia Databases

Pushing A Boulder Uphill Leading Organizational Change in an Entrenched Environment Presenters

Multi-Platform SCADA GUI Regression Testing at CERN Paul Burkimsher Manuel Gonzalez-Berges

I. Defining Prejudice Prejudice is the spoiled fruit that grows from the cognitive vine. It

Recent Developments in False Claims Act Litigation Introduction Thomas M. Greene Enacted in

Outline American Foundation for Suicide Preventions Interactive Screening Program

Engaging the Public through Content Marketing Welcome! The webinar will begin at 2:00 p.m. CT

Elastic Electron Scattering for Proton Charge Radius Determination Toshimi Suda Tohoku University

Embodiment 4-15-16 Good old-fashioned artificial intelligence In the early days of AI, symbol

Sambuz

Useful Links

Newsletter

Mail Us

Subjective Databases: Enabling Search by Experience Wang-Chiew Tan - PowerPoint PPT Presentation

Subjective Databases: Enabling Search by Experience Wang-Chiew Tan Megagon Labs EDBT 2019 Megagon Labs Recruit Holdings : A human resources and lifestyle company, 200+ online services. : EDBT 2019 An example hotel query Hotels with

Three measures of subjective social status Subjective measures of social Centers (1949):

Creating Databases and Tables Introduction to Databases in Python Creating Databases

Inductive Inductive Inductive Inductive Databases Databases Databases Databases and

Lecture 11: Persistent Memory Databases 1 / 71 Persistent Memory Databases Recap

Module 3: Creating and Managing Databases Overview Creating Databases Creating

Generic Generic and Subjective and Subjective Assisting Assisting Conversational Conversational

Models for Inexact Reasoning Reasoning with Subjective Pseudo Reasoning with Subjective Pseudo

Subjective Expected Utility Tommaso Denti March 8, 2015 We will go over Savages subjective

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

T T T The CDO Blueprint: Enabling the he CDO Blueprint: Enabling the he CDO Blueprint:

GEMS/Food Databases and GEMS/Food Databases and GEMS/Food Databases and in the Food Supply

Image Databases Image Databases Image Databases Prof. Paolo Ciaccia Prof. Paolo Ciaccia

Lecture 10: Larger-than-Memory Databases 1 / 53 Larger-than-Memory Databases Recap

Databases and PHP Accessing databases from PHP PHP &amp; Databases l PHP can connect to

3. Text and document databases Normal databases: formatted records; document databases:

Indexing Multimedia Multimedia Databases Databases Indexing Indexing Multimedia Databases

Pushing A Boulder Uphill Leading Organizational Change in an Entrenched Environment Presenters

Multi-Platform SCADA GUI Regression Testing at CERN Paul Burkimsher Manuel Gonzalez-Berges

I. Defining Prejudice Prejudice is the spoiled fruit that grows from the cognitive vine. It

Recent Developments in False Claims Act Litigation Introduction Thomas M. Greene Enacted in

Outline American Foundation for Suicide Preventions Interactive Screening Program

Engaging the Public through Content Marketing Welcome! The webinar will begin at 2:00 p.m. CT

Elastic Electron Scattering for Proton Charge Radius Determination Toshimi Suda Tohoku University

Embodiment 4-15-16 Good old-fashioned artificial intelligence In the early days of AI, symbol

Sambuz

Useful Links

Newsletter

Mail Us

Databases and PHP Accessing databases from PHP PHP & Databases l PHP can connect to