CMSC 676 Personalized Information Retrieval Meera Patel - - PowerPoint PPT Presentation

cmsc 676 personalized information retrieval
SMART_READER_LITE
LIVE PREVIEW

CMSC 676 Personalized Information Retrieval Meera Patel - - PowerPoint PPT Presentation

CMSC 676 Personalized Information Retrieval Meera Patel Motivation Searched FB side effects... FB (in medical terms): Foreign Body When the user writes a The user expresses the required information in query terms and expects query...


slide-1
SLIDE 1

CMSC 676 Personalized Information Retrieval

Meera Patel

slide-2
SLIDE 2

Motivation

Searched “FB side effects”...

FB (in medical terms): Foreign Body

slide-3
SLIDE 3

When the user writes a query...

The user expresses the required information in query terms and expects semantic content of document What information Retrieval returns? “Documents matching to query terms” , which might not contain the expected result… Then what’s the solution? Personalized Information Retrieval

slide-4
SLIDE 4

The Basic idea of Personalization...

slide-5
SLIDE 5

“Personalization” Approach: Detecting document genre[1]

Based on three aspects:

  • Familiarity
  • Genre
  • Geography
slide-6
SLIDE 6

Formulating

Run_score = baseline_score +

Σwi*metadata_scorei

where wi = the constant weight of the metadata_score

H1: People with low familiarity with a topic will prefer documents which have a high proportion

  • f concrete terms, and conversely, people with

high familiarity with a topic will prefer documents that have a high proportion of abstract terms. H2: Documents of a given genre will include terms characteristic of that genre, which can be mathematically modeled so as to differentiate the genre of documents retrieved on a specific topic. H3: The vocabulary of documents is specific for the geographic area to which they refer: US documents can be distinguished from non-US documents by virtue of vocabulary characteristics.

slide-7
SLIDE 7

Another approach of Personalization[2]

slide-8
SLIDE 8

Future work

Clustering of documents based on semantic contents can make things easier, eg. reranking can be done based on the intersection of user interests and cluster semantics Enhancing the disambiguation of natural language techniques may help in finding more relevant results Adding one more phase in IR simply increases the delay in retrieval, so further work should be focused on faster retrieval

slide-9
SLIDE 9

References

[1] G. Muresan, C. L. Smith, M. Cole, Lu Liu and N. J. Belkin, "Detecting Document Genre for Personalization of Information Retrieval," Proceedings

  • f the 39th Annual Hawaii International Conference on System Sciences

(HICSS'06), Kauia, HI, USA, 2006, pp. 50c-50c. [2] The Personalized Information Retrieval Model Based on User Interest by Songjie Gong [3] Personalized Information Retrieval Approach by Myriam Hadjouni, Mohamed Ramzi Haddad, Hajer Baazaoui, Marie-Aude Aufaure, and Henda Ben Ghezal [4]C. Bouhini, M. Géry and C. Largeron, "Personalized information retrieval models integrating the user's profile," 2016 IEEE Tenth International Conference on Research Challenges in Information Science (RCIS), Grenoble, 2016, pp. 1-9. [5] Towards an architecture for personalized information retrieval implying the user’s profile and votes by Harbaoui Azza , Sahbi Sidhom , Malek Ghenima, Henda Ben Ghezala

slide-10
SLIDE 10

Thank you...

Any Questions?