so sorting ing do documents uments by b y base se the
play

So Sorting ing do documents uments by b y base se the heme me - PowerPoint PPT Presentation

UDC Seminar 2013, The Hague So Sorting ing do documents uments by b y base se the heme me wit ith h sy synt nthe hetic tic cla lass ssif ifica ication: tion: th the e doub ouble le query uery me meth thod od Claudio


  1. UDC Seminar 2013, The Hague So Sorting ing do documents uments by b y base se the heme me wit ith h sy synt nthe hetic tic cla lass ssif ifica ication: tion: th the e doub ouble le query uery me meth thod od Claudio udio Gn Gnoli oli & Alber berto to Cheti eti

  2. Knowledge organization A to Z ?... Friday Monday Sathurday Sunday Thursday Tuesday Wednesday

  3. Knowledge organization A to Z ?... 1 Sunday A solution: 2 Monday 3 Tuesday good old 4 Wednesday classification :-) 5 Thursday 6 Friday 7 Sathurday

  4. Knowledge organization A to Z ?... Systematic presentation can act as an intellectual guide to contents

  5. Knowledge organization A to Z ?... Original German term: Wissensordnung = “ ordering of knowledge”

  6. Classification Often poorly applied in online resources… Lack of integration between cataloguers’ and OPACmasters’ work [Bland & Stoffan, 2008; Rozman, 2009; Casson et al. 2011]

  7. Compound subjects Most real documents are about combinations of concepts, e.g.: «the corrosion of tinplace by acid fruit products»… [Foskett 1958]  Synthetic classmarks needed (subdivisions, auxiliaries, facets, roles, links…)

  8. Citation order matters 1:34 «philosophy – law» 34:1 «law – philosophy»

  9. The PRECIS-GRIS tradition (Verbal) subject strings should be ordered combinations of terms (concepts) Law – influence of philosophy – U.K. – dictionaries

  10. Base vs. particular theme Notions coming from text linguistics [Beaugrande & Dressler 1981] «Influence of the abundance of wild ungulates on wolf diet in Northern Apennines» Wolf – diet – effect of ungulate abundance – N Apennines

  11. Two-step search [GRIS] Interfaces should allow to: -- identify a concept (finding the right term, discarding homographs etc.) -- examine all combinations of it with other concepts …starting with those where it is the base theme!

  12. Double query method Let’s give the user what (s)he’s asked for: (1) all combinations where the search term is the base theme (2) all combinations where the search term is a particular theme

  13. An application

  14. (1) Results as base theme

  15. …either alone or combined…

  16. (2) Results as a particular theme

  17. Double query method $queryA = "SELECT * FROM `literature` WHERE `classmark` REGEXP '^757*' ORDER BY classmark"; $queryB = "SELECT * FROM `literature` WHERE `classmark` REGEXP ';757*' ORDER BY classmark";

  18. Position depends on search

  19. Conclusions -- Principles for combination in verbal indexing (base vs. particular themes) can be extended to classification -- They help users to locate what they are actually searching for among many possible results -- They can be applied to search interfaces by any script (e.g. PHP + MySQL) managing a double query

  20. Thank you! claudio.gnoli@unipv.it @scritur

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend