udc in action
play

UDC in Action Richard Smiraglia University of Wisconsin Milwaukee - PowerPoint PPT Presentation

UDC in Action UDC in Action Richard Smiraglia University of Wisconsin Milwaukee Andrea Scharnhorst, Almila Akdag Salah - eHumanities Cheng Gao Dahlian University (now Austin, Texas) Acknowledgement: We would like to thank Ed O Neill


  1. UDC in Action

  2. UDC in Action Richard Smiraglia – University of Wisconsin Milwaukee Andrea Scharnhorst, Almila Akdag Salah - eHumanities Cheng Gao – Dahlian University (now Austin, Texas) Acknowledgement: We would like to thank Ed O ’ Neill of the OCLC Office of Research who provided us with the OCLC dataset. We would also like to thank Johan Rademakers and Bart Peeters from KU Leuven who provided the Leuven dataset. Aida Slavic gave comments on the paper, and was an indispensable sparring partner for discussion. Part of this work has been funded by the Network of Excellence for Internet Science, FP7 – 288021.

  3. Classification of human knowledge production as complex phenomena

  4. Dataset OCLC – raw data 1. 9,055,623 records extracted from 214,596,487 bibliographic records using the “ 080 ” field in WorldCat 2. first column = internal ID number, second column = UDC numbers 3. Cleaning: 1. Lines not starting with an ‘ a ’ tag. 2. Lines with no numbers after “ a ” , or without “ a ” 3. 8,944,669 records 4. Another 570,629 dismissed as non-UDC numbers 4. Eventually we have 8,374,040

  5. Dataset KU Leuven The original file has 95,544 lines. The first column contains a string with the structure $$8 UDC number $$a UDC heading $$9 language of the heading. The second says how often this UDC number is used in bib records in the library.

  6. Use of UDC in KU Leuven

  7. Data processing or the beauty of a UDC ‘ string ’ 394.4 :[92(100+437) :329(437).15(091)+327.32(100)]

  8. Data processing or the beauty of a UDC ‘ string ’

  9. UDC as a complex system Not a hierarchy but a fully connected graph – still to be explored Evolution of the UDC over time Growth of UDC classes (AS, AAS, KS, CG, RS, 2011, Class&Ontolog) Entry and Exit of UDC numbers, changes in all tables including auxiliaries (AAS, CG, KS, AS, RS, 2012, ISKO) Structure of UDC UDC in collections How long is a UDC string? How are UDC classes connected by operations through auxiliary signs?

  10. Structure I – Profile of collections

  11. Structure II – Length of a UDC string

  12. Structure III – UDC six connecting symbols (or ‘ relators ” or ‘ operators ’ )

  13. Structure III – Networks views of UDC Matrix : The combined number 022:11.203+11.204 contributes one tick to the cell {row class 0, column class 1} in the matrix_colon, and in the matrix_plus between row class 1 and column class 1. Combinations between auxiliaries are not taken into account!

  14. Structure III – Network views of UDC No “ + ” in use in Leuven

  15. Conclusion I – UDC as a complex system

  16. Conclusion II Done To do Demonstration of analytic Systematic exploration of one (or several ‘ complete ’ perspectives from scientific datasets. Complete = UDC, plus full bibliographic visualization and complexity record research Possible applications Analysis of UDC numbers in collections = feedback to UDC editors about the use of classes. The temporal provenance of UDC numbers: Across the editions of the UDC, not only are UDC numbers added and deleted, they also are shifted (and re- labeled) and recombined, as well as receiving changed descriptions. Mapping out basic statistics on UDC classes as used in libraries for the information professionals Users might profit from mapping too, gaining an overview about the nature and focus of a specific collection.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend