 
              (Re-)examining the creation of an electronic collection on faculty scholarship John Sterbenz University of Michigan Kresge Library Services
Presentation Overview 2009 – Concept and background – Workflow and implementation – Issues and concerns – (Mostly) fully-realized example – Decision … then a LOT of things happened …
Presentation Overview 2018 – What’s changed – What hasn’t changed – Issues and concerns – A final analysis
The Concept • Create catalog records containing stable, proxied, article-level URLs for research authored by Ross School of Business faculty to load into III’s Millennium ILS – Aggregators/databases to include ProQuest, EBSCO, ScienceDirect, Ingenta Connect, Oxford University Press • Include “times cited” links to Google Scholar and Scopus • Provide catalog users options for capturing not only these records but also faculty working papers and RSB-authored books • Retrieve citation data for the (automated) creation of said catalog records
The “Why” • Desire to highlight institution-specific research • Local precedent already established – Faculty research and grey literature – Undergraduate and graduate research • Desire to ensure that these faculty scholarship records did not create “false hit” problems with OPAC searching
The Implementation • Millennium systems work • Universal overview – Identify full-text articles – Export citation data into citation management software (RefWorks) – Create tab-delimited (text) export file in RefWorks – Work with tab-delimited RefWorks export file in Excel – Create MARC records using MarcEdit – Load records into Millennium
Identify Full-text Articles • Compile a list of faculty, by discipline • Select an aggregator/resource • Search! • Identify/select records with full-text for export • Not surprisingly, this is where the greatest process variability lies
ProQuest (“Advanced Search”)
ProQuest (Search results, with selections)
Business Source Complete (“Advanced Search”)
Business Source Complete (Search results, with selections)
Import Citation Data to RefWorks • “Direct export to RefWorks ” was great— when it worked – Even when it didn’t (ProQuest), it didn’t pose undue difficulty
Export Tab-delimited File from RefWorks • From this point, processing mechanics are essentially identical between aggregators/databases
Unprocessed RefWorks file
Work with Tab-delimited RefWorks Export File in Excel • LOTS of massaging here (authors, quotation marks to identify text) • LOTS of data removal • What’s left at the end of this step? – Article title – Journal title – Year of publication – Abstract – URL – Authors (one author per column)
Exported RefWorks file in Excel (pre-processing)
Exported RefWorks file in Excel (post-processing)
Create MARC Records Using MarcEdit • Delimited text translator – Use .mrd template (based on number of authors) – Add other fields and subfields (not originally found in citation data) – Additional text cleanup (especially 520) • Compile file into MARC (.mrc) for loading
MarcEdit Delimited Text Translator (with template loaded)
A file of records!
A closer examination
Create MARC Records Using MarcEdit • Delimited text translator – Use .mrd template (based on number of authors) – Add other fields and subfields (not originally found in citation data) – Additional text cleanup (especially 520) • Compile file into MARC (.mrc) for loading
A completed record
Create MARC Records Using MarcEdit • Delimited text translator – Use .mrd template (based on number of authors) – Add other fields and subfields (not originally found in citation data) – Final text cleanup (especially 520) • Compile file into MARC (.mrc) for loading
Whoops….
Create MARC Records Using MarcEdit • Delimited text translator – Use .mrd template (based on number of authors) – Add other fields and subfields (not originally found in citation data) – Final text cleanup (especially 520) • Compile file into MARC (.mrc) for loading
Load Records into Millennium • Special Load Profile – Overlay based on 245 only – No matches? Insert record! – One or more matches? Reject record (do not load) – Insert fields common to all article-level records – Boilerplate 506, 538 fields common to all electronic titles
The Results • 1,791 records containing 2,024 article-level URLs – 1,621 Business Source Complete links – 403 ProQuest links – 233 records linked to both ProQuest and Business Source Complete • 48 “Times cited” links – Added after loading – 42 Google Scholar – 6 Scopus – Records examined with no “Times cited” link • What do they look like?
The Issues • No attempt was made to resolve issues, per se, after work on the project was halted • Need to craft specific guidelines for author/article inclusion – Level of affiliation – Secondary institutes and centers – Include only from peer-reviewed/scholarly sources? – All authorship?
The Issues • Work associated with appearance in more than one resource • Work associated with more than one RSB author • Identification of new works to add • Doesn’t necessarily capture *all* scholarship • Cataloging standards • What about authority control?
The Decision • You already know the results • Let’s examine the environment at the time
Kresge Business Administration Library (2009) • 120,000 volumes • 3,000 print subscriptions • 2,600 binding units • Fully-developed, mature, proprietary ILS • Monthly loading of Serials Solutions MARC records and ERM holdings data • Technical Services staffing – Five FT and one FTE temporary employees (one librarian)
Kresge Library Services (2018) • 156 non-circulating physical volumes • Five print subscriptions (all annuals) with “current issue only” retention • No binding • Self-supported and maintained open-source ILS (Koha) for cataloging and financials only • Monthly loading of Serials Solutions MARC records • Technical Services staffing – Three FT and zero FTE temporary employees (one librarian)
The Issues (revisited) • Need to craft specific guidelines for author/article inclusion • Work associated with appearance in more than one resource • Work associated with more than one RSB author • Identification of new works to add • May not necessarily capture *all* scholarship • Cataloging standards
Noted/Probable Improvements • Tighter integration of citation software • Increase/removal of seemingly arbitrary limits • Better working knowledge of MarcEdit • Better MarcEdit functionality • Software and hardware advances, in general
Millennium vs. Koha • Catalog scopes – Millennium: location-based scoping – Koha: Separate database instances • Public and staff displays – Millennium: A set of universal displays – Koha: Use of “Bibliographic frameworks” • Combining the universes
A Final Analysis
Questions?
Thank you! John Sterbenz Manager, Technical Services, Collections, and Library Automation University of Michigan Ross School of Business Kresge Library Services jsterben@umich.edu (734) 764-5746
Recommend
More recommend