This article was co-written by the e-Forum moderators.
An ALCTS e-Forum on “Adapting Cataloging Workflows to a Batch Processing Environment” was held November 14–15, 2017. Ethan Fenichel (Florida Atlantic University), Ariel Turner (Kennesaw State University), and Susan Wynne (University of Iowa) hosted the discussion.
Participants from many types of libraries shared experiences with a variety of Integrated Library System (ILS) platforms, including Sierra, Aleph, Alma, Voyager, Millennium, Symphony and Worldshare Management Services, as well as homegrown platforms. Common tools and services include MarcEdit, Python, Backstage Library Works, OCLC Collection Manager (WCM), SQLite, and OpenRefine.
Record quality was the major topic of discussion on day one. Participants described common errors or issues encountered in vendor or knowledgebase (KB) records, as well as methods for evaluating quality. A law librarian discussed a collaborative effort within the American Association of Law Libraries to certify vendor record quality. Determining “good enough” quality varies among record sets and scenarios. Participants use local guidelines and/or tools, such as MarcEdit or scripts, to evaluate quality. Issues of duplication and whether a record set accurately matches the library’s access or holdings are other considerations involved in evaluating quality.
We also discussed reasons for choosing among vendor records, an ILS-specific KB, or WCM—or in some cases, abandoning MARC records altogether when metadata is available in a discovery tool. Several respondents mentioned collaboration with public services staff when selecting records.
On the second day of the e-Forum, participants shared advice and local practices for batch loading and editing. MarcEdit was a favorite tool for many participants. MarcEdit’s Task Lists functionality was popular for automating repetitive tasks.
ILS-specific tools are also used for batch editing. For Millennium and Sierra, participants discussed Create Lists and Global Update as essential tools. In Alma, some use normalization rules during batch imports to add or remove fields or make other corrections and enhancements. Participants discussed considerations for matching, merging, or overlaying records.
For those new to batch processing, participants recommended talking with vendors, consulting the ILS’ documentation or electronic mailing list, consulting the MarcEdit electronic mailing list and YouTube tutorials, and—most importantly—talking with colleagues. Other tips included marking up a sample record printout with edits you’d like to do in batch, testing new import profiles with small sample sets, ensuring a way to backtrack, and maintaining thorough documentation.
Many libraries use local fields in collections for collocation, statistics, or other purposes. One has experimented with adding administrative data such as open access or dates of access data during batch loads.
We wrapped up the second day with discussions on training and hiring in relation to batch loading and processing. Some participants indicated that batch loading and processing is now integrated into training plans for more staff, or noted the increasing importance of hiring staff with aptitude and interest to learn batch processes. Willingness to experiment and problem solve is important for staff working with or managing batch processes.
Over the two days there were over 200 posts to the e-Forum, with many active participants sharing advice and perspectives on the topic of vendor records and batch processes.
Submitted by Ethan Fenichel, Ariel Turner, and Susan Wynne
Resources
Articles
- Flynn, Emily Alinder, and Erin Kilkenny. Cataloging from the Center: Improving e-Book Cataloging on a Consortial Level. Cataloging & Classification Quarterly 55, no. 7–8 (2017): 630–43.
- Lybarger, Kathryn. Keeping up with Ebooks: Automated Normalization and Access Checking with Normac. The Code4Lib Journal, no. 20 (2013).
- Martin, Kristin E., and Kavita Mundle. Cataloging E-Books and Vendor Records. Library Resources & Technical Services 54, no. 4 (2011): 227–37.
- Mugridge, Rebecca L., and Jeff Edmunds. Batchloading MARC Bibliographic Records. Library Resources & Technical Services 56, no. 3 (2012): 155–70.
- ———. Using Batchloading to Improve Access to Electronic and Microform Collections. Library Resources & Technical Services 53, no. 1 (2011): 53–61.
- Sapon-White, Richard. E-Book Cataloging Workflows at Oregon State University. Library Resources & Technical Services 58, no. 2 (2014): 127–36.
- Snow, Karen. Defining, Assessing, and Rethinking Quality Cataloging. Cataloging & Classification Quarterly 55, no. 7–8 (2017): 438–55.
- Traill, Stacie. Quality Issues in Vendor-Provided E-Monograph Records. Library Resources & Technical Services 57, no. 4 (2013): 213–26.
- Thompson, Kelly, and Stacie Traill. Leveraging Python to Improve Ebook Metadata Selection, Ingest, and Management. The Code4Lib Journal, no. 38 (2017).
- Van Kleeck, David, Hikaru Nakano, Gerald Langford, Trey Shelton, Jimmie Lundgren, and Allison Jai O’Dell. Managing Bibliographic Data Quality for Electronic Resources. Cataloging & Classification Quarterly 55, no. 7–8 (2017): 560–77.
- Wu, Annie, and Anne M. Mitchell. Mass Management of E-Book Catalog Records. Library Resources & Technical Services 54, no. 3 (2011): 164–74.
Presentations
- Lybarger, Kathryn. Fast, but Accurate? Pitfalls of Batch Metadata Editing. Presented at the ALA Midwinter Meeting, Dallas, Texas, 2012.
- Turner, Ariel. Implementation of Batch Cataloging: A Case Study. In Creating Sustainable Community: The Proceedings of the ACRL 2015 Conference. Portland, Oregon: Association of College and Research Libraries, 2015.
Working Groups
- American Association of Law Libraries (AAUL) Vendor-Supplied Records Advisory Working Group
- National Information Standards Organization (NISO) E-Book Metadata Working Group
Learning Resources
- MarcEdit 101 Workshop webinar series
- MarcEdit-L electronic mailing list
- MarcEdit Task List Guide from the University of Illinois at Urbana-Champaign
- MarcEdit YouTube Playlist
- Mashcat (Webinar recordings and Twitter chats on OpenRefine, MarcEdit, regex, etc.)
Workflow Documents
- Kent State University’s E-book Checklist by Roman Panchyshyn
- University of Minnesota’s E-Book Editing and Import Guidelines by Stacie Traill
- University of Tennessee’s E-Resources Checklist by Christina Harlow
Tools
- Perfect Keyboard (macro tool recommended by Ethan Fenichel)
- Marc_BatchLoading_Checks (Github repository by Ethan Fenichel)