Below is a list of questions Backstage Library Works posed to David W. Reser, Senior Cataloging Policy Specialist with the Library of Congress. David is working with OCLC on the non-Latin in authority references project. Backstage thought our clientele would be interested in this new development at the Library of Congress. We invite you to respond with your thoughts about these changes on this listserv or contact Backstage directly with any questions or concerns. Non-Latin Characters in Name Authority Records Questions 1. How will they be distributed, all at once or over a period of time? ANSWER: For the pre-population to be done by OCLC, they will contribute X number per day (number still being negotiated with the NACO nodes, but will likely be 25-30K per day on top of the regular daily files). Can I assume you subscribe to the weekly distribution file? If so, multiply that number by about 7. We don't know the total number of records that will be impacted, but hope that it will take only about a month to do the pre-population, assuming we can run them all through quickly (LC will be doing a version upgrade to its system in May, so if the pre-populated records aren't all done by then, there will be a hiatus for the upgrade). Once that is complete, regular NACO catalogers can begin adding/editing records with non-Latin characters, so that will obviously be an ongoing way of doing business. 2. Will all languages be included? If not, which will be included and which will be excluded? ANSWER: All languages that can be fully accommodated by one of the MARC-8 script repertoires (Arabic, Extended Arabic, CJK, Cyrillic, Extended Cyrillic, Greek, and Hebrew) are possible for the first phase. We plan to extend beyond the MARC-8 repertoire in a later phase, but no timelines have been set for that yet. 3. What normalization scheme will you be using for these headings? Currently NACO normalization is used and it is a scheme for MARC8 encoded data. What normalization scheme are you planning to use with UTF8 data? ANSWER: A revised NACO normalization scheme was approved recently, and is posted at: http://www.loc.gov/catdir/pcc/archive/PCCNormalization_Final.pdf (people smarter than I about such things assure me it covers the UTF-8 environment). 4. Will there be any special normalization rules followed for these records? ANSWER: Since the non-Latin forms will only appear in 4XXs, and since 4XXs are allowed to conflict (except within the same record), we're not expecting a major issue here, but we will receive reports from OCLC on those records that are flagged as normalization errors, just as we do now. 5. Will you also populate the 670 tag with Non-Latin Character data? ANSWER: The pre-population routines from OCLC will not generate 670 citations, but once NACO members begin adding data themselves after the pre-population, we expect non-Latin characters in 3 note fields, 667, 670, and 675 during the initial phase. We'll expand as/if we discover a need, but thought it would be nice to have a conservative target to start with. [Although our system doesn't care what fields non-Latin script data is used in, OCLC will check and notify us if they encounter records outside of the expected fields.] 6. Will you be making any changes to the bib records that you are harvesting the data from? ANSWER: If OCLC is planning to do anything to the bib records, I'm not aware of it. We expect that the harvesting will kick up a lot of dirt and make it more noticeable (e.g., typos, incorrect characters)-- I expect this will cause catalogers to update bibs on an as-needed basis. 7. Will this include NAME/TITLE and Corporate Bodies as well? ANSWER: We believe that OCLC's pre-population will cover personal names and corporate bodies tagged as X10s, but don't have a final analysis yet as to what exactly is covered. Once NACO members can begin adding non-Latin data after the pre-population, all name authority records are candidates, including geographic, titles, and name/titles. 8. Are there any plans for Subjects? ANSWER: We don't have plans to add non-Latin data to LCSH authorities at this time, but will probably re-evaluate this position from time to time. You are probably aware that we already distribute MARC Classification records with non-Latin scripts. Hope this helps, talk to you Wednesday at 9:00 am eastern. Dave John Reese Product Manager Backstage Library Works Voice (800) 391-5210 Ext. 249 Fax (801) 356-8220 Email jreese@bslw.com