koha-zebra
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Koha-zebra] Zebra and non-filing characters


From: Sebastian Hammer
Subject: Re: [Koha-zebra] Zebra and non-filing characters
Date: Thu, 22 Dec 2005 15:19:35 -0500
User-agent: Mozilla Thunderbird 1.0.7 (Macintosh/20050923)

Joshua Ferraro wrote:

Hello everyone,

This is just generic question regarding Zebra's handling of
MARC non-filing characters. I know there is a 'stopwords'-like
function available using the 'map' directive:

map (^The\s) @

but I'm wondering whether Zebra is also capable of examining the
non-filing character specs within each MARC field to decide
whether to index or not to index ...
You mean using an indicator in the field to determine how many characters to skip? To the best of my knowledge, this is not supported at present, sorry.

What I don't like about that approach anyway is that it leaves it ambiguous what happens when the user put a leading article into a search term... I think yu'd be better off just configuring the system to ignore the most common leading articles as described above.

It is true that this would require separate configuration for different languages, but you probably wouldn't get around that anyway, since many non-English-speaking countries use other record formats than MARC21, and the use of indicators to control indexing is not universal.. the Danish MARC (cleverly named DANMARC) format, for instance, use a special character inside of the subfields to mark the part which should not be indexed.

--Sebastian

Cheers,


--
Sebastian Hammer, Index Data
address@hidden   www.indexdata.com
Ph: (603) 209-6853






reply via email to

[Prev in Thread] Current Thread [Next in Thread]