[ale] Linux in libraries (a heads-up for June 2002)
Joseph A. Knapka
jknapka at earthlink.net
Wed Jul 18 12:11:02 EDT 2001
Ben Ostrowsky wrote:
>
> >Can you explain why that is so? Is it because the MARC data structure is
> >hierarchical?
>
> I think it's got to do with repeatable fields. Here's a sample MARC record:
>
> LEADER 00962nam 2200289Ki 4500
> 001 fla97047300
> 003 FLA
> 005 20010427183412.0
> 008 970901s1996 caua 001 0 eng d
> 020 1565921496 (pbk.) :$c$40.00
> 035 526575
> 040 FLA$cFLA$dWaOLN
> 245 00 Programming Perl /$cLarry Wall, Tom Christiansen, Randal L.
> Schwartz with Stephen Potter.
> 250 2nd ed.
> 260 Sebastopol, CA :$bO'Reilly & Associates,$cc1996.
> 300 xxi, 646 p. :$bill. ;$c24 cm.
> 490 1 A Nutshell handbook
> 500 Accompanied by booklet "Quick reference guide" (19 p. ; 21 cm)
> inserted at end.
> 500 "Covers Perl 5"--Cover.
> 500 Includes index.
> 650 0 Perl (Computer program language)
> 700 1 Wall, Larry.
> 830 0 Nutshell handbook.
> 920 970923
> 930 970923
> 945 997 FLA ils
> 949 10 $b3 2396 00329 1527$cFLAANF$d005.133P WALL$mTBLC$nFLA$p40.00$tFLABK
>
> Note that there's a fixed-width leader, and that each field has two
> characters called indicators (such as "00" in the 245 field), which are
> often null. There are also three 500 fields. Each of these numbers has a
> specific meaning like "Main title", "Publisher", "Edition statement",
> "Additional author", "Series title", etc.
>
> Subfields are listed as $ followed by one character. So the 300 field has
> two null indicators, contains the text "xxi, 646 p. :", and has two
> subfields, $b ("ill. ;") and $c ("24 cm.").
With a complete specification for MARC, designing a normalized
relational schema should be "not too hard". Repeating fields become
rows in an auxiliary table keyed to the master record. It's
pretty hard to come up with a data format that can't be
represented relationally (spatial data is the only example that
springs to mind); there may be performance issues
with a relational implementation, however. But if the current
state of the art is to search flat files that look like the
record above, it's hard to see how an RDBMS could be worse.
> It does get messier than this, of course...
Sounds like fun.
-- Joe Knapka
"You know how many remote castles there are along the gorges? You
can't MOVE for remote castles!" -- Lu Tze re. Uberwald
// Linux MM Documentation in progress:
// http://home.earthlink.net/~jknapka/linux-mm/vmoutline.html
* Evolution is an "unproven theory" in the same sense that gravity is. *
--
To unsubscribe: mail majordomo at ale.org with "unsubscribe ale" in message body.
More information about the Ale
mailing list