Greenstone tutorial exercise

Back to wiki
Back to index
Sample files: beatles.zip
Devised for Greenstone version: 2.60
Modified for Greenstone version: 2.70w

Looking at a multimedia collection

  1. Copy the entire folder

    sample_files → beatles → advbeat_large

    (with all its contents) into your Greenstone collect folder. If you have installed Greenstone in the usual place, this is

    My Computer → Local Disk (C:) → Program Files → Greenstone → collect

    Put advbeat_large in there.

  1. If the Greenstone Digital Library Local Library Server is already running, re-start it by clicking the CD icon on the task bar and then pressing Restart Library. If not, start it up by selecting Greenstone Digital Library from the Start menu.

  1. Explore the Beatles collection. Note how the Browse button divides the material into seven different types. Within each category, the documents have appropriate icons. Some documents have an audio icon: when you click these you hear the music (assuming your computer is set up with appropriate player software). Others have an image thumbnail: when you click these you see the images.

  1. Look at the Titles A-Z browser. Each title has a bookshelf that may include several related items. For example, Hey Jude has a MIDI file, lyrics, and a discography item.

  1. Observe the low quality of the metadata. For example, the four items under A Hard Day's Night (under "H" in the Titles A-Z browser) have different variants as their titles. The collection would have been easier to organize had the metadata been cleaned up manually first, but that would be a big job. Only a tiny amount of metadata was added by hand—fewer than ten items. The original metadata was left untouched and Greenstone facilities used to clean it up automatically. (You will find in Building a multimedia collection that this is possible but tricky.)

  1. In the Windows file browser, take a look at the files that makes up the collection, in the

    sample_files → beatles → advbeat_large → import

    folder. What a mess! There are over 450 files under seven top-level sub-folders. Organization is minimal, reflecting the different times and ways the files were gathered. For example, html_lyrics and discography are excerpts of web sites, and images contains various images in JPEG format. For each type, drill down through the hierarchy and look at a sample document.