Published in Probe Volume 6 (Final): July 1996
Susan McCarthy, Coordinator
Plant Genome Data and Information Center
USDA, ARS, National Agricultural Library
Beltsville, MD
Imagine being able to connect to the greatest archive of agricultural information from anywhere in the world 24 hours a day. Once connected, you search through the entire collection using key concepts. When the search is completed, you have a custom-tailored, synthesized document that retains source attributions but contains only the relevant pieces of the entire collection.
This scenario is still a dream, although we are closer to achieving it than ever before. Underlying this dream is Standard Generalized Markup Language (SGML). "Standard" and "Generalized" are the two key concepts. A standard allows searching across many different sources while providing the capacity to reconstitute the results in a common format. A generalized standard can adapt to many different formats and unique situations.
Markup languages help authors and publishers organize their work by structuring the content. The structure has a strong display component that helps the reader to understand the authors meaning. We all use markup language whenever we use puntuation, spacing--both horizontal and vertical--font selection, and the like.
The SGML mark-up language identifies key concepts for document content (for example: introduction, references, paragraphs) and display elements through a standard document type definition (DTD), and style sheets.
Our dream must again be anchored on the "standard" ideal to allow independence of hardware and software platforms. This standard should be internationally recognized to extend access to materials over time despite the rapid technological changes. Such recognition for ASCII has exploded the boundaries of telecommunication.
Today, we are beginning to build an experimental digital core based on SGML here at the USDAs National Agricultural Library. A collaboration has been forged with the Bean Improvement Cooperative to digitize its complete series of annual reports and conference proceedings.
This experiment will achieve its greatest success only with the full participation of NAL users. Accurately identifying and defining the keyword/keyphrase concepts is the only means to guarantee that users will be able to search and retrieve appropriate text across the entire body of knowledge.
To begin, we have placed sample articles on our homepage (http://www.nal.usda.gov/pgdic), and we invite everyone to take a look at the "digital core." You are cordially invited to participate in our experiment. Just remember: your participation at the ground floor may influence how you access information in the not-so-distant future.