This is a static archive of the previous Open Grid Forum Redmine content management system saved from host redmine.ogf.org file /dmsf_files/8906?download=13689 at Fri, 04 Nov 2022 20:10:22 GMT GridIR GGF Charter

GridIR GGF Charter

Administration

Name

Grid Information Retrieval (GridIR)

Chairs

Kevin Gamiel
OrganizationMCNC-RDI
Postal3021 Cornwallis Road, Research Triangle Park, NC 27709-2889
Phone+1-919-248-1915
Emailkgamiel at cnidr.org
Dr. Gregory B. Newby, Research Faculty
OrganizationArctic Region Supercomputing Center
University of Alaska Fairbanks
Postal910 Yukon Drive Suite 106 PO Box 756020 Fairbanks, AK 99775
Phone+1-907-474-7160
Fax+1-907-474-5494
Emailnewby at arsc.edu
Nassib Nassar
OrganizationEtymon Systems, Inc.
PostalP.O. Box AM, Princeton, NJ 08542, USA
Phone+1-609-851-4356
Emailnassar at etymon.com

Secretary

Sousan Karimi
OrganizationMCNC-RDI
Postal3021 Cornwallis Road, Research Triangle Park, NC 27709-2889
Phone+1-919-248-9297
Emailsousan at cnidr.org

Mailing List

gir-wg@gridforum.org

Subscription details available at: http://www.gridir.org/mailing_list.html.

Description and Objectives

Purpose

The GridIR WG will establish a specific set of requirements, an architecture, and detailed specifications for a particular Information Retrieval (IR) system on the OGSA Grid. GridIR will provide document collection management, indexing/searching, and query processing services to OGSA grid users and applications.

Goals

The GridIR WG will:

1. Establish requirements for GridIR.

The WG will describe specific functional system requirements. Initial requirements are identified as:

  • Distributed, asynchronous and independent of content and query type
  • Collection management capability
  • Indexing and searching capability
  • Query processing capability
  • Metadata capabilities to identify and interact with collections, indexes and query processors
2. Define an architecture for GridIR

The WG will describe a specific architecture in support of satisfying the IR system requirements in an efficient, modular manner. The initial architecture includes three major components:

  • Collection Manager services - Provides generalized methods for describing logical record collections, instantiating those collections (by crawling, copying, accepting streams, etc), providing metadata services on those collections, provide access methods to those collections, and provide event services for collection change notification.
  • Indexing/Searching services - Provides generalized methods for building indexes/representations of Collections for subsequent rapid search. Also provides generalized complex query and retrieval methods.
  • Query Processing services - Provides methods for distributed searching, merging result sets, query expansion, and event-driven change notification.
3. Describe detailed GridIR specifications

The WG will describe a set of specifications based on the OGSA suitable to implement GridIR systems. We will describe a specific set of OGSA services, initially including:

  • GridIR PortType
    • Includes method(s) for discovering metadata about available IR capabilities and collections
  • CollectionManagement PortType
    • Includes methods for defining logical document collections (including document harvesting and transformation rules) and delivering them (as full collections or partial updates)
  • SearchAndPresentation PortType
    • Includes methods for submitting a structured query and manipulating result sets.
  • InformationRetrieval PortType
    • Inherits SearchAndPresentation PortType.
    • Adds methods for index administration.
    • Supplies core IR functionality.
  • QueryProcessing PortType
    • Inherits SearchAndPresentation PortType.
    • Handles distributed asynchronous event-driven queries
    • Presents super-sets of InformationRetrieval PortTypes to user clients
    • Provides methods for result set merging
    • Provides query expansion and other processing services

Milestones

  • GridIR Requirements Document - Stakeholder-driven list of service-level requirements for building a grid-based IR system. Revised draft by GGF7, finalize by GGF8
  • GridIR Architecture Document - Describes overall system comprised of integrated grid services, scenarios, etc. First draft by GGF7, finalize by GGF12.
  • GridIR Specifications Document - Describes each service in detail, with an emphasis on WSDL interface specification. First draft by GGF7, finalize by GGF13.

Website

http://www.gridir.org

This is a static archive of the previous Open Grid Forum Redmine content management system saved from host redmine.ogf.org file /dmsf_files/8906?download=13689 at Fri, 04 Nov 2022 20:10:22 GMT