AVIDD (Analysis and Visualization of Instrument-Driven Data) is a
distributed computing facility designed to process data generated by large
scientific instruments. AVIDD opens new doors for research at Indiana
University because of its novel design, addressing the full life cycle of
data analysis including intake, storage, analysis, and visualization. The
implementation of AVIDD is aimed at a great diversity of sciences, including
the life sciences, geophysics, atmospheric sciences, physics, and chemistry.
There are few, if any, similar efforts in existence today.
As of March 2003, most major components of the AVIDD facility are in place
and operational. As with any experimental system, a considerable amount of
effort is being devoted to development and integration of new software
tools. At the same time we are discovering dramatic research benefits that
this system will produce. For example, Dr. Alex Dzierba of Indiana
University has data that were collected in 1995 from high-energy physics
experiments but have not yet been analyzed for want of adequate computing
facilities. These data will be analyzed using AVIDD.
There are three aspects of the AVIDD facility that are particularly
forward-looking in meeting the coming needs of scientists:
- Managing very large data sets. With a total of 10 TB of spinning disk, and close integration with IU's massive data storage system, scientists will easily be able to manage and analyze multi-TB data sets using the AVIDD facility.
- Real-time data analysis. AVIDD provides facilities for analysis of nonscheduled, real-time data streams while also maintaining high overall usage levels of the computational systems. AVIDD will have the capability to respond to requests for analysis of incoming data streams in real time by preempting analysis tasks that are not time-sensitive in order. This is accomplished through a mix of kernel-level checkpointing (with LAM/MPI), application level checkpointing, and new features being added to the Maui scheduler at IU's request (IU is a charter member of the Maui Consortium).
- Low-cost, distributed visualization environments. IU has developed three relatively low-cost 3D visualization environments, permitting the installation of immersive 3D visualization devices in several labs and shared research areas. Placing immersive 3D visualization environments in or next to research labs will enable scientists to interact with and analyze their data in ways far more valuable than is typically possible today.
Further details on the hardware comprising the AVIDD facility are provided
below:
Computational component AVIDD Linux clusters
The computational component of AVIDD consists of an integrated quartet of
Linux clusters, tied together as a single computational resource, as
follows:
- Two identical IA32-based clusters, one located at IU Bloomington and one located at IUPUI. Each includes 208 2.4 GHz Prestonia processors.
- One IA64-based cluster, containing 36 1.0 GHz McKinley processors.
- One smaller, IA32-based cluster located at IU Northwest in Gary, containing 18 1.3 GHz
PIII processors. This cluster is for instructional use at the IU Northwest campus. (This cluster was funded entirely via a Shared University Research grant from IBM It will play a particularly important role in the educational aspects of the AVIDD cluster, as it is located on a campus with a high percentage of students from traditionally underrepresented groups.)
The aggregate capacity of the entire system will be 1.1 TFLOPS, 0.5 TB RAM,
and 10 TB disk space.
Massive Data Storage
The massive data storage equipment for AVIDD is being incorporated into IU's
existing HPSS installation. Added equipment includes two IBM p640 servers,
1.7 TB of RAID disk, and six StorageTek 9940A drives. It will be possible
to move 1 TB of data from disk cache into the Linux clusters in 2.5 hours.
Visualization
IU has developed and tested three separate types of display systems to be
deployed as part of the AVIDD visualization component:
- Small, portable systems with 3' x 4' passive stereo displays.
- Wall displays, very similar in technology to the portable displays, but with ceiling mounted projectors. (These may be either rear or front projected.)
- Larger, tiled display walls.
Two of the smaller portable systems have been completed, and prototypes of
the other two displays have been tested. Parts for several additional
displays have been ordered. A production agreement for the smaller, portable
displays is being negotiated with a regional manufacturer.
AVIDD is a $3.25 million project funded in part by a grant of $1.8 million
from the National Science Foundation's Major Research Instrumentation (MRI)
Program.
Top of page
Created March 10, 2003
http://www.indiana.edu/~uits/rac/avidd/what.html
Send comments and suggestions to rac at indiana.edu
Copyright 2003, The Trustees of
Indiana University