Multi-Scale Computing: Integrating Local, Regional, National and - - PowerPoint PPT Presentation

multi scale computing integrating local regional national
SMART_READER_LITE
LIVE PREVIEW

Multi-Scale Computing: Integrating Local, Regional, National and - - PowerPoint PPT Presentation

Multi-Scale Computing: Integrating Local, Regional, National and International Grids through the TIGRE Project Alan Sill, Ph.D TIGRE Senior Scientist, High Performance Computing Center Adjunct Professor of Physics, Texas Tech University Goals


slide-1
SLIDE 1

Multi-Scale Computing: Integrating Local, Regional, National and International Grids through the TIGRE Project

Alan Sill, Ph.D TIGRE Senior Scientist, High Performance Computing Center Adjunct Professor of Physics, Texas Tech University

slide-2
SLIDE 2

Goals of the TIGRE Project

Provide a grid infrastructure that enables integration of computing systems, storage systems, databases, visualization labs and displays, even instruments and sensors across Texas. Facilitate new academic - government - private industry research partnerships by dramatically enhancing both computational capabilities and research infrastructure. Address research areas of interest to the State of Texas in which manifold increase of computing power, data access, and collaboration are necessary. Demonstrate new, preferred, enhanced or increased computing and storage handling capabilities offered by a statewide grid infrastructure.

slide-3
SLIDE 3

Background and History

SuperComputing 1998: representatives from five Texas institutions met and agreed to cooperate and exchange notes on a variety of topics of interest in computing. High Performance Computing Across Texas (HiPCAT) was born as a result. TIGRE was proposed as one of HiPCAT’s first projects. (Rice, Texas Tech. U. of Houston, Texas A&M, and UT/TACC.) Combined by Legislature into

  • ne funding bill with LEARN:

(Lone Star Education and Research Network)

TIGRE

$2.5M $7.3M Fiber Optic Networking Grid Software Development

slide-4
SLIDE 4

Project Organization

Targeted application areas were selected by the steering committee, which also has responsibility for the overall direction of the project. These initial areas are:

Biosciences and Medicine Air Quality Modeling Energy Exploration

Technical implementation is carried out by the development team, consisting of two people at each of the five primary TIGRE institutions. Technical work organized by the development team into activities to meet project milestones, with work targeted to meet these milestones in collaboration across institutions.

slide-5
SLIDE 5

Funding: Texas Enterprise Fund

Created as a tool to “create jobs in Texas.” Accompanied by a formal Economic Development Agreement with State. TEF intends to “create quality jobs and leverage private investment” to strengthen economic future of the state. This mandate differs from that of many grids.

slide-6
SLIDE 6

Project Progress to Date

Goal is to achieve “quick build” toward working status:

YEAR 1

Q1: Project plan ✔ Web site ✔ Certificate Authority ✔ Minimum testbed requirements ✔ Select 3 driving applications ✔ Q2: Alpha quality user portal ✔ Q3: Define server software stack ✔ Distribution Mechanism ✔ Simple demo of 1 TIGRE app ✔ Q4: Alpha client software stack and installation method distributed ✔

YEAR 2

Q1: Alpha customer management services system deployed & demonstrated; applications in three application areas (in progress) Q2: Project-wide global grid scheduler deployed (in progress) Q3: Stable software status (only bug fixes after this) Required services for TIGRE specified Q4: Complete hardening of software Complete documentation Finalized procedures and policies to join TIGRE Demonstrate TIGRE at SC

... So Far Making Excellent Progress!

slide-7
SLIDE 7

Details of TIGRE Software

Based on the VDT, working in close cooperation with OSG. Based on Gram4 (Web Services); pre-WS only upon request. Client and server software stacks available. Goal is “one page” installation instructions that can be implemented quickly by newcomers. Uses a simplified set including GSI-OpenSSH, omitting much monitoring in favor of GPIR for lightweight status reporting. Installed on systems at all five primary TIGRE institutions; also running at several other locations throughout the state. Authentication via X.509 (IGTF + TACC CA recently accredited); authorization local mostly via grid-mapfiles. (TTU uses GUMS/PRIMA.)

slide-8
SLIDE 8

Server and Client Software Stack Contents

Contents of the client software stack:

Globus 4.x pre-web services and web services clients GSI OpenSSH client UberFTP MyProxy client Condor-G

Contents of the cluster/compute server stack:

Globus 4.x GRAM4 (web services) server GSI-OpenSSH server (note: as of this version works with PRIMA) GPIR monitoring (added manually after the VDT pieces) The client software stack

Storage is not yet resolved in TIGRE, but initial investigations using SRM/DRM, SRB/iRODS, etc. are in progress.

slide-9
SLIDE 9

TIGRE User Portal

http://tigreportal.hipcat.net)

slide-10
SLIDE 10

Application Progress: Biosciences and Medicine

Initial demo was “UltraScan” : an analysis tool for reduction

  • f data from ultracentrifuge biomolecular optical spectra.

Researchers (Borries Demeler, UTHSC-San Antonio w/postdoc Emre Brookes): “We can do science we never did before!”

slide-11
SLIDE 11

Other Biosciences and Medicine Progress

Beginning exploration of cancer radiotherapy modeling involving MD Anderson Cancer Center Houston, Joe Arrington Cancer Center and TTUHSC Southwest Cancer Center (Lubbock). Computationally intensive, good fit to grid computing and secure data transfer methods. Good fit to educational medical physics modeling needs. One of our most exciting near- term applications.

Identity management, security, privacy, license and export control requirements!

slide-12
SLIDE 12

Application Progress: Air Quality Modeling

One of the most computationally intensive application areas in Texas. Researchers identified at Texas A&M, U. Houston and TTU; application areas diverse and hard to separate from other general atmospheric modeling topics. Relevant code (WRF, MM5, etc.) exists at most institutions. Large amount of data generated; archival and redundancy, local access to data are among the relevant issues. Decide to start with data redundancy aspects while investigating more general code and framework needs. Goal: 31 day pool cache of modeling output results (~2 TB) to be shared among participating institutions.

slide-13
SLIDE 13

Application Progress: Energy Exploration

Also an existing area of computationally intensive use within the state. Complications stem from industry sensitivities to shared access to resources and need to protect proprietary data. Research topics still nonetheless exist in abundance. Working with different researchers at all five TIGRE primary institutions. Beginning work with Kalman-filter based modeling methods for oil reservoir seismic data analysis. Some license management issues, minimized by degree to which researchers already own or have access to licenses.

slide-14
SLIDE 14

Authentication and Authorization

Identity management and local authorization for virtual

  • rganizations within institutional boundaries is a very hot

topic among institutional CIOs and the security organizations that serve them. Within Texas, we have one large TIGRE participant, the UT system, that has adopted Shibboleth for their own internal IdM federated across campuses, others that are looking at it and related products, and some that are not showing any signs

  • f adopting it, as far as we can tell.

Any solution we adopt has to work with both Shibbolized and non-Shibbolized campuses. So far we use X.509 certificates in combination with local gridmap or GUMS/PRIMA authz; TACC is working on an IGTF member-integrated (“MICS”) credential service profile. Other related work is in progress.

slide-15
SLIDE 15

Storage Topics

TIGRE needs to work with a wide variety of storage models and data file sizes, compositions and types. Project formulation did not address this topic specifically. Atmospheric modeling application area has driven us to consider this topic more quickly than we expected. So far we have been working to understand options for SRM- interfaced storage and tools for grid-authenticated parallel transfer, access and storage. Plain gridftp, srmcp, srmcopy, RFT and GSI interfaces to SRB and/or iRODS being studied and tested. GridImage from NCI CaBIG (CaGrid) project also of interest for DICOM image transfers in radiotherapy applications.

slide-16
SLIDE 16

Lessons Learned

It is important to note that the applications being pursued in this project are the result of selection of areas by the steering committee and not (although they are compatible) from the results of natural selection processes that tend to create collaboration-oriented VOs. People tend to need a reason to collaborate. The availability

  • f resources on which to run is more of a problem (in the

chicken-or-egg sense) when trying to create sub-VOs and collaborations from scratch. In each case we have been able to make progress by putting people into the field to learn needs of researchers and to try to pull people and resources together to pursue common needs.

slide-17
SLIDE 17

Outreach

Outreach consists of pursuing needs beyond your comfort zone (by definition, think about the meaning of the word). In a grid context, this means doing things with the tools that we have not done before. Explaining has to be balanced with learning on the part of both the grid development team and the researchers. When the tools do not do what we want, we have to make them do what we (and the researchers) want them to. In many cases, this involves outreach to university decision makers (IT CIOs, Presidents, Provosts) as well as to the reseachers themselves to explain the need for new methods of working together as well as for networks, computers, etc.

slide-18
SLIDE 18

Future Efforts

Expect to work more closely with other nearby regional and state-wide grids (SURA, LONI, Rio Grande effort, etc.) and continue to work with OSG, Teragrid in the near-term future. Project efforts have already led to one modest-scale (order $100K) proposal for commercialization with industry. Expect to complete demonstrations in all three targeted application areas soon, with more work coming. Web services continue to be our primary technical direction. Discussions with the Steering Committee in progress on other future efforts; many ideas being discussed. Value our relationship with other national-scale, international, regional and local projects. Collaborations welcome!

slide-19
SLIDE 19

Conclusions

TIGRE set out to involve staff and researchers at each of five major Texas universities in production-quality grid software. We have succeeded in this effort. Demos in targeted application areas are complete or in progress. We have a simple software stack that can serve as an entry point for researchers and institutions into other large-scale national, international, regional and local projects. TIGRE has primarily targeted outreach, defined for this purpose as bringing grid software to areas that need it and/or to researchers who are not yet familiar with its use. The mandate for this type of work is strong and extends, in my opinion, into the long term future.

slide-20
SLIDE 20

TIGRE

Texas Internet Grid for Research and Education A HiPCAT Project