Grid Computing for I ndustry Grid Computing for I ndustry – – Early Applications Early Applications
Hing-Yan LEE National Grid Office Singapore
International Symposium on Grid Computing 2007
Grid Computing for I ndustry Grid Computing for I ndustry Early - - PowerPoint PPT Presentation
Grid Computing for I ndustry Grid Computing for I ndustry Early Applications Early Applications Hing-Yan LEE National Grid Office Singapore International Symposium on Grid Computing 2007 Preparing for I ndustry Adoption Preparing
International Symposium on Grid Computing 2007
Resource Usage
Resource Usage Monitoring
Collection of raw data
Metering
Usage charging
Accounting
Organisational level consumer-provider business relationship
$ = f (CPU , memory, license,…) Bill for organization A = ∑ (usage of members
Only stores & reports on information of resource status, no information of users & their jobs Hence, no metering & accounting mechanism [Courtesy of A/Prof. Francis Lee, NTU]
AIST, Japan CNIC, China KISTI, Korea ASCC, Taiwan NCHC, Taiwan UoHyd, India MU, Australia BII, Singapore KU, Thailand USM, Malaysia NCSA, USA SDSC, USA CICESE, Mexico UNAM, Mexico UChile, Chile TITECH, Japan
14 Organizations deployed with MOGAS, 5 of them with GT4 Cindy Zheng, GGF13, 12/10/06 modified by A/Prof. Bu-Sung Lee
MIMOS IOIT-HCM
GT4 GT2
NGO, Singapore QUT, Australia OSAKAU, Japan
– Completed migration. All existing NGPP sites have migrated their host certificates to Netrust certificates. – Continual effort to issue certificates for
– Seeking accreditation from APGrid PMA (part of IGTF)
interfaces with local workload schedulers
(e.g. Sun’s N1GE, Platform’s LSF) of resources on NGPP, and
schedules the job to the
best available resources.
Meta-Scheduler include:
– MicroRNA project (BII) – Media Grid (AE@SG) – DMG Portal (NGPP) – Multipitch Speech project (I 2R)
NUS-SMA
Hydra3 60 CPUs
I MCB
4 CPUs
NGO-GOG
Soursop 78 CPUs
I HPC
Lime 6 CPUs
– To provide licenses and machines to Digital Media companies for their rendering needs, as a means to move them to utilizing services on the Grid – Nurturing the emergence of a Grid Service Provider
– Free access & no charge to commercial & tertiary users – Floating licenses – Support from mental images GmbH
(animation & games)
commissioned by the Surfrider
Foundation of America to
educate the public about how
pollution is harming the country’s coastline.
Legend of Ron Burgundy”, “Bewitched”) & director Ian O’Roarty
was televised nationally in USA in 2006 on various networks.
Tang Chi Sim MD, Omens Studios
Yeo Chun Cheng CIO, Media Development Authority
– IT Department, National Library Board
– Reduce record conversion time from 3 full days to an 8-hour overnight run
– Data processing for monthly reporting
– Distribute the workload across as many idle computers as possible to convert the records in parallel using Condor workload management system – Instead of adding converted records t database residing on central server, each computer has its local database to store converted records – Removes bottleneck during records insertions into a central database, but requires an additional step of merging local databases subsequently
– Speedup of processing using idle compute resources within the enterprise, with no additional investment required.
shared load with other jobs
– IT Department, National Library Board
– Undertake web archival with reasonable investment in compute resources
– Crawl, index & archive web materials of specific interest to Singapore
– Use available compute resources on NGPP to run NutchWAX indexing software to address scalability – Use distributed compute resources on NGPP to run Heritrix crawling software to partition crawling of web sites
– Benchmark results show:
compute resources availability on NGPP
– Short-term: No need for purchase of additional computing resources
– Speech & Dialogue Processing Lab at Institute for Infocomm Research
– Process huge no. of voice data files (36 GB) using multi-pitch program
– Identify & differentiate voices in sound recordings by identifying unique frequencies in each voice – Processing of input file & producing track files is inherently parallelizable
– Re-structure source code of multi-pitch program – Use meta-scheduler to distribute tasks to available compute resources
– Original time taken: 9 months to process on 1 CPU – Speedup: 2 days using 60 server class Xeons CPUs – Results validated by speech analysis expert
Speech Recordings Separate the pitch tracks (fundamental frequency) Determine if there are 0, 1 or 2 people speaking Detect single speaker segments