Torrent-based software distribution
Costin Grigoras Pablo Saiz
ALICE Offline Week – 24.06.2009
Torrent-based software distribution Costin Grigoras Pablo Saiz - - PowerPoint PPT Presentation
Torrent-based software distribution Costin Grigoras Pablo Saiz ALICE Offline Week 24.06.2009 Current way of distributing sw SLC4 SLC4 SLC5 SLC5 32bit 64bit 32bit 64bit Build servers SLC4 Mac Mac Ubuntu AliRoot & deps
Costin Grigoras Pablo Saiz
ALICE Offline Week – 24.06.2009
Build servers AliRoot & deps SLC4 32bit SLC4 64bit SLC5 32bit SLC5 64bit Ubuntu 64bit SLC4 Itanium Mac 32bit Mac 64bit AliEn ALICE::CERN::SE Catalogue Grid Site X VoBox PackMan Shared software area NFS/AFS/... Worker nodes
A single service/site
Shared software area
Difficult to update
Need to keep a short
Worker nodes are independent
Self-consistent software packages are required
No site-local software repository
Avoid overloading central software repositories
Would be nice to be able to quickly update
We are trying to use BitTorrent technology to
package.tar.bz2 package.tar.bz2.torrent (tens of KB) Metadata info of the original file:
* uniquely identifies the file
Chunks of equal size
Tracker Seeder Seeder Client Client Clients that have the complete file and serve it Discovery service: keeps track of who has which files/chunks. HTTP-based protocol Are in the process of downloading the file. Cooperate to download faster.
Build servers AliRoot & deps SLC4 32bit SLC4 64bit SLC5 32bit SLC5 64bit Ubuntu 64bit SLC4 Itanium Mac 32bit Mac 64bit AliEn http://alitorrent.cern.ch Seeder alitorrent:8092 Catalogue torrent://... Grid Site X VoBox Worker nodes Tracker alitorrent:8088
Worker nodes keep seeding the packages that
Other worker nodes will fetch the content mostly
Worker nodes from site A are usually firewalled
If initial download is not possible via torrent, fall
Multiple versions of the same file can co-exist
AliEn itself is packaged in a small (35MB) archive AliRoot, Root & deps. packaged in single archives:
Subatech is used as testbed
LDAP flag to switch modes:
name=Subatech-CREAM,ou=CE,ou=Services,ou=Subatech,ou=Sites,o=alice,dc=cern,dc=ch
Production jobs work fine Analysis jobs fail to load a particular library; most
You can download precompiled packages from
Full-scale testing of the solution Evaluate the need for caching
On worker nodes, as files On VoBox, as seeder Regional seeders All these would require managers
Try to use the solution for distributing data files
Latest version would be fetched at every execution,