debian data export
play

Debian Data Export A standard for publishing Debian information. - PowerPoint PPT Presentation

Debian Data Export A standard for publishing Debian information. Enrico Zini enrico@debian.org Feb 7, 2009 15 slides Enrico Zini (enrico@debian.org) Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 1/15 Debian Data Export Debian: the data


  1. Debian Data Export A standard for publishing Debian information. Enrico Zini enrico@debian.org Feb 7, 2009 15 slides Enrico Zini (enrico@debian.org) Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 1/15 Debian Data Export

  2. Debian: the data hell ● Package files, for binary packages ● Format: rfc822-like ● Split per distribution, then per architecture ● Package files, for source packages ● Same as before ● Maintainer <-> Source package mapping ● Available at DDPO, deprecated ● Available at BTS, somehow Enrico Zini enrico@debian.org ● Extractable from source packages ● Available at projectb ● Debtags information ● From /var/lib/debtags or from Alioth Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 2/15 Debian Data Export

  3. Debian: the data hell ● Debtags vocabulary ● Extra debtags sources ● Popcon rankings ● Bug information ● Changelogs ● .desktop files of packages not installed ● New queue ● Screenshots Enrico Zini enrico@debian.org ● Apt-file information ● http://ftp-master.debian.org/~joerg/pkg-nums ● http://ftp-master.debian.org/~joerg/arch-space ● License information Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 3/15 Debian Data Export

  4. Debian: the data hell ● Localisation information ● uscan status ● Buildd logs ● BTS data ● sloccount run results ● Debian Weather ● Debian Pure Blend specific information ● UDD! Enrico Zini enrico@debian.org Can you think of more? Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 4/15 Debian Data Export

  5. Debian: the data formats hell ● rfc822-like files ● Description subformat ● Tag subformat ● Dependency subformat ● SOAP interfaces ● LDAP interfaces ● SQL interfaces ● Lots of ad-hoc formats Enrico Zini enrico@debian.org ● HTML scraping ● post-processing occasionally needed Can you think of more? Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 5/15 Debian Data Export

  6. Debian: the data access hell ● Something in mirrors ● Something on people.debian.org ● Something on specific Debian machines ● Something other machines elsewhere ● Something can only be accessed FROM specific machines ● Something can only be computed on the user's system Enrico Zini enrico@debian.org Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 6/15 Debian Data Export

  7. My general goals ● Producing data should be easy. The major task should be computing it: all the rest should be a no brainer ● Finding data should be easy ● Getting data should be easy ● In terms of protocol to download it ● In terms of format to parse it More wishes? Enrico Zini enrico@debian.org Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 7/15 Debian Data Export

  8. My specific goals ● debtags.debian.net ● Must (ideally) index information for all packages in Debian, Ubuntu, Pure Blends, other derivatives, all distros, all arches. ● Only one version per package. If a package is in more than one distribution, I want to use the data in testing . ● Autocompletion in web form fields ● Of binary package names ● Of source package names Enrico Zini enrico@debian.org ● Of maintainer names ● Of ... ● Machine readable interface to all the data that I produce Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 8/15 Debian Data Export

  9. The solution (so far) A demo should happen now. Enrico Zini enrico@debian.org If you are reading the slides after the presentation, you may find the video in the Debian video archives: http://meetings-archive.debian.net/pub/debian-meetings/2009/ Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 9/15 Debian Data Export

  10. Scope of the DDE data space ● Export views corresponding to common use cases ● Do not reimplement SQL, or LDAP ● For special needs, people can craft a SQL or LDAP query. ● If the need becomes more general, we turn the query into a DDE plugin Enrico Zini enrico@debian.org Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 10/15 Debian Data Export

  11. Uses of DDE Current ● debtags.debian.net, screenshots.debian.net ● Completion in web forms ● Some example mashups ● apt-file without local database Future ● Extra features in package managers ● apt fetcher for extra data Enrico Zini enrico@debian.org ● More external sites to feed (Blends?) ● Switch existing tools to use blend-specific data sources More ideas? Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 11/15 Debian Data Export

  12. Deployment In theory It is a WSGI application, it is trivial to deploy it in any way you like In practice ● CGI does not scale ● Cherrypy 2.x does not allow WSGI apps to stream ● Cherrypy 3.x has a bad chain of conflicts ● Paste won't reload without killing running streams Enrico Zini enrico@debian.org ● Fastcgi needs careful tuning, or you are killed if you run for long ● mod_wsgi runs in apache's process space Help? Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 12/15 Debian Data Export

  13. Scalability In theory All data is read only, there is no state: it's a cache wonderland! ● Put varnish in front of it ● Aggressive cache headers ● Can be replicated, can use DNS round robins In practice ● Demand may reach insane levels Enrico Zini enrico@debian.org ● All web forms making lots of small queries! ● All package managers! ● It's not static data, cannot use the mirror network Ideas? Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 13/15 Debian Data Export

  14. New possibilities opening ● Javascript mashups ● We need to wait for FireFox 3.1 to fix multi-source XMLHTTPRequest ● Currently DDE supports JSONP ● JSONPP is easy to add, but this path makes me sick Enrico Zini enrico@debian.org Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 14/15 Debian Data Export

  15. Who will make it happen? Not me alone. I will chase a couple of itches of mine, but I won't reach DDE's full potential just on myself. Enrico Zini enrico@debian.org If DDE can scratch an itch of yours, I'll be happy to show you the ropes. Fosdem, Brussels, February 7, 2009 - 09:43:46 AM 15/15 Debian Data Export

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend