Internet Publication of Geneva Justice Decisions A case study - - PowerPoint PPT Presentation

internet publication of geneva justice decisions
SMART_READER_LITE
LIVE PREVIEW

Internet Publication of Geneva Justice Decisions A case study - - PowerPoint PPT Presentation

Internet Publication of Geneva Justice Decisions A case study laurent.dami@justice.ge.ch LD, PJ-GE, july 2006 2 Agenda context presentation justice.ge.ch/jurisprudence : short tour technical information some lessons about Perl


slide-1
SLIDE 1

Internet Publication of Geneva Justice Decisions

A case study

laurent.dami@justice.ge.ch

slide-2
SLIDE 2

LD, PJ-GE, july 2006 2

Agenda

context presentation justice.ge.ch/jurisprudence : short tour technical information some lessons about

Perl in the enterprise

slide-3
SLIDE 3

Context presentation

slide-4
SLIDE 4

LD, PJ-GE, july 2006 4

A justice decision

is a structured document

header / facts / law / conclusion may have a 2nd, anonymous version

has a unique identifier (e.g. ACJC/123/2005) has a context (metadata)

date / names / topic / keywords / summary / etc.

is archived into a collection

minutes du TA / CJC / TPI / etc.

slide-5
SLIDE 5

LD, PJ-GE, july 2006 5

Lifecycle

receive case investigate write project deliberate finalize send supply context archive clerk judge college

slide-6
SLIDE 6

LD, PJ-GE, july 2006 6

Electronic archive : requirements

store document

multiple formats fulltext indexing

store metadata

structured fields quick search (unstructured!)

intelligent presentation

automatic hyperlinks

  • ffline / CDROM copies per collection
slide-7
SLIDE 7

LD, PJ-GE, july 2006 7

Some figures

Intranet : 20 – 30 collections

Internet : only 2 collections for the moment

500 to 50000 decisions per collection

for about 10 years of data

2 – 50 pages per document

slide-8
SLIDE 8

LD, PJ-GE, july 2006 8

slide-9
SLIDE 9

LD, PJ-GE, july 2006 9

slide-10
SLIDE 10

Short tour

http://justice.geneve.ch/jurisprudence

slide-11
SLIDE 11

LD, PJ-GE, july 2006 11

slide-12
SLIDE 12

LD, PJ-GE, july 2006 12

slide-13
SLIDE 13

LD, PJ-GE, july 2006 13

metadata search metadata search fulltext search fulltext search

slide-14
SLIDE 14

LD, PJ-GE, july 2006 14

slide-15
SLIDE 15

LD, PJ-GE, july 2006 15 Qualité pour agir

slide-16
SLIDE 16

LD, PJ-GE, july 2006 16

slide-17
SLIDE 17

Technical information

slide-18
SLIDE 18

LD, PJ-GE, july 2006 18

Which kind of solution ?

Electronic Doc. Management System

not well suited for multiple disjoint collections approval / workflow not relevant

Database

many fields : too much structure for easy searches (SQL not well suited) see CPAN SQL::KeywordSearch !

slide-19
SLIDE 19

LD, PJ-GE, july 2006 19

Storage of a collection

metadata.txt words.bdb file.{doc,html,pdf} file.{doc,html,pdf} file.{doc,html,pdf} w2docs.bdb positions.bdb

fulltext index in BerkeleyDB format documents flat file

config.txt

slide-20
SLIDE 20

LD, PJ-GE, july 2006 20

Phases for a search

Parse request Metadata search Fulltext search Merge results Sort & slice Contextual excerpts Display

slide-21
SLIDE 21

LD, PJ-GE, july 2006 21

Main Modules

Search::QueryParser Search::Indexer BerkeleyDB Template toolkit File::Tabular::Web File::Tabular CGI AppConfig

not (yet) on CPAN

ModPerl::Registry

slide-22
SLIDE 22

Some lessons about Perl in the Enterprise

slide-23
SLIDE 23

LD, PJ-GE, july 2006 23

Context : Geneva Justice

finished phase 1 (collaborative software,

document management)

  • ngoing phase 2 : rewrite the old COBOL

application for case management using

mod_perl Catalyst DHTML + Ajax

smooth transition

COBOL and Perl must live side-by-side for several years

slide-24
SLIDE 24

LD, PJ-GE, july 2006 24

Acceptance

strong internal resistance

bad image : low-tech, hacking, scripting Perl5 features not known

→objects, namespaces, closures, etc.

not "standard" (i.e. not Java) fear not to be able to maintain and industrialize cheap means "not serious"

but: Perl productivity wins !

slide-25
SLIDE 25

LD, PJ-GE, july 2006 25

Perl Job Market

found more people than expected. But

all coming from US / UK used Perl several years ago, now on Java / PHP missing other skills (modeling, communication, project management)

apparently not enough "average profiles"

few top stars many low-level geeks Perl not taught at school !

slide-26
SLIDE 26

LD, PJ-GE, july 2006 26

Industrialization

Release management :

granularity mismatch

production guys want

→big tarballs →few updates →strict release process

development guys want

→small and frequent updates using cpan / cpanplus / minicpan →fast release process, short feedback loop

slide-27
SLIDE 27

LD, PJ-GE, july 2006 27

Development

TMTOWTDI

yet developers need guidance

→many thanks to Damian Conway ! →IDE

CPAN : how to manage proliferation

→cpanratings not exhaustive/reliable enough →which modules were rated by <some_guru> ? →inverse dependencies

slide-28
SLIDE 28