DTAI Thesis Topics Dept. Computer Science KU Leuven 2018-2019 - - PowerPoint PPT Presentation

dtai thesis topics
SMART_READER_LITE
LIVE PREVIEW

DTAI Thesis Topics Dept. Computer Science KU Leuven 2018-2019 - - PowerPoint PPT Presentation

DTAI Thesis Topics Dept. Computer Science KU Leuven 2018-2019 http://people.cs.kuleuven.be/~luc.deraedt/dtaithesis18-19.pdf Luc De Raedt Lab for Declarative Languages and Artificial Intelligence Machine Learning 4 ZAP , 1 res. manager, 1


slide-1
SLIDE 1

DTAI Thesis Topics

  • Dept. Computer Science

KU Leuven 2018-2019

http://people.cs.kuleuven.be/~luc.deraedt/dtaithesis18-19.pdf Luc De Raedt

slide-2
SLIDE 2

2

Lab for Declarative Languages and Artificial Intelligence

Machine Learning

Declarative Languages and Systems

4 ZAP , 1 res. manager, 1 res. expert ± 4 post-docs ± 25 Ph.D. students 5 ZAP ± 3 post-docs ±12 Ph.D. students

Bruynooghe retired

slide-3
SLIDE 3

AI is hot!

Self-driving cars - Eve (the robot scientist) Siri IBM Watson in Jeopardy and “Machine Reading” AlphaGo — (Deep) learning …

3

slide-4
SLIDE 4

DTAI's focus on AI

4

Machine Learning & Data Mining how to extract knowledge from data Uncertainty reasoning how to represent and reason about uncertainty Knowledge Representation how to represent and reason about knowledge

slide-5
SLIDE 5

DTAI's focus on Declarative Languages

5

Declarative = specify the what rather than the how Different types of languages Logic Functional Constraints Probabilistic

slide-6
SLIDE 6

DTAI's methodology involves

6

Fundamental research (theoretical as well as empirical) Systems, Solvers and Software Applications

Thesis can focus on one or more aspects, depending on interests student This presentation does not go in depth about techniques but every thesis does

slide-7
SLIDE 7

This presentation

Overview of research illustrations of possible thesis topics. List of contact persons for topics Full information — see online Own topic should be aligned with interests professor

7

slide-8
SLIDE 8

Research Topics

Probabilistic Programming and Statistical Relational Learning Predictive Learning and Clustering Graph and Network Mining Static Analysis for Declarative Programming Languages Exploratory Data Mining Privacy, Non-discrimination and Ethical aspects Knowledge-Base Systems Constraints

(dtai.cs.kuleuven.be/research)

Functional Programming Automated Data Science Verification of AI and ML

slide-9
SLIDE 9

9 Sports Analytics Text and Web Health Games

Applications

Engineering & Sensors Humor (Comp. Creativity) Robotics Applications of the Knowledge Base System Paradigm

(dtai.cs.kuleuven.be/research)
slide-10
SLIDE 10

Research topics

slide-11
SLIDE 11

11

“Standard” machine learning develop new algorithms for machine learning Decision Trees Predictive Clustering Probabilistic Graphical Models evaluation of machine learning (ROC etc.)

Predictive learning and clustering

Contact: Hendrik Blockeel, Jesse Davis

slide-12
SLIDE 12

12

Automated Data Science

Contact: Luc De Raedt, Anton Dries

Can we (partly) automate data science ? Can we automatically derive the right features ? the right representations ? Can we automatically discover what we can learn / predict ? Can we learn constraints ? Example database about students, professors, courses, and marks … The SYNTH project — the democratisation of Data Science the automation of Data Science

slide-13
SLIDE 13

13

Automated Data Science

Contact: Luc De Raedt, Anton Dries Inductive Programming

FlashFill in Excel

slide-14
SLIDE 14

14

Automated Data Science

Contact: Luc De Raedt, Anton Dries Learning constraints

I What are the formulas here? I

T1[:, 6] = SUM(T1[:, 3:5], row)

I

T2[:, 2] = SUMIF(T1[:, 1]=T2[:, 1], T1[:, 6])

slide-15
SLIDE 15

15

Key open question in AI — integrate

Probabilistic reasoning Logical or relational representations Machine learning

Contact: Luc De Raedt, Hendrik Blockeel, Jesse Davis, Gerda Janssens

statistical relational learning

Probabilistic Programming and Statistical Relational Learning

probabilistic programming

slide-16
SLIDE 16

16

E.g. ProbLog: a probabilistic Prolog P( hears_alarm(john) | burglary = true) ? Challenges on inference, learning, implementation, application, ...

i

0.05 :: burglary. 0.01 :: earthquake. 0.7 :: hears_alarm(john). 0.6 :: hears_alarm(mary). alarm :- burglary. alarm :- earthquake. calls(Pers) :- alarm, hears_alarm(Pers).

Probabilistic Programming and Statistical Relational Learning

slide-17
SLIDE 17

Action and activity learning / Dynamics

17

Travian: A massively multiplayer real-time strategy game Commercial game run by TravianGames GmbH ~3.000.000 players spread over different “worlds”

[Thon et al. ECML 08]

Can we build a model of this world ? Can we use it for playing better ?

Probabilistic Programming and Statistical Relational Learning

slide-18
SLIDE 18

18

Contact: Luc De Raedt, Hendrik Blockeel, Jesse Davis, Bettina Berendt & Wannes Meert

Verification of software has a long tradition (eg model checking techniques) How to verify systems that learn ? that use AI ? Our approach — combined principles of probabilistic logics with verification Topics inductive synthesis of specifications Markov Decision Processes (& reinforcement leanring) Derive properties of learned systems …

Verifying AI & ML systems

slide-19
SLIDE 19

19

Learn probabilistic - logic model

Shelf push Shelf tap Shelf grasp

Moldovan et al. ICRA 12, 13, 14

Robotics

Contact: Luc De Raedt

slide-20
SLIDE 20

Contact: Luc De Raedt

Robotics (and Vision)

The visual genome

slide-21
SLIDE 21

21

Contact: Bettina Berendt

Help users manage friends and privacy by data mining

Socially Aware Data Mining

Focus on Privacy and (anti-discrimination)

Graph and Network Mining

slide-22
SLIDE 22

Contact: Bettina Berendt, Jesse Davis

Extraction of information from the web / social media Taxonomy learning Machine reading / Natural language processing NaturalMachine reading …

22

Text and Web

slide-23
SLIDE 23

Contact: Marc Denecker, Gerda Janssens

IDP Advanced KBS system developed by group FO(.) language rooted in predicate logic and logic programming separation of domain knowledge and problem solving Language extensions to increase expressivity E.g. design patterns for FO(.) (past thesis) Better solvers and more inference methods E.g. a solver for rational numbers (past thesis)

23

Knowledge-Base Systems

slide-24
SLIDE 24

Contact: Marc Denecker, Gerda Janssens

Three themes for students : logical modeling of interesting AI problem + expressing AI knowledge domains logical analysis and implementation of software systems and tasks + software by applying inference

  • n specifications

Advanced algorithmics and implementation + extending/optimising the IDP software package.

24

Knowledge-Base Systems

slide-25
SLIDE 25

Analysing medieval manuscripts

  • monks copied texts
  • resulting in variants (colors)
  • reconstruct history

vocabulary Vms { extern vocabulary V IsSource(Manuscript ) } theory Tms : Vms { { ! x : IsSource(x) <- ~ ? y : CopiedBy(y, x) & VariantIn(y) = VariantIn(x). } } term NbOfSources : Vms { #{ x : IsSource(x) } } procedure minSources(feature) { setvocabulary(feature, Vms) return minimize(Tms, feature, NbOfSources)[1] }

25

  • special-purpose datamining-program: 400 lines of Perl, bugs
  • description problem in IDP: 15 lines, correct, somewhat faster

Logical modeling of AI problems

Applications of the Knowledge Base System Paradigm

slide-26
SLIDE 26

Contact: Marc Denecker, Gerda Janssens

Software = Knowledge Base + Logical Inference + User Interface E.g., An interactive configuration system for an insurance company AIM : Build cheap, correct, reusable, maintainable software from a logical specification

26 Applications of the Knowledge Base System Paradigm

slide-27
SLIDE 27

27 Applications of the Knowledge Base System Paradigm

Winning the RuleML Challenge Insurance application Propagation constraints and choices Fill out necessary values

slide-28
SLIDE 28

Contact: Marc Denecker, Gerda Janssens

Advanced algorithmics and implementation + extending/

  • ptimising the IDP software package.

help us win the next CP or ASP competition + E.g., structuring search space as a hierarchy of search problems + E.g., linear programming techniques in IDP + E.g., improved computation of definitions + E.g., algorithms for revision inference (updating solutions)

28

Knowledge-Base Systems

slide-29
SLIDE 29

Contact: Tom Schrijvers, Marc Denecker, & Luc De Raedt

29

Constraints

  • Hyper heuristics to solve constraint satisfaction

and optimization problems — formalisation

  • Search Heuristics
  • Role in IDP
  • Role in Data Mining
  • Learning of constraints
slide-30
SLIDE 30

30

Functional Programming Contact: Tom Schrijvers

Functional Programming

Haskell

★ Explicit Side-Effects


★ Advanced Type Systems


★ Domain-Specific Languages


★ Much more…

Monads Effect Handlers Transformers Type Classes Polymorphism Kinds Design Infrastructure Applications

UITLEG: Je kent Functional Programming van de taal Haskell uit het vak Declaratieve Talen. Op onderzoeksgebied werken we rond alle aspecten van functionele talen, en Haskell in het bijzonder. Actuele onderwerpen zijn:

  • expliciete side-effects zoals monads,
  • gevorderde type system features
  • domein-specifieke talen
  • en nog veel meer
slide-31
SLIDE 31

31 Functional Programming

λ calculus 1936 Alonzo Church 1958 Lisp John McCarthy 1973 ML Robin Milner 1987 Haskell Haskell Committee 2014 Java 8 Swift 2011 C++11

Functional Languages Mainstream

2007 C#

FP now mainstream

Widespread Adoption

Haskell Language + GHC Compiler Finance Many Others Telecom

Haskell in
 industry

UITLEG: Heel wat interessante uitdagingen komen voort uit de gr mainstream adoptie van Functional Pr Hoe langer hoe meer bedrijven gaan aan de slag met functionele talen zoals Haskell en F# (F-sharp), en mainstream talen zoals Java en C# adopter concepten.

slide-32
SLIDE 32

201: The Oracle of Haskell

32 Functional Programming

GHC compiler abs x
 | x >= 0 = x
 | x < 0 = -x your oracle

✓exhaustive guards

UITLEG:

  • ntwikkel een orakel dat nagaat of

guards in Haskell-programma’s alle gevallen dekken

slide-33
SLIDE 33

Static Analysis for
 Declarative Programming Languages Contact: Tom Schrijvers, Gerda Janssens 


33 Declarative Programming Languages

★ Type Checking
 ★ Termination Analysis
 ★ Reasoning about Coroutines

UITLEG: Je kent de Declaratieve Taal Prolog uit het vak Declaratieve Talen. Op onderzoeksgebied werken we rond de automatische analyse van Prolog-programma’s. Actuele onderwerpen zijn:

  • een type checker om Prolog statisch getypeerd te maken
  • de eindigheid van programma’s te bepalen
  • analyseren van complexe control flow zoals coroutines
slide-34
SLIDE 34

34 Declarative Programming Languages

Automatically Inferring Properties of Interest

append([],L,L). append([X|Xs],Ys,[X|Zs]) :-
 append(Xs,Ys,Zs).

powerful dynamic flexible

  • ptimisation

correctness termination

UITLEG: Delcaratieve talen zoals Prolog zijn heel krachtig, dynamisch en flexibel. De uitdaging bestaat erin om automatisch belangrijke eigenschappen af te leiden van Prolog programma’s om na te gaan of ze correct zijn, altijd eindigen en hoe je ze efficient kan compileren.

slide-35
SLIDE 35

bugs

Delcarative Programming Languages

Industrial-Strength
 Static Types for Prolog

35

Prosyn Expert System 
 1 MegaLoC Prolog

Case Study: Industrial Partner

Prolog Program Types

+

your type checker

UITLEG: Prolog is een ongetypeerde taal. Hierdoor is het makkelijke om via schrijffouten moeilijk op te sporen bugs te introduceren. In deze thesis ontwikkel je een type systeem voor Prolog: De programmeur schrijft type-signaturen voor zijn predikaten, en jouw type checker gebruikt die om bugs op te sporen. Je evalueert je type checker op het Prosyn expert systeem van onze industriele partner. Dat bestaat uit 1 miljoen lijnen Prolog code.

slide-36
SLIDE 36

Application Areas

slide-37
SLIDE 37

37

Contact: Wannes Meert

Industry

  • e

Theses with:

Boeing Jetairfly EuroMillions Basketball League 3E Sirris Thomson-Reuters Xenit Pepite Melexis Flanders Make imec Cern …

slide-38
SLIDE 38

38

Machine Learning for sports Soccer & basketball E-sports

Sports Analytics

Contact: Jesse Davis

slide-39
SLIDE 39

Tasks

Strategy detection Performance analysis & prediction Scouting

39

Sports Analytics

slide-40
SLIDE 40

Thesis Topics

Soccer analytics

Model flow of a game Quantify team performance Learn aging curves of players

Basketball analytics

Detect surprising events

40

Sports Analytics

slide-41
SLIDE 41

Tasks

Continuous monitoring Injury risk profiles

41

Health

slide-42
SLIDE 42

Thesis topics

Performance management and Injury prevention
 Sensor fusion for surface detection and skill detection in runners
 Kinect monitoring for qualitative feedback during rehabilitation

42

Health

slide-43
SLIDE 43

43

Engineering & Sensors

Contact: Wannes Meert, Jesse Davis

43

http://www.pom-sbo.org http://www.pom2sbo.org ejector

Mebios-KU Leuven setup

slide-44
SLIDE 44

Engineering & Sensors

Contact: Wannes Meert, Jesse Davis, Luc De Raedt

Badminton-spelende robot: http://www.youtube.com/watch?v=StPZLZq01Xs

See also Health & Sports

slide-45
SLIDE 45

Engineering & Sensors

Contact: Wannes Meert, Jesse Davis, Luc De Raedt

Large Hadron Collider maintenance (CERN)

  • ng

Analysing data from airplanes

slide-46
SLIDE 46

Games

learning to solve science tests formulated in natural language (like SAT, GMAT, GRE, …) Tests as a testbed for intelligent behavior, for “reasoning” Allen AI Institute, Levesque’s Winograd test, IBM Watson …

46

AI Challenges

Contact: Luc De Raedt, Jesse Davis, Anton Dries, Hendrik Blockeel

slide-47
SLIDE 47

T h r e e m a c h i n e s A , B a n d C p r

  • d

u c e 5 p e r c e n t , 3 p e r c e n t a n d 2 p e r c e n t

  • f

t h e t

  • t

a l p r

  • d

u c t i

  • n

r e s p e c t i v e l y . T h e p e r c e n t a g e

  • f

d e f e c t i v e p i e c e s i s 3 p e r c e n t , 4 p e r c e n t a n d 5 p e r c e n t r e s p e c t i v e l y . O n e c h

  • s

e s a p i e c e . I t i s d e f e c t i v e . W h a t i s t h e p r

  • b

a b i l i t y t h a t i t

  • r

i g i n a t e s f r

  • m

m a c h i n e A ?

Problem

A die is thrown 3 times. Find the probability that the sum of the dots is at least 5. M i k e h a s a b a g w i t h 4 r e d m a r b l e s a n d 3 g r e e n m a r b l e s . H e t a k e s

  • n

e m a r b l e f r

  • m

t h e b a g a n d i t i s r e d . W h a t i s t h e p r

  • b

a b i l i t y t h a t t h e s e c

  • n

d m a r b l e h e t a k e s f r

  • m

t h e b a g i s a l s

  • r

e d ? In a group of 10 people, 60 percent have brown eyes. Two people are selected from the group. What is the probability that neither of them has brown eyes? Suppose 0.1 percent of the population is infected with a certain disease. On a medical test for the disease, 98 percent of those infected give a positive result while 1 percent of those not infected give a positive

  • result. If a randomly chosen person is

tested and gives a positive result, what is the probability the person has the disease? A g i n h a n d c

  • n

s i s t s

  • f

1 c a r d s f r

  • m

a d e c k

  • f

5 2 c a r d s , c

  • n

t a i n i n g 1 3 h e a r t s , 1 3 d i a m

  • n

d s , 1 3 c l u b s , a n d 1 3 s p a d e s . F i n d t h e p r

  • b

a b i l i t y t h a t a g i n h a n d h a s a l l 1 c a r d s

  • f

t h e s a m e s u i t .

GOAL: solve the problem directly from text

slide-48
SLIDE 48

48 Luc De Raedt

Artificial intelligence, reasoning about uncertainty, action- and activity learning, machine learning, data mining, constraint programming, probabilistic programming (ProbLog), automated data science, language for mining and learning. Applications in natural language, vision, robotics, automatic programming. Verification of AI and ML.

Hendrik Blockeel

Machine learning, data mining, probabilistic logics, declarative languages for data mining. Application domains include bio-informatics, arts, history, compiler development,

  • ptimization.

Jesse Davis

Machine learning, data mining for personalized medicine. Artificial intelligence, statistical relational learning, transfer learning Applications in healthcare (e.g., clinical practice, physical therapy, medical and biological texts, etc.). Applications to sport (e.g., football and basketball)

Bettina Berendt

Web mining, privacy, social media, user issues

Wannes Meert

Probabilistic programming and methods. Data Science Applications. Applications in

  • engineering. Collaborations with industry.

Anton Dries

Constraint programming, probabilistic programming, data mining. Automated Data Science. Design and implementation of AI systems and their applications.

slide-49
SLIDE 49

49 Gerda Janssens

Performant probabilistic ILP data mining systems, integration of logic programming techniques in the knowledge representation language FO(.), program analysis and abstract interpretation, implementations of logic programs, verification of functional equivalence of C programs

Danny De Schreye

Computational creativity in Humor

Marc Denecker

Constraint programming, Knowledge Base Systems, SAT solving, declarative languages (formal modelling languages), Applications in configuration, scheduling, optimization, security, business rule systems, executable formal software specifications, logical workflow languages.

Tom Schrijvers

functional programming, constraint and logic programming, type systems, programming language theory, programming language design and implementation, program analysis

Check out dtai-web for more details

slide-50
SLIDE 50

Questions ?

Advisable to contact promotors or daily advisors before selecting a topic Also, attend thesis info market after Easter Holidays