Automatic Mathematical Information Retrieval to Perform Translations - - PowerPoint PPT Presentation

automatic mathematical information retrieval to perform
SMART_READER_LITE
LIVE PREVIEW

Automatic Mathematical Information Retrieval to Perform Translations - - PowerPoint PPT Presentation

Automatic Mathematical Information Retrieval to Perform Translations up to Computer Algebra Systems Andr Greiner-Petter August 13, 2018 University of Konstanz Germany @GreinerPetter 1/9 Motivation & Problems Motivation - Formulae


slide-1
SLIDE 1

Automatic Mathematical Information Retrieval to Perform Translations up to Computer Algebra Systems

André Greiner-Petter August 13, 2018

University of Konstanz Germany @GreinerPetter 1/9

slide-2
SLIDE 2

Motivation & Problems

slide-3
SLIDE 3

Motivation - Formulae Presentations

DLMF 18.3

A Jacobi polynomial in different systems. Rendered Version: P (α,β)

n

(cos(aΘ)) Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP(n,alpha,beta,cos(a*Theta)) CAS Mathematica: JacobiP[n,\[Alpha],\[Beta],Cos[a \[CapitalTheta]]]

2/9

slide-4
SLIDE 4

Motivation - Formulae Presentations

DLMF 18.3

A Jacobi polynomial in different systems. Rendered Version: P (α,β)

n

(cos(aΘ)) Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP(n,alpha,beta,cos(a*Theta)) CAS Mathematica: JacobiP[n,\[Alpha],\[Beta],Cos[a \[CapitalTheta]]]

2/9

slide-5
SLIDE 5

Motivation - Formulae Presentations

DLMF 18.3

A Jacobi polynomial in different systems. Rendered Version: P (α,β)

n

(cos(aΘ)) Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP(n,alpha,beta,cos(a*Theta)) CAS Mathematica: JacobiP[n,\[Alpha],\[Beta],Cos[a \[CapitalTheta]]]

2/9

slide-6
SLIDE 6

Motivation - Formulae Presentations

DLMF 18.3

A Jacobi polynomial in different systems. Rendered Version: P (α,β)

n

(cos(aΘ)) Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP(n,alpha,beta,cos(a*Theta)) CAS Mathematica: JacobiP[n,\[Alpha],\[Beta],Cos[a \[CapitalTheta]]]

2/9

slide-7
SLIDE 7

Presentation To Computation with semantic information

slide-8
SLIDE 8

Problems of Translations

DLMF 18.3

Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP(n, alpha, beta, cos(a*Theta)) Potential Problems:

  • Differences in syntax
  • Function is not implemented in one system,
  • Function has multiple representations in one system,
  • Differences in definitions.

3/9

slide-9
SLIDE 9

Problems of Translations

DLMF 18.3

Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP(n, alpha, beta, cos(a*Theta)) Potential Problems:

  • Differences in syntax
  • Function is not implemented in one system,
  • Function has multiple representations in one system,
  • Differences in definitions.

3/9

slide-10
SLIDE 10

Problems of Translations

DLMF 18.3

Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP($2, $0, $1, $3) Potential Problems:

  • Differences in syntax

← solved by translation patterns

  • Function is not implemented in one system,
  • Function has multiple representations in one system,
  • Differences in definitions.

3/9

slide-11
SLIDE 11

Problems of Translations

DLMF 18.3

Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP(n, alpha, beta, cos(a*Theta)) Potential Problems:

  • Differences in syntax

← solved by translation patterns

  • Function is not implemented in one system,
  • Function has multiple representations in one system,
  • Differences in definitions.

3/9

slide-12
SLIDE 12

Problems of Translations

DLMF 18.3

Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple:

DLMF 18.5.7

Potential Problems:

  • Differences in syntax

← solved by translation patterns

  • Function is not implemented in one system,
  • translate equivalent presentations
  • Function has multiple representations in one system,
  • Differences in definitions.

n

  • ℓ=0

(n + α + β + 1)ℓ(α + ℓ + 1)n−ℓ ℓ! (n − ℓ)!

x − 1

2

3/9

slide-13
SLIDE 13

Problems of Translations

DLMF 18.3

Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP or Jacobi or JacobiPoly Potential Problems:

  • Differences in syntax

← solved by translation patterns

  • Function is not implemented in one system,
  • translate equivalent presentations
  • Function has multiple representations in one system,
  • Differences in definitions.

3/9

slide-14
SLIDE 14

Problems of Translations

DLMF 18.3

Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP or Jacobi or JacobiPoly Potential Problems:

  • Differences in syntax

← solved by translation patterns

  • Function is not implemented in one system,
  • translate equivalent presentations
  • Function has multiple representations in one system,
  • just pick a valid translation
  • Differences in definitions.

3/9

slide-15
SLIDE 15

Problems of Translations

DLMF 18.3

Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP(n, alpha, beta, cos(a*Theta)) Potential Problems:

  • Differences in syntax

← solved by translation patterns

  • Function is not implemented in one system,
  • translate equivalent presentations
  • Function has multiple representations in one system,
  • just pick a valid translation
  • Differences in definitions.

3/9

slide-16
SLIDE 16

Problems of Translations

DLMF 18.3

Semantic L

AT

EX: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} CAS Maple: JacobiP(n, alpha, beta, cos(a*Theta)) Potential Problems:

  • Differences in syntax

← solved by translation patterns

  • Function is not implemented in one system,
  • translate equivalent presentations
  • Function has multiple representations in one system,
  • just pick a valid translation
  • Differences in definitions. ← wait... What?

3/9

slide-17
SLIDE 17

Problems of Translations

DLMF 4.23.9 Maple Inv. Trig. Functions

Rendered Version Semantic L

AT

EX CAS Maple arccot(z) \acot@{z} arccot(z)

4/9

slide-18
SLIDE 18

Problems of Translations

DLMF 4.23.9 Maple Inv. Trig. Functions

Rendered Version Semantic L

AT

EX CAS Maple arccot(z) \acot@{z} arccot(z) Maple

Figure 1: ℜ(arccot(z)) with branch cut at [−∞i, −i], [i, ∞i].

DLMF & Mathematica

Figure 2: ℜ(arccot(z)) with branch cut at [−i, i].

4/9

slide-19
SLIDE 19

Problems of Translations

DLMF 4.23.9 Maple Inv. Trig. Functions

Rendered Version Semantic L

AT

EX CAS Maple arccot(z) \acot@{z} arccot(z) Maple

Figure 1: ℜ(arccot(z)) with branch cut at [−∞i, −i], [i, ∞i].

DLMF & Mathematica

Figure 2: ℜ(arccot(z)) with branch cut at [−i, i].

4/9

slide-20
SLIDE 20

Problems of Translations

DLMF 4.23.9 Maple Inv. Trig. Functions

Rendered Version Semantic L

AT

EX CAS Maple arccot(z) \acot@{z} arctan(1/z) Maple

Figure 1: ℜ(arccot(z)) with branch cut at [−∞i, −i], [i, ∞i].

DLMF & Mathematica

Figure 2: ℜ(arccot(z)) with branch cut at [−i, i].

4/9

slide-21
SLIDE 21

Presentation To Computation (P2C) without semantic information

slide-22
SLIDE 22

Problems of Generic L

A

T EX

DLMF 18.3

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Semantics: \JacobiP{\alpha}{\beta}{n}@{\cos@{a\Theta}} Potential Problems:

  • Is P a function, variable, constant?
  • Is cos(aΘ) an argument of P or part of a multiplication?
  • What are α, β, n, a, and Θ?

5/9

slide-23
SLIDE 23

Problems of Generic L

A

T EX

DLMF 18.3

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Semantics: Jacobi polynomial or Legendre function or Ferrers function or ... Potential Problems:

  • Is P a function, variable, constant?
  • Is cos(aΘ) an argument of P or part of a multiplication?
  • What are α, β, n, a, and Θ?

5/9

slide-24
SLIDE 24

Problems of Generic L

A

T EX

DLMF 18.3

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Semantics: P(cos(aΘ)) vs P · (cos(aΘ)) Potential Problems:

  • Is P a function, variable, constant?
  • Is cos(aΘ) an argument of P or part of a multiplication?
  • What are α, β, n, a, and Θ?

5/9

slide-25
SLIDE 25

Problems of Generic L

A

T EX

DLMF 18.3

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Semantics: Variable or 2nd Feigenbaum constant or ... Potential Problems:

  • Is P a function, variable, constant?
  • Is cos(aΘ) an argument of P or part of a multiplication?
  • What are α, β, n, a, and Θ?

5/9

slide-26
SLIDE 26

Human Approach

Rendered L

AT

EX: P (α,β)

n

(cos(aΘ))

6/9

slide-27
SLIDE 27

Human Approach

Rendered L

AT

EX: P (α,β)

n

(cos(aΘ)) The Naive Approach How does a reader understands the mathematical formula?

  • he knows the symbols and structure,
  • knowledge-based pattern recognition
  • it was previously introduced in the paper (e.g. in definitions,

the text or in other referenced publications),

  • analyse the context from near to far
  • he searching the formula in books or online
  • dictionary-based pattern recognition

6/9

slide-28
SLIDE 28

Human Approach

Rendered L

AT

EX: P (α,β)

n

(cos(aΘ)) The Naive Approach How does a reader understands the mathematical formula?

  • he knows the symbols and structure,
  • knowledge-based pattern recognition
  • it was previously introduced in the paper (e.g. in definitions,

the text or in other referenced publications),

  • analyse the context from near to far
  • he searching the formula in books or online
  • dictionary-based pattern recognition

6/9

slide-29
SLIDE 29

Human Approach

Rendered L

AT

EX: P (α,β)

n

(cos(aΘ)) The Naive Approach How does a reader understands the mathematical formula?

  • he knows the symbols and structure,
  • knowledge-based pattern recognition
  • it was previously introduced in the paper (e.g. in definitions,

the text or in other referenced publications),

  • analyse the context from near to far
  • he searching the formula in books or online
  • dictionary-based pattern recognition

6/9

slide-30
SLIDE 30

Human Approach

Rendered L

AT

EX: P (α,β)

n

(cos(aΘ)) The Naive Approach How does a reader understands the mathematical formula?

  • he knows the symbols and structure,
  • knowledge-based pattern recognition
  • it was previously introduced in the paper (e.g. in definitions,

the text or in other referenced publications),

  • analyse the context from near to far
  • he searching the formula in books or online
  • dictionary-based pattern recognition

6/9

slide-31
SLIDE 31

Human Approach

Rendered L

AT

EX: P (α,β)

n

(cos(aΘ)) The Naive Approach How does a reader understands the mathematical formula?

  • he knows the symbols and structure,
  • knowledge-based pattern recognition
  • it was previously introduced in the paper (e.g. in definitions,

the text or in other referenced publications),

  • analyse the context from near to far
  • he searching the formula in books or online
  • dictionary-based pattern recognition

6/9

slide-32
SLIDE 32

Human Approach

Rendered L

AT

EX: P (α,β)

n

(cos(aΘ)) The Naive Approach How does a reader understands the mathematical formula?

  • he knows the symbols and structure,
  • knowledge-based pattern recognition
  • it was previously introduced in the paper (e.g. in definitions,

the text or in other referenced publications),

  • analyse the context from near to far
  • he searching the formula in books or online
  • dictionary-based pattern recognition

6/9

slide-33
SLIDE 33

Human Approach

Rendered L

AT

EX: P (α,β)

n

(cos(aΘ)) The Naive Approach How does a reader understands the mathematical formula?

  • he knows the symbols and structure,
  • knowledge-based pattern recognition
  • it was previously introduced in the paper (e.g. in definitions,

the text or in other referenced publications),

  • analyse the context from near to far
  • he searching the formula in books or online
  • dictionary-based pattern recognition

6/9

slide-34
SLIDE 34

Adopt Human Approach

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Adopt Human Behavior Let’s try to adopt the previous steps

  • pattern recognition
  • narrow down possible meanings from the structure of the

expression

  • context analysis
  • Near-Field-Analysis (NFA), e.g., extract identifier-definien

pairs from text, analyze definition environments, ...

  • Far-Field-Analysis (FFA), e.g., overall topic of the paper,

citations, author’s field of interest, ...

7/9

slide-35
SLIDE 35

Adopt Human Approach

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Adopt Human Behavior Let’s try to adopt the previous steps

  • pattern recognition
  • narrow down possible meanings from the structure of the

expression

  • context analysis
  • Near-Field-Analysis (NFA), e.g., extract identifier-definien

pairs from text, analyze definition environments, ...

  • Far-Field-Analysis (FFA), e.g., overall topic of the paper,

citations, author’s field of interest, ...

7/9

slide-36
SLIDE 36

Adopt Human Approach

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Adopt Human Behavior Let’s try to adopt the previous steps

  • pattern recognition
  • narrow down possible meanings from the structure of the

expression

  • context analysis
  • Near-Field-Analysis (NFA), e.g., extract identifier-definien

pairs from text, analyze definition environments, ...

  • Far-Field-Analysis (FFA), e.g., overall topic of the paper,

citations, author’s field of interest, ...

7/9

slide-37
SLIDE 37

Adopt Human Approach

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Adopt Human Behavior Let’s try to adopt the previous steps

  • pattern recognition
  • narrow down possible meanings from the structure of the

expression

  • context analysis
  • Near-Field-Analysis (NFA), e.g., extract identifier-definien

pairs from text, analyze definition environments, ...

  • Far-Field-Analysis (FFA), e.g., overall topic of the paper,

citations, author’s field of interest, ...

7/9

slide-38
SLIDE 38

Adopt Human Approach

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Adopt Human Behavior Let’s try to adopt the previous steps

  • pattern recognition
  • narrow down possible meanings from the structure of the

expression

  • context analysis
  • Near-Field-Analysis (NFA), e.g., extract identifier-definien

pairs from text, analyze definition environments, ...

  • Far-Field-Analysis (FFA), e.g., overall topic of the paper,

citations, author’s field of interest, ...

7/9

slide-39
SLIDE 39

Adopt Human Approach

Generic L

AT

EX: P_n^{(\alpha,\beta)}(\cos(a\Theta)) Adopt Human Behavior Let’s try to adopt the previous steps

  • pattern recognition
  • narrow down possible meanings from the structure of the

expression

  • context analysis
  • Near-Field-Analysis (NFA), e.g., extract identifier-definien

pairs from text, analyze definition environments, ...

  • Far-Field-Analysis (FFA), e.g., overall topic of the paper,

citations, author’s field of interest, ...

7/9

slide-40
SLIDE 40

Previous Research Projects

MathMLBen We developed a MathML gold standard and performed evaluations

  • f nowadays L

AT

EX to MathML conversion tools.

mathmlben.wmflabs.org

Results: Many tools has trouble to understand formulae. MathMLTools We developed an open Java API for convenient MathML handling and easy access to useful services.

www.github.com/ag-gipp/MathMLTools

Evaluating Translations

www.dlmf.org

We performed automatic evaluation techniques on extracted formulae from the Digital Library of Mathematical Functions (DLMF) with the Computer Algebra System Maple. Results: We discovered two errors in the DLMF and one bug in Maple.

8/9

slide-41
SLIDE 41

Previous Research Projects

MathMLBen We developed a MathML gold standard and performed evaluations

  • f nowadays L

AT

EX to MathML conversion tools.

mathmlben.wmflabs.org

Results: Many tools has trouble to understand formulae. MathMLTools We developed an open Java API for convenient MathML handling and easy access to useful services.

www.github.com/ag-gipp/MathMLTools

Evaluating Translations

www.dlmf.org

We performed automatic evaluation techniques on extracted formulae from the Digital Library of Mathematical Functions (DLMF) with the Computer Algebra System Maple. Results: We discovered two errors in the DLMF and one bug in Maple.

8/9

slide-42
SLIDE 42

Previous Research Projects

MathMLBen We developed a MathML gold standard and performed evaluations

  • f nowadays L

AT

EX to MathML conversion tools.

mathmlben.wmflabs.org

Results: Many tools has trouble to understand formulae. MathMLTools We developed an open Java API for convenient MathML handling and easy access to useful services.

www.github.com/ag-gipp/MathMLTools

Evaluating Translations

www.dlmf.org

We performed automatic evaluation techniques on extracted formulae from the Digital Library of Mathematical Functions (DLMF) with the Computer Algebra System Maple. Results: We discovered two errors in the DLMF and one bug in Maple.

8/9

slide-43
SLIDE 43

Previous Research Projects

MathMLBen We developed a MathML gold standard and performed evaluations

  • f nowadays L

AT

EX to MathML conversion tools.

mathmlben.wmflabs.org

Results: Many tools has trouble to understand formulae. MathMLTools We developed an open Java API for convenient MathML handling and easy access to useful services.

www.github.com/ag-gipp/MathMLTools

Evaluating Translations

www.dlmf.org

We performed automatic evaluation techniques on extracted formulae from the Digital Library of Mathematical Functions (DLMF) with the Computer Algebra System Maple. Results: We discovered two errors in the DLMF and one bug in Maple.

8/9

slide-44
SLIDE 44

Previous Research Projects

MathMLBen We developed a MathML gold standard and performed evaluations

  • f nowadays L

AT

EX to MathML conversion tools.

mathmlben.wmflabs.org

Results: Many tools has trouble to understand formulae. MathMLTools We developed an open Java API for convenient MathML handling and easy access to useful services.

www.github.com/ag-gipp/MathMLTools

Evaluating Translations

www.dlmf.org

We performed automatic evaluation techniques on extracted formulae from the Digital Library of Mathematical Functions (DLMF) with the Computer Algebra System Maple. Results: We discovered two errors in the DLMF and one bug in Maple.

8/9

slide-45
SLIDE 45

Previous Research Projects

MathMLBen We developed a MathML gold standard and performed evaluations

  • f nowadays L

AT

EX to MathML conversion tools.

mathmlben.wmflabs.org

Results: Many tools has trouble to understand formulae. MathMLTools We developed an open Java API for convenient MathML handling and easy access to useful services.

www.github.com/ag-gipp/MathMLTools

Evaluating Translations

www.dlmf.org

We performed automatic evaluation techniques on extracted formulae from the Digital Library of Mathematical Functions (DLMF) with the Computer Algebra System Maple. Results: We discovered two errors in the DLMF and one bug in Maple.

8/9

slide-46
SLIDE 46

Upcoming Projects

A real-time recommender system for semantic version of mathematical input included in the editor of Wikipedia articles.

  • real-time recommendations
  • ordered from most likely to impossible
  • consider the context

9/9

slide-47
SLIDE 47

Thank you for your attention!