Which One Is Better: Presentation-Based or Content-Based Math Search?
Minh-Quoc NGHIEM, Giovanni Yoko KRISTIANTO, Goran TOPÍC, Akiko AIZAWA
Which One Is Better: Presentation-Based or Content-Based Math - - PowerPoint PPT Presentation
Which One Is Better: Presentation-Based or Content-Based Math Search? Minh-Quoc NGHIEM, Giovanni Yoko KRISTIANTO, Goran TOPC, Akiko AIZAWA Outline Introduction Math Search Systems Method Evaluation Conclusion 2
Which One Is Better: Presentation-Based or Content-Based Math Search?
Minh-Quoc NGHIEM, Giovanni Yoko KRISTIANTO, Goran TOPÍC, Akiko AIZAWA
2
– Presentation-based
– Content-based
– http://ntcir-math.nii.ac.jp/
3
LaTeXML for semantic enrichment
module contribute to search system
presentation-based search
4
– Springer LaTeX Search – MathFind – The Digital Library of Mathematical Functions – EgoMath – Math Indexer and Searcher – ActiveMath – …
5
– Wolfram Function – MathWebSearch – MathGO! – MathDA – The system of Nguyen et. al – …
6
Presentation to Content MathML
MathML
7
Content MathML expressions
Presentation MathML expressions Indexing Semantic Enrichment Ranking
8
(CICM 2013)
– Segmentation rules: segment Presentation MathML trees into smaller trees – Translation rules: translate Presentation MathML trees to Content MathML trees – Each rule is associated with a probability
9
– Opaths: path in XML tree with order – Upaths: no order – Sisters: sister nodes in subtree
10
– 20k Math expressions in WFS – 15 queries (modified from NTCIR)
– Presentation MathML (PMathML) – Content MathML (CMathML) – Semantic Enrichment (SE)
11
– Precision at 10 (P@10)
– Normalize Discounted Cumulative Gain (nDCG)
12
∞
𝑦𝑒𝑦 𝑦2 + 𝑧2
∞
ⅇ−𝑦2 𝑒𝑦 𝑏𝑠𝑑𝑡𝑗𝑜(𝑦) 𝑙2 coshⅇ𝑨 + sinhⅇ𝑨 ⅇ ℛ𝑨𝜔𝜉(𝑨), ∞
~
∫ 𝑏𝑒+𝑐𝑨 𝑨 𝑒𝑨 lim
𝜉→∞
𝑀𝛽+𝜉 𝑀𝜉 ℬ𝒬
𝑨𝔔𝜉 𝜈(𝑨)
𝜉 ∈ ℕ 𝜔𝜉(𝑨) log(𝑨 + 1) 𝐼𝑜(𝑨) 1 𝜌
𝜌
cos𝑢𝑜 − 𝑨sin𝑢 𝑒𝑢
14
Using content markup improve search performance
0.6 0.7 0.8 0.9 1 PMathML CMathML SE
nDCG and Precision at 10
nDCG P@10
15
Using content markup improve search performance Relevant results are ranked higher
0.6 0.7 0.8 0.9 1 1 3 5 7 9
Precision at k
PMathML CMathML SE
– Functions have specific meanings
– More than one way to represent math expression
– Elementary functions
math search
search performance
their strong points
17
18