Codes over Trees Lev Yohananov and Eitan Yaakobi Technion Israel - - PowerPoint PPT Presentation

โ–ถ
codes over trees
SMART_READER_LITE
LIVE PREVIEW

Codes over Trees Lev Yohananov and Eitan Yaakobi Technion Israel - - PowerPoint PPT Presentation

Codes over Trees Lev Yohananov and Eitan Yaakobi Technion Israel Institute of Technology 2020 IEEE International Symposium on Information Theory Motivation Trees and their properties are very beneficial in numerous applications. In


slide-1
SLIDE 1

Codes over Trees

Lev Yohananov and Eitan Yaakobi Technion โ€“ Israel Institute of Technology 2020 IEEE International Symposium on Information Theory

slide-2
SLIDE 2

Motivation

  • Trees and their properties are very beneficial in numerous applications.
  • In biology.
  • In chemistry.
  • In programming languages.
  • In cyber applications.
  • etc.
  • In coding theory, a novel family of codes is presented.

1

slide-3
SLIDE 3

Trees

  • A finite undirected tree over ๐‘œ nodes is a connected undirected graph

with ๐‘œ โˆ’ 1 edges.

  • ๐‘ˆ ๐‘œ : the set of all trees over ๐‘œ nodes.
  • By Cayleyโ€™s formula1 it holds that |๐‘ˆ ๐‘œ | = ๐‘œ๐‘œโˆ’2.

1 2 3 4 5 6 7 8 9

1 M. Aigner and G. M. Ziegler, Proofs from THE BOOK, pp. 141โ€“146,Springer-Verlag, New York, 1998.

2

slide-4
SLIDE 4

Codes (over Trees)

  • ๐ท๐‘ˆ: a code (over trees) denoted by ๐‘œ, ๐‘ ๐‘ˆ, such that
  • ๐‘œ: the number of nodes in a tree.
  • ๐‘: the size of ๐ท๐‘ˆ.
  • Example ๐‘œ = 5, ๐‘ = 8

2 1 3 1 2 3 2 1 3 3 1 2 4 4 4 4 1 2 3 1 2 3 2 1 3 3 1 2 4 4 4 4

3

slide-5
SLIDE 5

Edge Erasure

  • An edge erasure is a removal of an edge from a tree.

4

slide-6
SLIDE 6

Tree Distance

  • Given two trees ๐‘ˆ

1 = (๐‘Š ๐‘œ, ๐น1) and ๐‘ˆ2 = ๐‘Š ๐‘œ, ๐น2 .

  • ๐‘’๐‘ˆ ๐‘ˆ

1, ๐‘ˆ2 : the tree distance (or distance) between ๐‘ˆ 1 and ๐‘ˆ2 is

๐‘’๐‘ˆ ๐‘ˆ

1, ๐‘ˆ2 = ๐‘œ โˆ’ 1 โˆ’ |๐น1 โˆฉ ๐น2|.

  • This distance is a metric.

๐‘’๐‘ˆ ๐‘ˆ

1, ๐‘ˆ2 = 8 โˆ’ 7 = 1.

1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9 5

slide-7
SLIDE 7

Codes with Minimal Distance ๐‘’

  • ๐‘œ, ๐‘, ๐‘’ ๐‘ˆ: a code over trees of minimal distance ๐’†.
  • ๐‘ : the redundancy of the code ๐ท๐‘ˆ.
  • ๐‘  =

๐‘œ โˆ’ 2 log ๐‘œ โˆ’ log(๐‘).

  • ๐ต ๐‘œ, ๐‘’ : the largest size of a code of distance ๐‘’.
  • ๐‘  ๐‘œ, ๐‘’ : the minimal redundancy of a code of distance ๐‘’.
  • Theorem: A ๐‘œ, ๐‘ ๐‘ˆ code over trees ๐ท๐‘ˆ is of tree distance ๐’† if and
  • nly if it can correct any ๐’† โˆ’ ๐Ÿ edges.

6

slide-8
SLIDE 8

Forests

  • An undirected graph that consists of only disjoint union of trees is

called a forest.

  • โ„ฑ ๐‘œ, ๐‘ข : the set of all forests over ๐‘œ nodes with exactly ๐’– trees.
  • Note that โ„ฑ(๐‘œ, 1) = ๐‘ˆ(๐‘œ).

1 2 3 4 5 6 7 8 9 7

slide-9
SLIDE 9

Number of Forest with Exactly ๐‘ข Trees

  • The value of |โ„ฑ ๐‘œ, ๐‘ข | was shown to be

๐บ ๐‘œ, ๐‘ข = ๐‘œ ๐‘ข ๐‘œ๐‘œโˆ’๐‘ขโˆ’1 เท

๐‘—=0 ๐‘ข

โˆ’ 1 2

๐‘— ๐‘ข

๐‘— (๐‘ข + ๐‘—) ๐‘œ โˆ’ ๐‘ข ! ๐‘œ๐‘— ๐‘œ โˆ’ ๐‘ข โˆ’ ๐‘— ! .

  • Another representation of it

๐บ ๐‘œ, ๐‘ข = ๐‘œ๐‘œโˆ’๐‘ข เท

๐‘—=0 ๐‘ข

โˆ’ 1 2

๐‘— ๐‘ข

๐‘— ๐‘œ โˆ’ 1 ๐‘ข โˆ’ 1 + ๐‘— ๐‘ข + ๐‘— ! ๐‘œ๐‘—๐‘ข! .

  • J. Moon2 1970.
  • B. Bollobas3 1979.

2 J. W. Moon, Counting labeled trees, 1970. 3 B. Bollobas, Graph Theory: An Introductory Course, Springer-Verlag, New York, 1979.

8

slide-10
SLIDE 10

Forest Ball of a Tree

  • ๐’ฌ๐‘ˆ ๐‘œ, ๐‘ข : the forest ball of a tree ๐‘ˆ of radius ๐‘ข.
  • Note that ๐’ฌ๐‘ˆ ๐‘œ, ๐‘ข โІ โ„ฑ(๐‘œ, ๐‘ข + 1).

1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4

๐’ฌ๐‘ˆ ๐‘œ = 5, ๐‘ข = 1 = ๐‘ˆ = Note that |๐’ฌ๐‘ˆ ๐‘œ, ๐‘ข | = ๐‘œ โˆ’ 1 ๐‘ข

Regular

9

slide-11
SLIDE 11

Sphere Packing Bound

  • Theorem: For all ๐‘œ โ‰ฅ 1 and 1 โ‰ค ๐‘’ โ‰ค ๐‘œ, it holds that

๐ต ๐‘œ, ๐‘’ โ‰ค ๐บ(๐‘œ, ๐‘’)/ ๐‘œ โˆ’ 1 ๐‘’ โˆ’ 1 . ๐‘ˆ

1

๐‘ˆ2

๐บ

๐‘’ โˆ’ 1 10

slide-12
SLIDE 12

Corollary

  • It was also proved2 that for any fixed ๐‘’,

lim

๐‘œโ†’โˆž

๐บ ๐‘œ, ๐‘’ ๐‘œ๐‘œโˆ’2 = 1 2๐‘’โˆ’1 ๐‘’ โˆ’ 1 ! .

  • Thus,

๐ต ๐‘œ, ๐‘’ โ‰ค ๐บ ๐‘œ, ๐‘’ ๐‘œ โˆ’ 1 ๐‘’ โˆ’ 1 = ๐‘ƒ ๐‘œ๐‘œโˆ’1โˆ’๐‘’ .

2 J. W. Moon, Counting labeled trees, 1970.

11

slide-13
SLIDE 13

Results from Sphere Packing Bound

  • Correcting ๐’ โˆ’ ๐Ÿ‘ erasures: ๐ต ๐‘œ, ๐‘œ โˆ’ 1 โ‰ค

๐‘œ 2 .

  • Correcting ๐’ โˆ’ ๐Ÿ’ erasures: ๐ต ๐‘œ, ๐‘œ โˆ’ 2 = ๐‘ƒ ๐‘œ2 .
  • Correcting ๐’ โˆ’ ๐Ÿ“ erasures: ๐ต ๐‘œ, ๐‘œ โˆ’ 3 = ๐‘ƒ ๐‘œ3 .

12

slide-14
SLIDE 14

The Results of this Work

  • Correcting ๐’ โˆ’ ๐Ÿ‘ erasures: ๐ต ๐‘œ, ๐‘œ โˆ’ 1 = ๐‘œ/2 .
  • Correcting ๐’ โˆ’ ๐Ÿ’ erasures: ๐ต ๐‘œ, ๐‘œ โˆ’ 2 = ๐‘œ.
  • Correcting ๐’ โˆ’ ๐Ÿ“ erasures: ๐ต ๐‘œ, ๐‘œ โˆ’ 3 = ๐‘ƒ ๐‘œ2 .
  • For fixed ๐‘’ and ๐‘œ โ‰ฅ 2๐‘’,

ฮฉ ๐‘œ๐‘œโˆ’2๐‘’ โ‰ค ๐ต ๐‘œ, ๐‘’ โ‰ค ๐‘ƒ ๐‘œ๐‘œโˆ’1โˆ’๐‘’ . ๐ต ๐‘œ, ๐‘œ โˆ’ 1 โ‰ค ๐‘œ 2 . ๐ต ๐‘œ, ๐‘œ โˆ’ 2 = ๐‘ƒ ๐‘œ2 . ๐ต ๐‘œ, ๐‘œ โˆ’ 3 = ๐‘ƒ ๐‘œ3 .

13

slide-15
SLIDE 15

Lower Bound on ๐ต(๐‘œ, ๐‘œ โˆ’ 1)

  • A line tree ๐‘ˆ:
  • Our code will be constructed from ๐‘œ/2 line trees as follows:
  • Thus, this code is a set of ๐‘œ/2 disjoint Hamiltonian paths, see Lucas4.

1 2 3 4 1 7 2 6 4 3 5 1 2 3 7 5 4 6

๐ต ๐‘œ, ๐‘œ โˆ’ 1 = ๐‘œ/2

4 E. Lucas, โ€œLes rondes enfantines,โ€ Recreations mathematiques, vol. 2, Paris, 1894.

14

slide-16
SLIDE 16

Lower Bound on ๐ต(๐‘œ, ๐‘œ โˆ’ 2)

  • A star tree ๐‘ˆ:
  • Our code will be constructed from ๐‘œ star trees as follows:
  • Every two trees have exactly one edge in common.

1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 3 2 4 1 4 2 3

๐ต ๐‘œ, ๐‘œ โˆ’ 2 = ๐‘ƒ ๐‘œ2 ๐ต ๐‘œ, ๐‘œ โˆ’ 2 โ‰ฅ ๐‘œ

15

slide-17
SLIDE 17

Upper Bound on ๐ต(๐‘œ, ๐‘œ โˆ’ 2)

  • Let ๐ป be a bipartite graph as in example:
  • Any code of tree distance ๐‘œ โˆ’ 2 yields a bipartite ๐ป of girth of at least 6.
  • Using Reimanโ€™s inequality5 1958, it is shown that every such bipartite ๐ป

holds ๐‘ โ‰ค ๐‘œ. ๐‘ˆ

1

๐‘ˆ2 โ‹ฎ ๐‘ˆ๐‘

๐‘“1 ๐‘“2

โ‹ฎ

๐‘“ ๐‘œ

2

๐ต ๐‘œ, ๐‘œ โˆ’ 2 = ๐‘ƒ ๐‘œ2 ๐ต ๐‘œ, ๐‘œ โˆ’ 2 โ‰ค ๐‘œ

5 I. Reiman, โ€œUber ein Problem von K. Zarankiewicz,โ€ Acta mathematica hungarica, vol. 9, issue 3โ€“4, pp. 269โ€“273,

Hungary, Budapest, Sep. 1958.

deg egree = ๐’ โˆ’ ๐Ÿ

16

slide-18
SLIDE 18

Upper Bound on ๐ต(๐‘œ, ๐‘œ โˆ’ 3)

  • Also done using Reimanโ€™s inequality. (short in time)
  • In the longest version6: ๐ต ๐‘œ, 3๐‘œ/4

= ฮฉ(๐‘œ2).

  • ๐‘œ is a prime number.

๐ต ๐‘œ, ๐‘œ โˆ’ 3 = ๐‘ƒ ๐‘œ3 ๐ต ๐‘œ, ๐‘œ โˆ’ 3 = ๐‘ƒ(๐‘œ2)

6 L. Yohananov and E. Yaakobi, โ€œCodes over trees, โ€arXiv:2001.01791,Jan. 2020.

17

slide-19
SLIDE 19

Lower Bound on General ๐ต(๐‘œ, ๐‘’)

  • Theorem: for fixed ๐‘’ and ๐‘œ โ‰ฅ 2๐‘’ it holds

๐ต ๐‘œ, ๐‘’ = ๐›ป ๐‘œ๐‘œโˆ’2๐‘’ .

  • Construction: Let (๐‘“1, ๐‘“2, โ€ฆ , ๐‘“ ๐‘œ

2

) be some order of all the edges of the complete graph over ๐’ nodes.

  • Each tree ๐‘ˆ will be represented as a characteristic vector of length

๐‘œ 2 and weight ๐’ โˆ’ ๐Ÿ as in the example:

1 2 3

1 1 1

{0,1} {0,2} {0,3} {1,2} {1,3} {2,3}

18

slide-20
SLIDE 20

The Proof

  • A linear binary code of length ๐‘‚ = ๐‘œ

2 and Hamming distance ๐ธ = 2๐‘’ โˆ’ 1 can correct at most ๐‘’ โˆ’ 1 substitution.

  • Corresponding to ๐‘’ โˆ’ 1 edge erasures.
  • Applying BCH codes, we pay the redundancy of

๐‘  = ๐‘’ โˆ’ 1 log ๐‘œ 2 + ๐‘ƒ 1 = 2 ๐‘’ โˆ’ 1 log ๐‘œ + ๐‘ƒ 1 .

  • The 2๐‘  cosets of such code are also ( ๐‘œ

2 , ๐ฟ, 2๐‘’ โˆ’ 1) codes.

  • Thus, by the pigeonhole principle there is a code of cardinality at least

๐‘œ๐‘œโˆ’2 22 ๐‘’โˆ’1 log ๐‘œ = ฮฉ ๐‘œ๐‘œโˆ’2๐‘’ .

19

slide-21
SLIDE 21

Tree Balls of Trees

20

slide-22
SLIDE 22

Tree Ball of Trees

  • ๐ถ๐‘ˆ ๐‘œ, ๐‘ข : tree ball of trees:

|๐’ฌ๐‘ˆ ๐‘œ, ๐‘ข | = ๐‘œ โˆ’ 1 ๐‘ข

1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4

โ‹ฏ โ‹ฏ โ‹ฏ โ‹ฏ

|๐ถ๐‘ˆ ๐‘œ, ๐‘ข | =?

Regular Not Regular

๐’ฌ๐‘ˆ ๐‘œ, ๐‘ข

21

slide-23
SLIDE 23

Radius One

22

๐‘ˆ ๐‘ˆ ๐‘ˆ ฮ˜(๐‘œ2) ฮ˜(๐‘œ3) Arbitrary ๐‘ˆ Average ball size: ฮ˜(๐‘œ2.5) Explicit formulas.

slide-24
SLIDE 24

Radius ๐‘ข (fixed)

23

๐‘ˆ ๐‘ˆ ๐‘ˆ ฮ˜(๐‘œ2๐‘ข) ฮ˜(๐‘œ3๐‘ข) Arbitrary ๐‘ˆ Average ball size: ฮ˜(๐‘œ2.5๐‘ข) Recursive formulas.

slide-25
SLIDE 25

The Results of this Work

  • Correcting ๐’ โˆ’ ๐Ÿ‘ erasures: ๐ต ๐‘œ, ๐‘œ โˆ’ 1 = ๐‘œ/2 .
  • Correcting ๐’ โˆ’ ๐Ÿ’ erasures: ๐ต ๐‘œ, ๐‘œ โˆ’ 2 = ๐‘œ.
  • Correcting ๐’ โˆ’ ๐Ÿ“ erasures: ๐ต ๐‘œ, ๐‘œ โˆ’ 3 = ๐‘ƒ ๐‘œ2 .
  • For fixed ๐‘’ and ๐‘œ โ‰ฅ 2๐‘’,

ฮฉ ๐‘œ๐‘œโˆ’2๐‘’ โ‰ค ๐ต ๐‘œ, ๐‘’ โ‰ค ๐‘ƒ ๐‘œ๐‘œโˆ’1โˆ’๐‘’ .

  • Studying tree balls of trees.

24

slide-26
SLIDE 26

Conclusions and Future Work

  • Improve the lower and upper bounds on ๐ต(๐‘œ, ๐‘’).
  • Study codes over trees under different metrics such as the tree edit

distance.

  • Study the problem of reconstructing trees based upon several forests

in the forest ball of trees.

25

Thank You!