Motivation Problem Statement Construction THANK YOU!
Mass Error-Correction Codes for Polymer-Based Data Storage Ryan - - PowerPoint PPT Presentation
Mass Error-Correction Codes for Polymer-Based Data Storage Ryan - - PowerPoint PPT Presentation
Motivation Problem Statement Construction THANK YOU! Mass Error-Correction Codes for Polymer-Based Data Storage Ryan Gabrys A joint work with S. Pattabiraman and O. Milenkovic ISIT June 8 th , 2020 Motivation Problem Statement Construction
Motivation Problem Statement Construction THANK YOU!
Motivation
Motivation Problem Statement Construction THANK YOU!
Protein Sequencing
▸ A protein is a long sequence of amino acids whose composition and order determine the protein’s functionality.
Motivation Problem Statement Construction THANK YOU!
Protein Sequencing
▸ A protein is a long sequence of amino acids whose composition and order determine the protein’s functionality. ▸ Mass spectrometry (M/S) has emerged an an important technique for sequencing proteins.
Motivation Problem Statement Construction THANK YOU!
Protein Sequencing
▸ A protein is a long sequence of amino acids whose composition and order determine the protein’s functionality. ▸ Mass spectrometry (M/S) has emerged an an important technique for sequencing proteins. ▸ The molecular masses of fragments of the protein sequence are then determined as the output of this mass spectrometry.
Motivation Problem Statement Construction THANK YOU!
Protein Sequencing
▸ A protein is a long sequence of amino acids whose composition and order determine the protein’s functionality. ▸ Mass spectrometry (M/S) has emerged an an important technique for sequencing proteins. ▸ The molecular masses of fragments of the protein sequence are then determined as the output of this mass spectrometry. ▸ From these molecular masses, the identities of the corresponding amino acids can be determined.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s.
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s. ▸ As an example, if s = (0, 1, 0, 0), then, C(s) = {{0}, {1}, {0}, {0}, {01}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}.
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s. ▸ As an example, if s = (0, 1, 0, 0), then, C(s) = {{0}, {1}, {0}, {0}, {01}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}.
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s. ▸ As an example, if s = (0, 1, 0, 0), then, C(s) = {{0}, {1}, {0}, {0}, {01}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}.
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s. ▸ As an example, if s = (0, 1, 0, 0), then, C(s) = {{0}, {1}, {0}, {0}, {01}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}.
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s. ▸ As an example, if s = (0, 1, 0, 0), then, C(s) = {{0}, {1}, {0}, {0}, {01}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}. ▸ Under this model, previous work in [ADMOP15] studied how to recover a string s
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s. ▸ As an example, if s = (0, 1, 0, 0), then, C(s) = {{0}, {1}, {0}, {0}, {01}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}. ▸ Under this model, previous work in [ADMOP15] studied how to recover a string s
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s. ▸ As an example, if s = (0, 1, 0, 0), then, C(s) = {{0}, {1}, {0}, {0}, {01}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}. ▸ Under this model, previous work in [ADMOP15] studied how to recover a string s provided its composition multi-set C(s).
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s. ▸ As an example, if s = (0, 1, 0, 0), then, C(s) = {{0}, {1}, {0}, {0}, {01}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}. ▸ Under this model, previous work in [ADMOP15] studied how to recover a string s provided its composition multi-set C(s).
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Model
▸ The composition multi-set C(s) of a string s = (s1, . . . , sn) ∈ {0, 1}n is the multiset C(s) = {{si, si+1, . . . , sj} ∶ 1 ≤ i ≤ j ≤ n}
- f all ( n + 1
2 ) contiguous substrings of s. ▸ As an example, if s = (0, 1, 0, 0), then, C(s) = {{0}, {1}, {0}, {0}, {01}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}. ▸ Under this model, previous work in [ADMOP15] studied how to recover a string s provided its composition multi-set C(s).
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Previous Work
▸ The following results are known from [ADMOP15]:
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Previous Work
▸ The following results are known from [ADMOP15]: Theorem All strings of length one less than a prime are reconstructable.
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Previous Work
▸ The following results are known from [ADMOP15]: Theorem All strings of length one less than a prime are reconstructable. Theorem All strings of length one less than twice a prime are reconstructable.
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
Problem Statement
Motivation Problem Statement Construction THANK YOU!
Composition Error-Correcting Codes (1/2)
▸ Motivated by applications in polymer-based storage, we wish to consider the setup where s belongs to a code and errors may occur.
Motivation Problem Statement Construction THANK YOU!
Composition Error-Correcting Codes (1/2)
▸ Motivated by applications in polymer-based storage, we wish to consider the setup where s belongs to a code and errors may occur. ▸ For any string s ∈ {0, 1}n, we say that t-composition errors have occurred to C(s) resulting in ˜ Ct(s) if
Motivation Problem Statement Construction THANK YOU!
Composition Error-Correcting Codes (1/2)
▸ Motivated by applications in polymer-based storage, we wish to consider the setup where s belongs to a code and errors may occur. ▸ For any string s ∈ {0, 1}n, we say that t-composition errors have occurred to C(s) resulting in ˜ Ct(s) if ∣ ˜ Ct(s)∣ = ∣C(s)∣,
Motivation Problem Statement Construction THANK YOU!
Composition Error-Correcting Codes (1/2)
▸ Motivated by applications in polymer-based storage, we wish to consider the setup where s belongs to a code and errors may occur. ▸ For any string s ∈ {0, 1}n, we say that t-composition errors have occurred to C(s) resulting in ˜ Ct(s) if ∣ ˜ Ct(s)∣ = ∣C(s)∣, and ∣C(s) △ ˜ Ct(s)∣ ≤ 2t, where C(s) △ ˜ Ct(s) denotes their symmetric difference.
Motivation Problem Statement Construction THANK YOU!
Composition Error-Correcting Codes (1/2)
▸ Motivated by applications in polymer-based storage, we wish to consider the setup where s belongs to a code and errors may occur. ▸ For any string s ∈ {0, 1}n, we say that t-composition errors have occurred to C(s) resulting in ˜ Ct(s) if ∣ ˜ Ct(s)∣ = ∣C(s)∣, and ∣C(s) △ ˜ Ct(s)∣ ≤ 2t, where C(s) △ ˜ Ct(s) denotes their symmetric difference.
Motivation Problem Statement Construction THANK YOU!
Composition Error-Correcting Codes (2/2)
▸ We say that any a code C is a t-composition error-correcting code (t-CECC for short) if for any distinct s, s′ ∈ C ˜ Ct(s) ≠ ˜ Ct(s′),
Motivation Problem Statement Construction THANK YOU!
Composition Error-Correcting Codes (2/2)
▸ We say that any a code C is a t-composition error-correcting code (t-CECC for short) if for any distinct s, s′ ∈ C ˜ Ct(s) ≠ ˜ Ct(s′), where ˜ Ct(s), ˜ Ct(s′), are the result of at most t-composition errors to C(s), C(s′), respectively.
Motivation Problem Statement Construction THANK YOU!
Example of Composition Errors
▸ Suppose s = (0, 1, 0, 0) so that
Motivation Problem Statement Construction THANK YOU!
Example of Composition Errors
▸ Suppose s = (0, 1, 0, 0) so that C(s) = {{0}, {1}, {0}, {0}, {0, 1}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}.
Motivation Problem Statement Construction THANK YOU!
Example of Composition Errors
▸ Suppose s = (0, 1, 0, 0) so that C(s) = {{0}, {1}, {0}, {0}, {0, 1}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}. ▸ Let ˜ C1(s) = {{0}, {1}, {0}, {0}, {0, 1}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 0}, {0, 1, 0, 0}}.
Motivation Problem Statement Construction THANK YOU!
Example of Composition Errors
▸ Suppose s = (0, 1, 0, 0) so that C(s) = {{0}, {1}, {0}, {0}, {0, 1}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}. ▸ Let ˜ C1(s) = {{0}, {1}, {0}, {0}, {0, 1}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 0}, {0, 1, 0, 0}}. Since ∣ ˜ C1(s) △ C(s)∣ = 2,
Motivation Problem Statement Construction THANK YOU!
Example of Composition Errors
▸ Suppose s = (0, 1, 0, 0) so that C(s) = {{0}, {1}, {0}, {0}, {0, 1}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 1}, {0, 1, 0, 0}}. ▸ Let ˜ C1(s) = {{0}, {1}, {0}, {0}, {0, 1}, {0, 1}, {0, 0}, {0, 0, 1}, {0, 0, 0}, {0, 1, 0, 0}}. Since ∣ ˜ C1(s) △ C(s)∣ = 2, we say that ˜ C1(s) is the result of a single composition error occurring to C(s).
Motivation Problem Statement Construction THANK YOU!
Problem Statement
▸ We will be interested in the following two problems:
Motivation Problem Statement Construction THANK YOU!
Problem Statement
▸ We will be interested in the following two problems:
1
How many bits of redundancy are sufficient to construct a t-CECC?
Motivation Problem Statement Construction THANK YOU!
Problem Statement
▸ We will be interested in the following two problems:
1
How many bits of redundancy are sufficient to construct a t-CECC?
2
How can we construct a t-CECC that has a small amount of redundancy but possesses an efficient encoding/decoding algorithm?
Motivation Problem Statement Construction THANK YOU!
Summary of Results
1
We show that at most O(t) + log n bits of redundancy are sufficient to construct a t-CECC.
- S. Pattabiraman, R. Gabrys, and O. Milenkovic, “Reconstruction and
error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Summary of Results
1
We show that at most O(t) + log n bits of redundancy are sufficient to construct a t-CECC.
2
We construct a systematic code with O(t2 log n) bits of redundancy and decoding complexity O(n3).
- S. Pattabiraman, R. Gabrys, and O. Milenkovic, “Reconstruction and
error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Summary of Results
1
We show that at most O(t) + log n bits of redundancy are sufficient to construct a t-CECC.
2
We construct a systematic code with O(t2 log n) bits of redundancy and decoding complexity O(n3). ▸ Our results from this work are an extension of our preliminary paper [PGM19], where we constructed a 1-CECC.
- S. Pattabiraman, R. Gabrys, and O. Milenkovic, “Reconstruction and
error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Summary of Results
1
We show that at most O(t) + log n bits of redundancy are sufficient to construct a t-CECC.
2
We construct a systematic code with O(t2 log n) bits of redundancy and decoding complexity O(n3). ▸ Our results from this work are an extension of our preliminary paper [PGM19], where we constructed a 1-CECC. ▸ For the remainder of the talk, we will focus on 2).
- S. Pattabiraman, R. Gabrys, and O. Milenkovic, “Reconstruction and
error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Construction
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3. ▸ From [ADMOP15], Ps(x, y)Ps( 1 x, 1 y ) = (n + 1) + Ss(x, y) + Ss( 1 x, 1 y ).
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (1/2)
▸ In the following, and similar to [ADMOP15], we will interpret the vectors as polynomials. ▸ For example, for the vector s = (0, 1, 0, 0), we associate s with the following bivariate polynomial: Ps(x, y) = 1 + y + xy + xy2 + xy3, where we associate the symbol x with 1 and y with 0. ▸ Note that we can also interpret the composition multiset as a polynomial Ss(x, y) where Ss(x, y) = 1 + 3y + x + 2xy + y2 + 2xy2 + xy3. ▸ From [ADMOP15], Ps(x, y)Ps( 1 x, 1 y ) = (n + 1) + Ss(x, y) + Ss( 1 x, 1 y ).
[ADMOP15] J. Acharya, H. Das, O. Milenkovic, A. Orlitsky, and S. Pan, “String reconstruction from substring compositions,” SIAM J. Discrete Math, 2015.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (2/2)
▸ For a bivariate polynomial f(x, y), let f ∗(x, y) denote its reciprocal polynomial where f ∗(x, y) = xdxydyf ( 1 x, 1 y ) , where dx is the x-degree of f(x, y) and dy is the y-degree.
Motivation Problem Statement Construction THANK YOU!
A Polynomial Interpretation (2/2)
▸ For a bivariate polynomial f(x, y), let f ∗(x, y) denote its reciprocal polynomial where f ∗(x, y) = xdxydyf ( 1 x, 1 y ) , where dx is the x-degree of f(x, y) and dy is the y-degree. ▸ Using this interpretation, we can write Ps(x, y)P ∗
s (x, y) = xdxydy (n + 1 + Ss(x, y)) + S∗ s(x, y),
where dx is the x-degree of Ps(x, y) and dy is the y-degree.
Motivation Problem Statement Construction THANK YOU!
Some Useful Results
Lemma Suppose wt(s) mod 2t + 1 ≡ 0. Then, given ˜ Ss(x, y) one can generate Ps(x, y)P ∗(x, y) + ˜ E(x, y), where ˜ E(x, y) has at most 4t non-zero terms.
Motivation Problem Statement Construction THANK YOU!
Some Useful Results
Lemma Suppose wt(s) mod 2t + 1 ≡ 0. Then, given ˜ Ss(x, y) one can generate Ps(x, y)P ∗(x, y) + ˜ E(x, y), where ˜ E(x, y) has at most 4t non-zero terms. Lemma Suppose f(x, y) has at most 4t non-zero terms where f(x, y) has total degree at most n. Then, we can uniquely recover f(x, y) provided f(αℓ1, αℓ2) = 0, where α ∈ Fq is a primitive element of a field, q ≥ 2n + 1, and ℓ1, ℓ2 ∈ [[4t]].
Motivation Problem Statement Construction THANK YOU!
Our Approach
Theorem Let C ⊆ {0, 1}n be such that for any s ∈ C, Ps(αℓ1, αℓ2) = 0, wt(s) = 0 mod (2t + 1), where α ∈ Fq is primitive, q ≥ 2n + 1, ℓ1, ℓ2 ∈ [[4t]]. Then, C is a t-ECC. In light of the previous lemmas the proof of correctness follows from two basic steps:
[PGM19]S. Pattabiraman et. al, “Reconstruction and error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Our Approach
Theorem Let C ⊆ {0, 1}n be such that for any s ∈ C, Ps(αℓ1, αℓ2) = 0, wt(s) = 0 mod (2t + 1), where α ∈ Fq is primitive, q ≥ 2n + 1, ℓ1, ℓ2 ∈ [[4t]]. Then, C is a t-ECC. In light of the previous lemmas the proof of correctness follows from two basic steps:
1
Step 1: From ˜ Ss(x, y) generate Z(x, y) = Ps(x, y)P ∗
s (x, y) + ˜
E(x, y).
[PGM19]S. Pattabiraman et. al, “Reconstruction and error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Our Approach
Theorem Let C ⊆ {0, 1}n be such that for any s ∈ C, Ps(αℓ1, αℓ2) = 0, wt(s) = 0 mod (2t + 1), where α ∈ Fq is primitive, q ≥ 2n + 1, ℓ1, ℓ2 ∈ [[4t]]. Then, C is a t-ECC. In light of the previous lemmas the proof of correctness follows from two basic steps:
1
Step 1: From ˜ Ss(x, y) generate Z(x, y) = Ps(x, y)P ∗
s (x, y) + ˜
E(x, y).
2
Step 2: Since Z(αℓ1, αℓ2)
[PGM19]S. Pattabiraman et. al, “Reconstruction and error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Our Approach
Theorem Let C ⊆ {0, 1}n be such that for any s ∈ C, Ps(αℓ1, αℓ2) = 0, wt(s) = 0 mod (2t + 1), where α ∈ Fq is primitive, q ≥ 2n + 1, ℓ1, ℓ2 ∈ [[4t]]. Then, C is a t-ECC. In light of the previous lemmas the proof of correctness follows from two basic steps:
1
Step 1: From ˜ Ss(x, y) generate Z(x, y) = Ps(x, y)P ∗
s (x, y) + ˜
E(x, y).
2
Step 2: Since Z(αℓ1, αℓ2) = Ps(αℓ1, αℓ2)P ∗
s (αℓ1, αℓ2) + ˜
E(x, y)
[PGM19]S. Pattabiraman et. al, “Reconstruction and error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Our Approach
Theorem Let C ⊆ {0, 1}n be such that for any s ∈ C, Ps(αℓ1, αℓ2) = 0, wt(s) = 0 mod (2t + 1), where α ∈ Fq is primitive, q ≥ 2n + 1, ℓ1, ℓ2 ∈ [[4t]]. Then, C is a t-ECC. In light of the previous lemmas the proof of correctness follows from two basic steps:
1
Step 1: From ˜ Ss(x, y) generate Z(x, y) = Ps(x, y)P ∗
s (x, y) + ˜
E(x, y).
2
Step 2: Since Z(αℓ1, αℓ2) = Ps(αℓ1, αℓ2)P ∗
s (αℓ1, αℓ2) + ˜
E(x, y) = ˜ E(αℓ1, αℓ2),
[PGM19]S. Pattabiraman et. al, “Reconstruction and error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Our Approach
Theorem Let C ⊆ {0, 1}n be such that for any s ∈ C, Ps(αℓ1, αℓ2) = 0, wt(s) = 0 mod (2t + 1), where α ∈ Fq is primitive, q ≥ 2n + 1, ℓ1, ℓ2 ∈ [[4t]]. Then, C is a t-ECC. In light of the previous lemmas the proof of correctness follows from two basic steps:
1
Step 1: From ˜ Ss(x, y) generate Z(x, y) = Ps(x, y)P ∗
s (x, y) + ˜
E(x, y).
2
Step 2: Since Z(αℓ1, αℓ2) = Ps(αℓ1, αℓ2)P ∗
s (αℓ1, αℓ2) + ˜
E(x, y) = ˜ E(αℓ1, αℓ2), we can recover ˜ E(x, y). Finally, from Ps(x, y)P ∗
s (x, y), one can recover
Ps(x, y) [PGM19].
[PGM19]S. Pattabiraman et. al, “Reconstruction and error-correction codes for polymer-based data storage,” Information Theory Workshop, 2019.
Motivation Problem Statement Construction THANK YOU!
Conclusion and Future Work
▸ Is it possible to generate an efficient systematic t-CECC with less redundancy?
Motivation Problem Statement Construction THANK YOU!
Conclusion and Future Work
▸ Is it possible to generate an efficient systematic t-CECC with less redundancy? ▸ We have shown that t + log n bits of redundancy is sufficient to correct t composition errors.
Motivation Problem Statement Construction THANK YOU!
Conclusion and Future Work
▸ Is it possible to generate an efficient systematic t-CECC with less redundancy? ▸ We have shown that t + log n bits of redundancy is sufficient to correct t composition errors. ▸ Upper bounds on t-CECC.
Motivation Problem Statement Construction THANK YOU!