TY - JOUR
T1 - Coding over Sets for DNA Storage
AU - Lenz, Andreas
AU - Siegel, Paul H.
AU - Wachter-Zeh, Antonia
AU - Yaakobi, Eitan
N1 - Publisher Copyright:
© 1963-2012 IEEE.
PY - 2020/4
Y1 - 2020/4
N2 - In this paper we study error-correcting codes for the storage of data in synthetic deoxyribonucleic acid (DNA). We investigate a storage model where a data set is represented by an unordered set of M sequences, each of length L. Errors within that model are a loss of whole sequences and point errors inside the sequences, such as insertions, deletions and substitutions. We derive Gilbert-Varshamov lower bounds and sphere packing upper bounds on achievable cardinalities of error-correcting codes within this storage model. We further propose explicit code constructions than can correct errors in such a storage system that can be encoded and decoded efficiently. Comparing the sizes of these codes to the upper bounds, we show that many of the constructions are close to optimal.
AB - In this paper we study error-correcting codes for the storage of data in synthetic deoxyribonucleic acid (DNA). We investigate a storage model where a data set is represented by an unordered set of M sequences, each of length L. Errors within that model are a loss of whole sequences and point errors inside the sequences, such as insertions, deletions and substitutions. We derive Gilbert-Varshamov lower bounds and sphere packing upper bounds on achievable cardinalities of error-correcting codes within this storage model. We further propose explicit code constructions than can correct errors in such a storage system that can be encoded and decoded efficiently. Comparing the sizes of these codes to the upper bounds, we show that many of the constructions are close to optimal.
KW - Coding over sets
KW - DNA data storage
KW - Gilbert-Varshamov bound
KW - insertion and deletion errors
KW - sphere packing bound
UR - http://www.scopus.com/inward/record.url?scp=85082517678&partnerID=8YFLogxK
U2 - 10.1109/TIT.2019.2961265
DO - 10.1109/TIT.2019.2961265
M3 - Article
AN - SCOPUS:85082517678
SN - 0018-9448
VL - 66
SP - 2331
EP - 2351
JO - IEEE Transactions on Information Theory
JF - IEEE Transactions on Information Theory
IS - 4
M1 - 8937735
ER -