Project Overview

Project Code: CIT 05

Project name:

Error-Correction Schemes in DNA-based Storage Systems

TUM Department:

CIT - Electrical and Computer Engineering

TUM Chair / Institute:

Coding and Cryptography

Research area:

Coding Theory

Student background:

Computer EngineeringComputer Science/ InformaticsElectrical EngineeringMathematics

Further disciplines:

Participation also possible online only:

Planned project location:

Building N4 - Central City Campus
Theresienstrasse 90
80333 München

Project Supervisor - Contact Details


Title:

Given name:

Anisha

Family name:

Banerjee

E-mail:

anisha.banerjee@tum.de

Phone:

+49(89)289 - 23492

Additional Project Supervisor - Contact Details


Title:

Prof. Dr.-Ing.

Given name:

Antonia

Family name:

Wachter-Zeh

E-mail:

antonia.wachter-zeh@tum.de

Phone:

+49 (89) 289 - 23495

Additional Project Supervisor - Contact Details


Title:

Given name:

Family name:

E-mail:

Phone:

Project Description


Project description:

Nowadays there is a growing demand of large scale storing systems due to the availability of massive data. Synthetic DNA is recognized as a promising, powerful storage medium to archive digital information. Since DNA is a marcomolecule, it can function as an ultra-dense storage system. In an experiment researchers stored approximately two petabyte of data per gram. Another advantage is its time durability because of its chemical stability, i.e. information can still be retrieved after a long amount of time, e.g. after 10000 years. Since the current biological writing and reading process is cumbersome, for now, DNA is seen as an ultra-dense, long-term storage system and will not replace a flash drive soon. The storing and retrieving methods work as follows: For writing, firstly your binary data will be mapped to multiple strings of DNA nucleotides representations. These strings will be then synthesized to DNA sequences. Reading out information is normally split into two parts. First the PCR, which is a method to generate thousands of copies of one DNA sequence, and after that the DNA sequencing, which is the actual reading process. During these overall processes many errors in the sequences can occur, such as mutations, duplications, or deletions of some nucleotides. To circumvent these errors special error-correcting codes need to be applied.
The objective of this project is to investigate the DNA storage channel with particular focus on the error-correction coding schemes.
[1] Yazdi et al. “DNA-Based Storage: Trends and Methods”, 2015.

Working hours per week planned:

40

Prerequisites


Required study level minimum (at time of TUM PREP project start):

2 years of bachelor studies completed

Subject related:

Linear Algebra, Probability Theory

Other:

  • Keine Stichwörter