Over the past years, technological advances in sequencing (i.e., the process of reading out genomic information) have led to a faster and more cost-efficient approach to sequence individual genomes. Because of the enormous amount of sequencing data generated by high-throughput sequencing (HTS) machines, the processing, storage, and analysis of sequencing data entails novel challenges for the scientific community. Novel processes and tools have to be developed to overcome the current limitations in terms of storage space, processing speed, and many more.
Raw sequencing data generated by HTS machines passes through a great number of different analysis steps. Our goal is to develop novel algorithms to enhance the information processing "from the tissue to the hard drive".