“One thousand petabytes of genomic data are already being stored worldwide, and the aggregate cost of genomic storage is expected to grow from US $0.5 billion today to US $5 billion by 2021. There will be a need to reduce the footprint of genomic datasets in FASTQ and BAM, at the same time preserving genotyping…
Enabling the digitalization of personalized medicine
We are developing a new generation of genomic information compressors exploiting the available expertise in digital information processing and entropy coding.
Efficient Data Access
We are working on a new genomic data format that supports most stages of existing sequencing and analysis pipelines, with a great advantage over the current practice that uses different file formats at each stage.
Genomic Data Transport
We are improving a genomic data transport layer which provide essential features such as streaming capabilities, incremental update of data and metadata.