Hammer Lab has built and maintains Pageant (https://github.com/hammerlab/pageant), a parallel genomic analysis toolkit, which contains tools for analyzing genomic data on Spark as well as libraries for more general computations using RDDs.
Ryan will discuss some of the most interesting applications and algorithms therein:
• coverage-depth (https://github.com/hammerlab/coverage-depth): joint-histograms of coverage-depth for one or two genomic-read datasets
• guacamole (https://github.com/hammerlab/guacamole): work-in-progress somatic variant caller
• suffix-arrays (https://github.com/hammerlab/suffix-arrays): proof-of-concept implementations of distributed-constructions of suffix-arrays and FM-indices
Session hashtag: #SFeco6