Marco Capuccini is a data scientist and bioinformatician. He started his carrier as a software engineer, working for IBM and Sopra Steria in Europe. After he completed his undergraduate studies in computer science and bioinformatics, he started a PhD at Uppsala University (Sweden) where is currently enrolled. Marco is developing methods to run scientific applications, that are traditionally ran on HPC clusters, on cloud resources. He uses Spark as the main tool to enable large-scale data processing in his research.


EasyMapReduce: Leverage the Power of Spark and Docker to Scale Scientific Tools in MapReduce Fashion

High-throughput methods in various scientific fields produced massive datasets in the past decade, and using Big Data frameworks, such as Apache Spark, is a natural choice to enable large-scale analysis. In scientific applications, many tools… Read more