Marco Capuccini is a data scientist and bioinformatician. He started his carrier as a software engineer, working for IBM and Sopra Steria in Europe. After he completed his undergraduate studies in computer science and bioinformatics, he started a PhD at Uppsala University (Sweden) where is currently enrolled. Marco is developing methods to run scientific applications, that are traditionally ran on HPC clusters, on cloud resources. He uses Spark as the main tool to enable large-scale data processing in his research.