Photo of

Michael Malak

Senior Software Engineer, Oracle

Michael Malak has been implementing Spark solutions for two Fortune 200 companies since early 2013. He is currently at Oracle in Colorado in a team developing a Spark-based Big Data cloud app. He has an M.S. Math from George Mason University. His book Spark GraphX In Action is due to be published later in 2015.


Extending Word2Vec for Performance and Semi-Supervised Learning

MLLib Word2Vec is an unsupervised learning technique that can generate vectors of features that can then be clustered. But the weakness of unsupervised learning is that although it can say an apple is close to…