May 03, 2016
by Francesco Gadaleta
Produced by: worldofpiggy.com
How would you perform accurate classification on a very large dataset, by just looking at a sample of it?
In this episode I interview friend and colleague Rocco Langone, Machine Learning Researcher at the University of Leuven, Belgium. One of his recent papers is about big data and similarity metrics.
In this work Rocco proposes a deterministic method to obtain subsets from Big Data which are a good representative of the inherent structure in the data itself. This allows one to consider only a subset of the entire dataset, still performing at high accuracy if not better than traditional (eg. random) sampling.
As you can see, there is always a solution in Big Data. More details in this episode. Enjoy!
</a> Algorithm Flowchart[/caption]
<img src=h”ttps://s3-eu-west-1.amazonaws.com/wopcontent/uploads/2016/05/mapreduce.jpg”> Distributed Map-Reduce graph construction
I provide high quality training on statistics, data science, computer programming, in order to facilitate setting data analytics pipelines in the most optimized and cost effective ways. Feel free to schedule a meeting with me
Designing algorithms is what I have been doing in the last 10 years and more. I can set up data analytics solutions for small and large enterprises and apply machine learning algorithms to detect patterns and trends, extract knowledge and support decisions within several commercial domains, from finance, healthcare, traffic, and sales forecasting. I design and deploy algorithms and cloud-based software systems for production environments with high industrial standards. Feel free to schedule a meeting with me
I can provide technical consulting regarding data science, deep learning, computing architectures and the most prominent software packages currently available. My skills in distributed infrastructure are essential to give the insights you need to start your own data analytics pipeline in the most optimized and efficient environment. Feel free to schedule a meeting with me