Probabilistic Programming versus Machine Learning In the past ten years, we’ve seen an explosion in Machine Learning applications, these applications have been particularly successful in search, e-commerce, advertising, social media and other verticals. These applications have been particularly focused on predictive accuracy and often involve large amounts of data — sometimes in the region of terabytes — in fact this… Continue reading Why would I ever NEED Bayesian Statistics?
I’ve been recently playing around with ‘arviz’. For those of you who don’t know Arviz is a library for exploratory analysis Bayesian Models. I’ve got a Bayesian model built – or someone has built one for me, how do I explore it? How do I plot it? This is the fundamental question that Arviz answers.… Continue reading How do I visualise the results of a Bayesian Model: Rugby models in Arviz
The insightful Data Scientist Trey Causey talks about Software Development Skills for Data Scientists I’m going to write about my views on Code Review – as a Data Scientist with a few years experience, and experience delivering Data Products at organizations of varying sizes. I’m not perfect and I’m still maturing as an Engineer. A good… Continue reading Why Code review? Or why should I care as a data scientist.
I recently caught up with Alice Zheng a Director of Data Science at Dato – Alice is an expert on building scalable Machine Learning models and currently works for www.dato.com who are a company providing tooling to help you build scalable machine learning models easily. She is also a keen advocate of encouraging women in… Continue reading Interview with a Data Scientist: Alice Zheng
I recently caught with with Maria Rosario Mestre – she shared her personal views on Data Science – like all these interviewee subjects – these do not reflect her employers views. Biography – Maria: I completed a PhD in signal processing at Cambridge developing models of user behaviour using brain data. After the PhD I… Continue reading Interview with a Data Scientist: Maria Rosario Mestre
Trey Causey is a blogger with experience as a professional data scientist in sports analytics and e-commerce. He’s got some fantastic views about the state of the industry, and I was privileged to read this. 1. What project have you worked on do you wish you could go back to, and do better? The easy and… Continue reading Interview with a Data Scientist: Trey Causey
(Linkedin picture) I was very happy to interview Natalie about her data science stuff – as she gave a really cool Machine Learning focused talk at PyData in London this year, which was full of insights into the challenges of doing Machine Learning with Imbalanced data sets. Natalie leads the data team at GoCardless, a… Continue reading Interview with a Data Scientist: Nathalie Hockham
Recently I decided to do some quick Data Analysis of my interviews with data scientists. It seems natural when you collect a lot of data to explore it and do some data analysis on it. You can access the code here. The code isn’t in much depth but it is a simple example of how… Continue reading Interviews with Data Scientists: NLP for the win
As part of my interviews with Data Scientists I recently caught up with Erik Bernhardsson who is famous in the world of ‘Big Data’ for his open source contributions, his leading of teams at Spotify, and his various talks at various conferences. 1. What project have you worked on do you wish you could go back to,… Continue reading Interview with a Data Scientist: Erik Bernhardsson
J.D.Long is the current AVP Risk Management at RenaissanceRe and has a 15 year history of working as an analytics professional. I sent him an interview recently to see what he would say. Good questions Peadar. Here’s a really fast attempt at answers: 1. What project have you worked on do you wish you could go back… Continue reading An interview with a data artisan