(Repost from 2015) I recently interviewed Hadley Wickham the creator of Ggplot2 and a famous R Stats person. He works for RStudio and his job is to work on Open Source software aimed at Data Geeks. Hadley is famous for his contributions to Data Science tooling and inspires a lot of other languages! I include some light edits. 1.… Continue reading Interview with a Data Scientist (Hadley Wickham)
Learning to use the Cloud A common question from a Developer or a student is ‘how do I learn AWS or GCP’. For those of you who don’t know AWS – stands for Amazon Web Services and GCP stands for Google Cloud Platform. At the current moment in time I’ve no opinions on Azure from… Continue reading I’m lost, how do I learn to use the Cloud?
Bayesian Analysis the good parts One of the questions I’m often asked is what’s so powerful about Bayesian analysis? I speak regularly to analysts, who’ve heard of some of the powerful aspects of it, but haven’t heard enough to emotionally invest time in learning it. I’ve thought about this on and off for a few… Continue reading Think you need to learn Bayesian Analysis? Read this first
I did a review like this in – 2017 It’s fun to look back on what I’ve learned and what I’ve accomplished. This will be contracting focused and side projects focused. (Appologies to Julia Evans, I basically just stole her format) I got married! I won’t embarrass my wife online. But this year, after a… Continue reading 2018: Year in Review
What is Bayesian Statistics Bayesian Statistics (or Probabilistic Programming) take a more effective and deep approach to perform analysis of any given data and situation. A/B testing is one of the hottest topics on the internet nowadays. In this testing, you simply consider two different groups, A and B, to analyze the performance of both… Continue reading 3 reasons to learn Bayesian Statistics in the new year
Bayesian Statistics and Supply Chain Supply Chain can be thought of as a set of procedures that are coordinated to combine manufacturers, suppliers, warehouses, and stores in order to ensure proper production and distribution of material of right quantities at the right location and in right time. This, in turn, ensures that the total supply… Continue reading Applications of Bayesian Statistics: Supply Chain
I’ve been contributing on and off to PyMC3 and other projects for a few years now. I’m still learning a lot about Bayesian Statistics and building software. I intend to continue to work on this stuff. Chris Fonnesbeck – recently did a talk at NeurIPS where he talks about some of the stuff we’ve learned in… Continue reading 3 key lessons from being an OSS developer
One question that is often asked by those who know Machine Learning to me is how do I build a Bayesian Logistic Regression model? If you know how to build a logistic regression model in sklearn or a standard machine learning library it’s quite easy to learn how to do the Bayesian version. In this screencast… Continue reading New Screencast: How do I build a Logistic Regression model the Bayesian way?
You have a problem that you think might need some Bayesian modelling A common question I’m asked is how do you start? In this tutorial I take you from a fresh data set, the data set is an educational dataset. I don’t know anything about the data, and I have no specific domain knowledge. I… Continue reading How to build a bayesian model in 30 minutes?
I recently gave a talk to the excellent research team at Signal Media. And got asked the question I didn’t want to be asked. It was what is the BFMI in PyMC3? The way I largely think of it is, in a practical level – which is – if the BFMI metric is below the… Continue reading What is BFMI (Bayesian Fraction of Missing Information)?