Interview with a Data Scientist (Hadley Wickham)

(Repost from 2015) I recently interviewed Hadley Wickham the creator of Ggplot2 and a famous R Stats person. He works for RStudio and his job is to work on Open Source software aimed at Data Geeks. Hadley is famous for his contributions to Data Science tooling and inspires a lot of other languages! I include some light edits. 1.… Continue reading Interview with a Data Scientist (Hadley Wickham)

Think you need to learn Bayesian Analysis? Read this first

Bayesian Analysis the good parts One of the questions I’m often asked is what’s so powerful about Bayesian analysis? I speak regularly to analysts, who’ve heard of some of the powerful aspects of it, but haven’t heard enough to emotionally invest time in learning it. I’ve thought about this on and off for a few… Continue reading Think you need to learn Bayesian Analysis? Read this first

3 reasons to learn Bayesian Statistics in the new year

What is Bayesian Statistics Bayesian Statistics (or Probabilistic Programming) take a more effective and deep approach to perform analysis of any given data and situation. A/B testing is one of the hottest topics on the internet nowadays. In this testing, you simply consider two different groups, A and B, to analyze the performance of both… Continue reading 3 reasons to learn Bayesian Statistics in the new year

Applications of Bayesian Statistics: Supply Chain

Bayesian Statistics and Supply Chain Supply Chain can be thought of as a set of procedures that are coordinated to combine manufacturers, suppliers, warehouses, and stores in order to ensure proper production and distribution of material of right quantities at the right location and in right time. This, in turn, ensures that the total supply… Continue reading Applications of Bayesian Statistics: Supply Chain

3 key lessons from being an OSS developer

I’ve been contributing on and off to PyMC3 and other projects for a few years now. I’m still learning a lot about Bayesian Statistics and building software. I intend to continue to work on this stuff. Chris Fonnesbeck – recently did a talk at NeurIPS where he talks about some of the stuff we’ve learned in… Continue reading 3 key lessons from being an OSS developer

Why would I ever NEED Bayesian Statistics?

Probabilistic Programming versus Machine Learning In the past ten years, we’ve seen an explosion in Machine Learning applications, these applications have been particularly successful in search, e-commerce, advertising, social media and other verticals. These applications have been particularly focused on predictive accuracy and often involve large amounts of data — sometimes in the region of terabytes — in fact this… Continue reading Why would I ever NEED Bayesian Statistics?

I’m an Analyst and the software engineers made fun of my code!

I’m seeing a very fuzzy line between where technical business analysts end & IT teams begin. Y’all know of good articles/books on managing responsibilities when coding analysts on the biz side interface with IT dev teams? Seems tricky. #rstats #pydata #python — JD Long (@CMastication) October 19, 2018 My friend JD Long, has been a… Continue reading I’m an Analyst and the software engineers made fun of my code!

How to use AWS Lambda to build a tweetbot

Inspired by Vicki I decided to build a Tweetbot – the code is available here. You can follow the tweetbot online – the architecture that Vicki proposed is basically what I did, only I made a few changes in the code. Time taken It’s worth pointing out first that it took me approximately 32 hours of… Continue reading How to use AWS Lambda to build a tweetbot

Interview with a Data Scientist – Vicky Boykis

It gives me great pleasure to interview Vicki Boykis – we’ve chatted a lot on Twitter over the past few years and her blog/ side projects have been inspiring for my own. Vicki is a Data Scientist and Engineer who tweets awesome stuff. She’s well worth following. Her twitter bio – Born: Jewish in Russia. Raised:… Continue reading Interview with a Data Scientist – Vicky Boykis

Why Probabilistic Programming is the next big thing in Data Science

TLDR: This is an opinionated post, but based on recent trends. What is Probabilistic Programming? I recently wrote a course teaching this. Probabilistic Programming is a newish paradigm used in Quantitative Finance, Biology, Insurance and Sports Analytics – it allows you to build generative models to infer latent parameters and the uncertainty of those parameters. It’s been… Continue reading Why Probabilistic Programming is the next big thing in Data Science