Interview with a Data Scientist (Hadley Wickham)

(Repost from 2015) I recently interviewed Hadley Wickham the creator of Ggplot2 and a famous R Stats person. He works for RStudio and his job is to work on Open Source software aimed at Data Geeks. Hadley is famous for his contributions to Data Science tooling and inspires a lot of other languages! I include some light edits. 1.… Continue reading Interview with a Data Scientist (Hadley Wickham)

I’m lost, how do I learn to use the Cloud?

Learning to use the Cloud A common question from a Developer or a student is ‘how do I learn AWS or GCP’. For those of you who don’t know AWS – stands for Amazon Web Services and GCP stands for Google Cloud Platform. At the current moment in time I’ve no opinions on Azure from… Continue reading I’m lost, how do I learn to use the Cloud?

Think you need to learn Bayesian Analysis? Read this first

Bayesian Analysis the good parts One of the questions I’m often asked is what’s so powerful about Bayesian analysis? I speak regularly to analysts, who’ve heard of some of the powerful aspects of it, but haven’t heard enough to emotionally invest time in learning it. I’ve thought about this on and off for a few… Continue reading Think you need to learn Bayesian Analysis? Read this first

How to build a bayesian model in 30 minutes?

You have a problem that you think might need some Bayesian modelling A common question I’m asked is how do you start? In this tutorial I take you from a fresh data set, the data set is an educational dataset. I don’t know anything about the data, and I have no specific domain knowledge. I… Continue reading How to build a bayesian model in 30 minutes?

Why would I ever NEED Bayesian Statistics?

Probabilistic Programming versus Machine Learning In the past ten years, we’ve seen an explosion in Machine Learning applications, these applications have been particularly successful in search, e-commerce, advertising, social media and other verticals. These applications have been particularly focused on predictive accuracy and often involve large amounts of data — sometimes in the region of terabytes — in fact this… Continue reading Why would I ever NEED Bayesian Statistics?

I’m an Analyst and the software engineers made fun of my code!

I’m seeing a very fuzzy line between where technical business analysts end & IT teams begin. Y’all know of good articles/books on managing responsibilities when coding analysts on the biz side interface with IT dev teams? Seems tricky. #rstats #pydata #python — JD Long (@CMastication) October 19, 2018 My friend JD Long, has been a… Continue reading I’m an Analyst and the software engineers made fun of my code!

Adding value as Data Scientists

I’ve been thinking and discussing with various people lately – ‘career path for data science’. Someone said to me recently: Go become a research scientist and specialise in specific machine learning models say NLP at a specific company such as Google, Amazon, etc. Become a data scientist at a startup or growth company and accept… Continue reading Adding value as Data Scientists

Interview with a Data Scientist – Vicky Boykis

It gives me great pleasure to interview Vicki Boykis – we’ve chatted a lot on Twitter over the past few years and her blog/ side projects have been inspiring for my own. Vicki is a Data Scientist and Engineer who tweets awesome stuff. She’s well worth following. Her twitter bio – Born: Jewish in Russia. Raised:… Continue reading Interview with a Data Scientist – Vicky Boykis