(Repost from 2015) I recently interviewed Hadley Wickham the creator of Ggplot2 and a famous R Stats person. He works for RStudio and his job is to work on Open Source software aimed at Data Geeks. Hadley is famous for his contributions to Data Science tooling and inspires a lot of other languages! I include some light edits. 1.… Continue reading Interview with a Data Scientist (Hadley Wickham)
Learning to use the Cloud A common question from a Developer or a student is ‘how do I learn AWS or GCP’. For those of you who don’t know AWS – stands for Amazon Web Services and GCP stands for Google Cloud Platform. At the current moment in time I’ve no opinions on Azure from… Continue reading I’m lost, how do I learn to use the Cloud?
Bayesian Analysis the good parts One of the questions I’m often asked is what’s so powerful about Bayesian analysis? I speak regularly to analysts, who’ve heard of some of the powerful aspects of it, but haven’t heard enough to emotionally invest time in learning it. I’ve thought about this on and off for a few… Continue reading Think you need to learn Bayesian Analysis? Read this first
You have a problem that you think might need some Bayesian modelling A common question I’m asked is how do you start? In this tutorial I take you from a fresh data set, the data set is an educational dataset. I don’t know anything about the data, and I have no specific domain knowledge. I… Continue reading How to build a bayesian model in 30 minutes?
Probabilistic Programming versus Machine Learning In the past ten years, we’ve seen an explosion in Machine Learning applications, these applications have been particularly successful in search, e-commerce, advertising, social media and other verticals. These applications have been particularly focused on predictive accuracy and often involve large amounts of data — sometimes in the region of terabytes — in fact this… Continue reading Why would I ever NEED Bayesian Statistics?
Today after over a decade of programming, I was reminded that programming is hard. I was committing a fix to PyMC3 (but it could have been any project, and I definitely make those mistakes professionally), and Thomas was good enough to point out ‘break-point committed’. JD Long shared his own war stories. The lesson I… Continue reading Coding is hard
I’m seeing a very fuzzy line between where technical business analysts end & IT teams begin. Y’all know of good articles/books on managing responsibilities when coding analysts on the biz side interface with IT dev teams? Seems tricky. #rstats #pydata #python — JD Long (@CMastication) October 19, 2018 My friend JD Long, has been a… Continue reading I’m an Analyst and the software engineers made fun of my code!
I’ve been thinking and discussing with various people lately – ‘career path for data science’. Someone said to me recently: Go become a research scientist and specialise in specific machine learning models say NLP at a specific company such as Google, Amazon, etc. Become a data scientist at a startup or growth company and accept… Continue reading Adding value as Data Scientists
It gives me great pleasure to interview Vicki Boykis – we’ve chatted a lot on Twitter over the past few years and her blog/ side projects have been inspiring for my own. Vicki is a Data Scientist and Engineer who tweets awesome stuff. She’s well worth following. Her twitter bio – Born: Jewish in Russia. Raised:… Continue reading Interview with a Data Scientist – Vicky Boykis
Machine Learning hipster effect Machine Learning is very in vogue at the moment. I feel that a pressure some junior data scientists and engineers feel is the need to do ML just to be a cool hipster, or as a friend of mine calls it ‘the ML hipster trap’. What is the ML hipster trap?… Continue reading Avoiding the ML hipster trap