Probabilistic Programming versus Machine Learning In the past ten years, we’ve seen an explosion in Machine Learning applications, these applications have been particularly successful in search, e-commerce, advertising, social media and other verticals. These applications have been particularly focused on predictive accuracy and often involve large amounts of data — sometimes in the region of terabytes — in fact this… Continue reading Why would I ever NEED Bayesian Statistics?
I’m on https://www.datacamp.com/community/podcast/human-centered-design-data-science at 38:10 talking about interpretability and fairness in Machine Learning.
I’ve been recently playing around with ‘arviz’. For those of you who don’t know Arviz is a library for exploratory analysis Bayesian Models. I’ve got a Bayesian model built – or someone has built one for me, how do I explore it? How do I plot it? This is the fundamental question that Arviz answers.… Continue reading How do I visualise the results of a Bayesian Model: Rugby models in Arviz
Today after over a decade of programming, I was reminded that programming is hard. I was committing a fix to PyMC3 (but it could have been any project, and I definitely make those mistakes professionally), and Thomas was good enough to point out ‘break-point committed’. JD Long shared his own war stories. The lesson I… Continue reading Coding is hard
I’m seeing a very fuzzy line between where technical business analysts end & IT teams begin. Y’all know of good articles/books on managing responsibilities when coding analysts on the biz side interface with IT dev teams? Seems tricky. #rstats #pydata #python — JD Long (@CMastication) October 19, 2018 My friend JD Long, has been a… Continue reading I’m an Analyst and the software engineers made fun of my code!
Inspired by Vicki I decided to build a Tweetbot – the code is available here. You can follow the tweetbot online – the architecture that Vicki proposed is basically what I did, only I made a few changes in the code. Time taken It’s worth pointing out first that it took me approximately 32 hours of… Continue reading How to use AWS Lambda to build a tweetbot
I’ve been thinking and discussing with various people lately – ‘career path for data science’. Someone said to me recently: Go become a research scientist and specialise in specific machine learning models say NLP at a specific company such as Google, Amazon, etc. Become a data scientist at a startup or growth company and accept… Continue reading Adding value as Data Scientists
It gives me great pleasure to interview Vicki Boykis – we’ve chatted a lot on Twitter over the past few years and her blog/ side projects have been inspiring for my own. Vicki is a Data Scientist and Engineer who tweets awesome stuff. She’s well worth following. Her twitter bio – Born: Jewish in Russia. Raised:… Continue reading Interview with a Data Scientist – Vicky Boykis
TLDR: This is an opinionated post, but based on recent trends. What is Probabilistic Programming? I recently wrote a course teaching this. Probabilistic Programming is a newish paradigm used in Quantitative Finance, Biology, Insurance and Sports Analytics – it allows you to build generative models to infer latent parameters and the uncertainty of those parameters. It’s been… Continue reading Why Probabilistic Programming is the next big thing in Data Science