(Repost from 2015) I recently interviewed Hadley Wickham the creator of Ggplot2 and a famous R Stats person. He works for RStudio and his job is to work on Open Source software aimed at Data Geeks. Hadley is famous for his contributions to Data Science tooling and inspires a lot of other languages! I include some light edits. 1.… Continue reading Interview with a Data Scientist (Hadley Wickham)
It gives me great pleasure to interview Vicki Boykis – we’ve chatted a lot on Twitter over the past few years and her blog/ side projects have been inspiring for my own. Vicki is a Data Scientist and Engineer who tweets awesome stuff. She’s well worth following. Her twitter bio – Born: Jewish in Russia. Raised:… Continue reading Interview with a Data Scientist – Vicky Boykis
TLDR: This is an opinionated post, but based on recent trends. What is Probabilistic Programming? I recently wrote a course teaching this. Probabilistic Programming is a newish paradigm used in Quantitative Finance, Biology, Insurance and Sports Analytics – it allows you to build generative models to infer latent parameters and the uncertainty of those parameters. It’s been… Continue reading Why Probabilistic Programming is the next big thing in Data Science
On being a Senior Data Scientist This post is partly for myself and based on various peoples conversations – it is also inspired by https://www.kitchensoap.com/2012/10/25/on-being-a-senior-engineer/ I’m trying to answer questions like ‘what do we expect from a Senior Data Scientist’. My job title is ‘Senior Data Scientist’ and I often joke I’ve no idea what… Continue reading What does it mean to be a Senior Data Scientist?
I was recently speaking to some data science friends on Slack, and we were discussing projects and war stories. Something that came across was that ‘data science’ projects aren’t always successful. Somewhere around this discussion a lightbulb went off in my head about some of the problems we have with embarking on data science projects.… Continue reading One weird tip to improve the success of Data Science projects
I interviewed the interesting and fascinating Ian Wong – he’s the technical co-founder of OpenDoor, which I personally think is amazing as a concept! 1. What project have you worked on do you wish you could go back to, and do better? Pretty much any project I’ve worked on in the past Two projects stick… Continue reading Interview with a Data Scientist – Ian Wong of OpenDoor
I’ve been in the Data Science space for a number of years now, I first got interested in AI/Machine Learning in 2009 and have a background typical of a number of people in my field – I come from Physics and Mathematics. One trend I’ve run into both at Corporates and Startups is that there… Continue reading Building Full-Stack Vertical Data Products
I recently gave a keynote at www.pycon.co the first PyCon conference in Colombia. I spoke on Data Science Models in Production, lessons learned and the cultural aspects. I interviewed a Colombian Data Scientist – Juan Pablo Isaza Aristizábal 1. What project have you worked on do you wish you could go back to, and do better?… Continue reading Interview with a Data Scientist: Juan Pablo Isaza Aristizábal
I caught up with Greg Linden via email recently Greg was one of the first people to work on data science in Industry – he invented the item-to-item collaborative filtering algorithm at Amazon.com in the late 90s. I’ll quote his bio from Linkedin: “Much of my past work was in artificial intelligence, personalization, recommendations, search,… Continue reading Interview with a Data Scientist: Greg Linden
I recently was experimenting with RNN’s in Keras. I used the example and edited it slightly. This is what I got for Nietzsche – as you can see the answer above to my question is No. ——– diversity: 0.2 ——- Generating with seed: “iginal text, homo natura; to bring it ab” iginal text, homo natura;… Continue reading Are RNN’s ready to replace journalists?