One term that has surprised me on data projects is ‘governance’ or ‘data quality’ or ‘master data management’. It’s surprised me because I’m not an expert in this discipline and it’s quite different to my Machine Learning work. The aim of this blog post is to just jot down some ideas on ‘data governance’ and… Continue reading What does a Data Scientist need to know about Data Governance?
I spent about a year working as an Energy Analyst in Procurement at a large Telecommunications company. I’m by no means an expert but these are my own thoughts on where I feel ‘big data’ fits into procurement. Firstly for the stake of this argument let us consider procurement as a the purchase of goods… Continue reading Where does ‘Big Data’ fit into Procurement?
I recently attended and keynoted at PyData Amsterdam 2016. (Clockwise from top right – ‘The Sunset when the event was closing’, ‘Peadar Coyle giving a keynote at PyDataAmsterdam’, ‘Video interviews with Holden Karau a Spark expert from IBM’, ‘The organizing committee’, ‘Maciej Kula of Lyst talking about Recommendation Engines’.) Firstly this was a wonderful conference,… Continue reading PyData Amsterdam
One question you have when you use Python is what do I do with my data. How do I process it and analyze it. The aim of this flow chart is to simply provide a simple to use ‘map’ of the PyData stack. At PyData Amsterdam I’ll present this and explain it in more detail… Continue reading A map of the PyData Stack