Dataist Dogma

Reflections and projects in Data Science, Machine Learning and AI "A critical examination of the Dataist dogma is likely to be not only the greatest scientific challenge of the twenty-first century, but also the most urgent political and economic project" - Yuval Noah Harari - Homo Deus: a Brief History of the Future (2016)

Articles


Using NLP to extract terms and conditions

A project that seeks to solve the business problem: "what obligations do I have as a business buried in all my terms and conditions documents" using pre-trained Natural Language Processing (NLP) models available for Python.

Six of the best podcasts on ML, AI and Data Science

Technology futurists tell us that one day we'll be able to plug our brains into a computer and download the complete knowledge of humanity. Until that day comes, putting headphones in your ears and listening to podcasts is the next best thing. There are a plethora of good podcasts on data science, machine learning and AI. They can teach basic concepts to the uninitiated or go right into the detail of the latest research. Below are the podcasts I listen to on a regular basis and would recommend you take a look at.

Social media sentiment analysis

A project to analyze social media Tweets relating to a bank (NAB) to get an understanding of changing sentiment over time, as well as the topics that are driving that sentiment. The application uses the Tweepy Twitter API to collect the tweets, the NLTK Natural Language Processing Toolkit to analyse the text and the Vader library to analyse sentiment.

Recommender system: Australian suburbs using clustering and the Foursquare API

This project utilizes publicly available data to recommend 10 similar suburbs in Australia to a given suburb. This could be useful for people trying to decide where to work, rent or buy property. It utilizes K-Means clustering from Scikit-learn to cluster venue and cultural data, the pandasdmx package to interface with the Australian Bureau of Statistics API, and the Beautiful Soup package to scrape demographic information from the web.

Five great books on Artificial Intelligence

There is a LOT of AI literature out there. Amazon.com has over 7000 AI non-fiction books in their catalogue. I've read a bunch of them (and have a bunch more on my reading list) but there are five that stand out to me as mandatory reading. The books I've listed here are by leading computer scientists and thinkers and really expanded my understanding and even my worldview. The first two get into technical details which really require reading the text, but the last three are well suited to the Audiobook format if you don't have the time to curl up on the couch with a paperback.