AI & Machine Learning

Shortcut Learning, The Reason ML Models Often Fail in Practice

Training machine learning models is far from easy. Shortcut learning typically arises when there isn’t enough data to force algorithms into learning the task properly.

Good Data Scientists Don’t Gather Project Requirements. They Dig For Them

The majority of Data Science projects fail. The reasons for the high failure rate are many and varied. However, as surprising as this may sound, one of the main reasons is the lack of clearly defined project goal(s) and the associated requirements.

4 MINUTES READ Continue Reading »

Understanding Graph Embeddings

This blog post gives you a better intuitive feel for what graph embeddings are and how they are used to accelerate real-time analytics. Within the next few years we will see graph embedding take center stage in the area of innovative analytics.

11 MINUTES READ Continue Reading »
  • Top articles, research, podcasts, webinars and more delivered to you monthly.

  • How We Use Machine Learning to Help Predict Customer Churn

    There are dozens of companies that lost their customers due to a million tiny issues or a couple of big ones. This article shows how Machine Learning can help predict customer churn.

    3 MINUTES READ Continue Reading »

    A Layman’s Guide to Data Science Part 3: Data Science Workflow

    By now, you have already gained enough knowledge and skills about Data Science and have built your first (or even your second and third) project. At this point, it is time to improve your workflow to facilitate further development process.

    7 MINUTES READ Continue Reading »

    Beyond Weisfeiler-Lehman: Approximate Isomorphisms And Metric Embeddings

    This post argues that the graph isomorphism setting is too limiting for analysing the expressive power of graph neural networks and suggest a broader setting based on metric embeddings.

    5 MINUTES READ Continue Reading »

    Linear Regression And Gradient Descent For Absolute Beginners

    Linear regression is about finding the line of best fit for a dataset. This line can then be used to make predictions. Gradient descent is a tool to arrive at the line of best fit.

    5 MINUTES READ Continue Reading »

    Beyond Weisfeiler-Lehman: Using Substructures For Provably Expressive Graph Neural Networks

    This post discusses how to design local and computationally efficient provably powerful graph neural networks that are not based on the Weisfeiler-Lehman tests hierarchy.

    7 MINUTES READ Continue Reading »

    Guide to Classification on Imbalanced Datasets

    This is a comprehensive tutorial on handling imbalanced datasets. Whilst these approaches remain valid for multiclass classification, the main focus of this article will be on binary classification for simplicity.

    22 MINUTES READ Continue Reading »

    Expressive Power Of Graph Neural Networks And The Weisfeiler-Lehman Test

    Do you have a feeling that deep learning on graphs is a bunch of heuristics that work sometimes and nobody has a clue why? In this post, I discuss the graph isomorphism problem, the Weisfeiler-Lehman heuristic for graph isomorphism testing, and how it can be used to analyse the expressive power of graph neural networks.

    9 MINUTES READ Continue Reading »

    101 Data Science Quotes

    This article compiled a large collection of inspirational quotes on data science. The quotes are categorised alphabetically. Hopefully, these words of wisdom will bring perspectives and inspire your data science journey.

    14 MINUTES READ Continue Reading »

    Why Automated Machine Learning Is Becoming a Must-Have Business Intelligence Skill

    Predictive analytics used to be in the domain of more technical employees, but today, no-code automated machine learning (AutoML) tools mean anyone can deploy AI.

    2 MINUTES READ Continue Reading »