Adolph Odhiambo

Posted on Jan 21

From Pythagorean Theorem to K-Means: How Grade School Math Powers Machine Learning

#machinelearning #datascience #algorithms #computerscience

A Fourth-Grade Discovery That Shaped My Career

Back in 2010, I was a curious fourth-grader staring at the colorful posters on my math classroom walls. Among them was one that etched itself into my memory: the Pythagorean Theorem. Memorizing the iconic triples like (3-4-5) and (5-12-13) felt like solving magical puzzles. I didn’t know it then, but that simple formula $a^2 + b^2 = c^2$ would one day power some of the most critical algorithms I use in my career.

Fast forward 15 years, and I’m a data scientist clustering customers based on their transaction patterns and account growth. My go-to algorithm? K-Means clustering, a machine learning technique that owes its elegance and efficiency to none other than the Pythagorean theorem.

The Pythagorean Theorem’s Hidden Superpower

The Pythagorean theorem states:

c = \sqrt{a^2 + b^2}

where ( c ) is the hypotenuse of a right triangle, and ( a ), ( b ) are the other two sides.

But here’s the twist: this formula is the secret sauce behind measuring distances in machine learning.

By simply reinterpreting the sides of the triangle, we can measure distances in higher-dimensional spaces a technique that underpins many algorithms.

The Secret Superpower: Euclidean Distance

Let’s start with the Euclidean distance—the straight-line distance between two points. Imagine two points on a 2D plane, $(x_1, y_1)$ and $(x_2, y_2)$ . The distance between them is:

d = \sqrt{(x_2 - x_1)^2 + (y_2 - y_1)^2}

This formula is essentially the Pythagorean theorem in disguise! Instead of triangle sides, the differences in $x$ - and $y$ -coordinates form the “legs,” while the hypotenuse becomes the distance between points.Instead of triangle sides, we’re measuring the “straight-line” distance between points.

Why Distance Matters in Machine Learning ?

In machine learning, distance = similarity. The closer two data points are in a feature space, the more alike they are.

For example, consider two customers:

Customer A: 25 years old, earning $50K. Represented as:

\vec{\mathbf{a}} = \begin{bmatrix} 25 & 50 \end{bmatrix}

Customer B: 40 years old, earning $80K. Represented as:

\vec{\mathbf{b}} = \begin{bmatrix} 40 & 80 \end{bmatrix}

To measure their similarity, calculate the Euclidean distance:

d = \sqrt{(40 - 25)^2 + (80 - 50)^2} = \sqrt{225 + 900} = 33.54

The smaller the distance, the more similar the customers. This simple concept becomes the backbone of clustering algorithms like K-Means.

Scaling to Higher Dimensions (and Real-World Problems)

What if we add more features, like number of purchases or average transaction amount? The Euclidean distance formula adapts effortlessly:

d = \sqrt{(x_2 - x_1)^2 + (y_2 - y_1)^2 + (z_2 - z_1)^2 + \dots}

Even in 100-dimensional space, the principle remains the same simplified as:

d = \sqrt{\sum_{i=1}^n (x_{i2} - x_{i1})^2}

K-Means Clustering: Geometry in Action

K-Means is one of the most popular clustering algorithms in machine learning. Here’s how it works:

Initialization: Start by guessing initial cluster centers (centroids).
Assignment: Assign each data point to the nearest centroid, using Euclidean distance.
Update: Recalculate the centroids as the average of all points assigned to them.
Repeat: Continue until the centroids stabilize.

Euclidean distance is the heart of this process, ensuring that clusters group together similar points.

Full Circle: A Fourth-Grade Formula in Action

In my customer segmentation project, every customer’s transaction history became a vector in multi-dimensional space. By calculating Euclidean distances, I grouped customers with similar behavior patterns into clusters. This allowed my team to design targeted marketing strategies and predict account growth effectively.

Looking back, it’s incredible to see how a formula I first encountered in elementary school has grown with me, becoming a tool I use every day.

Final Thoughts

Math isn’t just a subject , it’s a lens to understand the world. The Pythagorean theorem, once a tool to solve triangles, now powers machine learning models that drive real-world decisions. Whether it’s triangles on a chalkboard or billion-dollar ML models, the fundamentals remain timeless. Next time you see a right triangle, remember: you’re staring at the foundation of modern AI.

DEV Community

From Pythagorean Theorem to K-Means: How Grade School Math Powers Machine Learning

A Fourth-Grade Discovery That Shaped My Career

The Pythagorean Theorem’s Hidden Superpower

The Secret Superpower: Euclidean Distance

Why Distance Matters in Machine Learning ?

Scaling to Higher Dimensions (and Real-World Problems)

K-Means Clustering: Geometry in Action

Full Circle: A Fourth-Grade Formula in Action

Final Thoughts

Top comments (0)

Read next

How I Earned the Certified Artificial Intelligence Scientist (CAIS) Credential

Why Your Brain Ghosts Most of Your Memories!?

Rust and Generative AI: Creating High-Performance Applications

Intel Gaudi NPU Matches NVIDIA GPU Performance at 30% Lower Cost in AI Workload Tests