Decision Trees through an Example
top of page
Here I blog on all topics related to Big Data and Data Science. Articles could be of types: Executive Summaries, Tools analysis, Tool Comparisons, Architectural Patterns, Introductions to complex topics, and 'How to' or Tutorial types that share code snippets.
Search

We have so far seen what decision trees are, why we need them, what are certain measures that help in creating a decision tree and how...

Sai Geetha M N
- Sep 17, 2022
- 4 min
Decision Trees - Feature Selection for a Split
In the previous two articles "Decision Trees- How to decide the split?" and "Decision Trees - Homogeneity Measures", I have laid the...
233 views1 comment

Sai Geetha M N
- Sep 4, 2022
- 4 min
Decision Trees - Homogeneity Measures
Having had an introduction to what is homogeneity and what are the 3 basic types of measures that can be used in the previous article on...
135 views0 comments

Sai Geetha M N
- Aug 16, 2022
- 3 min
Decision Trees - How to decide the split?
In the introduction to Decision trees, we have seen that the whole process is to keep splitting one node into two based on certain...
53 views0 comments

Sai Geetha M N
- Jul 30, 2022
- 2 min
Why Decision Trees?
As we saw in the last article introducing Decision Trees, decision trees can be used for classification or regression. But the same can...
101 views0 comments

Sai Geetha M N
- Jan 23, 2022
- 4 min
Hierarchical Clustering Through an Example
I have taken a problem statement of an NGO wanting to find the top 5-10 countries from a list of 169 who are in dire need of aid, in the...
144 views0 comments

Sai Geetha M N
- Jan 16, 2022
- 3 min
Hierarchical Clustering - Types of Linkages
We have seen in the previous post about Hierarchical Clustering, when it is used and why. We glossed over the criteria for creating...
876 views0 comments

Sai Geetha M N
- Nov 26, 2021
- 5 min
Hierarchical Clustering: A Deep Dive
In the last five blog posts, I have discussed the basics of Clustering and then, K-Means clustering in detail. In my "Introduction to...
83 views0 comments

Sai Geetha M N
- Nov 4, 2021
- 10 min
K-Means Clustering through An Example
Now that we have understood the basics of K-Means Clustering, let us dive a little deeper today. Let us look at one practical problem and...
138 views0 comments

Sai Geetha M N
- Oct 21, 2021
- 4 min
Steps towards Data Science or Machine Learning Models
Having completed the basics of K-Means clustering in the last 3 weeks, I was tempted to take you through an example problem through code....
80 views0 comments


Sai Geetha M N
- Oct 8, 2021
- 6 min
K-Means Clustering: Part 3 of 3
Theoretically and mathematically, we have understood a great deal about K-Means Clustering through Part 1 and Part 2 of this series. If...
32 views0 comments

Sai Geetha M N
- Oct 1, 2021
- 4 min
K-Means Clustering: Part 2 of 3
Last week, we looked at the basic understanding of how K-Means Clustering works through the 5-step process where the two steps of...
25 views0 comments

Sai Geetha M N
- Sep 23, 2021
- 4 min
K-Means Clustering: Part 1 of 3
Having looked at Clustering in general and also having heard that K-Means is one of the simplest and most popular clustering algorithms,...
79 views0 comments

Sai Geetha M N
- Sep 16, 2021
- 6 min
Introduction to Clustering Algorithms
As mentioned in one of my introductory articles on "Machine Learning Algorithms Categories", Clustering Algorithms are a set of...
63 views0 comments

Sai Geetha M N
- Sep 9, 2021
- 5 min
When can you use Linear Regression?
It's been a while since my last post, as I was caught up with a couple of talking engagements - one at a university for engineering...
79 views1 comment

Sai Geetha M N
- Aug 3, 2021
- 4 min
Prediction Vs Forecasting in Supervised Learning
In supervised learning and especially in the context of Linear regression, we often use these two terms: Prediction and Forecast. We also...
76 views0 comments

Sai Geetha M N
- Jul 8, 2021
- 8 min
Machine Learning - Rendezvous Architecture
The Rendezvous architecture proposed by Ted Dunning and Ellen Friedman in their book on Machine Learning Logistics was a wonderful...
339 views0 comments

Sai Geetha M N
- Jul 1, 2021
- 7 min
Big Data Architecture for Machine Learning
Machine Learning by itself is a branch of Artificial Intelligence that has a large variety of algorithms and applications. One of my...
189 views0 comments

Sai Geetha M N
- Jun 25, 2021
- 6 min
Data Scientists, Data Engineers, ML Engineers And More - Demystified
As the world of Big Data, Machine Learning and Artificial Intelligence is taking off, there is an overlap of roles and responsibilities...
152 views0 comments

Sai Geetha M N
- May 27, 2021
- 5 min
MultiCollinearity
Multicollinearity is a concept relevant to all the input data that is used in a Machine learning Algorithm. This has to be understood...
112 views0 comments
bottom of page