top of page
Here I blog on all topics related to Big Data and Data Science. Articles could be of types: Executive Summaries, Tools analysis, Tool Comparisons, Architectural Patterns, Introductions to complex topics, and 'How to' or Tutorial types that share code snippets.
Search


Decision Trees - An Introduction
Decision trees are an algorithm class that form the foundation for Random Forests, a class of algorithms that is extensively used in...
Sai Geetha M N
Jul 23, 20224 min read
279 views
0 comments


K-Means Clustering: Part 3 of 3
Theoretically and mathematically, we have understood a great deal about K-Means Clustering through Part 1 and Part 2 of this series. If...
Sai Geetha M N
Oct 8, 20216 min read
59 views
0 comments


K-Means Clustering: Part 2 of 3
Last week, we looked at the basic understanding of how K-Means Clustering works through the 5-step process where the two steps of...
Sai Geetha M N
Oct 1, 20214 min read
40 views
0 comments


K-Means Clustering: Part 1 of 3
Having looked at Clustering in general and also having heard that K-Means is one of the simplest and most popular clustering algorithms,...
Sai Geetha M N
Sep 23, 20214 min read
106 views
0 comments


Feature Selection in Machine Learning
Selecting the right features that contribute to your model is an art and a science. I call it art because much pain can be saved if you...
Sai Geetha M N
Jul 19, 20217 min read
926 views
0 comments


HBase Architecture
We looked at the basics of HBase in the previous article, last week. Today we will understand the Architecture of HBase. We all agree...
Sai Geetha M N
Jun 10, 20217 min read
566 views
0 comments


HBase Fundamentals
HBase is a NoSQL DB that uses some capabilities of the Hadoop ecosystem to provide its features. NoSQL DBs (a.k.a Not Only SQL) are...
Sai Geetha M N
Jun 3, 20219 min read
1,495 views
2 comments


MultiCollinearity
Multicollinearity is a concept relevant to all the input data that is used in a Machine learning Algorithm. This has to be understood...
Sai Geetha M N
May 27, 20215 min read
196 views
0 comments


Feature Scaling and its Importance
Feature Scaling is a very important aspect of data preparation for many Machine Learning Algorithms. Let us understand what is feature...
Sai Geetha M N
May 13, 20215 min read
98 views
1 comment


Regression Algorithms
#ExecutiveSummary #MLModels What is Regression? Regression is a statistical model/method used to determine the strength and character of...
Sai Geetha M N
Apr 8, 20212 min read
110 views
0 comments


Machine Learning Algorithms Categories
Machine Learning Algorithms learn from data as humans learn from experience. But the type of learning and the goal varies from algorithm...
Sai Geetha M N
Mar 16, 20214 min read
358 views
0 comments


The Machine Learning Landscape
If you are looking to start learning about the basics of Machine learning, you are at the right place. My blog will cover overviews of...
Sai Geetha M N
Mar 10, 20213 min read
613 views
3 comments
bottom of page