Linear Regression with Louis from "What is Artificial Intelligence"

This lecture talks simply talks about Linear Regression. The lecture also shows how to get the job done on Python and with the help of sklearn.

Hands-on Linear Regression Using Sklearn

To predict the cereal ratings of the columns that give ingredients from the given dataset using linear regression with sklearn

Speeding up a sklearn model pipeline to serve single predictions with very low latency

Speeding up a sklearn model pipeline to serve single predictions with very low latency. Writing your own sklearn functions, (for now final)

Combining tree based models with a linear baseline model to improve extrapolation

Combining tree based models with a linear baseline model to improve extrapolation. Writing your own sklearn functions.

Applying Machine Learning To The E.Coli Class Imbalance Dataset

Applying Machine Learning To The E.Coli Class Imbalance Dataset. The E.Coli dataset is a very popular dataset to experiment on because it is a multi-classification that has several imbalances

Top 3 programming mistakes every data scientist makes

How to use Pandas, Sklearn and functions. This article identifies the most common mistakes prospect data scientists make and discuss how to avoid them. Without further ado, let’s jump straight into it.

The best Machine Learning algorithm for Email Classification

Email Classification works on the same basic concepts. By going through the text of the email, we will use Machine Learning algorithms to predict whether the email has been written by one person or the other. Implementing Machine Learning Algorithms to Classify Emails. The best Machine Learning algorithm for Email Classification

Apply Machine Learning on a Cancer Dataset

Apply Machine Learning on a Cancer Dataset. In this article, take a look at how to apply machine learning on a cancer dataset.

Machine learning basic library abstract

Machine learning basic library abstract. Before starting machine learning i’m starting machine learning required library using python.

Singapore Housing Prices ML Prediction — Analyse Singapore’s Property Price

I will share some popular machine learning algorithms to predict the housing prices and the live model that I have built. My objective is to find a model that can generate a high accuracy of the housing prices, based on the available dataset, such that given a new property and with the required information, we will know whether the property is over or under-valued.

Don’t Overfit

We will also see by how using the simple machine learning models like KNeighborsClassifier and LogisticRegression we can reduce overfitting and help our model generalize better on unseen data even with a less amount of training data that we have.

Enrich your train fold with a custom sampler inside an imblearn pipeline

Enrich your train fold with a custom sampler inside an imblearn pipeline. I wouldn’t be able to write this article without the help of my colleagues and people from StackOverflow!

Bank Data: Accuracy | F1 Score | ROC AUC

We discussed that the metric that we will be basing our results on was F1 Score using the Confusion Matrix. This blog will discuss, in depth, why.

Expanding your regression repertoire with regularisation

Classifying drum samples with logistic regression using a large number of features generated using tsfresh. You can also find this on GitHub. This GitHub repository includes everything you need to run the analyses yourself.

Titanic Survival Dataset Part 2/2: Logistic Regression

Welcome back! In my previous post I wrote an EDA (Exploratory Data Analysis) on Titanic Survival dataset. Check it out now if you haven’t already. Anyway, in this article I would like to be more focusing on how to create a machine learning model which is able to predict whether a Titanic passenger survived based on their attributes i.e. gender, title, age and many more.

Could a server with 64 cores be 100x slower than my laptop?

No spoilers here ;-)A long time ago I asked on Twitter if someone could help me with a puzzling problem. A tool I was using utilized the scipy linear algebra package to perform the calculations. Most of the time was spent, running the pinv function, which makes calculates the inverse matrix. There are four functions in the scipy.linalg module, that can calculate the inverse matrix: pinv, pinv2, pinvh, and inv.

Heart-Disease Classification(Classical algorithms vs Neural Networks)

Find out whether the person has heart disease or not in two ways. Well,well guys..Heart Disease Classification using structured data.

Imputing Missing Data Using Sklearn SimpleImputer

Imputing Missing Data Using Sklearn SimpleImputer. In this post, learn how to use Python's Sklearn SimpleImputer for imputing/replacing numerical and categorical missing data using different strategies.

Dimensionality Reduction

It is easy for us to visualize two or three dimensional data, but once it goes beyond three dimensions, it becomes much harder to see what high dimensional data looks like.

Does TF-IDF work differently in textbooks and sklearn routine?

TF-IDF is a simple twist in the bag of words approach. Bag of words just means (# times word w appears in a document d). TF-IDF stands for term frequency times inverse document frequency.