Explore the advantages of facebook new Prophet API developed for time series data specially is designed to be easier to use and find a good set of hyper parameters automatically in an effort to make skillful forecasts for data with trends without needing the requirement of special domain knowledge.
Prophet is an open source library published by Facebook designed to forecast Uni-variate time series data based on decomposition (trend+Seasonality+holidays) components. …
Full Guide to 15 different types of clustering methods
Hi there whatsup? I hope its all good!
Today we will take clustering into another level with more than 12 types of clustering methods.
Its goanna be a long chapter, shall we get started?
Here are the lists of clustering techniques that we will be going through:
1.) K-means
2.) H-Clust
3.) DBSCAN / Density-Based Spatial Clustering
4.) HDBSCAN / Hierarchical Density-Based Spatial Clustering
5.) Cure Algorithm
6.) Mini-Batch K-means Clustering
7.) Fuzzy Clustering
8.) Spectral Clustering
9.) Mean Shift
10.) BRICH / Balanced Iterative Reducing and Clustering using Hierarchies
11.)…
Transform your regular classifier into a Bagging Classifier in 2 lines of code
Hi do you know we can transform any of our classifier into Bagging Classifier like Random Forest for more accuracy. Sounds pretty interesting?
But before we get started lets recap what is Bagging?
Bagging is also known as Bootstrap aggregating, a machine learning ensemble meta-algorithm designed to improve the stability and accuracy of machine learning algorithms used for classification and regression analysis.
It helps to reduce variance and helps to avoid overfitting. The best example is Random Forest. …
Transform your regular classifier into a Bagging Classifier in 2 lines of code
Hi do you know we can transform any of our classifier into Bagging Classifier like Random Forest for more accuracy. Sounds pretty interesting?
But before we get started lets recap what is Bagging?
Bagging is also known as Bootstrap aggregating, a machine learning ensemble meta-algorithm designed to improve the stability and accuracy of machine learning algorithms used for classification and regression analysis.
It helps to reduce variance and helps to avoid overfitting. The best example is Random Forest. …
Let’s see how can we create a cryptocurrency fraud detection system
Hi, it’s been a while haven’t wrote anything since. But i’m back with some cool data to apply machine learning with. It’s Ethereum transactional dataset where we will create a model to identify fraud detection in the crypto currency world. You can download the dataset from here @ github.com/rupak-roy
I will try to keep it short and simple neat and clean with a few advance techniques like automation of best model selection.
Shall we get started?
import pandas as pd
dataset = pd.read_csv("transaction_dataset.csv",sep=",")
Guide to find the best ML algorithms in just 2 lines of code.
Hi there…………how’s things going on! been a while….. today i got again something really cool for you. Using the function we can easily sort out the best fit algorithms from the list of more than 500 algorithms.
Shall we get started?
QUICK ~ Install the package using
pip install lazypredict
Now if you get any errors while installing the package probably will be the package compatibility. OLD VERSIONS VS NEW VERSIONS…. ufff ya always hate that part. …
Combine the power of Stacking and Voting Classifiers for a high accuracy prediction.
Hi there how are you doing today? great? thanks for the continuous support. Today i show you 2 interesting way to increase the power of your model.
Yes! its Ensembling via 1.) Stacking and 2.) Voting Classifier
I will try to keep it short and simple focusing mainly on the core part of the topic.
Let’s begin with Stacking.
What is Stacking?
In short is an ensemble machine learning algorithm. …
Ordering Points To Identify Cluster Structure (OPTICS) ) is a density based clustering technique that allows to partition data into groups with similar characteristics(clusters)
Its addresses one of the DBSCAN’s major weaknesses. The problem of detecting meaningful clusters in data of varying density.
In a density based clustering, clusters are defined as dense regions of data points separated by low-density regions.
It adds two more terms to the concepts of DBSCAN clustering. They are:
Things i write about frequently on Medium: Data Science, Machine Learning, Deep Learning, NLP and many other random topics of interest. ~ Let’s stay connected!