Outlier detection and removal using IQR | Feature engineering tutorial python # 4

codebasics
codebasics
84.5 هزار بار بازدید - 4 سال پیش - IQR is another technique that
IQR is another technique that one can use to detect and remove outliers. The formula for IQR is very simple. IQR = Q3-Q1. Where Q3 is 75th percentile and Q1 is 25th percentile. Once you have IQR you can find upper and lower limit by removing this formula,
lower_limit = Q1-1.5*IQR
upper_limit = Q3 +1.5*IQR
Anything less than a lower limit or above the upper limit is considered outlier. We will use python pandas to remove outliers on a sample dataset and in the end, as usual, I have an interesting exercise for you to practice

Code & Exercise: https://github.com/codebasics/py/blob...
Link for kaggle dataset: https://www.kaggle.com/mustafaali96/w...

Topics

00:00 What is percentile and IQR
04:15 Remove outliers using IQR
06:55 Exercise

Do you want to learn technology from me? Check https://codebasics.io/ for my affordable video courses.

Website: https://codebasics.io/
Facebook: Facebook: codebasicshub
Twitter: Twitter: codebasicshub
4 سال پیش در تاریخ 1399/03/09 منتشر شده است.
84,539 بـار بازدید شده
... بیشتر