Welcome to my Data Science Portfolio. My passion is understanding complex data sets and derieve insights from it and also study and implement various state of the art models in deep learning and machine learning.
Recent Projects
Project 1: Phishing Domain Detection
This application takes inputs related to a URL’s features and depending upon that, it predicts whether the website is malicious or not. After doing various Experimentations it was decided that XGBoost was well performing ML algorithm for this use case The inputs are preprocessed and then given to the XGBoost model Depending upon the output of the model ,Output(whether the website is malicious or not) is displayed Link to Github
read more
Project 2: High-risk-appicant-identifier
It is Machine Learning Classification Problem After Taking various inputs from the user data preprocessing is done like(Handling missing values,handling categorical values,handling outliers,handling distribution,bringing data into a same scale) Various Ml algorithms were considered during this experimentation and by using the optuna library Performance metrics - Recall was used as False Negative were the main concern in this usecase Link to Github
read more
Project 3: T20I-Score-predictor
It is Machine Learning Regression Problem After Taking various inputs from the user data preprocessing is done like(Handling missing values,handling categorical values,handling outliers,handling distribution,bringing data into a same scale) Various Ml algorithms were considered during this experimentation and by using the optuna library and model was saved as a pickle file Performance metrics - Adjusted R square was used as a performance metric as this is a regression problem Link to Github
read more