Projects

Spotify Track Popularity – Data pipelines, Analysis & Prediction

Created three pipelines for data transformations, Features most clearly distinguish a popular song, Most Popular Song and Artists

Advancing Healthcare with PyTorch – Disease Analysis using Deep Learning Models

Disease Analysis in the United States - to gain meaningful insights from the disease data

Experiental Learning – Unleasing AutoML in talent management - GTA

This project is sponsored by Georgetown Analytics and Technology and goal is to predict the hiring status of the candidates

AirBnb Price Prediction- Explainable AI (LIME and SHAP)

Model Interpretation, Predict the price for listings using listing descriptions and features that affected the price

Leveraging Big Data Analytics in Aviation Industry

Apache Spark as our big data analytics tool within the Hadoop ecosystem

Time Series ARIMA and SARIMAX for Covid-19 Forecasting

Vector Autoregressive (VAR) model, Granger causality test:, ARIMA and SARIMAX used to determine the covid-19 forecasting in Boston

Portfolio Management Using Python for Healthcare Stocks

Built a portfolio using Modern Portfolio Theory (MPT) with one-year data [2020], calculated Value at Risk (VaR) and Conditional Value at Risk (CVaR) at a 99% confidence level for each trading day and week, forward tested the portfolio with the next year's data [2021], conducted hypothesis testing on the calculation described in step 3, and compared the results to the market benchmark - S&P 500, with returns presented in percentage and annualized.

Support_Vector_Machine_&_PCA_CreditCard_Default

From the perspective of risk management, we are trying to classify - credible or not credible clients