Transforming complex data into actionable business strategies. Skilled in PySpark, SQL, and Pandas for scalable pipelines & RFM analytics.
Pinned Loading
-
Bank-Loan-Default-Prediction
Bank-Loan-Default-Prediction PublicDistributed Bank Loan Default Prediction using Apache Spark & Hadoop. Processes 2.2M records (~1.55GB) with a balanced Random Forest model. Achieves AUC ≈ 0.70 through strategic Undersampling and P…
Jupyter Notebook 1
-
Olist-Ecommerce-Data-Analysis
Olist-Ecommerce-Data-Analysis PublicEnd-to-end E-commerce Data Analysis using PySpark: Customer Segmentation (RFM), Market Insights, and Logistics Performance Dashboard.
Jupyter Notebook 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.