Popular repositories Loading
-
-
Spark-Kafka-Streaming-Project-Docker-
Spark-Kafka-Streaming-Project-Docker- PublicThis project implements a real-time data pipeline using Apache Kafka and Apache Spark Structured Streaming. Incoming sensor events are consumed from Kafka, processed in micro-batches, and persisted…
Python
-
SP500-SPY-Data-Pipeline-on-Synology-NAS
SP500-SPY-Data-Pipeline-on-Synology-NAS PublicThis project implements an end-to-end batch data pipeline that ingests intraday market data for the SPY ETF (proxy for the S&P 500) from a public API, stores raw data in PostgreSQL, processes it da…
Python
-
housing_price_prediction
housing_price_prediction PublicThis project focuses on predicting house prices using supervised machine learning techniques. Beyond achieving predictive performance, the main goal is to **understand model behavior, analyze error…
Python
-
CSV_ETL_Python_Spark_PowerBI
CSV_ETL_Python_Spark_PowerBI PublicThis project analyzes global earthquake data from the last 10 years using an end-to-end data pipeline.
HTML
If the problem persists, check the GitHub status page or contact support.