Skip to content

Projects

Things I've Built

End-to-end machine learning systems — from statistical monitoring pipelines to LLM applications and paper reproductions.

End-to-end ML monitoring pipeline that detects distribution drift between reference and production data streams using Kolmogorov–Smirnov tests.
PythonNumPyPandasSciPyStreamlit
2024
Details
Model Context Protocol server that connects technical PDFs to LLMs through an automated RAG pipeline with vector-database retrieval.
PythonLangChainVector DatabasesLLMsMCP
2025
Details
From-scratch PyTorch reproduction of “An Image is Worth 16×16 Words” with modular OOP design and custom training pipelines.
PythonPyTorch
2024
Details
A GPT-style language model trained on diverse text corpora (Amazon, IMDB, Reddit), covering architecture optimization and tokenization.
PythonPyTorchNLP
2026
Details
Motion detection using acoustic signals and signal processing — no dedicated hardware sensor required.
PythonSignal ProcessingNumPy
2026
Details
ML pipeline for network anomaly detection: EDA on ~125k records, Random Forest with Bayesian tuning, and an interactive Streamlit dashboard.
PythonScikit-learnskoptStreamlitPandas
2024
Details