Projects
Things I've Built
End-to-end machine learning systems — from statistical monitoring pipelines to LLM applications and paper reproductions.
Featured
End-to-end ML monitoring pipeline that detects distribution drift between reference and production data streams using Kolmogorov–Smirnov tests.
PythonNumPyPandasSciPyStreamlit
Featured
Model Context Protocol server that connects technical PDFs to LLMs through an automated RAG pipeline with vector-database retrieval.
PythonLangChainVector DatabasesLLMsMCP
From-scratch PyTorch reproduction of “An Image is Worth 16×16 Words” with modular OOP design and custom training pipelines.
PythonPyTorch
A GPT-style language model trained on diverse text corpora (Amazon, IMDB, Reddit), covering architecture optimization and tokenization.
PythonPyTorchNLP
Motion detection using acoustic signals and signal processing — no dedicated hardware sensor required.
PythonSignal ProcessingNumPy
ML pipeline for network anomaly detection: EDA on ~125k records, Random Forest with Bayesian tuning, and an interactive Streamlit dashboard.
PythonScikit-learnskoptStreamlitPandas