Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
portfolio
Collaborative Filtering Methods for Paper Recommendation Systems
Describing collaborative filtering methods from classical SVD to NeuMF/GraphNeuMF/DMF and an ensemble to recommend papers.
The MCMC Inference Engine Behind a PPL
Overview of building an MCMC engine with practical implementation details.
KV-Cache Refresh Methods for Long Generation Permalink
In this blog, we show how helpful KV cache refreshes can be for long generation from small models, along with efficient ways of finding when to refresh using inference algorithms.
publications
Percutaneous cannulated screw fixation in the treatment for diabetic ankle fractures
Published in European Journal of Orthopedics, 2020
Inferring Interpretable Semantic Cognitive Maps from Noisy Document Corpora
Published in ICAART, 2024
Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
Published in ICLR (Oral, top 1.8%), 2025
Language Models over Canonical Byte-Pair Encodings
Published in ICML, 2025
Fine-Tuning GPT-5 for GPU Kernel Generation
arXiv Preprint, 2026
talks
Language Models and Using Instagram Data for Investment Opportunities
Published:
Language Models are now heavily used in Investment, but a big portion of data analysis done by companies in the financial sector has not fully integrated its capabilites. In this presentation, we show different methods of obtaining customer experience data on Instagram and present a solution to link it to the performance of a company’s stock, providing information on whether to invest or not
