Posts by Collection

portfolio

KV-Cache Refresh Methods for Long Generation

Authors: Yahya Emara, Woojeong Kim, Mohamed Abdelfattah

In this blog, we show how helpful KV cache refreshes can be for long generation from small models, along with efficient ways of finding when to refresh using inference algorithms.

publications

talks

Language Models and Using Instagram Data for Investment Opportunities

Published:

Language Models are now heavily used in Investment, but a big portion of data analysis done by companies in the financial sector has not fully integrated its capabilites. In this presentation, we show different methods of obtaining customer experience data on Instagram and present a solution to link it to the performance of a company’s stock, providing information on whether to invest or not

teaching

Natural Language Processing

Masters course, ETH Zurich, 2024

Teaching Assistant for Natural Language Processing Class; Taught by Prof. Ryan Cotterell