Yao Lirong's Blog

Matryoshka Representation Learning, Adaptive Retrieval and Binary Vector Search

Introduces ways to make retrieval quicker

2024/12/25

ML

YouTube Recommendation Algorithms (2016)

This is a detailed reading of Google’s paper Deep Neural Networks for YouTube Recommendations

2024/10/15

ML

Running MobileBert on Android with TensorFlow Lite

So Google, fxxk you.

2024/09/22

Variational Inference

Probabilistic Latent Variable Models The two general forms of probabilistic models are: p(x): a typical probabilistic distribution. In this model, we call x the query. p(y ∣ x): a conditional probabilistic distribution. In this model, we cal x the evidence and y the query. Latent variable model...

2024/09/09

Hyper-Parameter Tuning with Optuna

After self-implementing a grid-search but having a horrible time writing pyplot visualizing the result, I finally decided to find an existing tool to do the HP tuning for me.

2024/08/23

ML

KV Cache

Before this, see 2024/06/17 Conducting Multi-Round Conversation with Transformers for why we need cache. But we have query, key, value three matrices. Why do you only cache past keys and values? How about past queries?

2024/07/02

ML

Conducting Multi-Round Conversation with Transformers

I was using LLaVA to query in an image how many characters there are. For higher accuracy, I decided to employ Chain of Thought, but struggled to implement it. CoT is conducted through a multiple round conversation. It is easily done in a graphical chat interface but how is it done internally with code?

2024/06/17

ML

GPT-4o Release

One day before Google I/O, OpenAI made a Spring Update Release, introducing multi-modal end-to-end model GPT4-o

2024/05/14

ML

CLIP

CLIP investigates whether it is possible to transfer the success of task-agnostic web-scale pre-training in NLP to another domain (CV).

2024/04/22

ML

Gradient Scaling

Loss Scaling / Gradient Scaling was mentioned in Mixed-Precision Training as one of the 3 techniques, but there are many points to be careful with when in practice.

2024/04/08