RAG |

The Creativity-Control Spectrum For RAG

November 19, 2024

From full creative freedom to strict fact matching, organizations can design chatbots that meet their specific risk tolerance and communication needs. The key is choosing an approach that serves the user while protecting the brand.

Tiny Retriever vs ChatGPT 🥊

October 7, 2023

The Tiny Retriever vs OpenAI RAG showdown! Did the fine-tuned underdog beat the champ? See the nDCG results, the tech stack (GPL, SBERT), and why losing isn’t the end.

Training so far

October 3, 2023

While my model is training, let’s take a moment to pause and reflect on the process so far, its thorns and roses, and make a few more Bert puns while we’re at it!

Building up my BERT stack

September 29, 2023

What makes a good retriever for RAG? Join me on a journey from classic BM25 to the world of BERT, DistilBERT, fine-tuning tricks like GPL, and why these ‘smaller’ models still punch above their weight.

Creating the tiniest information retriever

September 25, 2023

Why use giant LLMs for RAG? This post kicks off a mission to build a tiny, specialized information retriever using BERT, aiming for better performance on niche/non-English data, faster speeds, and no GPU required!