Description
Your queries are on a spectrum.
Head Tail
High Volume Low Volume
General Specific
Few Queries Many Queries
When we talk about volume, we talk about the amount of searches with the same query term.
Tail queries still have a large volume of search volume, but as a distribution.
What counts into the head, torso, and tail queries will always be application specific, so you have to log the queries and create some analytics for it to identify them.
Before you use them, you have to find them.
Examine your query logs (start logging if you haven't).Categorize queries based on frequency: Head: Most frequentTorso: Moderately frequentTail: Least frequentUse volume and percentiles for categorization.If you find any mistakes or performance improvements, please shout it out.
You can follow along with me on:
LinkedIn: Nicolay GeroldTwitter / X: Nicolay GeroldPodcast: How AI Is BuiltNewsletter: Nicolay GeroldYouTube: How AI Is BuiltDo you want to implement an AI or data solution? Hire me or my company; Aisbach.
Documentation quality is the silent killer of RAG systems. A single ambiguous sentence might corrupt an entire set of responses. But the hardest part isn't fixing errors - it's finding them.
Today we are talking to Max Buckley on how to find and fix these errors.
Max works at Google and has built...
Published 11/21/24
Ever wondered why vector search isn't always the best path for information retrieval?
Join us as we dive deep into BM25 and its unmatched efficiency in our latest podcast episode with David Tippett from GitHub.
Discover how BM25 transforms search efficiency, even at GitHub's immense scale.
BM25,...
Published 11/15/24