Built a small benchmark to decide which LLM should power a “find the
official site” feature for a hobby project.
Task = take a brand name, spit back the canonical URL (or “none”).
Results: GPT-4o-Mini & Llama-3.1-70B give 90 % accuracy for ~2 ¢/hit;
Perplexity is perfect but 45× the price; Gemini Flash is dirt-cheap but
70 % accurate.
Tables + code →
https://new.knife.day/blog/using-llms-for-knife-brand-research
Would love suggestions on making the parser bullet-proof or other cheap
model options I missed.
submitted by /u/Putrid-Television981
[link] [comments]